The Data Warrior

Changing the world, one data model at a time. How can I help you?

Archive for the tag “Data Vault”

A Snow Storm of Snowflake Webinars

Good Monday Morning!

Been itching to learn more about the Snowflake Elastic Data Warehouse? Well, now is your chance.

Over the next two weeks we have a bunch of great webinars coming up so I figured I should just give you a an easy list to review with links to sign up. Here it is:

WEBINAR #1

Wednesday, 04/27/2016 10am PT 

CapSpecialty: Leveraging data to deliver faster business results linked to Key Performance Indicators 

Abstract:

CapSpecialty is upping its game to become the preferred provider of specialty insurance products using MicroStrategy Analytics and Snowflake Cloud Data Warehousing.

Featured partner: MicroStrategy

Hosted by: MicroStrategy

Featured Customer: CapSpecialty

Register here! 

WEBINAR #2

Wednesday, 04/27/2016 11am PT

4 Big Data Strategies You Can’t Go Without 

Abstract:

You’ve got questions about big data, our panel has answers.

When it comes to customer relationships, big data can usher in big opportunities or big problems. That’s why it’s vital for organizations to take a strategic approach to big data. 

They must clean their data, integrate it, maximize data value, comply with security and governance requirements, and make sure the right people have the right access to the data at the right times.

Media partner: CRM Magazine

Hosted by: CRM Magazine 

Featured partner: Informatica + Looker

Featured use case: Pitney Bowes

Featured presenter: Kent Graziano, The Data Warrior

Register here! 

WEBINAR #3

Thursday, 04/28/2016 10am PT

Using the Cloud For Speed-of-Thought Analytics on All Your Data

1.5 TB of data per day? No problem! Learn how Ask.com turned to Snowflake’s cloud-native data warehouse combined with Tableau’s data visualization solution to address their challenges.

Featured partner: Tableau

Hosted by: Snowflake

Featured Customer: Ask.com

Featured presenter: Jon Bock, VP Product and Marketing, Snowflake

Register here!

WEBINAR #4

Thursday, 05/05/2016  10am PT 

The Right Choice: Why Spark + a Cloud Data Warehouse = Success 

The first rule of data analytics for fast-growing companies? Measure all things. When putting in place a robust data analytics strategy to go from measurement to insight, you’ve got lots of options for tools — from databases and data warehouse options to new “big data” tools such as Hadoop, Spark, and their related components. But tools are nothing if you don’t know how to put them to use. 

Media partner: VentureBeat

Hosted by: VentureBeat

Featured Customer: Celtra

Featured presenter: Jon Bock, VP Product and Marketing, Snowflake

Register here!

The Data Warrior Live in Chicago!

Later this week on Thursday April 28th, I will be speaking about Data Vault and Agile Data Engineering at a special Snowflake half-day workshop in downtown Chicago. You can sign up for that here.

So, no excuse for not learning more about Snowflake in the coming weeks. Sogn up for one or more of these events today.

Have a good week!

Kent

The Data Warrior

4 Keys to Succeeding with Agile Data Warehousing in 2016

I have been out giving talks again on using agile methods for data warehouse and business intelligence projects, so I thought it was time for me to share my thoughts about the 4 key elements you need to be successful with an Agile DW project in 2016.

Adopt an Agile Methodology

By this I am talking about SCRUM, Kanban, ScrumBan, or DAD (Disciplined Agile Development), among others.

Go read the blogs, read the books, study these methods. Attend a conference (like Agile Tech in April). Figure out what will work for your organization’s culture and leverage the skills of your staff. One size does not fit all.

In past engagements I have used approaches primarily based on SCRUM and Kanban. Both have been very effective once we got our processes down.

If you need/want help, find a good agile coach.

Use an Agile Data Engineering Approach

If you want to develop your data warehouse in an agile, iterative manner, then you need a way to design your EDW repository that lends itself to this approach without causing huge re-engineering pains (known as refactoring) in future iterations.

The best way I have found is using the Data Vault modeling approach. It was designed specifically for building data warehouses in this manner. I have written much about this approach and give many talks showing examples of successful agile projects using Data Vault. And there is plenty of material available to help you learn how to do it (see the books on the sidebar of this blog).

Also keep an eye on Dan Linstedt’s twitter feed and blog for his training classes.

Use Data Warehouse Automation Software

No better way to get agile and deliver results fast, than to automate as much of your development work as possible. If you use repeatable patterns (like Data Vault) in your design methodology, then it is even easier to automate and greatly reduce your time to market.

There are two vendors in the market that I like a lot and have had some experience with. They are WhereScape and AnalytixDS. And both support not only “traditional” approaches to data warehousing (like automating the ETL for a Type 2 Slowly Changing Dimension) but they both also support Data Vault (and both will be at WWDVC 2016).

Which of these tools you might use depends on your approach, your current tools, and your skills.

If you are coming from a more traditional DW paradigm and use ETL tools like Informatica, Talend, or DataStage, then I would recommend you look at AnalytixDS Mapping Manager which allows you to generate your ETL code from source to target mappings.

If you are just getting started or are committed to more of a database-centric approach and want your ETL or ELT code to run in the database, then look at WhereScape’s products.

Both are great companies with knowledgable people and happy customers.

Your third option is to write your own automation routines. There are many shops doing that as well. Just be sure you have the appropriate skills in house and can allocate the upfront time to get going (a month or so at least).

Deploy on an Agile Data Warehouse Platform

So now that I have learned about Elastic Data Warehousing in the cloud, I can’t imagine trying to do an agile DW project any other way.

Of course I am referring to Snowflake Computing’s DWaaS (data warehouse as a service) offering. Yes, I might be a bit biased since I do work for them now, but…this tech is really good!

From a features perspective, what I am talking about is having a high powered, easily scalable database that supports BI and analytic workloads and does not require a ton of time to configure and tweak.

Why do I think that is a success criteria? Because I have spent way too many months on way too many “agile” projects waiting to get access to the hardware! Or I get access and we either run out of space (e.g., “we had no idea you need THAT much storage”) or we can’t properly test production level loads and queries because the development box does not have enough horsepower.

Taking advantage of the elasticity of the cloud solves both of these problems and the folks at Snowflake have successfully built an RDBMS in the cloud that specifically harnesses these features and leverages them for data warehouse and analytic workloads by providing the ability to scale up and scale down both storage and compute resources on demand.

That and its many other features, give me the infrastructure I need to get an agile data warehouse project off the ground almost instantly. And I can do a Data Vault on Snowflake too.

Very cool.

So what do you think? Are you ready to accelerate your team’s performance and adopt an agile approach to data warehousing?

I hope this post gives you a few ideas on how to make that happen.

Model on!

Kent

The Data Warrior

 

Fire Sale #2: Loading Your Data Vault with Informatica

As I mentioned the other day, Dan and Sanjay have been hard at work re-vamping the Learn Data Vault site and have relaunched it. In celebration they are offering amazing deals on their world class Data Vault Implementation classes.

The 2nd course that is ready to go is Implementing Data Vault with Informatica. This online course walks you through the details of implementing a Data Vault-based data warehouse using Informatica. It includes a ton of examples and templates that you can use in your own ETL work to make it very easy to crank out your load processes.

You can see the full outline for all the topics and modules here.

The Deal

So the deal is that originally the Retail Price was  $1,497. But right now with the re-launched site, this class is marked down to $997  for a short time (a savings of $500). The price will eventually go back to the higher price.

The Better Deal: A Fire Sale

Starting today until February 15th, Learn Data Vault is having a Fire Sale! You can take another $500 off with this special Data Warrior Coupon Code:  DWFS500OFF.

So with the mark down price plus my coupon you can get the Data Vault Informatica Implementation class for only $497 (a savings of $1,000).

But remember is is only good until February 15 (1 week from today!).

So please sign up ASAP before the coupon code expires.

Data Vault Rules!

Happy Coding!

Kent, The Data Warrior

Data Vault Master (CDVDM, CDVP2)

Would You Like to Load a Data Vault like a Master?

If you are about to embark on a Data Vault project this year, and not quite sure the best way to load your data into a Data Vault efficiently, I have a great opportunity for you.

Dan and Sanjay have been hard at work re-vamping the Learn Data Vault site and are now ready to relaunch with a brand new look and feel.

The first course out of the gate is the SQL Implementation course. This online course walks you through the details of implementing a Data Vault-based data warehouse using plain and simple SQL. These are the same techniques I have been successfully using for over 10 years on my Data Vault (and even Non-DV) projects. I learned them from Dan in my very first Data Vault class and have used them ever since.

They just work!

If you are building on a standard relational database platform (on premises or in the cloud), this class gives you all the patterns and code examples you need to move data into not only Hubs, Links, and Sats but your staging areas as well. It also includes the techniques for implementing very efficient change data capture (i.e., delta detection).

You can see the full outline for all the topics and modules here.

The Deal

So the deal is that originally the Retail Price was  $997. But right now with the re-launched site, this class is marked down to $797  for a short time (a savings of $200). This course has sold in the past for $1497 and they could raise it back up at anytime.

The Better Deal: A Fire Sale

Starting today until February 13th, Learn Data Vault is having a Fire Sale! You can take another $400 off with this special Data Warrior Coupon Code:  DWFS400OFF.

So with the mark down price plus my coupon you can get the Data Vault SQL Implementation class for  only $397 (a savings of $600).

But remember is is only good until February 13 (1 week from today!).

UPDATE: If you are reading this after the sale is over you can still get a discount by using my special blog reader discount code: Kent10S which will get you 20% off.

So don’t delay – sign up for the class today. It is the quickest way for your to get productive with your Data Vault implementation I know. (And it does not hurt that you can uses these techniques for lots of non-data vault systems too. I have.)

Why a Fire Sale?

So if this class is so good, why such great sale?

I asked Sanjay the same thing. This is what he told me:

  1. This is really a re-launch on a new platform and they really want to give it a workout.
  2. There may be some small errors on the site that they missed during QA. (The price will go up once they are sure it is completely error free)
  3. They need feedback on the new site to be sure it is the best it can be and this is the best way to get that feedback quickly – get people using the site!
  4. It’s Valentines Day!

So please sign up ASAP before the coupon code expires.

SQL Rules!

Happy Coding!

Kent, The Data Warrior

Data Vault Master (CDVDM, CDVP2)

The Data Warrior Speaks 2016: Updated

As expected, I have been booked to speak a few more places this year.

Here is my updated speaking schedule as of today:

RMOUG Training Days 2016 – February 9-11 in Denver, CO (I have 2 hour deep dive on Feb 9th). Register here.

TDWI Nashville – March 8th in Nashville (of course). I will be discussing how to apply Agile Methods to Data Warehousing. You can get more details (soon) and sign up here.

Tampa Analytics Professionals – March 22 at the St Pete College Epicenter. Again talking about how to apply Agile Methods to Data Warehousing. You can get details and sign up here.

Agile Alliance Technical Conference 2016 – April 7-9 at the Raleigh Marriott Crab Tree Valley in Raleigh, North Carolina. I will present Agile Data Engineering: Introduction to Data Vault Data Modeling on Thursday April 7th. The Super Early Bird and Early Bird rates are still available. Register here.

Enterprise Data World – April 17-22 at the Sheraton Marina in San Diego, California. Register early for discounts. My talk here will be Agile Data Warehousing: Building a Virtualized ODS.

Data Science Maryland Meetup – May 16th (Tentative). I expect to be talking about how to apply Agile Methods to Data Warehousing.Keep your eyes on the meetup page for details and to sign up.

World Wide Data Vault Consortium (WWDVC) – May 25-28 in Stowe, Vermont. I am now confirmed to speaking at WDVC for the 3rd time! And this year, Snowflake Computing will also be a sponsor. My talk this year will be Agile Data Warehousing: Building a Virtualized ODS with Oracle SDDM. Register here soon as this event has limited space.

ODTUG KScope16 – June 26-30 in Chicago, IL. Register early and be sure to book the hotel! My talk this year will be Data Warehousing in the Real World. I will also be running my annual Morning Chi Gung sessions.

And that is the first half of the year. I have nothing confirmed yet for the 2nd half, but am sure something will pop up.

Stay tuned.

I look forward to see y’all at one of these events.

Kent

The Data Warrior

P.S. I will also be working the Snowflake booth at both the Gartner BI and Analytics Summit this March in Dallas, and the HIMSS event in Las Vegas at the end of February. Stop by and say “hi” if you will be at either of these events.

Post Navigation