The Data Warrior

Changing the world, one data model at a time. How can I help you?

Archive for the tag “Data Vault”

How’s your surfing?

I read this today on one of my favorite blogs – Zenhabits, and it definitely spoke to me:

We are not walking a path, but surfing a sea.

Most people look at goal setting as picking a destination, then figuring out a path to get there. That assumes you’re walking on land that will change very little, and that while you will have unforeseen obstacles, you’ll be on stable ground and the destination won’t move. That’s not at all true — life is more like the sea, ever changing with no fixed paths or destinations, with swells and currents and waves that change everything at every moment. The ultimate skill, then, isn’t setting a destination (goal) or a path (plan), but surfing. In surfing, you take whatever waves come, learn to judge the waves as they come, learn to ride the wave as it changes, not as you planned. It’s going with the flow (literally), and changing what you do depending on how the flow changes. (via » Why We Overplan :zenhabits.)

For years, every time someone asked about how I got to where I am in my career, I often found myself at a loss to give them a seemingly satisfactory answer.

What Leo wrote above articulates really well what I have been doing (unconsciously) most of my life – going with the flow. I have only been on surfboard once (yes, even data warriors surf) but the analogy fits really well in my mind. (BTW – A good downhill ski run or shooting some white water fits too)

It “feels” like the right answer.

Oddly (or not?) it fits with a classic quote from my martial arts hero Bruce Lee: “Be water”

Pretty Zen, right?

So what does this have to do with data modeling, data warehousing, etc?

Mostly I have found in doing agile (or agile-like) projects, the team needs to be like water, or really like a surfer on the water, and go with the flow through the sprints and iterations.

Changing directions at a moments notice as the users needs and priorities change.

Embrace the change.

Doing so without judgment or expectation.

Flow around the obstacles and blockages – or risk crashing on the reef!

So, let go of all the goals and set-in-stone project plans. Embrace the flow and see where you might go.

Who knows, you might hang 10 on the biggest wave of your life!

Aloha.

Kent

P.S. If you want to learn to be a better data surfer check out the Data Vault Learning Portal and learn how to implement the most agile data modeling technique around – Data Vault.

Want to be a Data Vault implementation Black Belt?

Are you tired of seeing failed data warehouse projects?

Tired of being part of the problem or having to clean up after someone else messed it up?

Well, now you can be part of the solution and kick implementation failures in the <you know what>.

I am pleased to tell you that my good friend, Dan Linstedt, creator of the Data Vault, has just launched a new, online, Data Vault training portal.

And it is now open to the public!

The first class you can get is on Data Vault implementation.

It is way cool!

The quality is excellent and the material is even better (including material I have never seen in a class). Dan provides tons of information about not only the right way to implement the Data Vault but gives examples of how he has done it and gives you code templates (for multiple databases) you can implement on a real project.

Why would you want to sign up for this training? Well lots of reasons:

  1. You read the Data Vault modeling book, but can’t quite see how to load the model after it is built.
  2. It’s less expensive than face to face training. No time off or travel required!
  3. You can rewind and watch the training at your own pace (no need to feel behind or ahead of the rest of the class).
  4. You get access to the course for an entire year instead of 1 to 3 days in a lecture format. So you can watch it over and over again.
  5. You get to ask Dan questions directly (and you can even engage and interact with other students).
  6. Dan is going to host tele-seminars for members only where you can ask him any question (without having to pay his normal consulting fees).
  7. It is currently on sale at a huge discount.

This is really a great deal.

So, what are you waiting for?

Head on over to the site now and get started! (If you are ready to buy and want to skip the sales stuff, just scroll to the bottom and hit “add to cart”. So why are you still here?)

You can’t get this material anywhere else and get direct access to the guy that invented it.

Doesn’t get much better than that.

Later.

Kent

More free stuff!

Hey gang,

I have been working hard over the past few weeks to find some of my old white papers so I could make them available to everyone on my blog site. Well, I finally found a few of them on some flash drives and figured out how to upload them to here to WordPress.

If you look above you should now see a new menu item called “White papers”. Click that link to get access to the papers I have found so far.

They are FREE for you to download. I am not even asking you to “opt in” or anything.

I just ask that you respect the copyrights and tell folks where you found them (share on Facebook, LinkedIn, Tweet it, etc).

I know there are more but have to figure out which ones are still useful (or at least moderately so). So be sure to check back often to see what I have added.

If you remember any I did in the past you might want a copy of, tell me in the comments (below) and I will see if I can find it.

Oh and as a bonus, I have also included a copy of my recent “Introduction to Data Vault Data Modeling” article just in case you have not read it yet.

Hope you find some of these useful. Have a great week!

Kent

P.S. I am thinking about publishing some of these, with minor revisions, to Kindle. Do you think that would be useful to any of you?

Is the Data Vault too complex?

This was a very interesting topic that came up on LinkedIn the other day, so I wanted to address it here to.

There seems to be quite a few people who think that Data Vault models are harder to build and manage than what we do today in most shops. So let me explain how I came to learn Data Vault Data Modeling.

Before learning Data Vault, I had successfully built several 3NF OLTP, 3NF DW, and Kimball-style Dimensional data warehouses (and wrote about it with Bill Inmon and Len Silverston in the original Data Model Resource Book).

In other words, I had a reasonable amount of experience in the subject area (data modeling and data warehousing).

I personally found Data Vault extremely easy to learn as a modeling technique (once I took the time to study it a bit). At the time that meant reading the old white papers, attending some lunch & learns with Dan Linstedt and then building a few sample models.

I was definitely skeptical at first (and asked lots of questions at the public lunch & learns). I did not care about MPP, scalability, or many of the other benefits Dan mentioned. I just knew from experience there were a few issues I had seen with the other approaches when it came to building a historical enterprise data store and was hoping Data Vault might be a solution.

In comparison to trying to learn how to design and load a Type 2 slowly changing dimension, Data Vault was a piece of cake (for me anyway).

Once I was convinced, I then introduced the technique to my team in Denver – who had virtually no data warehouse experience.

It was universal – everyone from the modelers to the dbas to the ETL programmers found the technique very easy to learn.

Our investment: One week of training from Dan for 7 people and 3 or 4 days of follow-on consulting where Dan came in once a month (for a day) to do a QA review on our models and load routines and mentor us on any issues we were having.

Dan did not make much $$ off of us. 😦

Since then, I have found that experienced 3NF modelers pick up the technique in no-time flat.

Why is that?

Because Data Vault relies on solid relational principles, experienced 3NF modelers seem to grasp it pretty fast.

Modelers who only have experience with star schemas, on the other hand, seem to have a bit of a hard time with the approach. For some of them it is a paradigm shift in modeling technique (i.e., feels very unfamiliar – “too many tables and joins”),  for others it is almost a dogmatic objection as they were (sadly) taught that dimensional/star was the only “right” way to do data warehousing.

They are just not open to a new approach for any reason (sad but true). 😦

The biggest issue I have seen with clients is a reluctance to try the approach for fear of failure because they don’t personally know anyone (other than me) who has used the approach and because they think it is easier (and cheaper?) to find dimensional modelers.

This happens, even if they agree in concept that Data Vault sounds like a very valid and flexible modeling approach.

As we all know, it takes $$ to train people on star schema design too, so my advice is that if you have a team of people who know 3NF but don’t know dimensional, train them on Data Vault to build your EDW, then hire one or two dimensional modelers to build your end user reporting layer (i.e., data marts) off the Data Vault.

So that’s my 25 cent testimonial. (You get if for free!)

If you want to learn more about Data Vault, check out my presentations on SlideShare or click on the Super Charge book cover (below my picture in the sidebar) to buy the Data Vault modeling book..

Check it out and let me know what you think in the comments. How do we get people over the fear of trying Data Vault?

Talk to you later.

Kent

Data Vault Certification Class – Day 3 (It’a a wrap!)

Well, not so cold today in Montreal. Instead we got very cold rain and snow mix. Yuk. (But I definitely want to come back in the summer!)

This morning, Dan dived into how to load the Data Vault with all new material he has not taught in the certification class before. We really got lucky by attending this class. I knew most of the concepts, and have implemented most of them, but his new slides are just killer. They really get the point across and cleared up a few points for me too.

Not only do the slides include sample SQL for various parts of the load, and the logical patterns, Dan even demonstrated some working code on his really cool, high-powered laptop. It was great for the class to see the Data Vault concepts put into practice. (And he of course had some more tales to tell)

Cool Phrase for the Day

Short Circuit Boolean Evaluation: A mathematical concept, that Dan laid on us, that is used to get very fast results from the Data Vault change detection code . We use it in doing column compares on attribute data to determine if a Satellite row has changes.

In Oracle it looks like this: decode(src_att1, trg_att1,0,1) = 1

In ANSI-SQL it is a bit longer but has the same effect:

CASE WHEN (src_att1 is null and trg_att1 is null or src_att1 = trg_att1))

THEN 1 else 0 = 0

I have been using this for years (learned it from Dan) but had no idea there was a math term for it.

Okay so I am a geek. 🙂

The Test

After all that cool stuff came the certification test.

Not easy. My hand cramped writing out the answers.

We get our results next week. (Dan has a fun weekend ahead of him doing a bunch of scoring).

I am sure everyone in class will do fine. As I said, they all seemed to get it.

Anyway, the class is over now and I am in a hotel in Vermont (where it is snowing now). I fly back to Houston in the morning.

I had a good week here in the northeast (despite the weather). It was definitely worth the time and money to come for this class. I met some great people, learned a lot, and got to spend time with my good friend Dan.

Watch out Montreal – you are about to be descended upon by a whole new batch of Data Vault experts.

It could change the way you do data warehousing.

Later.

Kent

Post Navigation