The Data Warrior

Changing the world, one data model at a time. How can I help you?

Archive for the tag “agile data warehouse”

Why Automation is No Longer a Choice for Your Data Architecture

Back in the saddle again for The Data Warrior! Here is a piece I just did for the folks at Wherescape about one of my favorite topics – Automation!

The world of data has changed for sure. Especially over the past several years. In fact, the pandemic accelerated some changes, like the migration to cloud-based data platforms. When everyone needed to be remote, it just made sense to move to the cloud and use a service for your data platform.

Along with that came more data, more data types, and an actual business needs to move faster. Companies had to adapt very quickly during the pandemic if they wanted to survive. Many did and thrived while others, well, not so much.

As the demand for data continues to grow at unprecedented rates, and as it becomes a non-negotiable asset for organizational success, the requirement to rapidly deliver value from that data (i.e., turn it into information for data-driven decision making) has become an imperative.

So how do we deliver value faster with our data warehouses, data meshes, and enterprise data hubs? Automate, automate, automate.

Check out the full post here – Why Automation is No Longer a Choice for Your Data Architecture

Enjoy!

Kent

The Data Warrior

Building a Real-time Data Vault in Snowflake?

Yes you can! The #DataCloud loves #DataVault!

In this day and age, with the ever-increasing availability and volume of data from many types of sources such as IoT, mobile devices, and weblogs, there is a growing need, and yes, demand, to go from batch load processes to streaming or “real-time” (RT) loading of data. Businesses are changing at an alarming rate and are becoming more competitive all the time. Those that can harness the value of their data faster to drive better business outcomes will be the ones to prevail.

One of the benefits of using the Data Vault 2.0 architecture is that it was designed from inception not only to accept data loaded using traditional batch mode (which was the prevailing mode in the early 2000s when Dan introduced Data Vault) but also to easily accept data loading in real or near-realtime (NRT). In the early 2000s, that was a nice-to-have aspect of the approach and meant the methodology was effectively future-proofed from that perspective. Still, few database systems had the capacity to support that kind of requirement. Today, RT or at least NRT loading is almost becoming a mandatory requirement for modern data platforms. Granted, not all loads or use cases need to be NRT, but most forward-thinking organizations need to onboard data for analytics in an NRT manner.

See all the details (and some code) in the full post over on Data Vault Alliance.

Happy Vaulting!

Kent

The Data Warrior

Data Vault 2.0 Automation with erwin and Snowflake

I am seeing a HUGE uptick in interest in Data Vault around the globe. Part of the interest is the need for agility in building a modern data platform. One of the benefits of the Data Vault 2.0 method is the repeatable patterns which lend themselves to automation.  I am please to pass on this great new post with details on how to automate building your Data Vault 2.0 architecture on Snowflake using erwin! Thanks to my buddy John Carter at erwin for taking this project on.

The Data Vault methodology can be applied to almost any data store and populated by almost any ETL or ELT data integration tool. As Snowflake Chief Technical Evangelist Kent Graziano mentions in one of his many blog posts, “DV (Data Vault) was developed specifically to address agility, flexibility, and scalability issues found in the other mainstream data modeling approaches used in the data warehousing space.” In other words, it enables you to build a scalable data warehouse that can incorporate disparate data sources over time. Traditional data warehousing typically requires refactoring to integrate new sources, but when implemented correctly, Data Vault 2.0 requires no refactoring.

Successfully implementing a Data Vault solution requires skilled resources and traditionally entails a lot of manual effort to define the Data Vault pipeline and create ETL (or ELT) code from scratch. The entire process can take months or even years, and it is often riddled with errors, slowing down the data pipeline. Automating design changes and the code to process data movement ensures organizations can accelerate development and deployment in a timely and cost-effective manner, speeding the time to value of the data.

Snowflake’s Data Cloud contains all the necessary components for building, populating, and managing Data Vault 2.0 solutions. erwin’s toolset models, maps, and automates the creation, population, and maintenance of Data Vault solutions on Snowflake. The combination of Snowflake and erwin provides an end-to-end solution for a governed Data Vault with powerful performance.

Get the rest of the details here: Data Vault Automation with erwin and Snowflake

Vault away my friends!

Kent

The Data Warrior

Better Data Modeling: Agile Data Engineering

You asked for it, you got it!

Ever since I wrote my Kindle book on Agile Data Engineering and Data Vault 2.0, many, many people have asked me to provide it in a hardcopy format. Well, I finally managed to find time to convert that ebook into a paperback book (I even corrected a few errors in the process).

If you forgot what the book was about, here is the description:

This book will give you a short introduction to Agile Data Engineering for Data Warehousing and Data Vault 2.0. I will explain why you should be trying to become Agile, some of the history and rationale for Data Vault 2.0, and then show you the basics for how to build a data warehouse model using the Data Vault 2.0 standards.In addition, I will cover some details about the Business Data Vault (what it is) and then how to build a virtual Information Mart off your Data Vault and Business Vault using the Data Vault 2.0 architecture.So if you want to start learning about Agile Data Engineering with Data Vault 2.0, this book is for you.

So here it is – Introduction to Agile Data Engineering – now available to purchase on Amazon.

Get your copy now. Next time you see me at an event, I will be happy to sign it for you. 🙂

Enjoy!

Kent

The Data Warrior

Adapting Agile Principles for Data Warehousing

Despite the clear benefits of data warehousing programs, business units often question what they perceive to be long delivery timelines and a lack of data accessibility. Nearly half (48%) of the respondents in a TDWI survey reported that personnel in their organizations spend at least 61% of their time finding and preparing data. Only 28% said that their business users and analysts can access and analyze new data, including external data, without close IT support.

Check out how to solve these type of issues, and more, by applying the principles of Agile to data warehousing in my newest post and ebook:

Adapting the Agile Manifesto Principles to Data Warehousing

Cheers.

Kent, The Data Warrior &

Chief Technical Evangelist, Snowflake

Post Navigation

%d bloggers like this: