The Data Warrior

Changing the world, one data model at a time. How can I help you?

Archive for the category “Big Data”

Snowflake and Spark, Part 2: Pushing Spark Query Processing to Snowflake

Here is the latest post on using Spark and the Snowflake cloud-native data warehouse.

Welcome to the second post in our ongoing blog series describing Snowflake’s integration with Spark. In Part 1, we discussed the value of using Spark and Snowflake together to power an integrated data processing platform, with a particular focus on ETL scenarios.

In this post, we change perspective and focus on performing some of the more resource-intensive processing in Snowflake instead of Spark, which results in significant performance improvements. As part of this, we walk you through the details of Snowflake’s ability to push query processing down from Spark into Snowflake. We also touch on how this pushdown can help you transition from a traditional ETL process to a more flexible and powerful ELT model.

Read the rest: Snowflake and Spark, Part 2: Pushing Spark Query Processing to Snowflake

Enjoy!

Kent

The Data Warrior

Cloud Analytics Conference – London!

Next up on The Data Warrior speaking tour 2017 is the Snowflake Cloud Analytics Conference in London on June 1st!

CloudConference

Snowflake is kicking off this year’s Cloud Analytics City Tour with a blow out event in London, England. This will be a full day workshop style event where you get to hear and learn from industry veterans and thought leaders like myself, and the CEO of Snowflake Computing, Bob Muglia (to name just a few). In addition we will have a Practitioner Panel discussion that includes several of our customers along with other industry thought leaders.

The unique value proposition for this event is that in the afternoon you can choose from two tracks of in depth sessions related to implementing your BI solutions and your data warehouse in the cloud.

I will be presenting my talk Agile Methods and Data Warehousing: How to Deliver Faster. My highly seasoned colleagues from Snowflake (all industry experts) will teach you about loading data in the cloud, deploying BI in the cloud, and how to best use Snowflake to be successful with your cloud analytics program.

And of course there will be food, drinks, and networking.

You can find all the agenda details here along with the registration form. Use discount code DATAWARRIOR for 50% off the registration fee.  Sign up today!

This will be my first time ever in London, so if you are in the area, please come by, say “hi” and learn about the new world of Cloud Analytics.

Until then, cheers!

Kent

The Data Warrior

P.S. I will be in London the day before and after the event, so if you want to have a more detailed or personalized discussion of the benefits of cloud-native data warehousing, please reach out to me at kent.graziano@snowflake.net.

Snowflake and Spark, Part 1: Why Spark? 

Snowflake Computing is making great strides in the evolution of our Elastic DWaaS in the cloud. Here is a recent update from engineering and product management on our integration with Spark:

Spark

This is the first post in an ongoing series describing Snowflake’s integration with Spark. In this post, we introduce the Snowflake Connector for Spark (package available from Maven Central or Spark Packages, source code in Github) and make the case for using it to bring Spark and Snowflake together to power your data-driven solutions.

Read the rest of the post: Snowflake and Spark, Part 1: Why Spark?

Enjoy!

Kent

The Data W

Building a Data Lake in the Cloud 

Hey fellow data warriors! Here is a new joint blog post I just did with fellow data warrior Dale Anderson from Talend! Check it out. I hope you find the concept compelling!

So you want to build a Data Lake?  Ok, sure let’s talk about that.  Perhaps you think a Data Lake will eliminate the need for a Data Warehouse and all your business users will merely lure business analytics from it easily.  Maybe you think putting everything into Big Data technologies like Hadoop will resolve all your data challenges and deliver fast data processing with Spark delivering cool Machine Learning insights that magically give you a competitive edge.  And really, with NoSQL, nobody needs a data model anymore, right?

Avoid the data swamp! Use modern cloud based DWaaS (Snowflake) and the leading-edge Data Integration tool (Talend) to build a Governed Data Lake.

Read the rest: How to Build a Governed Data Lake in the Cloud with Snowflake and Talend

Enjoy!

Kent

The Data Warrior

Cloud Data Warehousing for Dummies

As we all know, cloud is the big thing these days. Getting bigger everyday it seems.

It may get even bigger than Big Data!

If you, like me, are a data warehousing or BI professional, you have probably been wondering how this all fits in the cloud world. You may have even heard of data warehousing  “in the cloud”.

But what does that really mean? What is a cloud data warehouse?

Well thanks to Snowflake Computing, it just got a little easier to answer this question.

They sponsored the development of a new book called Cloud Data Warehousing for Dummies. Yup, an actual Dummies guide for this. And yes, yours truely, got to have a hand in editing and writing the book.

And the best part – it is FREE!

clouddw_dummies

Researching and helping to write the book was very educational for me. I learned a lot in the process about what constitutes a cloud data warehouse, the difference between a platform in the cloud and a real service in the cloud, and what characteristics folks should look for when choosing one.

I also learned to say “on-premises” instead of “on-premise.” 🙂

Content

The chapters of the book cover:

  • An introduction to cloud data warehousing
  • Why the modern data warehouse emerged
  • The criteria for selecting a modern data warehouse
  • On-premises vs cloud data warehousing
  • Comparing cloud data warehousing solutions
  • A six-step guide to choosing a cloud data warehouse

It also includes several real-world customer case studies.

Even though Snowflake sponsored the book, it is vendor agnostic. It really is a book designed to get you introduced to the concepts and to get you thinking about what you might want in a cloud-based data warehousing system.

It is ideal for anyone who is considering making that transition to the cloud.

So head on over to this site and download your FREE copy today!

To infinity and beyond!

Kent

The Data Warrior (with his head in the clouds)

P.S. Forward this to a friend so they can download a copy too!

 

 

Post Navigation

%d bloggers like this: