All Things Open 2017 - Data Washing Machine

This blog is going away soon! :( Check out my new site where you can read the latest and subscribe for updates!

This year marks the second time I have attended All Things Open, and it continues to be awesome! Some amazing keynote speakers, including Sara Chipps of Jewelbots, and Kelsey Hightower from Google Cloud Platform.

Today, I was honored to present on how Marketing Operations at Red Hat tackles the problem of data quality. Specifically, we dived in to how we abstracted the process that many data scientists and data engineers use in a more ad-hoc manner.

Below is the slide deck, along with a link to the Github repo:

https://www.slideshare.net/secret/KRDIMK9nJNikmW

https://github.com/rh-marketingops/dwm

Hope to see you at ATO 2018!


Feel free to connect with me!

 
0
Kudos
 
0
Kudos

Now read this

Managing a Databricks Spark Environment with Ansible

Bringing configuration management to Big Data # Apache Spark is an absolutely amazing tool for working with big data. It hides and optimizes all the complexity of Hadoop MapReduce, and reduces coding to (mostly) simple Scala,... Continue →