Author: research@theseattledataguy.com

3 Great Streaming Data Systems: Kafka, Flink And Storm

Back in my day, databases and applications used to only sync late at night while everyone was asleep. Now in the modern era, everyone expects their data the second it’s updated (if not somehow magically before the data occurs). Large corporations and Fortune 500 companies depend on this data to be able to predict consumer…
Read more


February 1, 2020 0

Joining Data in DynamoDB and S3 for Live Ad Hoc Analysis

Photo by ZACHARY STAINES on Unsplash Performing ad hoc analysis is a daily part of life for most data scientists and analysts on operations teams. They are often held back by not having direct and immediate access to their data because the data might not be in a data warehouse or it might be stored across…
Read more


January 10, 2020 0

Storing Data In The Cloud In 2020: Using RDS, S3 and Redshift

Photo by Taylor Vick on Unsplash How companies manage their data is no longer limited to traditional relational databases. Amazon Web Services (AWS) for example offers a diverse collection of options when it comes to storing data. We recently wrote a piece solely focused on Redshift, but we wanted to introduce a few more options.…
Read more


December 25, 2019 0

Building a Data Warehouse on Amazon Redshift

Photo by Jezael Melgoza on Unsplash As an organization grows, its data storage, monitoring and analysis requirements also exponentially increase. Traditional data warehouse don’t always easily handle massive amounts of growth. This caused a need for alternative solutions, starting from the mid 2000s. One such solution is Amazon Redshift from Amazon Web Services. What is Amazon…
Read more


December 20, 2019 0

5 AWS Technologies That’ll Make Your Life Easier

Amazon Web Services (AWS) has simplified much of developers’ workflows and development over the past decade. AWS allows engineers to command and control cloud-based infrastructure, data, and other technical pieces of infrastructure without the hassle of developing entire frameworks from scratch. Initially, AWS was launched to take care of online retail operations for Amazon, but…
Read more


December 7, 2019 0
cloud consulting

Airbnb’s Airflow Versus Spotify’s Luigi

Photo by Marcin Jozwiak on Unsplash We recently wrote about ETLs and why they’re important. We wanted to provide an outline for what ETL tools are. You could refer to these ETL tools as workflow tools that help manage moving data from point A to point B. Two of these popular workflow tools are Luigi by Spotify and…
Read more


November 25, 2019 0

Healthcare Fraud Detection With Python

This April a 1.5 billion dollar medicare scheme took advantage of hundreds of thousands of seniors in the US. In reality, this is just a small sliver of the billions of dollars healthcare fraud costs both consumers and insurance providers annually. Healthcare fraud can come from many different directions. Some people might think of the…
Read more


November 6, 2019 0

What Are ETLs and Why Are They Important?

Creating a world of self-service analytics Photo by chuttersnap on Unsplash The rise in self-service analytics is a significant selling point in the business intelligence world. Part of the point of creating self-service analytics is having easy access to the data from your organization. The question is how do you get your data from external application…
Read more


November 2, 2019 0

5 Use Cases for DynamoDB

Introduction Web-based applications face scaling due to the growth of users along with the increasing complexity of data traffic. Along with the modern complexity of business comes the need to process data faster and more robustly. Because of this, standard transactional databases aren’t always the best fit. Instead, databases such as DynamoDB have been designed…
Read more


October 30, 2019 0
predictive modeling

What Is Predictive Modeling?

Photo by Roman Mager on Unsplash In this modern world it is hard to imagine visiting a website that doesn’t automatically personalize what you see or predicting what product you will want to buy? It seems like the whole world wide web already knows who we are. Well, this is what predictive modeling enables us to…
Read more


October 15, 2019 0