Author: research@theseattledataguy.com

data science consulting

5 FinTech Companies Disrupting Banking And Finance

Photo by Sean Pollock on Unsplash You may think of ATM workings as a revolutionary experience but since the advent of Fintech, the entire financial services domain has entered a new era. Whether you purchase a cup of coffee or manage your finances, fintech is everywhere. From payments via apps such as Payoneer or Paypal to getting reports,…
Read more


February 19, 2020 0

5 Great Libraries To Manage Big Data With Python

Python really is everywhere at this point. Although many gatekeepers argue whether a person is really a software developer if they don’t code in a language more difficult than Python, it still is everywhere. It’s used to automate, manage websites, analyze data, and wrangle big data. As data grows, the way we manage it becomes more and…
Read more


February 10, 2020 0

3 Great Streaming Data Systems: Kafka, Flink And Storm

Back in my day, databases and applications used to only sync late at night while everyone was asleep. Now in the modern era, everyone expects their data the second it’s updated (if not somehow magically before the data occurs). Large corporations and Fortune 500 companies depend on this data to be able to predict consumer…
Read more


February 1, 2020 0

Joining Data in DynamoDB and S3 for Live Ad Hoc Analysis

Photo by ZACHARY STAINES on Unsplash Performing ad hoc analysis is a daily part of life for most data scientists and analysts on operations teams. They are often held back by not having direct and immediate access to their data because the data might not be in a data warehouse or it might be stored across…
Read more


January 10, 2020 0

Storing Data In The Cloud In 2020: Using RDS, S3 and Redshift

Photo by Taylor Vick on Unsplash How companies manage their data is no longer limited to traditional relational databases. Amazon Web Services (AWS) for example offers a diverse collection of options when it comes to storing data. We recently wrote a piece solely focused on Redshift, but we wanted to introduce a few more options.…
Read more


December 25, 2019 0

Building a Data Warehouse on Amazon Redshift

Photo by Jezael Melgoza on Unsplash As an organization grows, its data storage, monitoring and analysis requirements also exponentially increase. Traditional data warehouse don’t always easily handle massive amounts of growth. This caused a need for alternative solutions, starting from the mid 2000s. One such solution is Amazon Redshift from Amazon Web Services. What is Amazon…
Read more


December 20, 2019 0

5 AWS Technologies That’ll Make Your Life Easier

Amazon Web Services (AWS) has simplified much of developers’ workflows and development over the past decade. AWS allows engineers to command and control cloud-based infrastructure, data, and other technical pieces of infrastructure without the hassle of developing entire frameworks from scratch. Initially, AWS was launched to take care of online retail operations for Amazon, but…
Read more


December 7, 2019 0
cloud consulting

Airbnb’s Airflow Versus Spotify’s Luigi

Photo by Marcin Jozwiak on Unsplash We recently wrote about ETLs and why they’re important. We wanted to provide an outline for what ETL tools are. You could refer to these ETL tools as workflow tools that help manage moving data from point A to point B. Two of these popular workflow tools are Luigi by Spotify and…
Read more


November 25, 2019 0

Healthcare Fraud Detection With Python

This April a 1.5 billion dollar medicare scheme took advantage of hundreds of thousands of seniors in the US. In reality, this is just a small sliver of the billions of dollars healthcare fraud costs both consumers and insurance providers annually. Healthcare fraud can come from many different directions. Some people might think of the…
Read more


November 6, 2019 0

What Are ETLs and Why Are They Important?

Creating a world of self-service analytics Photo by chuttersnap on Unsplash The rise in self-service analytics is a significant selling point in the business intelligence world. Part of the point of creating self-service analytics is having easy access to the data from your organization. The question is how do you get your data from external application…
Read more


November 2, 2019 0