Author: research@theseattledataguy.com

data city consulting

What Is Dask and How Can It Help You as a Data Scientist?

Photo by Nathan John on Unsplash Managing big data no longer means just buying bigger and faster servers It now also means needing to understand the concept of parallel computing. The list of tools and data systems that are helping manage this specific concept continues to grow on a yearly basis. Whether it be using AWS and querying…
Read more


February 26, 2020 0
data science consulting

Big data. Bigger results. Data-driven solutions for banks.

Photo by Sean Pollock on Unsplash Banks from Seattle to New York trust us with their data science solutions. Why’s that? The short answer: while the finance industry has been leveraging data science and statistics for decades, times are changing. Data science and financial strategy are evolving. This means teams need to stay current in order to maximize…
Read more


February 22, 2020 0
data science consulting

5 FinTech Companies Disrupting Banking And Finance

Photo by Sean Pollock on Unsplash You may think of ATM workings as a revolutionary experience but since the advent of Fintech, the entire financial services domain has entered a new era. Whether you purchase a cup of coffee or manage your finances, fintech is everywhere. From payments via apps such as Payoneer or Paypal to getting reports,…
Read more


February 19, 2020 0

5 Great Libraries To Manage Big Data With Python

Python really is everywhere at this point. Although many gatekeepers argue whether a person is really a software developer if they don’t code in a language more difficult than Python, it still is everywhere. It’s used to automate, manage websites, analyze data, and wrangle big data. As data grows, the way we manage it becomes more and…
Read more


February 10, 2020 0

3 Great Streaming Data Systems: Kafka, Flink And Storm

Back in my day, databases and applications used to only sync late at night while everyone was asleep. Now in the modern era, everyone expects their data the second it’s updated (if not somehow magically before the data occurs). Large corporations and Fortune 500 companies depend on this data to be able to predict consumer…
Read more


February 1, 2020 0

Joining Data in DynamoDB and S3 for Live Ad Hoc Analysis

Photo by ZACHARY STAINES on Unsplash Performing ad hoc analysis is a daily part of life for most data scientists and analysts on operations teams. They are often held back by not having direct and immediate access to their data because the data might not be in a data warehouse or it might be stored across…
Read more


January 10, 2020 0

Storing Data In The Cloud In 2020: Using RDS, S3 and Redshift

Photo by Taylor Vick on Unsplash How companies manage their data is no longer limited to traditional relational databases. Amazon Web Services (AWS) for example offers a diverse collection of options when it comes to storing data. We recently wrote a piece solely focused on Redshift, but we wanted to introduce a few more options.…
Read more


December 25, 2019 0

Building a Data Warehouse on Amazon Redshift

Photo by Jezael Melgoza on Unsplash As an organization grows, its data storage, monitoring and analysis requirements also exponentially increase. Traditional data warehouse don’t always easily handle massive amounts of growth. This caused a need for alternative solutions, starting from the mid 2000s. One such solution is Amazon Redshift from Amazon Web Services. What is Amazon…
Read more


December 20, 2019 0

5 AWS Technologies That’ll Make Your Life Easier

Amazon Web Services (AWS) has simplified much of developers’ workflows and development over the past decade. AWS allows engineers to command and control cloud-based infrastructure, data, and other technical pieces of infrastructure without the hassle of developing entire frameworks from scratch. Initially, AWS was launched to take care of online retail operations for Amazon, but…
Read more


December 7, 2019 0
cloud consulting

Airbnb’s Airflow Versus Spotify’s Luigi

Photo by Marcin Jozwiak on Unsplash We recently wrote about ETLs and why they’re important. We wanted to provide an outline for what ETL tools are. You could refer to these ETL tools as workflow tools that help manage moving data from point A to point B. Two of these popular workflow tools are Luigi by Spotify and…
Read more


November 25, 2019 0