Blog

apache airflow consulting

Common Pitfalls in Deploying Airflow for Data Teams

If you’re a data engineer, then you’ve likely at least heard of Airflow. Apache Airflow is one of the most popular open-source workflow orchestration solutions that gets used for data pipelines. This is what spurred me to write the article “Should You Use Airflow” because there are plenty of people who don’t enjoy Airflow or…
Read more


November 27, 2023 0
what is apache druid

Apache Druid: Who’s Using It and Why?

Image Source: Druid The past few decades have increased the need for faster data. Some of the catalysts were the push for better data and decisions to be made around advertising. In fact, Adtech has driven much of the real-time data technologies that we have today. For example, Reddit uses a real-time database to provide…
Read more


November 17, 2023 0
data analytics consulting

How To Set Up Your Data Analytics Team For Success – Centralized vs Decentralized vs Federated Data Teams

Photo by Austin Distel on Unsplash Success in the data world hinges on team setup. I’ve delved into onboarding and standards in previous articles, but never into the structure of data teams. Typically, there are three configurations: Centralized, Decentralized, and Federated. Most companies I’ve seen use a mix of these. While the newest tech breakthroughs…
Read more


September 13, 2023 0
SSIS Consulting

What Is SSIS and Should You Use It?

SSIS, short for SQL Server Integration Service, is an essential data migration tool for modern businesses. As a key part of Microsoft’s SQL database software, It allows you to easily complete many complex tasks, including data extraction, merging data, loading and transformation, aggregating data, and more. It’s a comprehensive solution to your data management needs. In today’s business landscape,…
Read more


September 2, 2023 0

Why Is Data Modeling So Challenging – How To Data Model For Analytics

Photo by Shubham Dhage on Unsplash Learning about how to data models from basic star schemas on the internet is like learning data science using the IRIS data set. It works great as a toy example. But it doesn’t match real life at all. Data modeling in real life requires you fully understand the data…
Read more


August 9, 2023 0
snowflake consulting

Data Warehouses Vs Operational Data Stores Vs Data Lakes – How To Store Your Data For Analytics

Photo by Leif Christoph Gottwald on Unsplash A few months ago, I uploaded a video where I discussed data warehouses, data lakes, and transactional databases. However, the world of data management is evolving rapidly, especially with the resurgence of AI and machine learning. There are numerous other methods that technical teams are utilizing to handle…
Read more


August 3, 2023 0

4 Alternatives to Fivetran: The Evolving Dynamics of the ETL & ELT Tool Market

The ETL & ELT tool market is experiencing continuous transformation, propelled by fluctuating pricing structures and the advent of inventive alternatives. This industry remains fiercely competitive due to these changing elements and a swiftly growing user base. In the following sections, we will explore four emerging alternatives to Fivetran. Of course, that is if you…
Read more


July 16, 2023 0

What Is Change Data Capture

Some data teams need to have their data near real-time for dashboards and reporting. So how can they implement a near real-time data pipeline? One possible choice is a method called change data capture, also known as CDC. I have seen companies employ multiple ways to use CDC or CDC-like approaches to pull data from…
Read more


July 9, 2023 0

7 Data Engineering Projects To Put On Your Resume

Starting new data engineering projects can be challenging. Data engineers can get stuck on finding the right data for their data engineering project or picking the right tools. And many of my Youtube followers agree as they confirmed in a recent poll that starting a new data engineering project was difficult. Here were the key…
Read more


May 21, 2023 0

OLTP Vs OLAP – What Is The Difference

If you’re relying on your OLTP system to provide analytics, you might be in for a surprise. While it can work initially, these systems aren’t designed to handle complex queries. Adding databases like MongoDB and CassandraDB only makes matters worse, since they’re not SQL-friendly – the language most analysts and data practitioners are used to.…
Read more


May 8, 2023 0