Tag: data engineering

change data capture time

Terms You Should Know If You’re Planning To Use Change Data Capture

If you’ve worked in data long enough, then you’ve likely come across the term change data capture. Often called CDC, change data capture involves tracking and recording changes in a database as they happen, and then transmitting these changes to designated targets. This can be crucial because some pipelines, in particular batch pipelines, don’t capture…
Read more


April 29, 2024 0
spark vs flink

Apache Spark Vs Apache Flink – What Is The Difference?

As data increased in volume, velocity, and variety, so, in turn, did the need for tools that could help process and manage those larger data sets coming at us at ever faster speeds. As a result, frameworks such as Apache Spark and Apache Flink became popular due to their abilities to handle big data processing…
Read more


April 25, 2024 0
data engineering videos

10 Great Videos To Help You Learn Data Engineering

How data is structured, managed and processed will continue to grow in importance as the demand for AI and machine learning increase. It’s unavoidable that as businesses demand that their data teams implement AI, they will also realize that data engineers are a crucial piece of the data pipeline. That means, if you’re looking for…
Read more


April 20, 2024 0
data engineering project consulting

Common Pitfalls of Data Analytics Projects

Have you ever been part of a data or software project that seems stuck in a loop? Three weeks have passed, and although you arrive at work daily, exhausted, having tackled numerous issues, the project remains stagnant. Why? Then, suddenly, a new engineer or project manager steps in, reorganizes and prioritizes tasks, and just like…
Read more


April 12, 2024 0
apache druid architecture

Apache Druid’s Architecture – How Druid Processes Data In Real Time At Scale

Recently, I wrote an article diving into what Druid is and which companies are using it. Now I wanted to do a deeper dive into Apache Druid’s architecture. Apache Druid has several unique features that allow it to be used as a real-time OLAP. Everything from its various nodes and processes that each have unique…
Read more


March 11, 2024 0
real-time streaming consulting

5 Real-Time Data Processing and Analytics Technologies – And Where You Can Implement Them

No matter your industry, you’ll often need to make split-second business decisions in the digital age. Real-time data can help you do just that. It’s information that’s made available as soon as it’s created, meaning you don’t need to wait around for the insights you need. Real-time data processing can satisfy the ever-increasing demand for…
Read more


March 1, 2024 0
ssis migration project

Alternatives to SSIS(SQL Server Integration Services) – How To Migrate Away From SSIS

SQL Server Integration Services (SSIS) comes with a lot of functionality useful for extracting, transforming, and loading data. It can also play important roles in application development and other projects. But SSIS is far from the only platform that can provide these services. You might seek alternatives to SSIS because you want a more agile…
Read more


February 27, 2024 0
data quality

Why Your Team Needs To Implement Data Quality For Your AI Strategy

Companies that range from start-ups to enterprises are looking to implement AI and ML into their data strategy. With that it’s important not to forget about data quality. Regardless of how fancy or sophisticated a company’s AI model might be, poor data quality will break it. It will make the outputs of these models useless…
Read more


February 12, 2024 0
data ai ready consulting

Data Warehousing Essentials: A Guide To Data Warehousing

Photo by Tiger Lily Data warehouses and data lakes play a crucial role for many businesses. It gives businesses access to the data from all of their various systems. As well as often integrating data so that end-users can answer business critical questions. But if we take a step back and only focus on the…
Read more


February 11, 2024 0
cut data stack costs

Cutting Your Data Stack Costs: How To Approach It And Common Issues

I once had an engineer tell me that they essentially didn’t want to consider cost as they were building a solution. I was baffled. Don’t get me wrong, yes, when you’re building, you iterate and aim to improve your solutions cost. But from my perspective, I don’t think completely ignoring costs from day one is…
Read more


January 5, 2024 0