Data Lake Change Data Capture (CDC) using Apache Hudi on Amazon EMR — Part 2—Process

Easily process data changes over time from your database to Data Lake using Apache Hudi on Amazon EMR

Manoj Kukreja
Towards Data Science
6 min readOct 22, 2020

--

Image by Gino Crescoli from Pixabay

In a previous article below we had discussed how to seamlessly collect CDC data using Amazon Database Migration Service (DMS).

--

--

Author, Big Data Engineering, Data Science, Data Lakes, Cloud Computing and IT security specialist.