Real-Time Feeding of a Data Lake with PostgreSQL and Debezium
PostgreSQL Users Group Belgium
Julien RIOU
February 13, 2024
Speaker
Julien RIOU
Open Source DBA
https://julien.riou.xyz
@jriou
@hachyderm.io
Summary
Who are we?
Internal databases
Data Lake
ETL
CDC
Other uses
The future
Who are we?
Internal databases
Statistics
3
DBMS (MySQL, MongoDB, PostgreSQL)
7
autonomous infrastructures worldwide
500+
servers
2000+
databases
100+
clusters
Highly secure environments
Cluster example
Mutualized environments
Analytics needs
Billing
Revenue
Enterprise strategy
KPIs
Fraud detection
Electrical consumption
Metadata analysis (from JIRA)
Work time detection of support teams
Mix of workloads