blue-star-01-1

Extreme scale engineering

Discover the latest trends and best practices impacting data-intensive applications. Register for access to all 60+ sessions available on demand.

Building a Database Replication Platform at Scale

Joy Gao17 minutes
Share this
Share this

Register for access to all 60+ sessions available on demand.

Fill out the form to watch this session from the Monster Scale Summit livestream. You’ll also get access to all available recordings.

In this Monster Scale Summit Presentation

Cross-system database replication at petabyte scale is far harder than it looks, especially when bridging transactional OLTP systems with column-oriented OLAP stores like ClickHouse. This talk covers the architectural decisions, trade-offs, and operational patterns behind ClickHouse's database integration platform, including how it handles the fundamental mismatch between transactional semantics and eventual consistency at scale.

Joy Gao, Software Engineer, ClickHouse

Joy is a software engineer at ClickHouse. She works on the data ingestion platform, ClickPipes, specializing in CDC-based replication from various databases to ClickHouse. Previously, she worked on infrastructure and platform engineering at Figma, including its real-time data system, LiveGraph. Before that, she worked on streaming and batch data pipelines at WePay, and was a contributor to Debezium and a committer for Airflow. She is passionate about software craftsmanship, reliability, performance, and FOSS. Outside of work, she is an avid kitesurfer.