Apache Iceberg Streaming Workload Updates: Copy-on-Write vs Merge-on-Read

This title was summarized by AI from the post below.
View organization page for Ryft

2,233 followers

How do you handle updates in streaming Apache Iceberg workloads? 🧊 Two options: 1. Copy-on-Write: rewrite the entire file on every update. Queries stay fast, but write latency is high. 2. Merge-on-Read: write a delete file and merge at read time. Write latency stays lower, but queries slow down over time. Most streaming workloads choose MoR for the latency benefits, but that means committing to more aggressive compaction and retention to manage delete files. See what each strategy means for write latency, reads, and compaction → https://lnkd.in/dG8BUQvw

  • graphical user interface, text

To view or add a comment, sign in

Explore content categories