🌊 Open Lakehouse Storage & Management
- Apache Iceberg (
apache/iceberg): A high-performance open table format for massive analytic datasets. Iceberg brings SQL-like tables to big data engines (like Spark, Trino, and Flink) safely managing multiple tasks concurrently.
👉 Apache Iceberg GitHub Repository [1, 2] - Delta Lake (
delta-io/delta): An open-source storage layer that brings reliability to data lakes. It provides ACID transactions, scalable metadata handling, and unifies streaming and batch data processing on top of existing cloud storage.
👉 Delta Lake GitHub Repository [1, 2, 3, 4]