🌊 Open Lakehouse Storage & Management

  • Apache Iceberg (apache/iceberg): A high-performance open table format for massive analytic datasets. Iceberg brings SQL-like tables to big data engines (like Spark, Trino, and Flink) safely managing multiple tasks concurrently.
    👉 Apache Iceberg GitHub Repository [12]
  • Delta Lake (delta-io/delta): An open-source storage layer that brings reliability to data lakes. It provides ACID transactions, scalable metadata handling, and unifies streaming and batch data processing on top of existing cloud storage.
    👉 Delta Lake GitHub Repository [1234]

You may also like...