DuckLake: Lightweight Data Lake and Catalog in One

2025-05-27
DuckLake: Lightweight Data Lake and Catalog in One

DuckLake offers a lightweight, all-in-one solution for building a data lake and catalog. It enables a 'multiplayer DuckDB' setup with multiple DuckDB instances reading and writing the same dataset—a concurrency model not supported by standard DuckDB. Even if you only use DuckDB for your DuckLake entry point and catalog database, you still benefit from features like time-travel queries, data partitioning, and storing data across multiple files instead of a single, potentially huge, database file.

Development data lake