Lakehouse
Jump to navigation
Jump to search
External
- https://www.cidrdb.org/cidr2021/papers/cidr2021_paper17.pdf Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics by Michael Armbrust, Ali Ghodsi, Reynold Xin, Matei Zaharia
Overview
An architectural pattern used to implement data warehouses that is based on open direct-access data formats (such as Apache Parquet), has support for machine learning and data science and offers state-of-the-art performance. It is based on the concept of Data Lake.