Unity Catalog

Industry’s only universal catalog for data and AI
Colorful circular chart illustrating 3 components of the Unity Catalog service. Tables, AI Assets, & Unstuctured Data.
Stylized icon of a multicolored llama representing an aspect of the Unity Catalog's branding.
pink and yellow background gradient blur

Interoperability

Multimodal interface provides interoperability across lakehouse formats like Delta Lake, Apache IcebergTM, major cloud platforms, and various compute engines.

Openness

Open APIs and open source server offer maximum flexibility and customer choice by ensuring broad interoperability.

Unified Governance for data and AI

Built-in governance and security for tabular, non-tabular data as well as AI assets such as Gen AI tools

Interoperability across any format and engine

Unity Catalog supports Delta Lake, Apache IcebergTM via UniForm, Parquet, CSV, JSON, and many other formats. It also implements the Iceberg REST Catalog APIs to interoperate with a broad ecosystem.

Colorful circular chart illustrating components of the Unity Catalog including Delta Lake, Apache Hudi, and Apache Iceberg, divided into sectors for Tables, Functions, Models, and AI Assets.pink and yellow background gradient blur
Diagram illustrating the components of Unity Catalog's openness features, including Unity REST APIs, Iceberg REST catalog API, and Delta Sharing. The central graphic represents Unity Catalog's integration and accessibility, emphasizing data sharing and cataloging through standardized REST APIs and protocols.pink and yellow background gradient blur

Openness

Unity Catalog is Apache 2.0 licensed, including an OpenAPI specification, server and clients. Adoption of open standards maximizes flexibility and customer choice by ensuring extensive interoperability across various engines, tools, and platforms.

Multi-format support (Delta, Iceberg and Hudi as UniForm)
Beyond tables - Unstructured data (Volumes) and AI assets (ML models, Gen AI tools)
Plugin support - extensible to Iceberg REST Catalog and HMS interface for client compatibility, plus additional plugins (e.g., integration with a new AI framework)
Delta Sharing open protocol for sharing tabular and non-tabular assets cross-domain

Unified governance for data and AI

Unity Catalog has built-in governance and security – with strong authentication, secure credential vending, and asset-level access control to protect your data and AI assets with a unified solution. Manage unstructured data, such as images and documents, and Gen AI tools with a single, universal catalog.

Diagram showcasing Unity Catalog's data governance features, including strong authentication, asset-level access control, and credential vending. The image highlights secure interaction between Unity Catalog and external clients, ensuring robust access management and data security.pink and yellow background gradient blur

Be future-ready with a broad and open ecosystem

Unity Catalog OSS, with its Universal interface, provides broad interoperability across the modern data stack including all major cloud platforms, compute engines, data and AI platforms as well as data catalog and governance solutions.
Integration icon

Endorsed by industry leaders and innovators

“Microsoft is committed to the open-source community and empowering customers with  choice. Databricks has been a strategic partner for years and it's great to see them  open-sourcing Unity Catalog. We believe truly open standards with broad industry  participation are in customers' best interests. Our collaboration with Databricks continues  to elevate Microsoft Azure as the best choice for data and AI workloads.”
Jessica Hawk
CVP Data, AI and Digital Applications, Microsoft
"AWS welcomes Databricks' move to open source Unity Catalog. AWS is committed to working with the industry on open source solutions that enable choice and interoperability for customers.”
Chris Grusz
Managing Director of Technology Partnerships, AWS
"Google is committed to open, flexible solutions that empower customers to maximize the value of their data. Databricks’ strategy to open up the Unity Catalog standard for data and AI aligns very well with our strategy"
Ritika Suri
Director, Data and AI Technology Partnerships, Google Cloud
"Enterprise data is essential to developing accurate generative AI applications. NVIDIA  works closely with our partner ecosystem to support open-source offerings like Databricks  Unity Catalog, which can help customers curate efficient and powerful development  pipelines."
Pat Lee
VP of Strategic Enterprise Partnerships, NVIDIA
"Salesforce Data Cloud is built from the ground up on Open Standards with Apache Parquet and Apache Iceberg. Our zero copy innovations enable customers to unlock data, derive insights and orchestrate actions across the Customer 360. Databricks' embrace of Apache Iceberg via UniForm and Unity Catalog addresses key interoperability challenges between Delta Lake and Iceberg. We are excited to have Databricks as a member of our Zero Copy Partner Network and look forward to joint innovations with the new open Unity Catalog, delivering compelling customer value in structured data, unstructured data and AI models"
Raveendrnathan Loganathan
Executive Vice President of Software Engineering at Salesforce
"Databricks's decision to open source Unity Catalog is an exciting development for the data and AI community. We're excited to partner with Databricks to integrate Unity Catalog with LangChain, which allows our shared users to build advanced agents using Unity Catalog functions as tools."
Harrison Chase
CEO & Founder, LangChain
“Unstructured is the leading unstructured data ETL solution for LLMs - helping organizations transform their data from raw to RAG-ready. Our partnership with Unity Catalog OSS makes perfect sense, as we break down data silos and accelerate AI/ML development in enterprises. We are excited to partner with Databricks to develop this open standard for AI use cases and to standardize metadata for unstructured data – helping our customers operate at the cutting edge of AI.”
Brian Raymond
CEO & Founder, UnstructuredIO
“At Eventual, we have built Daft, the leading open source distributed query engine for multimodal data. We believe that unifying compute for tabular and unstructured data is not enough and that a multimodal catalog is crucial to build GenAI data lakehouses. We are excited to partner with Databricks and other AI innovators to develop the Unity Catalog open standard for modern data+AI workloads.”
Sammy Sidhu
CEO & Founder, Eventual Computing
“At Granica, we champion data democratization and freedom from vendor lock-in. Our Safe Room technology ensures privacy, trust, and safety in generative AI workflows while supporting open standards like Unity Catalog, Delta Lake, and Apache Iceberg. Unity Catalog's vendor-neutral architecture and robust governance solutions align with our vision of providing customers with flexibility and control over their data. We are excited to contribute to this open ecosystem, driving innovation and enabling customers to seamlessly work with their data across best-of-breed platforms”
Rahul Ponnala
CEO & Co-Founder, Granica
“Open sourcing Unity Catalog is a pivotal step towards a more collaborative and innovative data ecosystem. By making this technology accessible, Databricks is fostering an environment where the entire community can contribute to and benefit from enhanced data governance and management capabilities. This move aligns with our vision at Onehouse and Apache XTable (Incubating) to support open format interoperability that drives progress and innovation for all.”
Vinoth Chandar
CEO & Co-Founder, Onehouse
"Confluent's mission is to set data in motion and enable organizations to take advantage of their data everywhere. We're excited to see Databricks make a significant contribution to an open data ecosystem with Unity Catalog becoming open sourced. Tableflow on Confluent Cloud will enable easy delivery of real-time data to places like a data lake by turning data streams into Iceberg tables with a single click. By combining our industry-leading streaming capabilities with Databricks' robust data management solutions, customers will be able to put their data to work more effectively than ever."
Shaun Clowes
CPO, Confluent
"Together, Databricks and dbt Cloud help users break down data silos to collaborate effectively, simplify ETL to lower TCO with Delta Lake, and unify governance with Unity Catalog. We are thrilled to announce our support for Unity Catalog OSS and the open APIs. This partnership underscores our commitment to providing a unified data experience, empowering our community to achieve greater insights and drive innovation."
Mark Porter
CTO dbt Labs
"We are thrilled to see Databricks open source Unity Catalog as an open standard for data and AI. This move will provide our customers with greater choice and flexibility in their data ecosystem, ensuring seamless integration and maximizing interoperability with Fivetran's platform as they ingest critical data to Databricks."
Anjan Kundavaram
CPO, Fivetran
"The exposure of native access patterns within Unity Catalog has transformed how our business is able to streamline access to data and apply governance rules at scale - with no performance impact. Databricks continued investment in a community to accelerate services to make data controls easier to build allows our customers to govern with greater ease and manage the massive volume of new data consumers being onboarded in the age of AI."
Matthew Carroll
CEO, Immuta
image/svg+xml logo-informatica logo-informatica Created with Sketch.
"We are excited to see the opportunity for our joint customers as Databricks open-sources Unity Catalog as an open standard for data and AI. With Unity Catalog OSS and the Informatica intelligent Data Management Cloud, customers can gain greater choice, flexibility and interoperability in their data ecosystems."
Brett Roscoe
GM and SVP Cloud Data Governance and Cloud Operations, Informatica
Left slider arrow icon in white
Right slider arrow icon in white
“Unity Catalog OSS represents a huge step forward for open source innovation, and Microsoft Fabric direct integration with the open APIs will give our customers more flexibility and interoperability than ever to optimize their data strategy.”
John Smith
VP Microsoft Fabric
"We are excited to see Databricks open up Unity Catalog for data and AI. We look forward to the enhanced interoperability and governance that the open standard will bring, enabling customers to seamlessly integrate and manage their data across various tools and platforms."
Ganapathy Krishnamoorthy
VP AWS Analytics Services
"We are thrilled to see Databricks open up the Unity Catalog standard for data and AI. This development aligns with our commitment to providing open, flexible solutions that empower customers to maximize the value of their data. We look forward to the enhanced interoperability and governance capabilities that Unity Catalog brings to customers."
Jane Smith
VP Google BigQuery

Get started today!

View the Github repo or join the Unity Catalog open source community to view more information on the roadmap.