DataHub is an open-source metadata platform that provides end-to-end data discovery, data observability, and data governance for the modern data stack. Built on a stream-based architecture, it enables real-time metadata ingestion, search, and actionability, allowing data teams to understand the flow, quality, and usage of their data assets across all systems. It functions as a centralized metadata catalog and active metadata management system.




