Modern data lineage platforms like Collibra Lineage, MANTA, and Alation are built to map data flows between tables, reports, and dashboards. For AI governance, this lineage must extend deeper into the AI/ML pipeline: tracing from source system raw data, through feature engineering in tools like Databricks or SageMaker, into training datasets, and finally to specific model versions in registries like MLflow or SageMaker Model Registry. An AI integration injects intelligence into this extended graph, automatically documenting these new node types (features, models, prompts) and the transformations between them via APIs and metadata hooks.




