Provenance is a pre-training requirement. You cannot establish a verifiable chain of custody for a model's outputs if you did not instrument the data pipeline from the first collection event. This makes compliance with frameworks like the EU AI Act impossible.














