Building ML models at scale requires large, diverse datasets. Raw training data is often sensitive, siloed across organisations, and requires significant preprocessing before it can be used. Moving it creates risk; leaving it in place limits model quality.