Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework
Published in NeurIPS 2025, 2025
Data mixture optimization should be principled rather than based on guesswork. Our results demonstrate that Bayesian Optimization yields superior performance compared to prior ad-hoc methods.
