RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs

Publication
CoRR, 2025, vol. abs/2503.15888