a1: Steep Test-time Scaling Law via Environment Augmented Generation

Publication
arXiv preprint arXiv:2504.14597, 2025