STAIR Research Group | Scalable & Trustworthy AI Research
STAIR Research Group | Scalable & Trustworthy AI Research
People
Projects
Talks
Publications
Light
Dark
Automatic
Yilong Xu
Latest
Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models
Adaptive Token Biaser: Knowledge Editing via Biasing Key Entities
Cite
×