STAIR Research Group | Scalable & Trustworthy AI Research
STAIR Research Group | Scalable & Trustworthy AI Research
People
Projects
Talks
Publications
Light
Dark
Automatic
Preprint
Reward and Guidance through Rubrics: Promoting Exploration to Improve Multi-Domain Reasoning
Baolong Bi
,
Shenghua Liu
,
Yiwei Wang
,
Siqian Tong
,
Lingrui Mei
,
Yuyao Ge
,
Yilong Xu
,
Jiafeng Guo
,
Xueqi Cheng
Cite
DOI
PDF
A Survey of Vibe Coding with Large Language Models
Yuyao Ge
,
Lingrui Mei
,
Zenghao Duan
,
Tianhao Li
,
Yujia Zheng
,
Yiwei Wang
,
Lexin Wang
,
Jiayu Yao
,
Tianyu Liu
,
Yujun Cai
,
Baolong Bi
,
Fangda Guo
,
Jiafeng Guo
,
Shenghua Liu
,
Xueqi Cheng
Cite
PDF
Not in Sync: Unveiling Temporal Bias in Audio Chat Models
Jiayu Yao
,
Shenghua Liu
,
Yiwei Wang
,
Rundong Cheng
,
Lingrui Mei
,
Baolong Bi
,
Zhen Xiong
,
Xueqi Cheng
Cite
PDF
Focusing by Contrastive Attention: Enhancing VLMs' Visual Reasoning
Yuyao Ge
,
Shenghua Liu
,
Yiwei Wang
,
Lingrui Mei
,
Baolong Bi
,
Xuanshan Zhou
,
Jiayu Yao
,
Jiafeng Guo
,
Xueqi Cheng
Cite
PDF
Context-DPO: Aligning Language Models for Context-Faithfulness
Baolong Bi
,
Shaohan Huang
,
Yiwei Wang
,
Tianchi Yang
,
Zihan Zhang
,
Haizhen Huang
,
Lingrui Mei
,
Junfeng Fang
,
Zehao Li
,
Furu Wei
,
Weiwei Deng
,
Feng Sun
,
Qi Zhang
,
Shenghua Liu
Cite
DOI
PDF
A Survey of Context Engineering for Large Language Models
Lingrui Mei
,
Jiayu Yao
,
Yuyao Ge
,
Yiwei Wang
,
Baolong Bi
,
Yujun Cai
,
Jiazhi Liu
,
Mingyu Li
,
Zhong-Zhi Li
,
Duzhen Zhang
,
Chenlin Zhou
,
Jiayi Mao
,
Tianze Xia
,
Jiafeng Guo
,
Shenghua Liu
Cite
PDF
RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs
Baolong Bi
,
Shenghua Liu
,
Xingzhang Ren
,
Dayiheng Liu
,
Junyang Lin
,
Yiwei Wang
,
Lingrui Mei
,
Junfeng Fang
,
Jiafeng Guo
,
Xueqi Cheng
Cite
DOI
PDF
Decoding by Contrasting Knowledge: Enhancing LLMs' Confidence on Edited Facts
Baolong Bi
,
Shenghua Liu
,
Lingrui Mei
,
Yiwei Wang
,
Pengliang Ji
,
Xueqi Cheng
Cite
DOI
PDF
Is Factuality Enhancement a Free Lunch For LLMs? Better Factuality Can Lead to Worse Context-Faithfulness Benchmark
Baolong Bi
,
Shenghua Liu
,
Yiwei Wang
,
Lingrui Mei
,
Xueqi Cheng
Cite
DOI
PDF
a1: Steep Test-time Scaling Law via Environment Augmented Generation
Lingrui Mei
,
Shenghua Liu
,
Yiwei Wang
,
Baolong Bi
,
Yuyao Ge
,
Jun Wan
,
Yurong Wu
,
Xueqi Cheng
Cite
PDF
»
Cite
×