STAIR Research Group | Scalable & Trustworthy AI Research
STAIR Research Group | Scalable & Trustworthy AI Research
People
Projects
Talks
Publications
Light
Dark
Automatic
"Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jailbreak
Lingrui Mei
,
Shenghua Liu
,
Yiwei Wang
,
Baolong Bi
,
Jiayi Mao
,
Xueqi Cheng
January 2024
Cite
DOI
PDF
Type
Preprint
Publication
CoRR, 2024, vol. abs/2406.11668
Cite
×