WeChat Mini Program
Old Version Features

PKU-SafeRLHF: Towards Multi-Level Safety Alignment for LLMs with Human Preference

Jiaming Ji,Donghai Hong,Borong Zhang,Boyuan Chen, Juntao Dai, Boren Zheng,Tianyi Qiu, Jiayi Zhou, Kaile Wang, Boxuan Li, Sirui Han, Yike Guo,Yaodong Yang

arxiv(2024)

Cited 0|Views20
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined