Shihan Dou

Cytowane przez

	Wszystkie	Od 2020
Cytowania	1869	1868
h-indeks	17	17
i10-indeks	21	21

1400

700

350

1050

202120222023202420259 32 217 1324 282

Dostęp publiczny

Wyświetl wszystko

8 artykułów

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Współautorzy

Huang Xuanjing (黄萱菁)Professor of Computer Science, Fudan UniversityZweryfikowany adres z fudan.edu.cn
Qi Zhang (张奇)Professor of Computer Science, Fudan UniversityZweryfikowany adres z fudan.edu.cn
Tao Gui （桂韬）复旦大学Zweryfikowany adres z fudan.edu.cn
Rui ZhengFudan UniversityZweryfikowany adres z fudan.edu.cn
Hai JinHuazhong University of Science and TechnologyZweryfikowany adres z hust.edu.cn
Xipeng Qiu（邱锡鹏）Professor of Computer Science, Fudan UniversityZweryfikowany adres z fudan.edu.cn
Yueming Wu

Obserwuj

Shihan Dou

Fudan University

Zweryfikowany adres z m.fudan.edu.cn

Alignment RLHF Reward Modeling


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
The rise and potential of large language model based agents: A survey Z Xi, W Chen, X Guo, W He, Y Ding, B Hong, M Zhang, J Wang, S Jin, ... Science China Information Sciences 68 (2), 121101, 2025	844	2025
Secrets of RLHF in large language models part I: PPO R Zheng, S Dou, S Gao, Y Hua, W Shen, B Wang, Y Liu, S Jin, Q Liu, ... arXiv preprint arXiv:2307.04964, 2023	149*	2023
Vulcnn: An image-inspired scalable vulnerability detection system Y Wu, D Zou, S Dou, W Yang, D Xu, H Jin Proceedings of the 44th International Conference on Software Engineering …, 2022	137	2022
Secrets of RLHF in Large Language Models Part II: Reward Modeling B Wang, R Zheng, L Chen, Y Liu, S Dou, C Huang, W Shen, S Jin, E Zhou, ... arXiv preprint arXiv:2401.06080, 2024	96*	2024
LoRAMoE: Alleviate world knowledge forgetting in large language models via MoE-style plugin S Dou, E Zhou, Y Liu, S Gao, J Zhao, W Shen, Y Zhou, Z Xi, X Wang, ... arXiv preprint arXiv:2312.09979, 2023	94*	2023
SCDetector: Software functional clone detection based on semantic tokens analysis Y Wu, D Zou, S Dou, S Yang, W Yang, F Cheng, H Liang, H Jin Proceedings of the 35th IEEE/ACM international conference on automated …, 2020	66	2020
IntDroid: Android malware detection based on API intimacy analysis D Zou, Y Wu, S Yang, A Chauhan, W Yang, J Zhong, S Dou, H Jin ACM Transactions on Software Engineering and Methodology (TOSEM) 30 (3), 1-32, 2021	48	2021
Codechameleon: Personalized encryption framework for jailbreaking large language models H Lv, X Wang, Y Zhang, C Huang, S Dou, J Ye, T Gui, Q Zhang, X Huang arXiv preprint arXiv:2402.16717, 2024	43	2024
Easyjailbreak: A unified framework for jailbreaking large language models W Zhou, X Wang, L Xiong, H Xia, Y Gu, M Chai, F Zhu, C Huang, S Dou, ... arXiv preprint arXiv:2403.12171, 2024	37	2024
Towards understanding the capability of large language models on code clone detection: A survey S Dou, J Shan, H Jia, W Deng, Z Xi, W He, Y Wu, T Gui, Y Liu, X Huang arXiv preprint arXiv:2308.01191, 2023	35*	2023
Loose lips sink ships: Mitigating length bias in reinforcement learning from human feedback W Shen, R Zheng, W Zhan, J Zhao, S Dou, T Gui, Q Zhang, X Huang arXiv preprint arXiv:2310.05199, 2023	33	2023
MINER: Improving out-of-vocabulary named entity recognition from an information theoretic perspective X Wang, S Dou, L Xiong, Y Zou, Q Zhang, T Gui, L Qiao, Z Cheng, ... arXiv preprint arXiv:2204.04391, 2022	32	2022
Stepcoder: Improve code generation with reinforcement learning from compiler feedback S Dou, Y Liu, H Jia, L Xiong, E Zhou, W Shen, J Shan, C Huang, X Wang, ... arXiv preprint arXiv:2402.01391, 2024	27*	2024
Zhiheng Xi W Zhou, X Wang, L Xiong, H Xia, Y Gu, M Chai, F Zhu, C Huang, S Dou Rui Zheng, Songyang Gao, Yicheng Zou, Hang Yan, Yifan Le, Ruohui Wang, Lijun …, 2024	20	2024
ToolEyes: fine-grained evaluation for tool learning capabilities of large language models in real-world scenarios J Ye, G Li, S Gao, C Huang, Y Wu, S Li, X Fan, S Dou, Q Zhang, T Gui, ... arXiv preprint arXiv:2401.00741, 2024	20	2024
Training large language models for reasoning through reverse curriculum reinforcement learning Z Xi, W Chen, B Hong, S Jin, R Zheng, W He, Y Ding, S Liu, X Guo, ... arXiv preprint arXiv:2402.05808, 2024	19	2024
Open the Pandora's Box of LLMs: Jailbreaking LLMs through Representation Engineering T Li, S Dou, W Liu, M Wu, C Lv, X Zheng, X Huang arXiv preprint arXiv:2401.06824, 2024	18	2024
What's Wrong with Your Code Generated by Large Language Models? An Extensive Study S Dou, H Jia, S Wu, H Zheng, W Zhou, M Wu, M Chai, J Fan, C Huang, ... arXiv preprint arXiv:2407.06153, 2024	17	2024
Contrastive learning for robust android malware familial classification Y Wu, S Dou, D Zou, W Yang, W Qiang, H Jin IEEE Transactions on Dependable and Secure Computing, 2022	17	2022
Mousi: Poly-visual-expert vision-language models X Fan, T Ji, C Jiang, S Li, S Jin, S Song, J Wang, B Hong, L Chen, ... arXiv preprint arXiv:2401.17221, 2024	14	2024

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy