Metadrive: Composing diverse driving scenarios for generalizable reinforcement learning Q Li, Z Peng, L Feng, Q Zhang, Z Xue, B Zhou IEEE transactions on pattern analysis and machine intelligence 45 (3), 3461-3475, 2022 | 203 | 2022 |
Trafficgen: Learning to generate diverse and realistic traffic scenarios L Feng, Q Li, Z Peng, S Tan, B Zhou 2023 IEEE International Conference on Robotics and Automation (ICRA), 3567-3575, 2023 | 76 | 2023 |
Reinforcement-learning-and belief-learning-based double auction mechanism for edge computing resource allocation Q Li, H Yao, T Mai, C Jiang, Y Zhang IEEE Internet of Things Journal 7 (7), 5976-5985, 2019 | 68 | 2019 |
Learning to simulate self-driven particles system with coordinated policy optimization Z Peng, Q Li, KM Hui, C Liu, B Zhou Advances in Neural Information Processing Systems 34, 10784-10797, 2021 | 67 | 2021 |
Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization Q Li, Z Peng, B Zhou International Conference on Learning Representations, 2022 | 46 | 2022 |
Safe driving via expert guided policy optimization Z Peng, Q Li, C Liu, B Zhou Conference on Robot Learning, 1554-1563, 2022 | 39 | 2022 |
ScenarioNet: Open-Source Platform for Large-Scale Traffic Scenario Simulation and Modeling Q Li, Z Peng, L Feng, C Duan, W Mo, B Zhou Advances in neural information processing systems, 2023 | 33 | 2023 |
Improving the generalization of end-to-end driving through procedural generation Q Li, Z Peng, Q Zhang, C Liu, B Zhou CVPR workshop, 2020 | 22 | 2020 |
Hybrid internal model: Learning agile legged locomotion with simulated robot response J Long, Z Wang, Q Li, L Cao, J Gao, J Pang The Twelfth International Conference on Learning Representations, 2024 | 17* | 2024 |
Cat: Closed-loop adversarial training for safe end-to-end driving L Zhang, Z Peng, Q Li, B Zhou Conference on Robot Learning, 2357-2372, 2023 | 15 | 2023 |
Learning from active human involvement through proxy value propagation ZM Peng, W Mo, C Duan, Q Li, B Zhou Advances in neural information processing systems 36, 2024 | 11 | 2024 |
Human-AI Shared Control via Policy Dissection Q Li, Z Peng, H Wu, L Feng, B Zhou Advances in Neural Information Processing Systems, 2022 | 10 | 2022 |
Guarded Policy Optimization with Imperfect Online Demonstrations Z Xue, Z Peng, Q Li, Z Liu, B Zhou International Conference on Learning Representations, 2023 | 8 | 2023 |
Learning H-Infinity Locomotion Control J Long, W Yu, Q Li, Z Wang, D Lin, J Pang arXiv preprint arXiv:2404.14405, 2024 | 3 | 2024 |
Metaurban: A simulation platform for embodied ai in urban spaces W Wu, H He, Y Wang, C Duan, J He, Z Liu, Q Li, B Zhou arXiv e-prints, arXiv: 2407.08725, 2024 | 2 | 2024 |
GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction X Chen, Q Li, T Wang, T Xue, J Pang Computer Vision and Pattern Recognition (CVPR), 2024 | 2 | 2024 |
A Holistic Framework Towards Vision-based Traffic Signal Control with Microscopic Simulation P He, Q Li, X Yuan, B Zhou arXiv preprint arXiv:2403.06884, 2024 | | 2024 |