📝 Publications

  • Notes:(*)indicates the equal contributions and(†)indicates the corresponding author.

🎙 Multimodal Foundation Models

sym

Arxiv Position Paper Towards Building Specialized Generalist AI with System 1 and System 2 Fusion, Kaiyan Zhang*, Biqing Qi*, Bowen Zhou.

sym

Arxiv Survey Paper A Survey of Reinforcement Learning for Large Reasoning Models, Kaiyan Zhang, …, Zhiyuan Ma, Ganqu Cui, Zhiyuan Liu, Biqing Qi†, Ning Ding, Bowen Zhou.

sym

Technical Report Multimodal Large Language Models InternVL3. 5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency, Weiyun Wang, …, Biqing Qi, Jiaye Ge, Qipeng Guo, Wenwei Zhang, Wanli Ouyang, Limin Wang, Min Dou, Xizhou Zhu, Tong Lu, Dahua Lin, Jifeng Dai, Bowen Zhou, Weijie Su, Kai Chen, Yu Qiao, Wenhai Wang, Gen Luo.

sym

Technical Report Hybrid Diffusion Language Models SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation, Shuang Cheng, Yihan Bian, Dawei Liu, Yuhua Jiang, Yihao Liu, Linfeng Zhang, Wenhai Wang, Qipeng Guo, Kai Chen, Biqing Qi†, Bowen Zhou

  • Low-Cost AR-to-BlockDiffusion
  • 2-4× Faster Inference
  • Advanced performance on science reasoning bechmarks (e.g., GPQA and ChemBench)
sym

Arxiv Hybrid Model Architecture Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism, Yuhua Jiang, Shuang Cheng, Yihao Liu, Ermo Hua, Che Jiang, Weigao Sun, Yu Cheng, Feifei Gao, Biqing Qi†, Bowen Zhou

sym

CVPR 2024 Continual Learning Cognition-Inspired Interactive continual learning: Fast and slow thinking, Biqing Qi, Xinquan Chen, Junqi Gao, Dong Li, Jianxing Liu, Ligang Wu, Bowen Zhou,

  • This work was the first to propose the concept of interactive continual learning.
  • Instantiated through the Cognitive Complementarity Theory (System1 and System2).
  • An advanced continual learning framework with the novel structured key-value pairs memory unit.
  • A potential framework to develop Specialized Generalist AI.
sym

ACL 2025 Alignment (Oral) Intuitive Fine-Tuning: Towards Unifying SFT and RLHF into a Single Process, Eermo Hua, Biqing Qi†, Kaiyan Zhang, Yue Yu, Ning Ding, Xintai Lv, Kai Tian, Bowen Zhou.

sym

NeurIPS 2025 Reasoning Reinforcement Learning TTRL: Test-time reinforcement learning, Yuxin Zuo, Kaiyan Zhang, Shang Qu, Li Sheng, Xuekai Zhu, Biqing Qi, Youbang Sun, Ganqu Cui, Ning Ding, Bowen Zhou.

sym

TCSVT 2025 Continual Learning Contrastive Augmented Graph2Graph Memory Interaction for Few Shot Continual Learning, Biqing Qi, Junqi Gao, Xingquan Chen, Dong Li, Jianxing Liu, Ligang Wu, Bowen Zhou.

sym

ICML 2025 Position Embedding Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization, Ermo Hua, Che Jiang, Xingtai Lv, Kaiyan Zhang, Ning Ding, Youbang Sun, Biqing Qi†, Yuchen Fan, Xue Kai Zhu, Bowen Zhou.

🌱 Multi-Agents Systems

sym

Technical Report Multi Agent Systems Marti: A framework for multi-agent llm systems reinforced training and inference, Kaiyan Zhang, …, Youbang Sun, Zhiyuan Ma, Ganqu Cui, Lei Bai, Ning Ding, Biqing Qi†, Bowen Zhou.

sym

CVPR 2025 Model Merging (Highlight) Less is More: Efficient Model Merging with Binary Task Switch, Biqing Qi, Fangyuan Li, Zhen Wang, Junqi Gao, Dong Li, Peng Ye, Bowen Zhou.

sym

NeurIPS 2025 Model Merging Bohdi: Heterogeneous LLM Fusion with Automatic Data Exploration, Junqi Gao, Zhichang Guo, Dazhi Zhang, Dong Li, Runze Liu, Pengfei Li, Kai Tian, Biqing Qi†.

sym

Arxiv Test Time Scaling Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling, Runze Liu, Junqi Gao, Jian Zhao, Kaiyan Zhang, Xiu Li, Biqing Qi†, Wanli Ouyang and Bowen Zhou.

sym

Arxiv Test Time Scaling GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning, Jian Zhao, Runze Liu, Kaiyan Zhang, Zhimu Zhou, Junqi Gao, Dong Li, Jiafei Lyu, Zhouyi Qian, Biqing Qi†, Xiu Li, Bowen Zhou.

sym

ACL 2025 Test Time Scaling Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning, Junqi Gao, Xiang Zou, Ying Ai, Dong Li, Yichen Niu, Biqing Qi†, Jianxing Liu.

👄 Applications

sym

COLM 2024 Scientific Discovery Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation, Biqing Qi, Kaiyan Zhang, Kai Tian, Haoxiang Li, Zhang-Ren Chen, Sihang Zeng, Ermo Hua, Hu Jinfang, Bowen Zhou.

sym

ACL 2025 Scientific Discovery Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent System, Haoyang Su, Renqi Chen, SHIXIANG TANG, Zhenfei Yin, Xinzhe Zheng, Jinzhe Li, Biqing Qi, Qi Wu, Hui Li, Wanli Ouyang, Philip Torr, Bowen Zhou, Nanqing Dong.

sym

EMNLP 2025 Scientific Discovery ReviewRL: Towards Automated Scientific Review with RL, Sihang Zeng, Kai Tian, Kaiyan Zhang, Yuru wang, Junqi Gao, Runze Liu, Sa Yang, Jingxuan Li, Xinwei Long, Jiaheng Ma, Biqing Qi†, Bowen Zhou.

sym

Arxiv Gui Agents ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data, Zhaoyang Liu, JingJing Xie, Zichen Ding, Zehao Li, Bowen Yang, Zhenyu Wu, Xuehui Wang, Qiushi Sun, Shi Liu, Weiyun Wang, Shenglong Ye, Qingyun Li, Zeyue Tian, Gen Luo, Xiangyu Yue, Biqing Qi, Kai Chen, Bowen Zhou, Yu Qiao, Qifeng Chen, Wenhai Wang.

sym

Arxiv GUI Agents Scienceboard: Evaluating multimodal autonomous agents in realistic scientific workflows Qiushi Sun, Zhoumianze Liu, Chang Ma, Zichen Ding, Fangzhi Xu, Zhangyue Yin, Haiteng Zhao, Zhenyu Wu, Kanzhi Cheng, Zhaoyang Liu, Jianing Wang, Qintong Li, Xiangru Tang, Tianbao Xie, Xiachong Feng, Xiang Li, Ben Kao, Wenhai Wang, Biqing Qi, Lingpeng Kong, Zhiyong Wu.

sym

Arxiv GUI Agents OS-MAP: How Far Can Computer-Using Agents Go in Breadth and Depth?, Xuetian Chen, Yinghao Chen, Xinfeng Yuan, Zhuo Peng, Lu Chen, Yuekeng Li, Zhoujia Zhang, Yingqian Huang, Leyan Huang, Jiaqing Liang, Tianbao Xie, Zhiyong Wu, Qiushi Sun, Biqing Qi†, Bowen Zhou.

sym

NeurIPS 2024 D&B Track Scientific Discovery (Spotlight) UltraMedical: Building Specialized Generalists in Biomedicine, Kaiyan Zhang, Sihang Zeng, Eermo Hua, Ning Ding, Zhang-Ren Chen, Zhiyuan Ma, Hhaoxiang Li, Ganqu Cui, Biqing Qi, Xuekai Zhu, Bowen Zhou, .

sym

Arxiv Scientific Discovery MolSpectLLM: A Molecular Foundation Model Bridging Spectroscopy, Molecule Elucidation, and 3D Structure Generation, Shuaike Shen, Jiaqing Xie, Zhuo Yang, Antong Zhang, Shuzhou Sun, Ben Gao, Tianfan Fu,Biqing Qi†, Yuqiang Li.

sym

Arxiv Scientific Discovery Chem3DLLM: 3D Multimodal Large Language Models for Chemistry, Lei Jiang, Shuzhou Sun, Biqing Qi, Yuchen Fu, Xiaohua Xu, Yuqiang Li, Dongzhan Zhou, Tianfan Fu.

sym

Arxiv Scientific Discovery SpectrumWorld: Artificial Intelligence Foundation for Spectroscopy, Zhuo Yang, Jiaqing Xie, Shuaike Shen, Daolang Wang, Yeyun Chen, Ben Gao, Shuzhou Sun, Biqing Qi, Dongzhan Zhou, Lei Bai, Linjiang Chen, Shufei Zhang, Jun Jiang, Tianfan Fu, Yuqiang Li.

sym

Arxiv Embodied Agents CliMRS: Cooperative Large-Language-Model Drriven Hyterogeneous Multi-robot Systems, Siqi Song, Xuanbing Xie, Zonglin Li, Yuqiang Li, Shijie Wang, Biqing Qi†.

sym

EMNLP 2024 Embodied Agents MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making, Dayuan Fu*, Biqing Qi†, Yihuai Gao, Che Jiang, Guanting Dong, Bowen Zhou.