Welcome to my academic homepage! I am currently pursuing my Ph.D. as a jointly trained doctoral student at Harbin Institute of Technology, Shenzhen and Great Bay University, since September 2024.
Research Interests: Multimodal Large Language Models (MLLMs), Vision-Language-Action (VLA).
Paper Submissions: Multiple papers submitted to top-tier conferences and journals including CVPR, TIP, ICML, AAAI, etc.
🔥 News
- 2026.05: 🎉🎉 One paper accepted by TIP 2026
- 2026.01: 🎉🎉 One paper accepted by ICLR 2026
- 2025.11: 🎉🎉 One paper accepted by AAAI 2026 as Oral
- 2025.07: 🎉🎉 Two papers accepted by ACM MM 2025
- 2024.09: Started my Ph.D. at Harbin Institute of Technology, Shenzhen&&Great Bay University
- 2024.04: 🎉🎉 One paper accepted by IEEE TITS
- 2023.09: 🎉🎉 One paper accepted by IVC
- 2023.03: Started internship at SenseTime as an Algorithm Intern
📝 Publications

UniEmo: Unifying emotional understanding and generation with learnable expert queries
Yijie Zhu, Lingsen Zhang, Zitong Yu, Rui Shao, Tao Tan, Liqiang Nie
IEEE Transactions on Image Processing (CCF A Journal)

CoEmoGen: Towards Semantically-Coherent and Scalable Emotional Image Content Generation
Kaishen Yuan, Yuting Zhang, Shang Gao, Yijie Zhu, Wenshuo Chen, Yutao Yue
ICLR (CCF A Conference)

Yijie Zhu, Rui Shao, Ziyang Liu, Jie He, Jizhihui Liu, Jiuru Wang, Zitong Yu
AAAI Oral (CCF A Conference)

Yijie Zhu, Yibo Lyu, Zitong Yu, Rui Shao, Kaiyang Zhou, Liqiang Nie
ACM MM (CCF A Conference)

Yibo Lyu, Rui Shao, Gongwei Chen, Yijie Zhu, Weili Guan, Liqiang Nie
ACM MM (CCF A Conference)

MENet: Multi-modal mapping enhancement network for 3D object detection in autonomous driving
Moyun Liu, Youping Chen, Jingming Xie†, Yijie Zhu†, Yang Zhang, Lei Yao, Zhenshan Bing, Genghang Zhuang, Kai Huang, Joey Tianyi Zhou
IEEE Transactions on Intelligent Transportation Systems (CCF B Journal)

BF3D: Bi-directional fusion 3D detector with semantic sampling and geometric mapping
Yijie Zhu, Jingming Xie, Moyun Liu, Lei Yao, Youping Chen
Image and Vision Computing (CCF C Journal)
📄 Preprints

DeltaVLA: Prior-Guided Vision-Language-Action Models via World Knowledge Variation
Yijie Zhu, Jie He, Rui Shao, Kaishen Yuan, Tao Tan, Xiaochen Yuan, Zitong Yu
🎓 Educations
-
Harbin Institute of Technology, Shenzhen & Great Bay University, Sep 2024 – Present
Ph.D. Student in Computer Science and Technology -
Huazhong University of Science and Technology, Sep 2021 – Jun 2024
Master’s Degree in Mechanical Engineering -
Donghua University, Sep 2017 – Jun 2021
Bachelor’s Degree in Mechanical Engineering
💼 Internships
-
SenseTime, Feb 2023 – Aug 2023 Algorithm Intern -
Shanghai Hehe Information Technology Co., Ltd., Jul 2022 – Sep 2022 Algorithm Intern
🎖 Honors and Awards
- Outstanding Graduate, Donghua University
- Outstanding Graduate, Huazhong University of Science and Technology
👨🎓 Academic Service
- Reviewer
International Conference on Learning Representations (ICLR)
Proceedings of the IEEE International Conference on Computer Vision (ICCV)
International Conference on Machine Learning (ICML)
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI)
ACM Multimedia (ACM MM)
Pattern Recognition
IEEE Transactions on Intelligent Transportation Systems