Dr. Kaiyang Zhou is an Assistant Professor in the Department of Computer Science at Hong Kong Baptist University, specializing in machine learning and computer vision. He has published over 30 technical papers in leading venues, including CVPR, ICCV, ECCV, NeurIPS, ICLR, ICML, AAAI, and top journals like IEEE TPAMI and IJCV. His work has garnered over 10,000 citations. He serves as an Associate Editor of IJCV, the flagship journal in computer vision, and regularly takes on roles such as Area Chair for major conferences like CVPR, ECCV, NeurIPS, and ICLR. He is also the creator of influential AI software packages such as Torchreid (the No.1 popular person re-identification software on GitHub), Dassl (a multifunctional machine learning framework), and CoOp (a prompt learning toolbox for vision-language models). Before joining HKBU, he was a postdoctoral researcher at Nanyang Technological University, Singapore, working with Prof. Ziwei Liu and Prof. Chen Change Loy. He completed his PhD in Computer Science at the University of Surrey, UK, under the supervision of Prof. Tao Xiang.
Fields of specialization:
- computer vision and pattern recognition
- machine learning and deep learning
- multimodal models (vision + language)
- domain generalization and adaptation
To prospective students:
- I have multiple openings for PhD/RA with topics in LLM & VLM. If you are interested in joining my lab, please email me with your CV, transcripts, etc. You are highly recommended to contact me via email and receive my endorsement before submitting a formal PhD application to the Department. Due to the large volume of emails, I will only reply to shortlisted candidates.
- If you are undergrad at BU and interested in doing research with me, you can contact me via email.
News & Activities
- [2024-10] Call for Papers: IJCV Special Issue on Visual Domain Generalization in Real-World Applications.
- [2024-09] Invited to serve as CVPR 2025 area chair.
- [2024-08] Invited to serve as ICLR 2025 area chair.
- [2024-06] Invited to serve as AAAI 2025 senior program committee.
- [2024-06] We're organizing a workshop on Prompting in Vision at CVPR 2024.
- [2024-05] Invited to serve as BMVC 2024 area chair.
- [2024-04] Call for Papers: ECCV 2024 Workshop on Green Foundation Models.
- [2024-04] Invited to serve as NeurIPS 2024 area chair.
- [2024-02] Call for Papers: CVPR 2024 Workshop on Prompting in Vision.
Older News & Activities
- [2023-12] Invited to serve as ECCV 2024 area chair.
- [2023-10] Gave a talk at VALSE.
- [2023-08] Gave a talk at AIGC-2023 Workshop on Trustworthy Foundation Models under Imperfect Data.
- [2023-08] Gave a talk at IJCAI-2023 Symposium Session on Medical Large Models.
- [2023-08] Invited to serve as CVPR 2024 area chair.
- [2023-08] Invited to join the editorial board of International Journal of Computer Vision as associate editor.
- [2023-07] Gave a talk at the University of Tokyo (slides & video).
- [2023-06] We're organizing a tutorial on Prompting in Vision at CVPR 2023.
- [2023-01] Call for Papers: ICLR 2023 workshop on what do we need for successful domain generalization?
- [2022-09] Call for Papers: IJCV Special Issue on The Promises and Dangers of Large Vision Models.
Professional Services
Mentoring
Teaching
-
Hong Kong Baptist University
COMP7065: Innovative Laboratory
ITS 7010: ITS Doctoral Research Training I
-
Nanyang Technological University
AI6126 Guest Lecture: Open-World Visual Recognition
OpenMMLab Workshop: Object Detection
-
Queen Mary University of London
ECS797: Machine Learning for Visual Data Analytics
ECS708: Machine Learning
Software & Datasets
- Torchreid: A codebase for person re-identification (including documentation and model zoo).
- Dassl: A multifunctional codebase for domain generalization, domain adaptation and semi-supervised learning.
- CoOp: A codebase for developing adaptation methods (e.g., prompt learning) for large-scale vision-language models.
- OpenOOD: A codebase and benchmark for out-of-distribution detection.
- PSG: A dataset for panoptic scene graph generation. (Codebase: OpenPSG.)