Yanyuan Qiao

I am a Postdoctoral Research Fellow working with Asst.Prof. Josie Hughes at École Polytechnique Fédérale de Lausanne (EPFL), Switzerland. Previously, I spent years working with A.P. Qi Wu, at Australian Institute for Machine Learning (AIML), The University of Adelaide, where I completed my Ph.D. in Computer Science under the supervision of A.P. Qi Wu and Dr. Yuankai Qi.

My research interests lie broadly in the field of Vision-and-Language and Embodied AI, especially in Vision-and-Language Navigation.

Email  /  CV  /  Google Scholar  /  Github /  LinkedIn  /  Twitter

profile photo

News

[Jun. 2025] One paper is accepted by ICCV 2025 and two papers are accepted by IROS 2025.

[Apr. 2025] MiniVLN has been selected as ICRA 2025 Best Paper Award Finalist.

[Jan. 2025] One paper is accepted by ICLR 2025 and four papers are accepted by ICRA 2025.

[Dec. 2023] I have been awarded the PhD degree and Dean's Commendation for Doctoral Thesis Excellence.

[Aug. 2023] I have been awarded ICCV 2023 Doctoral Consortium mentored by Prof. Judy Hoffman.

Selected Publications [Full List]


clean-usnob NavBench: Probing Multimodal Large Language Models for Embodied Navigation
Yanyuan Qiao, Haodong Hong, Wenqi Lyu, Dong An, Siqi Zhang, Yutong Xie, Xinyu Wang, Qi Wu
project / arxiv
clean-usnob Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
Yanyuan Qiao, Wenqi Lyu, Hui Wang, Zixu Wang, Zerui Li, Yuan Zhang, Mingkui Tan, Qi Wu
International Conference on Robotics and Automation (ICRA), 2025
project / arxiv
clean-usnob MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation
Junyou Zhu, Yanyuan Qiao, Siqi Zhang, Xingjian He, Qi Wu, Jing Liu
International Conference on Robotics and Automation (ICRA), 2025
arxiv
clean-usnob General Scene Adaptation for Vision-and-Language Navigation
Haodong Hong, Yanyuan Qiao, Sen Wang, Jiajun Liu, Qi Wu
International Conference on Learning Representations (ICLR), 2025
paper
clean-usnob VL-Mamba: Exploring State Space Models for Multimodal Learning
Yanyuan Qiao, Zheng Yu, Zijia Zhao, Sihan Chen, Mingzhen Sun, Longteng Guo, Qi Wu, Jing Liu
NeurIPS Workshop on Efficient Natural Language and Speech Processing, 2024
project / arxiv / code
clean-usnob LLM as Copilot for Coarse-grained Vision-and-Language Navigation
Yanyuan Qiao, Qianyi Liu, Jiajun Liu, Jing Liu, Qi Wu
European Conference on Computer Vision (ECCV), 2024
paper

clean-usnob
March in Chat: Interactive Prompting for Remote Embodied Referring Expression
Yanyuan Qiao, Yuankai Qi, Zheng Yu, Jing Liu, Qi Wu
International Conference on Computer Vision (ICCV), 2023
paper / arxiv / code
clean-usnob VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation
Yanyuan Qiao, Zheng Yu, Qi Wu
International Conference on Computer Vision (ICCV), 2023
paper / arxiv / code
clean-usnob HOP+: History-Enhanced and Order-Aware Pre-Training for Vision-and-Language Navigation
Yanyuan Qiao, Yuankai Qi, Yicong Hong, Zheng Yu, Peng Wang, Qi Wu
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
paper
clean-usnob HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation
Yanyuan Qiao, Yuankai Qi, Yicong Hong, Zheng Yu, Peng Wang, Qi Wu
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
paper / arxiv / code