About Me
I am currently a Research Assistant at the Singapore University of Technology and Design (SUTD), supervised by Prof. Ngai-Man (Man) Cheung. Previously, I was an AI researcher at FPT Software AI Center Lab, working with Prof. Anh Nguyen. In 2021, I received my honors bachelor’s degree in computer science from the University of Information Technology-Vietnam National University Ho Chi Minh City (VNU-HCM), where I was supervised by Prof. Le Dinh Duy. (Honors programs offer special curricula, privileges, scholarships, and recognition for exceptional undergraduate students)
I have multiple years of experience working in the industry as an AI Research Engineer at renowned AI companies in Vietnam, including MoMo M-Service JSC (a unicorn), FPT Software, and VinBrain (Acquired by NVIDIA).
My research interest lies in Multimodal Learning and Domain Generalization, including their applications in robotics, computer vision, OOD detection/generalization and medical imaging.
News
- 2024.9: One co-first authored paper accepted at NeurIPS 2024.
- 2024.6: One first-authored paper accepted at IROS 2024.
- 2024.1: Two papers (one first-authored and one co-authored) accepted at ICRA 2024.
- 2022.9: One co-first authored paper accepted at BMVC 2022.
Selected Publications
Out-of-Distribution Generalization:
Vision Transformer Neural Architecture Search for Out-of-Distribution Generalization: Benchmark and Insights
Conference on Neural Information Processing Systems (NeurIPS), 2024
Sy-Tuyen Ho *, Tuan Van Vo *, Somayeh Ebrahimkhani *, Ngai-Man Cheung
* Co-first authors, Equal contribution
[NeurIPS]
[Code (Github)]
Multimodal Model for Robotic Vision Task:
Language-driven Grasp Detection with Mask-guided Attention
The IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024
Oral presentation
Tuan Van Vo, Minh Nhat Vu, Baoru Huang, An Vuong, Ngan Le, Thieu Vo, Anh Nguyen
[arXiv]
[Code (Coming Soon)]
Medical Imaging:
Dual consistency assisted multi-confident learning for the hepatic vessel segmentation using noisy labels
The British Machine Vision Conference (BMVC), 2022
Nam Phuong Nguyen * , Tuan Van Vo *, Soan TM Duong, Chanh D Tr Nguyen, Trung Bui, Steven QH Truong
* Co-first authors, Equal contribution
[BMVC]
[Code]
CathAction: A Benchmark for Endovascular Intervention Understanding
ArXiv
B. Huang * , Tuan Van Vo *, C. Kongtongvattana, …, Anh Nguyen
* Co-first authors, Equal contribution
[ArXiv]
[Github Page]
Selected Honors
- Rosen Scholarship 2020 & 2021
- Scholarship study 2018 & 2019
- Participated in the ICPC 2019 Vietnam National Programming
Academic Services
- Journal Reviewer: TMM-2024
- Conference Reviewer: ICRA-2025
Experiences
MoMo-Mservice (a Vietnam’s latest unicorn technology company), Vietnam.
Computer Vision Research Engineer, Jun 2022 - Sep 2023.