YQ

You Qin

Ph.D. Student in Computer Science @ National University of Singapore

About

I am a Ph.D. student in Computer Science at the National University of Singapore, researching at the Next++ Research Center with Prof. Roger Zimmermann. My work focuses on multimodal understanding — teaching machines to ground language in visual experiences, generate structured scene representations, and reason across vision and language.

My research interests include video moment retrieval, scene graph generation, temporal grounding, and cross-modal diffusion models.

Multimodal Learning Video Moment Retrieval Scene Graph Generation Temporal Grounding 3D Vision Diffusion Models

News

Research Experience

Next++ Sea Joint Lab, National University of Singapore
Research Intern
Sep 2022 — Present
  • Multi-modal Information Retrieval for Panoptic Scene Graph Generation
  • Multi-modal Understanding for Video Moment Retrieval
  • Described Spatial-Temporal Video Detection (DSTVD) benchmark and framework
Intelligent Machine Perception Lab, SUTD
Research Associate
Mar 2024 — Aug 2024
  • Fully Sparse Multi-modal 3D Object Detection with Dynamic Prompting
  • Pretrained Diffusion for Single-view 3D Scene Generation

Publications

ICLR 2025
Generalized Video Moment Retrieval
You Qin, Qilong Wu, Yicong Li, Wei Ji, Li Li, Pengcheng Cai, Lina Wei, Roger Zimmermann
International Conference on Learning Representations
ICCV 2025
Secure On-Device Video OOD Detection Without Backpropagation
Li Li, Peilin Cai, Yuxiao Zhou, Zhiyu Ni, Renjie Liang, You Qin, Yi Nian, Zhengzhong Tu, Xiyang Hu, Yue Zhao
International Conference on Computer Vision
t₁ t₂ Dual Grounding
IEEE TMM 2025
Grounding is All You Need? Dual Temporal Grounding for Video Dialog
You Qin, Wei Ji, Xinze Lan, Hao Fei, Xun Yang, Dan Guo, Roger Zimmermann, Lizi Liao
IEEE Transactions on Multimedia
Semantics Prototype
AAAI 2024
Panoptic Scene Graph Generation with Semantics-prototype Learning
Li Li, Wei Ji, Yiming Wu, Mengze Li, You Qin, Lina Wei, Roger Zimmermann
Association for the Advancement of Artificial Intelligence
L1 L2 L3
ICASSP 2024
MRTNet: Multi-Resolution Temporal Network for Video Sentence Grounding
Wei Ji, You Qin, Long Chen, Yinwei Wei, Yiming Wu, Fangfang Wang, Roger Zimmermann
International Conference on Acoustics, Speech, & Signal Processing
Domain A Domain B Invariant Space
ICASSP 2024
Domain-wise Invariant Learning for Panoptic Scene Graph Generation
Li Li, You Qin, Wei Ji, Yuxiao Zhou, Roger Zimmermann
International Conference on Acoustics, Speech, & Signal Processing
on top of next to Biased Unbiased
ACM MM 2023
Biased-Predicate Annotation Identification via Unbiased Visual Predicate Representation
Li Li*, Chenwei Wang*, You Qin, Wei Ji, Renjie Liang  (* Equal Contribution)
ACM International Conference on Multimedia
Preprints & Under Review
CVPR 2026
Contextual Hashing Meets Lightweight Convolution: Accelerating Retrieval and Refining Localization for Video Corpus Moment Retrieval — Under Review
Mingjin Kuai, Jin Peng, Zheqi Lv, You Qin, Zhan Yang, Zhen Zhang, Wei Zhou, Wei Ji
IEEE TMM
Dynamic Graph-enhanced Event Refinement for Temporal Sentence Grounding of Micro-moments — Under Major Revision
Mingjin Kuai*, You Qin*, Xiang Fang, Yiming Wu, Wei Ji, Roger Zimmermann  (* Equal Contribution)
IEEE TGRS
SRDiff: A Cross-Modal Diffusion Model for Satellite-to-Radar Transformation in Precipitation Nowcasting — Under Review
You Qin, Jinming Cao, Ting Wang, Yifang Yin, Li Li, Shili Xiang, Ying Zhang, Roger Zimmermann

Academic Service

Conference Reviewer:   NeurIPS 2025  ·  ICLR 2025  ·  ICCV 2025  ·  ACM Multimedia 2023, 2024 (Outstanding Reviewer)

Education

National University of Singapore
Ph.D. in Computer Science
Aug 2024 — Present
National University of Singapore
Master of Computing, General Track  GPA: 4.50 / 5.0
Aug 2022 — Jan 2024
University of Electronic Science and Technology of China
B.Sc. in Mathematics for Information & Computing Science  GPA: 3.71 / 4.0
Sep 2018 — Jun 2022