publications

(*) denotes equal contribution. Underlined name (You Qin) is the candidate. For a complete list, see my Google Scholar page.

2026

  1. NeurIPS 2026
    Under Review
    SOAR: Self-Correction for Optimal Alignment and Refinement in Diffusion Models
    You Qin, Linqing Wang, Hao Fei, Roger Zimmermann, Liefeng Bo, Qinglin Lu, Chunyu Wang
    Conference on Neural Information Processing Systems, 2026
  2. NeurIPS 2026
    Under Review
    Geometry over Density: Few-Shot Cross-Domain OOD Detection
    Shawn Li*, You Qin*, Jiate Li, Charith Peris, Lisa Bauer, Roger Zimmermann, Yue Zhao
    Conference on Neural Information Processing Systems, 2026
  3. arXiv Survey
    Audio-Visual Intelligence in Large Foundation Models
    You Qin, Kai Liu, Shengqiong Wu, Kai Wang, Shijian Deng, Yapeng Tian, Junbin Xiao, Yazhou Xing, Yinghao Ma, Bobo Li, Roger Zimmermann, Lei Cui, Furu Wei, Jiebo Luo, Hao Fei
    arXiv:2605.04045, 2026 · 56 pages, 16 figures
  4. IEEE TIP
    Under Review
    Contextual Hashing Meets Lightweight Convolution: Accelerating Retrieval and Refining Localization for Video Corpus Moment Retrieval
    Mingjin Kuai, Jin Peng, Zheqi Lv, You Qin, Zhan Yang, Zhen Zhang, Wei Zhou, Wei Ji
    IEEE Transactions on Image Processing
  5. IEEE TGRS 2026
    SRDiff: A Cross-Modal Diffusion Model for Satellite-to-Radar Translation in Precipitation Nowcasting
    You Qin, Jinming Cao, Ting Wang, Yifang Yin, Li Li, Shili Xiang, Ying Zhang, Roger Zimmermann
    IEEE Transactions on Geoscience and Remote Sensing, 2026
  6. IEEE TGRS 2026
    InstaFlow-Pan: One-Step Flow Matching for High-Fidelity Pansharpening
    Qian Liu, You Qin, Zhiyuan Li, Xiangyong Cao, Junmin Liu
    IEEE Transactions on Geoscience and Remote Sensing, 2026
  7. IEEE TMM 2026
    Dynamic Graph-enhanced Event Refinement for Temporal Sentence Grounding of Micro-moments
    Mingjin Kuai*, You Qin*, Xiang Fang, Yiming Wu, Wei Ji, Roger Zimmermann
    IEEE Transactions on Multimedia, 2026
  8. IEEE TMM 2026
    Grounding is All You Need? Dual Temporal Grounding for Video Dialog
    You Qin, Wei Ji, Xinze Lan, Hao Fei, Xun Yang, Dan Guo, Roger Zimmermann, Lizi Liao
    IEEE Transactions on Multimedia, 2026

2025

  1. ICCV 2025
    Secure On-Device Video OOD Detection Without Backpropagation
    Li Li, Peilin Cai, Yuxiao Zhou, Zhiyu Ni, Renjie Liang, You Qin, Yi Nian, Zhengzhong Tu, Xiyang Hu, Yue Zhao
    In International Conference on Computer Vision, 2025
  2. ICLR 2025
    Generalized Video Moment Retrieval
    You Qin, Qilong Wu, Yicong Li, Wei Ji, Li Li, Pengcheng Cai, Lina Wei, Roger Zimmermann
    In International Conference on Learning Representations, 2025
  3. IEEE TGRS 2025
    FSGformer: Frequency Separation and Guidance Transformer for Pansharpening
    Qian Liu, Xiangyu Zhao, You Qin, Lanyu Li, Junmin Liu
    IEEE Transactions on Geoscience and Remote Sensing, 2025

2024

  1. arXiv
    Described Spatial-Temporal Video Detection
    Wei Ji, Xiangyan Liu, Yingfei Sun, Jiajun Deng, You Qin, Ammar Nuwanna, Mengyao Qiu, Lina Wei, Roger Zimmermann
    arXiv:2407.05610, 2024
  2. IEEE GRSL 2024
    Unveiling Causalities in SAR ATR: A Causal Interventional Approach for Limited Data
    Chenwei Wang, Xin Chen, You Qin, Siyi Luo, Yulin Huang, Jifang Pei, Jianyu Yang
    IEEE Geoscience and Remote Sensing Letters, 2024
  3. AAAI 2024
    Panoptic Scene Graph Generation with Semantics-prototype Learning
    Li Li, Wei Ji, Yiming Wu, Mengze Li, You Qin, Lina Wei, Roger Zimmermann
    In AAAI Conference on Artificial Intelligence, 2024
  4. ICASSP 2024
    MRTNet: Multi-Resolution Temporal Network for Video Sentence Grounding
    Wei Ji, You Qin, Long Chen, Yinwei Wei, Yiming Wu, Fangfang Wang, Roger Zimmermann
    In IEEE International Conference on Acoustics, Speech & Signal Processing, 2024
  5. ICASSP 2024
    Domain-wise Invariant Learning for Panoptic Scene Graph Generation
    Li Li, You Qin, Wei Ji, Yuxiao Zhou, Roger Zimmermann
    In IEEE International Conference on Acoustics, Speech & Signal Processing, 2024

2023

  1. ACM MM 2023
    Biased-Predicate Annotation Identification via Unbiased Visual Predicate Representation
    Li Li*, Chenwei Wang*, You Qin, Wei Ji, Renjie Liang
    In ACM International Conference on Multimedia, 2023
  2. arXiv
    Causal SAR ATR with Limited Data via Dual Invariance
    Chenwei Wang, You Qin, Li Li, Siyi Luo, Yulin Huang, Jifang Pei, Ying Zhang, Jianyu Yang
    arXiv:2308.09412, 2023