Qidong Huang (黄启栋)  

Ph.D student, USTC



Information process center, CAS Key Laboratory of Electro-magnetic Space Information

School of Cyber Science and Technology, University of Science and Technology of China

Email: hqd0037[AT]mail.ustc.edu.cn

[Google Scholar] [GitHub] [CV]


News

  • 04/2024: OPERA is selected as Highlight in CVPR 2024!
  • 02/2024: Two papers are accepted by CVPR 2024. See you at Seattle!
  • 02/2024: I have one paper accepted by IEEE TIP 2024.
  • 12/2023: Please check out our new work OPERA for mitigating MLLM's hallucination!
  • 07/2023: I have one paper accepted by ACM MM 2023.
  • 07/2023: I have one paper accepted by ICCV 2023. See you at Paris!
  • 07/2023: I have a new homepage.

About Me

I am currently an Ph.D student at School of Cyber Science and Technology, University of Science and Technology of China , working with Prof. Nenghai Yu and Prof. Weiming Zhang. Prior to that, I received my bachelor degree in School of Information Engineering, University of Science and Technology of China. My current research focuses on large vision-language models.

Biography

  • 2023.08 - Present, Research Intern at Shanghai AI Lab, supervised by Jiaqi Wang and Xiaoyi Dong.
  • 2020.09 - Present, Ph.D. in School of Cyber Science and Technology, University of Science and Technology of China .
  • 2016.09- 2020.06, B.Eng. in School of Information Engineering, University of Science and Technology of China

    Publications

    OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

    Qidong Huang, Xiaoyi Dong, Pan Zhang, Bin Wang, Conghui He, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu

    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024. (Highlight, 2.8% of submissions)

    [arXiv] [Code]

    PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition

    Qidong Huang, Xiaoyi Dong, Dongdong Chen, Hang Zhou, Weiming Zhang, Kui Zhang, Gang Hua, Nenghai Yu

    IEEE Transactions on Image Processing (TIP), 2024.

    [arXiv] [Code]

    Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting

    Qidong Huang, Xiaoyi Dong, Dongdong Chen, Yinpeng Chen, Lu Yuan, Gang Hua, Weiming Zhang, Nenghai Yu

    IEEE/CVF International Conference on Computer Vision (ICCV), 2023.

    [arXiv] [Code]

    Diversity-Aware Meta Visual Prompting

    Qidong Huang, Xiaoyi Dong, Dongdong Chen, Weiming Zhang, Feifei Wang, Gang Hua, Nenghai Yu

    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023.

    [arXiv] [Code]

    Shape-invariant 3D Adversarial Point Clouds

    Qidong Huang, Xiaoyi Dong, Dongdong Chen, Hang Zhou, Weiming Zhang, Nenghai Yu

    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.

    [arXiv] [Code]

    Initiative Defense against Facial Manipulation

    Qidong Huang*, Jie Zhang*, Wenbo Zhou, Weiming Zhang, Nenghai Yu (*Equal contribution)

    AAAI Conference on Artificial Intelligence (AAAI), 2021.

    [arXiv] [Code]

    SimAC: A Simple Anti-Customization Method against Text-to-Image Synthesis of Diffusion Models

    Feifei Wang, Zhentao Tan, Tianyi Wei, Yue Wu, Qidong Huang*

    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024.

    [arXiv]

    Ada3Diff: Defending against 3D Adversarial Point Clouds via Adaptive Diffusion

    Kui Zhang, Hang Zhou, Jie Zhang, Qidong Huang, Weiming Zhang, Nenghai Yu

    ACM International Conference on Multimedia (MM), 2023.

    [arXiv]

    Towards Precise Flood Prediction via Hierachical Terrain Attention and Multi-Scale Rainfall Guidance

    Feifei Wang, Yong Wang, Bing Li, Qidong Huang, Shaoqing Chen

    IEEE International Conference on Image Processing (ICIP), 2023.

    [arXiv]

    Poison Ink: Robust and Invisible Backdoor Attack

    Jie Zhang, Dongdong Chen, Qidong Huang, Jing Liao, Weiming Zhang, Huamin Feng, Gang Hua, Nenghai Yu

    IEEE Transactions on Image Processing (TIP), 2022.

    [arXiv] [Code]

    Deep Template-based Watermarking

    Han Fang, Dongdong Chen, Qidong Huang, Jie Zhang, Zehua Ma, Weiming Zhang* and Nenghai Yu

    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2020.

    [Paper]

    Awards

  • 2021 National Scholarship for Graduate Students

  • Services

    Invited Reviewer for:
  • IEEE Transactions on Neural Networks and Learning Systems (TNNLS), Pattern Recognition (PR)
  • CVPR, ICCV, ECCV, ICPR