Minghang Zheng's homepage
Minghang Zheng's homepage
Home
Publications
Contact
Light
Dark
Automatic
Publications
Type
Conference paper
Date
2025
2024
2023
2022
2021
Hierarchical Event Memory for Accurate and Low-latency Online Video Temporal Grounding
In this paper, we tackle the task of online video temporal grounding (OnVTG), which requires the model to locate events related to a …
Minghang Zheng
,
Puxin Peng
,
Benyuan Sun
,
Yi Yang
,
Yang Liu
July 2025
Proceedings of the IEEE/CVF International Conference on Computer Vision
ICCV
PDF
Cite
Code
Poster
Video
ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual Grounding
Visual grounding aims to localize the object referred to in an image based on a natural language query. Although progress has been made …
Minghang Zheng
,
Jiahua Zhang
,
Qingchao Chen
,
Yuxin Peng
,
Yang Liu
September 2024
Proceedings of the 31st ACM International Conference on Multimedia
ACM MM
PDF
Cite
Code
Video
Training-free Video Temporal Grounding usingLarge-scale Pre-trained Models
Video temporal grounding aims to identify video segments within untrimmed videos that are most relevant to a given natural language …
Minghang Zheng
,
Xinhao Cai
,
Qingchao Chen
,
Yuxin Peng
,
Yang Liu
August 2024
Proceedings of the European Conference on Computer Vision
ECCV
PDF
Cite
Code
Video
Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization
Video sentence localization aims to locate moments in an unstructured video according to a given natural language query. A main …
Minghang Zheng
,
Shaogang Gong
,
Hailin Jin
,
Yuxin Peng
,
Yang Liu
July 2023
Association for Computational Linguistics
ACL
PDF
Cite
Code
Poster
Slides
Phrase-Level Temporal Relationship Mining for Temporal Sentence Localization
In this paper, we address the problem of video temporal sentence localization, which aims to localize a target moment from videos …
Minghang Zheng
,
Sizhe Li
,
Qingchao Chen
,
Yuxin Peng
,
Yang Liu
February 2023
The Thirty-Seventh AAAI Conference on Artificial Intelligence
AAAI
oral
PDF
Cite
Code
Poster
Slides
Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning
Temporal sentence grounding aims to detect the most salient moment corresponding to the natural language query from untrimmed videos. …
Minghang Zheng
,
Yanjie Huang
,
Qingchao Chen
,
Yuxin Peng
,
Yang Liu
Peking University
March 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
CVPR
PDF
Cite
Code
Weakly Supervised Video Moment Localization with Contrastive Negative Sample Mining
Video moment localization aims at localizing the video segments which are most related to the given free-form natural language query. …
Minghang Zheng
,
Yanjie Huang
,
Qingchao Chen
,
Yang Liu
Peking University
February 2022
The AAAI Conference on Artificial Intelligence
AAAI
PDF
Cite
Code
Poster
End-to-End Object Detection with Adaptive Clustering Transformer
End-to-end Object Detection with Transformer (DETR) performs object detection with Transformer and achieves comparable performance with …
Minghang Zheng
,
Peng Gao
,
Renrui Zhang
,
Kunchang Li
,
Hongsheng Li
,
Hao Dong
November 2021
British Machine Vision Conference
BMVC
oral
PDF
Cite
Code
Supplemental
Fast Convergence of DETR with Spatially Modulated Co-Attention
The recently proposed Detection Transformer (DETR) model successfully applies Transformer to objects detection and achieves comparable …
Peng Gao
,
Minghang Zheng
,
Xiaogang Wang
,
Jifeng Dai
,
Honghsneg Li
October 2021
International Conference on Computer Vision
ICCV
PDF
Cite
Code
Cite
×