Journal Papers
Ziyue Huang, Mingming Zhang, Yuan Gong, Qingjie Liu, and Yunhong Wang:
Generic Knowledge Boosted Pre-training For Remote Sensing Images.
IEEE Transactions on Geoscience and Remote Sensing. (Early Access) 2024
Mingming Zhang, Qingjie Liu, and Yunhong Wang:
HiT: Building Mapping with Hierarchical Transformers.
IEEE Transactions on Geoscience and Remote Sensing. (Early Access) 2024
Junfu Wang, Yuanfang Guo, Liang Yang, and Yunhong Wang:
Binary Graph Convolutional Network with Capacity Exploration.
IEEE Transactions on Pattern Analysis and Machine Intelligence. (Early Access) 2024
Jiankai Li, Yunhong Wang, and Weixin Li:
MHRN: A Multimodal Hierarchical Reasoning Network for Topic Detection.
IEEE Transactions on Multimedia. (Early Access) 2024
Zhen Yang, Yuanfang Guo, Junfu Wang, Di Huang, and Xiuguo Bao:
Towards Video Anomaly Detection in the Real World: A Binarization Embedded Weakly-Supervised Network.
IEEE Transactions on Circuits and Systems for Video Technology. (Early Access) 2023
Duoxuan Pei, Di Huang, Longteng Kong, and Yunhong Wang:
Key Role Guided Transformer for Group Activity Recognition.
IEEE Transactions on Circuits and Systems for Video Technology. (Early Access) 2023
Jiankai Li, Yunhong Wang, and Weixin Li:
Zero-shot Scene Graph Generation via Triplet Calibration and Reduction.
ACM Transactions on Multimedia Computing, Communications, and Applications. 20(1): 5:1-5:21, 2024
Sheng Liu, Annan Li, Jiahao Wang, and Yunhong Wang:
Bidirectional Maximum Entropy Training with Word Co-occurrence for Video Captioning.
IEEE Transactions on Multimedia. 25: 4494-4507, 2023
Jiaqi Zhou, Zehua Fu, Qiuyu Huang, Qingjie Liu, and Yunhong Wang:
LgNet: A Local-global Network for Action Recognition and Beyond.
IEEE Transactions on Multimedia. 25: 5192-5205, 2023
Yongqiang Kong, Yunhong Wang, Annan Li, and Qiuyu Huang:
Self-sufficient Feature Enhancing Networks for Video Salient Object Detection.
IEEE Transactions on Multimedia. 25: 557-571, 2023
Huiqun Wang, Ruijie Yang, Di Huang, and Yunhong Wang:
iDARTS: Improving DARTS by Node Normalization and Decorrelation Discretization.
IEEE Transactions on Neural Networks and Learning Systems. 34(4): 1945-1957, 2023
Xueping Wang, Yunhong Wang, Weixin Li, Zhengyin Du, and Di Huang:
Facial Expression Animation by Landmark Guided Residual Module.
IEEE Transactions on Affective Computing. 14(2): 878-894, 2023
Weixin Li, Xuan Dong, and Yunhong Wang:
Human Emotion Recognition with Relational Region-Level Analysis.
IEEE Transactions on Affective Computing. 14(1): 650-663, 2023
Tianrui Chai, Zhiyuan Chen, Annan Li, Jiaxin Chen, Xinyu Mei, and Yunhong Wang:
Video Person Re-Identification Using Attribute-Enhanced Features.
IEEE Transactions on Circuits and Systems for Video Technology. 32: 7951-7966, 2022
Yongqiang Kong, Yunhong Wang, and Annan Li:
Spatiotemporal Saliency Representation Learning for Video Action Recognition.
IEEE Transactions on Multimedia. 24: 1515-1528, 2022
Guangshuai Gao, Qingjie Liu, Zhenghui Hu, Lu Li, Qi Wen, and Yunhong Wang:
PSGCNet: A Pyramidal Scale and Global Context Guided Network for Dense Object Counting in Remote-Sensing Images.
IEEE Transactions on Geoscience and Remote Sensing. 60: 1-12, 2022
Huanyu Zhou, Qingjie Liu, Dawei Weng, and Yunhong Wang:
Unsupervised Cycle-Consistent Generative Adversarial Networks for Pan Sharpening.
IEEE Transactions on Geoscience and Remote Sensing. 60: 1-14, 2022
Jiahao Wang, Yunhong Wang, Nina Weng, Tianrui Chai, Annan Li, Faxi Zhang, and Sansi Yu:
Will You Ever Become Popular? Learning to Predict Virality of Dance Clips.
ACM Transactions on Multimedia Computing, Communications, and Applications. 18(2): 1-24, 2022
Ran Qin, Qingjie Liu, Guangshuai Gao, Di Huang, and Yunhong Wang:
MRDet: A Multihead Network for Accurate Rotated Object Detection in Aerial Images.
IEEE Transactions on Geoscience and Remote Sensing. 60: 1-12, 2022
Hongyu Yang, Di Huang, Yunhong Wang, and Anil K. Jain:
Learning Continuous Face Age Progression: A Pyramid of GANs.
IEEE Transactions on Pattern Analysis and Machine Intelligence. 43(2): 499-515, 2021
Mengshi Qi, Jie Qin, Yi Yang, Yunhong Wang, and Jiebo Luo:
Semantics-Aware Spatial-Temporal Binaries for Cross-Modal Video
Retrieval. IEEE Transaction on Image Processing. 30: 2989-3004, 2021
Qingjie Liu, Huanyu Zhou, Qizhi Xu, Xiangyu Liu, and Yunhong Wang:
PSGAN: A Generative Adversarial Network for Remote Sensing Image Pan-Sharpening.
IEEE Transactions on Geoscience and Remote Sensing. 14(8): 1-16, 2020
Longteng Kong, Di Huang, Jie Qin, and Yunhong Wang: A Joint Framework for
Athlete Tracking and Action Recognition in Sports Videos. IEEE Transactions on Circuits and Systems for Video Technology. 30(2): 532-548, 2020
Mengshi Qi, Yunhong Wang, Jie Qin, Annan Li, Jiebo Luo, and Luc Van Gool:
StagNet: An Attentive Semantic RNN for Group Activity and Individual Action
Recognition. IEEE Transactions on Circuits and Systems for Video Technology. 30(2):
549-565, 2020
Bin Hou, Qingjie Liu, Heng Wang, and Yunhong Wang: From W-Net to CDGAN:
Bitemporal Change Detection via Deep Learning Techniques. IEEE Trans.
Geoscience and Remote Sensing 58(3): 1790-1802, 2020
Songtao Liu, Di Huang, and Yunhong Wang: Pay Attention to Them: Deep
Reinforcement Learning-Based Cascade Object Detection. IEEE Transactions on Neural Networks and Learning Systems. 31(7): 2544-2556, 2020
Mengshi Qi, Yunhong Wang, Annan Li, and Jiebo Luo: STC-GAN: Spatio-Temporally
Coupled Generative Adversarial Networks for Predictive Scene Parsing.
IEEE Transactions on Image Processing. 29: 5420-5430, 2020
Mengshi Qi, Yunhong Wang, Annan Li, and Jiebo Luo: Sports Video Captioning via
Attentive Motion Representation and Group Relationship Modeling. IEEE
Transactions on Circuits and Systems for Video Technology. 30(8): 2617-2633, 2020
Guangshuai Gao, Wenting Zhao, Qingjie Liu, and Yunhong Wang: Co-Saliency
Detection with Co-Attention Fully Convolutional Network. IEEE Transactions on Circuits and Systems for Video Technology.
Longteng Kong, Di Huang, and Yunhong Wang: Long-Term Action Dependence-Based
Hierarchical Deep Association for Multi-Athlete Tracking in Sports
Videos. IEEE Transactions on Image Processing. 29: 7957-7969, 2020
Huiqun Wang, Di Huang, Kui Jia, and Yunhong Wang:Hierarchical Image Segmentation
Ensemble for Objectness in RGB-D Images. IEEE Transactions on Circuits and Systems for Video Technology. 29(1): 93-103, 2019
Jia Sun, Di Huang, Yunhong Wang, and Liming Chen:Expression Robust 3D Facial
Landmarking via Progressive Coarse-to-Fine Tuning. ACM Transactions on Multimedia Computing, Communications, and Applications. 15(1):
21:1-21:23, 2019
Qingpeng Li, Lichao Mou, Qingjie Liu, and Yunhong Wang, Xiao Xiang Zhu:HSF-Net:
Multiscale Deep Feature Embedding for Ship Detection in Optical Remote
Sensing Imagery. IEEE Transactions on Geoscience and Remote Sensing
56(12): 7147-7161, 2018
Xiaoke Zhu, Xiao-Yuan Jing, Liang Yang, Xinge You, Dan Chen, Guangwei Gao,
and Yunhong Wang:Semi-Supervised Cross-View Projection-Based Dictionary Learning
for Video-Based Person Re-Identification. IEEE Transactions on Circuits and Systems for Video Technology. 28(10): 2599-2611, 2018
Zhengxin Zhang, Qingjie Liu, and Yunhong Wang:Road Extraction by Deep Residual
U-Net. IEEE Geoscience Remote Sensing Letter 15(5): 749-753, 2018
Ying Lu, Liming Chen, Alexandre Saidi, Emmanuel Dellandréa, and Yunhong
Wang:Discriminative Transfer Learning Using Similarities and
Dissimilarities. IEEE Transactions on Neural Networks and Learning Systems. 29(7):
3097-3110, 2018
Yongqiang Yao, Di Huang, Xudong Yang, Yunhong Wang, and Liming Chen:Texture and
Geometry Scattering Representation-Based Facial Expression Recognition in
2D+3D Videos. ACM Transactions on Multimedia Computing, Communications, and Applications. 14(1s): 18:1-18:23, 2018
Xiao-Yuan Jing, Xiaoke Zhu, Fei Wu, Ruimin Hu, Xinge You, Yunhong Wang, Hui
Feng, and Jing-Yu Yang:Super-Resolution Person Re-Identification With
Semi-Coupled Low-Rank Discriminant Dictionary Learning. IEEE Transactions on Image Processing. 26(3): 1363-1378, 2017
Jiaxin Chen, Zhaoxiang Zhang, and Yunhong Wang:Corrections to "Relevance Metric
Learning for Person Re-Identification by Exploiting Listwise
Similarities". IEEE Transactions on Image Processing. 25(1): 494, 2016
Jie Qin, Li Liu, Zhaoxiang Zhang, Yunhong Wang, and Ling Shao:Compressive
Sequential Learning for Action Similarity Labeling. IEEE Transactions on Image Processing. 25(2): 756-769, 2016
Hongyu Yang, Di Huang, Yunhong Wang, Heng Wang, and Yuanyan Tang:Face Aging
Effect Simulation Using Hidden Factor Analysis Joint Sparse
Representation. IEEE Transactions on Image Processing. 25(6): 2493-2507, 2016
Conference Papers
Wenshuai Xu, Zhenghui hu, Yu Lu, Jinzhou Meng, Qingjie Liu, Yunhong Wang:
ActiveDC: Distribution Calibration for Active Finetuning. CVPR 2024
Wenrui Cai, Qingjie Liu, and Yunhong Wang:
HIPTrack: Visual Tracking with Historical Prompts. CVPR 2024
Haoxiang Ma, Modi Shi, Boyang Gao, and Di Huang:
Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge. CVPR 2024
Xiefan Guo, Jinlin Liu, Miaomiao Cui, Jiankai Li, Hongyu Yang, and Di Huang:
InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization. CVPR 2024
Jiankai Li, Yunhong Wang, Xiefan Guo, Ruijie Yang, and Weixin Li:
Leveraging Predicate and Triplet Learning for Scene Graph Generation. CVPR 2024
Guodong Wang, Yunhong Wang, Xiuguo Bao, and Di Huang:
Rotation Has Two Sides: Evaluating Data Augmentation for Deep One-class Classification. ICLR 2024
Ruikui Wang, Yuanfang Guo, and Yunhong Wang:
AGS: Affordable and Generalizable Substitute Training for Transferable Adversarial Attack. AAAI 2024
Bing Li, Jiaxin Chen, Xiuguo Bao, and Di Huang:
Compressed Video Prompt Tuning. NeurIPS 2023
Jiayi Zhang, and Weixin Li:
Multi-Modal and Multi-Scale Temporal Fusion Architecture Search for Audio-Visual Video Parsing. ACM Multimedia 2023: 3328-3336
Liangwei Jiang, Jiaxin Chen, Di Huang, and Yunhong Wang:
MIEP: Channel Pruning with Multi-granular Importance Estimation for Object Detection. ACM Multimedia 2023: 2908-2917
Weilai Xiang, Hongyu Yang, Di Huang, and Yunhong Wang:
Denoising Diffusion Autoencoders are Unified Self-supervised Learners. ICCV 2023: 15802-15812
Jinqing Zhang, Yanan Zhang, Qingjie Liu, and Yunhong Wang:
SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection. ICCV 2023: 3348-3357
Guodong Wang, Yunhong Wang, Jie Qin, Dongming Zhang, Xiuguo Bao, and Di Huang:
Unilaterally Aggregated Contrastive Learning with Hierarchical Augmentation for Anomaly Detection. ICCV 2023: 6888-6897
Nan Zhou, Jiaxin Chen, and Di Huang:
DR-Tune: Improving Fine-tuning of Pretrained Visual Models by Distribution Regularization with Semantic Calibration. ICCV 2023: 1547-1556
Mingwu Zheng, Haiyu Zhang, Hongyu Yang, and Di Huang:
NeuFace: Realistic 3D Neural Face Rendering from Multi-view Images. CVPR 2023: 16868-16877
Chao Zhou, Yanan Zhang, Jiaxin Chen, and Di Huang:
OcTr: Octree-based Transformer for 3D Object Detection. CVPR 2023: 5166-5175
Bowei Du, Yecheng Huang, Jiaxin Chen, and Di Huang:
Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images. CVPR 2023: 13435-13444
Huanyu Zhou, Qingjie Liu, and Yunhong Wang:
Learning Discriminative Representations for Skeleton Based Action Recognition. CVPR 2023: 10608-10617
Ruikui Wang, Yuanfang Guo, Yunhong Wang:
Global-Local Characteristic Excited Cross-Modal Attacks from Images to Videos. AAAI 2023: 2635-2643
Lingfeng Tan, Yunhong Wang, Junfu Wang, Liang Yang, Xunxun Chen and Yuanfang Guo:
Deepfake Video Dectection via Facial Action Dependency Estimation. AAAI 2023: 5276-5284
Kaicheng Li, Hongyu Yang, Binghui Chen, Pengyu Li, Biao Wang and Di Huang:
Learning Polysemantic Spoof Trace: A Multi-Modal Disentanglement Network for Face Anti-spoofing. AAAI 2023: 1351-1359
Ye Du, Yujun Shen, Haochen Wang, Jingjing Fei, Wei Li, Liwei Wu, Rui Zhao, Zehua Fu and Qingjie Liu:
Learning from Future: A Novel Self-Training Framework for Semantic Segmentation. NeurIPS 2022: 4749-4761
Haoxiang Ma and Di Huang:
Towards Scale Balanced 6-DoF Grasp Detection in Cluttered Scenes. CoRL 2022: 2004-2013
Guodong Wang, Yunhong Wang, Jie Qin, Dongming Zhang, Xiuguo Bao, Di Huang:
Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw Puzzles. ECCV 2022: 494-511
Jingcheng Ni, Nan Zhou, Jie Qin, Qian Wu, Junqi Liu, Boxun Li, Di Huang:
Motion Sensitive Contrastive Learning for Self-supervised Video Representation. ECCV 2022: 457-474
Zhiyuan Zhao,Qingjie Liu,Yunhong Wang:
Exploring Efficient Knowledge Transferring for Few-shot Object Detection. ACM Multimedia 2022: 6831-6839
Weilai Xiang, Hongyu Yang, Di Huang, Yunhong Wang:
Multi-view Gait Video Synthesis. ACM Multimedia 2022: 6783-6791
Zhihong Fu, Zehua Fu, Qingjie Liu, Wenrui Cai, Yunhong Wang:
SparseTT: Visual Tracking with Sparse Transformers. IJCAI 2022: 905-912
Jiahao Wang, Jie Qin, Yunhong Wang, Annan Li:
PACE: Predictive and Contrastive Embedding for Unsupervised Action Segmentation. IJCAI 2022: 1423-1429
Bing Li, Jiaxin Chen, Dongming Zhang, Xiuguo Bao, and Di Huang:
Representation Learning for Compressed Video Action Recognition via Attentive Cross-modal Interaction with Motion Enhancement. IJCAI 2022: 1060-1066
Mingwu Zheng, Hongyu Yang, Di Huang, Liming Chen:
ImFace: A Nonlinear 3D Morphable Face Model with Implicit Neural Representations. CVPR 2022: 20343-20352
Yanan Zhang, Jiaxin Chen, Di Huang:
CAT-Det: Contrastively Augmented Transformer for Multi-modal 3D Object Detection. CVPR 2022: 908-917
Ye Du, Zehua Fu, Qingjie Liu:
Weakly Supervised Semantic Segmentation by Pixel-to-prototype Contrast. CVPR 2022: 4320-4329
Tianrui Chai, Annan Li, Shaoxiong Zhang, Zilong Li, Yunhong Wang:
Lagrange Motion Analysis and View Embeddings for Improved Gait Recognition. CVPR 2022: 20249-20258
Jiaxi Wu, Jiaxin Chen, Mengzhe He, Yiru Wang, Bo Li, Bingqi Ma, Weihao Gan, Wei Wu, Yali Wang, Di Huang:
Target-Relevant Knowledge Preservation for Multi-Source Domain Adaptive Object Detection. CVPR 2022: 5301-5310
Jiaxi Wu, Jiaxin Chen, Di Huang:
Entropy-based Active Learning for Object Detection with Progressive Diversity Constraint. CVPR 2022: 9397-9406
Yecheng Huang, Jiaxin Chen, Di Huang:
UFPMP-Det: Toward Accurate and Efficient Object Detection on Drone Imagery. AAAI 2022: 1026-1033
Zichen Yang, Jie Qin, Di Huang:
ACGNet: Action Complement Graph Network for Weakly-supervised Temporal Action Localization. AAAI 2022: 3090-3098
Guangyuan Zhou, Huiqun Wang, Jiaxin Chen, Di Huang:
PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6D Pose Estimation. ICCV 2021:
2793-2802
Xiefan Guo, Hongyu Yang, Di Huang:
Image Inpainting via Conditional Texture and Structure Dual Generation. ICCV 2021:
14134-14143
Jiahao Wang, Yunhong Wang, Sheng Liu, Annan Li:
Few-Shot Fine-Grained Action Recognition via Bidirectional Attention
and Contrastive Meta-Learning. ACM Multimedia 2021: 582-591
Jingcheng Ni, Jie Qin, Di Huang:
Identity-aware Graph Memory Network for Action Detection. ACM Multimedia 2021: 3437-3445
Junfu Wang, Yunhong Wang, Zhen Yang, Liang Yang, Yuanfang Guo:
Bi-GCN: Binary Graph Convolutional Network. CVPR 2021:
1561-1570
Shaoxiong Zhang, Yunhong Wang, Annan Li: Cross-View Gait Recognition With
Deep Universal Linear Embeddings. CVPR 2021:
9095-9104
Zhihong Fu, Qingjie Liu, Zehua Fu, Yunhong Wang: STMTrack: Template-free
Visual Tracking with Space-time Memory Networks. CVPR 2021:
13774-13783
Yanan Zhang, Di Huang, Yunhong Wang: PC-RGNN: Point Cloud Completion
and Graph Neural Network for 3D Object Detection. AAAI 2021:
3430-3437
Yangtao Zheng, Di Huang, Songtao Liu, Yunhong Wang: Cross-domain Object
Detection through Coarse-to-Fine Feature Adaptation. CVPR 2020:
13763-13772
Mingda Wu, Di Huang, Yuanfang Guo, Yunhong Wang: Distraction-Aware Feature
Learning for Human Attribute Recognition via Coarse-to-Fine Attention
Mechanism. AAAI 2020: 12394-12401
Jiaxi Wu, Songtao Liu, Di Huang, Yunhong Wang: Multi-scale Positive Sample
Refinement for Few-Shot Object Detection. ECCV (16) 2020: 456-472
Xufang Luo, Qi Meng, Di He, Wei Chen, Yunhong Wang: I4R: Promoting Deep
Reinforcement Learning by the Indicator for Expressive Representations.
IJCAI 2020: 2669-2675
Mengshi Qi, Jie Qin, Xiantong Zhen, Di Huang, Yi Yang, Jiebo Luo: Few-Shot
Ensemble Learning for Video Classification with SlowFast Memory
Networks. ACM Multimedia 2020: 3007-3015
Mengshi Qi, Weijian Li, Zhengyuan Yang, Yunhong Wang, Jiebo Luo:Attentive
Relational Networks for Mapping Images to Scene Graphs. CVPR
2019: 3957-3966
Mengshi Qi, Yunhong Wang, Jie Qin, Annan Li:KE-GAN: Knowledge Embedded
Generative Adversarial Networks for Semi-Supervised Scene Parsing.
CVPR 2019: 5237-5246
Guodong Mu, Di Huang, Guosheng Hu, Jia Sun, Yunhong Wang: Led3D: A
Lightweight and Efficient Deep Approach to Recognizing Low-Quality 3D Faces.
CVPR 2019: 5773-5782
Songtao Liu, Di Huang, Yunhong Wang: Adaptive NMS: Refining Pedestrian
Detection in a Crowd. CVPR 2019: 6459-6468
Meijuan Jia, Hongyu Yang, Di Huang, Yunhong Wang: Attacking Gait Recognition
Systems via Silhouette Guided GANs. ACM Multimedia 2019: 638-646
Lian Gao, Di Huang, Yuanfang Guo, Yunhong Wang: Pedestrian Attribute
Recognition via Hierarchical Multi-task Learning and Relationship
Attention. ACM Multimedia 2019: 1340-1348
Hongyu Yang, Di Huang, Yunhong Wang, Anil K. Jain: Learning Face Age
Progression: A Pyramid Architecture of GANs. CVPR 2018: 31-39
Mengshi Qi, Jie Qin, Annan Li, Yunhong Wang, Jiebo Luo, Luc Van Gool: stagNet:
An Attentive Semantic RNN for Group Activity Recognition. ECCV (10)
2018: 104-120
Songtao Liu, Di Huang, Yunhong Wang: Receptive Field Block Net for Accurate
and Fast Object Detection. ECCV (11) 2018: 404-419
Zhixing Chen, Di Huang, Yunhong Wang, Liming Chen: Fast and Light Manifold CNN
based 3D Facial Expression Recognition across Pose Variations. ACM
Multimedia 2018: 229-238
Xufang Luo, Zijia Lin, Yunhong Wang, Zaiqing Nie: CoChat: Enabling Bot and
Human Collaboration for Task Completion. AAAI 2018
Xiaoke Zhu, Xiao-Yuan Jing, Fei Wu, Yunhong Wang, Wangmeng Zuo, Wei-Shi Zheng:
Learning Heterogeneous Dictionary Pair with Feature Projection Matrix for
Pedestrian Video Retrieval via Single Query Image. AAAI 2017
Mengshi Qi, Yunhong Wang, Annan Li: Online Cross-Modal Scene Retrieval by
Binary Representation and Semantic Graph. ACM Multimedia 2017
Jie Qin, Li Liu, Ling Shao, Bingbing Ni, Chen Chen, Fumin Shen and Yunhong
Wang: Binary Coding for Partial Action Analysis with Limited Observation
Ratios,CVPR 2017
Jie Qin, Li Liu, Ling Shao, Fumin Shen, Bingbing Ni, Jiaxin Chen and Yunhong
Wang: Zero-Shot Action Recognition via Error-Correcting Output
Codes,CVPR 2017
Jiaxin Chen, Yunhong Wang, Jie Qin, Li Liu and Ling Shao,Fast Person
Re-identification via Cross-camera Semantic Binary
Transformation, CVPR 2017