Selected Publications

Journal Papers

  • Ziyue Huang, Mingming Zhang, Yuan Gong, Qingjie Liu, and Yunhong Wang: Generic Knowledge Boosted Pre-training For Remote Sensing Images. IEEE Transactions on Geoscience and Remote Sensing. (Early Access) 2024
  • Mingming Zhang, Qingjie Liu, and Yunhong Wang: HiT: Building Mapping with Hierarchical Transformers. IEEE Transactions on Geoscience and Remote Sensing. (Early Access) 2024
  • Junfu Wang, Yuanfang Guo, Liang Yang, and Yunhong Wang: Binary Graph Convolutional Network with Capacity Exploration. IEEE Transactions on Pattern Analysis and Machine Intelligence. (Early Access) 2024
  • Jiankai Li, Yunhong Wang, and Weixin Li: MHRN: A Multimodal Hierarchical Reasoning Network for Topic Detection. IEEE Transactions on Multimedia. (Early Access) 2024
  • Zhen Yang, Yuanfang Guo, Junfu Wang, Di Huang, and Xiuguo Bao: Towards Video Anomaly Detection in the Real World: A Binarization Embedded Weakly-Supervised Network. IEEE Transactions on Circuits and Systems for Video Technology. (Early Access) 2023
  • Duoxuan Pei, Di Huang, Longteng Kong, and Yunhong Wang: Key Role Guided Transformer for Group Activity Recognition. IEEE Transactions on Circuits and Systems for Video Technology. (Early Access) 2023
  • Jiankai Li, Yunhong Wang, and Weixin Li: Zero-shot Scene Graph Generation via Triplet Calibration and Reduction. ACM Transactions on Multimedia Computing, Communications, and Applications. 20(1): 5:1-5:21, 2024
  • Sheng Liu, Annan Li, Jiahao Wang, and Yunhong Wang: Bidirectional Maximum Entropy Training with Word Co-occurrence for Video Captioning. IEEE Transactions on Multimedia. 25: 4494-4507, 2023
  • Jiaqi Zhou, Zehua Fu, Qiuyu Huang, Qingjie Liu, and Yunhong Wang: LgNet: A Local-global Network for Action Recognition and Beyond. IEEE Transactions on Multimedia. 25: 5192-5205, 2023
  • Yongqiang Kong, Yunhong Wang, Annan Li, and Qiuyu Huang: Self-sufficient Feature Enhancing Networks for Video Salient Object Detection. IEEE Transactions on Multimedia. 25: 557-571, 2023
  • Huiqun Wang, Ruijie Yang, Di Huang, and Yunhong Wang: iDARTS: Improving DARTS by Node Normalization and Decorrelation Discretization. IEEE Transactions on Neural Networks and Learning Systems. 34(4): 1945-1957, 2023
  • Xueping Wang, Yunhong Wang, Weixin Li, Zhengyin Du, and Di Huang: Facial Expression Animation by Landmark Guided Residual Module. IEEE Transactions on Affective Computing. 14(2): 878-894, 2023
  • Weixin Li, Xuan Dong, and Yunhong Wang: Human Emotion Recognition with Relational Region-Level Analysis. IEEE Transactions on Affective Computing. 14(1): 650-663, 2023
  • Tianrui Chai, Zhiyuan Chen, Annan Li, Jiaxin Chen, Xinyu Mei, and Yunhong Wang: Video Person Re-Identification Using Attribute-Enhanced Features. IEEE Transactions on Circuits and Systems for Video Technology. 32: 7951-7966, 2022
  • Yongqiang Kong, Yunhong Wang, and Annan Li: Spatiotemporal Saliency Representation Learning for Video Action Recognition. IEEE Transactions on Multimedia. 24: 1515-1528, 2022
  • Guangshuai Gao, Qingjie Liu, Zhenghui Hu, Lu Li, Qi Wen, and Yunhong Wang: PSGCNet: A Pyramidal Scale and Global Context Guided Network for Dense Object Counting in Remote-Sensing Images. IEEE Transactions on Geoscience and Remote Sensing. 60: 1-12, 2022
  • Huanyu Zhou, Qingjie Liu, Dawei Weng, and Yunhong Wang: Unsupervised Cycle-Consistent Generative Adversarial Networks for Pan Sharpening. IEEE Transactions on Geoscience and Remote Sensing. 60: 1-14, 2022
  • Jiahao Wang, Yunhong Wang, Nina Weng, Tianrui Chai, Annan Li, Faxi Zhang, and Sansi Yu: Will You Ever Become Popular? Learning to Predict Virality of Dance Clips. ACM Transactions on Multimedia Computing, Communications, and Applications. 18(2): 1-24, 2022
  • Ran Qin, Qingjie Liu, Guangshuai Gao, Di Huang, and Yunhong Wang: MRDet: A Multihead Network for Accurate Rotated Object Detection in Aerial Images. IEEE Transactions on Geoscience and Remote Sensing. 60: 1-12, 2022
  • Hongyu Yang, Di Huang, Yunhong Wang, and Anil K. Jain: Learning Continuous Face Age Progression: A Pyramid of GANs. IEEE Transactions on Pattern Analysis and Machine Intelligence. 43(2): 499-515, 2021
  • Mengshi Qi, Jie Qin, Yi Yang, Yunhong Wang, and Jiebo Luo: Semantics-Aware Spatial-Temporal Binaries for Cross-Modal Video Retrieval. IEEE Transaction on Image Processing. 30: 2989-3004, 2021
  • Qingjie Liu, Huanyu Zhou, Qizhi Xu, Xiangyu Liu, and Yunhong Wang: PSGAN: A Generative Adversarial Network for Remote Sensing Image Pan-Sharpening. IEEE Transactions on Geoscience and Remote Sensing. 14(8): 1-16, 2020
  • Longteng Kong, Di Huang, Jie Qin, and Yunhong Wang: A Joint Framework for Athlete Tracking and Action Recognition in Sports Videos. IEEE Transactions on Circuits and Systems for Video Technology. 30(2): 532-548, 2020
  • Mengshi Qi, Yunhong Wang, Jie Qin, Annan Li, Jiebo Luo, and Luc Van Gool: StagNet: An Attentive Semantic RNN for Group Activity and Individual Action Recognition. IEEE Transactions on Circuits and Systems for Video Technology. 30(2): 549-565, 2020
  • Bin Hou, Qingjie Liu, Heng Wang, and Yunhong Wang: From W-Net to CDGAN: Bitemporal Change Detection via Deep Learning Techniques. IEEE Trans. Geoscience and Remote Sensing 58(3): 1790-1802, 2020
  • Songtao Liu, Di Huang, and Yunhong Wang: Pay Attention to Them: Deep Reinforcement Learning-Based Cascade Object Detection. IEEE Transactions on Neural Networks and Learning Systems. 31(7): 2544-2556, 2020
  • Mengshi Qi, Yunhong Wang, Annan Li, and Jiebo Luo: STC-GAN: Spatio-Temporally Coupled Generative Adversarial Networks for Predictive Scene Parsing. IEEE Transactions on Image Processing. 29: 5420-5430, 2020
  • Mengshi Qi, Yunhong Wang, Annan Li, and Jiebo Luo: Sports Video Captioning via Attentive Motion Representation and Group Relationship Modeling. IEEE Transactions on Circuits and Systems for Video Technology. 30(8): 2617-2633, 2020
  • Guangshuai Gao, Wenting Zhao, Qingjie Liu, and Yunhong Wang: Co-Saliency Detection with Co-Attention Fully Convolutional Network. IEEE Transactions on Circuits and Systems for Video Technology.
  • Longteng Kong, Di Huang, and Yunhong Wang: Long-Term Action Dependence-Based Hierarchical Deep Association for Multi-Athlete Tracking in Sports Videos. IEEE Transactions on Image Processing. 29: 7957-7969, 2020
  • Huiqun Wang, Di Huang, Kui Jia, and Yunhong Wang:Hierarchical Image Segmentation Ensemble for Objectness in RGB-D Images. IEEE Transactions on Circuits and Systems for Video Technology. 29(1): 93-103, 2019
  • Jia Sun, Di Huang, Yunhong Wang, and Liming Chen:Expression Robust 3D Facial Landmarking via Progressive Coarse-to-Fine Tuning. ACM Transactions on Multimedia Computing, Communications, and Applications. 15(1): 21:1-21:23, 2019
  • Qingpeng Li, Lichao Mou, Qingjie Liu, and Yunhong Wang, Xiao Xiang Zhu:HSF-Net: Multiscale Deep Feature Embedding for Ship Detection in Optical Remote Sensing Imagery. IEEE Transactions on Geoscience and Remote Sensing 56(12): 7147-7161, 2018
  • Xiaoke Zhu, Xiao-Yuan Jing, Liang Yang, Xinge You, Dan Chen, Guangwei Gao, and Yunhong Wang:Semi-Supervised Cross-View Projection-Based Dictionary Learning for Video-Based Person Re-Identification. IEEE Transactions on Circuits and Systems for Video Technology. 28(10): 2599-2611, 2018
  • Zhengxin Zhang, Qingjie Liu, and Yunhong Wang:Road Extraction by Deep Residual U-Net. IEEE Geoscience Remote Sensing Letter 15(5): 749-753, 2018
  • Ying Lu, Liming Chen, Alexandre Saidi, Emmanuel Dellandréa, and Yunhong Wang:Discriminative Transfer Learning Using Similarities and Dissimilarities. IEEE Transactions on Neural Networks and Learning Systems. 29(7): 3097-3110, 2018
  • Yongqiang Yao, Di Huang, Xudong Yang, Yunhong Wang, and Liming Chen:Texture and Geometry Scattering Representation-Based Facial Expression Recognition in 2D+3D Videos. ACM Transactions on Multimedia Computing, Communications, and Applications. 14(1s): 18:1-18:23, 2018
  • Xiao-Yuan Jing, Xiaoke Zhu, Fei Wu, Ruimin Hu, Xinge You, Yunhong Wang, Hui Feng, and Jing-Yu Yang:Super-Resolution Person Re-Identification With Semi-Coupled Low-Rank Discriminant Dictionary Learning. IEEE Transactions on Image Processing. 26(3): 1363-1378, 2017
  • Jiaxin Chen, Zhaoxiang Zhang, and Yunhong Wang:Corrections to "Relevance Metric Learning for Person Re-Identification by Exploiting Listwise Similarities". IEEE Transactions on Image Processing. 25(1): 494, 2016
  • Jie Qin, Li Liu, Zhaoxiang Zhang, Yunhong Wang, and Ling Shao:Compressive Sequential Learning for Action Similarity Labeling. IEEE Transactions on Image Processing. 25(2): 756-769, 2016
  • Hongyu Yang, Di Huang, Yunhong Wang, Heng Wang, and Yuanyan Tang:Face Aging Effect Simulation Using Hidden Factor Analysis Joint Sparse Representation. IEEE Transactions on Image Processing. 25(6): 2493-2507, 2016
  • Conference Papers

  • Wenshuai Xu, Zhenghui hu, Yu Lu, Jinzhou Meng, Qingjie Liu, Yunhong Wang: ActiveDC: Distribution Calibration for Active Finetuning. CVPR 2024
  • Wenrui Cai, Qingjie Liu, and Yunhong Wang: HIPTrack: Visual Tracking with Historical Prompts. CVPR 2024
  • Haoxiang Ma, Modi Shi, Boyang Gao, and Di Huang: Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge. CVPR 2024
  • Xiefan Guo, Jinlin Liu, Miaomiao Cui, Jiankai Li, Hongyu Yang, and Di Huang: InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization. CVPR 2024
  • Jiankai Li, Yunhong Wang, Xiefan Guo, Ruijie Yang, and Weixin Li: Leveraging Predicate and Triplet Learning for Scene Graph Generation. CVPR 2024
  • Guodong Wang, Yunhong Wang, Xiuguo Bao, and Di Huang: Rotation Has Two Sides: Evaluating Data Augmentation for Deep One-class Classification. ICLR 2024
  • Ruikui Wang, Yuanfang Guo, and Yunhong Wang: AGS: Affordable and Generalizable Substitute Training for Transferable Adversarial Attack. AAAI 2024
  • Bing Li, Jiaxin Chen, Xiuguo Bao, and Di Huang: Compressed Video Prompt Tuning. NeurIPS 2023
  • Jiayi Zhang, and Weixin Li: Multi-Modal and Multi-Scale Temporal Fusion Architecture Search for Audio-Visual Video Parsing. ACM Multimedia 2023: 3328-3336
  • Liangwei Jiang, Jiaxin Chen, Di Huang, and Yunhong Wang: MIEP: Channel Pruning with Multi-granular Importance Estimation for Object Detection. ACM Multimedia 2023: 2908-2917
  • Weilai Xiang, Hongyu Yang, Di Huang, and Yunhong Wang: Denoising Diffusion Autoencoders are Unified Self-supervised Learners. ICCV 2023: 15802-15812
  • Jinqing Zhang, Yanan Zhang, Qingjie Liu, and Yunhong Wang: SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection. ICCV 2023: 3348-3357
  • Guodong Wang, Yunhong Wang, Jie Qin, Dongming Zhang, Xiuguo Bao, and Di Huang: Unilaterally Aggregated Contrastive Learning with Hierarchical Augmentation for Anomaly Detection. ICCV 2023: 6888-6897
  • Nan Zhou, Jiaxin Chen, and Di Huang: DR-Tune: Improving Fine-tuning of Pretrained Visual Models by Distribution Regularization with Semantic Calibration. ICCV 2023: 1547-1556
  • Mingwu Zheng, Haiyu Zhang, Hongyu Yang, and Di Huang: NeuFace: Realistic 3D Neural Face Rendering from Multi-view Images. CVPR 2023: 16868-16877
  • Chao Zhou, Yanan Zhang, Jiaxin Chen, and Di Huang: OcTr: Octree-based Transformer for 3D Object Detection. CVPR 2023: 5166-5175
  • Bowei Du, Yecheng Huang, Jiaxin Chen, and Di Huang: Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images. CVPR 2023: 13435-13444
  • Huanyu Zhou, Qingjie Liu, and Yunhong Wang: Learning Discriminative Representations for Skeleton Based Action Recognition. CVPR 2023: 10608-10617
  • Ruikui Wang, Yuanfang Guo, Yunhong Wang: Global-Local Characteristic Excited Cross-Modal Attacks from Images to Videos. AAAI 2023: 2635-2643
  • Lingfeng Tan, Yunhong Wang, Junfu Wang, Liang Yang, Xunxun Chen and Yuanfang Guo: Deepfake Video Dectection via Facial Action Dependency Estimation. AAAI 2023: 5276-5284
  • Kaicheng Li, Hongyu Yang, Binghui Chen, Pengyu Li, Biao Wang and Di Huang: Learning Polysemantic Spoof Trace: A Multi-Modal Disentanglement Network for Face Anti-spoofing. AAAI 2023: 1351-1359
  • Ye Du, Yujun Shen, Haochen Wang, Jingjing Fei, Wei Li, Liwei Wu, Rui Zhao, Zehua Fu and Qingjie Liu: Learning from Future: A Novel Self-Training Framework for Semantic Segmentation. NeurIPS 2022: 4749-4761
  • Haoxiang Ma and Di Huang: Towards Scale Balanced 6-DoF Grasp Detection in Cluttered Scenes. CoRL 2022: 2004-2013
  • Guodong Wang, Yunhong Wang, Jie Qin, Dongming Zhang, Xiuguo Bao, Di Huang: Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw Puzzles. ECCV 2022: 494-511
  • Jingcheng Ni, Nan Zhou, Jie Qin, Qian Wu, Junqi Liu, Boxun Li, Di Huang: Motion Sensitive Contrastive Learning for Self-supervised Video Representation. ECCV 2022: 457-474
  • Zhiyuan Zhao,Qingjie Liu,Yunhong Wang: Exploring Efficient Knowledge Transferring for Few-shot Object Detection. ACM Multimedia 2022: 6831-6839
  • Weilai Xiang, Hongyu Yang, Di Huang, Yunhong Wang: Multi-view Gait Video Synthesis. ACM Multimedia 2022: 6783-6791
  • Zhihong Fu, Zehua Fu, Qingjie Liu, Wenrui Cai, Yunhong Wang: SparseTT: Visual Tracking with Sparse Transformers. IJCAI 2022: 905-912
  • Jiahao Wang, Jie Qin, Yunhong Wang, Annan Li: PACE: Predictive and Contrastive Embedding for Unsupervised Action Segmentation. IJCAI 2022: 1423-1429
  • Bing Li, Jiaxin Chen, Dongming Zhang, Xiuguo Bao, and Di Huang: Representation Learning for Compressed Video Action Recognition via Attentive Cross-modal Interaction with Motion Enhancement. IJCAI 2022: 1060-1066
  • Mingwu Zheng, Hongyu Yang, Di Huang, Liming Chen: ImFace: A Nonlinear 3D Morphable Face Model with Implicit Neural Representations. CVPR 2022: 20343-20352
  • Yanan Zhang, Jiaxin Chen, Di Huang: CAT-Det: Contrastively Augmented Transformer for Multi-modal 3D Object Detection. CVPR 2022: 908-917
  • Ye Du, Zehua Fu, Qingjie Liu: Weakly Supervised Semantic Segmentation by Pixel-to-prototype Contrast. CVPR 2022: 4320-4329
  • Tianrui Chai, Annan Li, Shaoxiong Zhang, Zilong Li, Yunhong Wang: Lagrange Motion Analysis and View Embeddings for Improved Gait Recognition. CVPR 2022: 20249-20258
  • Jiaxi Wu, Jiaxin Chen, Mengzhe He, Yiru Wang, Bo Li, Bingqi Ma, Weihao Gan, Wei Wu, Yali Wang, Di Huang: Target-Relevant Knowledge Preservation for Multi-Source Domain Adaptive Object Detection. CVPR 2022: 5301-5310
  • Jiaxi Wu, Jiaxin Chen, Di Huang: Entropy-based Active Learning for Object Detection with Progressive Diversity Constraint. CVPR 2022: 9397-9406
  • Yecheng Huang, Jiaxin Chen, Di Huang: UFPMP-Det: Toward Accurate and Efficient Object Detection on Drone Imagery. AAAI 2022: 1026-1033
  • Zichen Yang, Jie Qin, Di Huang: ACGNet: Action Complement Graph Network for Weakly-supervised Temporal Action Localization. AAAI 2022: 3090-3098
  • Guangyuan Zhou, Huiqun Wang, Jiaxin Chen, Di Huang: PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6D Pose Estimation. ICCV 2021: 2793-2802
  • Xiefan Guo, Hongyu Yang, Di Huang: Image Inpainting via Conditional Texture and Structure Dual Generation. ICCV 2021: 14134-14143
  • Jiahao Wang, Yunhong Wang, Sheng Liu, Annan Li: Few-Shot Fine-Grained Action Recognition via Bidirectional Attention and Contrastive Meta-Learning. ACM Multimedia 2021: 582-591
  • Jingcheng Ni, Jie Qin, Di Huang: Identity-aware Graph Memory Network for Action Detection. ACM Multimedia 2021: 3437-3445
  • Junfu Wang, Yunhong Wang, Zhen Yang, Liang Yang, Yuanfang Guo: Bi-GCN: Binary Graph Convolutional Network. CVPR 2021: 1561-1570
  • Shaoxiong Zhang, Yunhong Wang, Annan Li: Cross-View Gait Recognition With Deep Universal Linear Embeddings. CVPR 2021: 9095-9104
  • Zhihong Fu, Qingjie Liu, Zehua Fu, Yunhong Wang: STMTrack: Template-free Visual Tracking with Space-time Memory Networks. CVPR 2021: 13774-13783
  • Yanan Zhang, Di Huang, Yunhong Wang: PC-RGNN: Point Cloud Completion and Graph Neural Network for 3D Object Detection. AAAI 2021: 3430-3437
  • Yangtao Zheng, Di Huang, Songtao Liu, Yunhong Wang: Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation. CVPR 2020: 13763-13772
  • Mingda Wu, Di Huang, Yuanfang Guo, Yunhong Wang: Distraction-Aware Feature Learning for Human Attribute Recognition via Coarse-to-Fine Attention Mechanism. AAAI 2020: 12394-12401
  • Jiaxi Wu, Songtao Liu, Di Huang, Yunhong Wang: Multi-scale Positive Sample Refinement for Few-Shot Object Detection. ECCV (16) 2020: 456-472
  • Xufang Luo, Qi Meng, Di He, Wei Chen, Yunhong Wang: I4R: Promoting Deep Reinforcement Learning by the Indicator for Expressive Representations. IJCAI 2020: 2669-2675
  • Mengshi Qi, Jie Qin, Xiantong Zhen, Di Huang, Yi Yang, Jiebo Luo: Few-Shot Ensemble Learning for Video Classification with SlowFast Memory Networks. ACM Multimedia 2020: 3007-3015
  • Mengshi Qi, Weijian Li, Zhengyuan Yang, Yunhong Wang, Jiebo Luo:Attentive Relational Networks for Mapping Images to Scene Graphs. CVPR 2019: 3957-3966
  • Mengshi Qi, Yunhong Wang, Jie Qin, Annan Li:KE-GAN: Knowledge Embedded Generative Adversarial Networks for Semi-Supervised Scene Parsing. CVPR 2019: 5237-5246
  • Guodong Mu, Di Huang, Guosheng Hu, Jia Sun, Yunhong Wang: Led3D: A Lightweight and Efficient Deep Approach to Recognizing Low-Quality 3D Faces. CVPR 2019: 5773-5782
  • Songtao Liu, Di Huang, Yunhong Wang: Adaptive NMS: Refining Pedestrian Detection in a Crowd. CVPR 2019: 6459-6468
  • Meijuan Jia, Hongyu Yang, Di Huang, Yunhong Wang: Attacking Gait Recognition Systems via Silhouette Guided GANs. ACM Multimedia 2019: 638-646
  • Lian Gao, Di Huang, Yuanfang Guo, Yunhong Wang: Pedestrian Attribute Recognition via Hierarchical Multi-task Learning and Relationship Attention. ACM Multimedia 2019: 1340-1348
  • Hongyu Yang, Di Huang, Yunhong Wang, Anil K. Jain: Learning Face Age Progression: A Pyramid Architecture of GANs. CVPR 2018: 31-39
  • Mengshi Qi, Jie Qin, Annan Li, Yunhong Wang, Jiebo Luo, Luc Van Gool: stagNet: An Attentive Semantic RNN for Group Activity Recognition. ECCV (10) 2018: 104-120
  • Songtao Liu, Di Huang, Yunhong Wang: Receptive Field Block Net for Accurate and Fast Object Detection. ECCV (11) 2018: 404-419
  • Zhixing Chen, Di Huang, Yunhong Wang, Liming Chen: Fast and Light Manifold CNN based 3D Facial Expression Recognition across Pose Variations. ACM Multimedia 2018: 229-238
  • Xufang Luo, Zijia Lin, Yunhong Wang, Zaiqing Nie: CoChat: Enabling Bot and Human Collaboration for Task Completion. AAAI 2018
  • Xiaoke Zhu, Xiao-Yuan Jing, Fei Wu, Yunhong Wang, Wangmeng Zuo, Wei-Shi Zheng: Learning Heterogeneous Dictionary Pair with Feature Projection Matrix for Pedestrian Video Retrieval via Single Query Image. AAAI 2017
  • Mengshi Qi, Yunhong Wang, Annan Li: Online Cross-Modal Scene Retrieval by Binary Representation and Semantic Graph. ACM Multimedia 2017
  • Jie Qin, Li Liu, Ling Shao, Bingbing Ni, Chen Chen, Fumin Shen and Yunhong Wang: Binary Coding for Partial Action Analysis with Limited Observation Ratios,CVPR 2017
  • Jie Qin, Li Liu, Ling Shao, Fumin Shen, Bingbing Ni, Jiaxin Chen and Yunhong Wang: Zero-Shot Action Recognition via Error-Correcting Output Codes,CVPR 2017
  • Jiaxin Chen, Yunhong Wang, Jie Qin, Li Liu and Ling Shao,Fast Person Re-identification via Cross-camera Semantic Binary Transformation, CVPR 2017