论文著作

Publications

Technical Reports

Y. Xiong, Y. Zhao, L. Wang, D. Lin, and X. Tang
A Pursuit of Temporal Accuracy in General Activity Detection
in arXiv 1703.03329
L. Wang, S. Guo, W. Huang, and Y. Qiao
Places205-VGGNet Models for Scene Recognition
in arXiv 1508.01667.
L. Wang, Y. Xiong, Z. Wang, and Y. Qiao
Towards good practices for very deep two-stream ConvNets
in arXiv 1507.02159.

Journal Papers

D. Du, L. Wang, Z. Li, G. Wu
Cross-Modal Pyramid Translation for RGB-D Scene Recognition
Journal extension of TRecgNet with pyramid translation extension.
in International Journal of Computer Vision (IJCV), Volume 12, Issue 8, Pages 2309-2327, 2021.
[ Paper ] [ Code ]
Z. Ruan, C. Zou, L. Wu, G. Wu, and L. Wang
SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Face Alignment and Reconstruction
in IEEE Transactions on Image Processing (TIP), Volume 30, Pages 5739-5806, 2021.
[ Paper ] [ Code ]
Y. Zheng, Z. Liu, Tong Lu, and L. Wang
Dynamic Sampling Networks for Efficient Action Recognition in Videos
A dynamic version of TSN for efficient action recognition
in IEEE Transactions on Image Processing (TIP), Volume 29, Pages 7970-7983, 2020.
[ Paper ] [ BibTex ]
Y. Zhao, Y. Xiong, L. Wang, Z. Wu, X. Tang, and D. Lin
Temporal Action Detection with Structured Segment Networks
Journal extension of SSN with more extensive study
in International Journal of Computer Vision (IJCV), Volume 128, Issue 1, Pages 74-95, 2020.
[ Paper ] [ BibTex ] [ Code ]
L. Wang, Y. Xiong, Z. Wang, Y. Qiao, D. Lin, X. Tang, and L. Van Gool
Temporal Segment Networks for Action Recognition in Videos
More extensive study on TSN and adding performance of I3D+TSN on Kinetics
in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Volume 41, Issue 11, Pages 2740-2755, 2019.
[ Paper ] [ BibTex ] [ Code ]
B. Zhang, L. Wang, Z. Wang, Y. Qiao, and H. Wang
Real-Time Action Recognition with Deeply-Transferred Motion Vector CNNs
in IEEE Transactions on Image Processing (TIP), Volume 27, Issue 5, Pages 2326-2339, 2018.
[ Paper ] [ BibTex ] [ Code ]
L. Wang, Z. Wang, Y. Qiao, and L. Van Gool
Transferring Deep Object and Scene Representations for Event Recognition in Still Images
rank 1st place in cultural event recognition at ChaLearn LAP challenge CVPR 2015
in International Journal of Computer Vision (IJCV), Volume 126, Issue 2-4, Pages 390-409, 2018.
[ Paper ] [ BibTex ] [ Code ]
L. Wang, S. Guo, W, Huang, Y. Xiong, and Y. Qiao
Knowledge Guided Disambiguation for Large-Scale Scene Classification with Multi-Resolution CNNs
rank 1st place at LSUN challenge 2016 and 2nd place at Places challenge 2015
in IEEE Transactions on Image Processing (TIP), Volume 26, Issue 4, Pages 2055-2068, 2017.
[ Paper ] [ BibTex ] [ Code ]
Z. Wang, L. Wang, Y. Wang, B. Zhang, and Y. Qiao
Weakly Supervised PatchNets: Describing and Aggregating Local Patches for Scene Recognition
in IEEE Transactions on Image Processing (TIP), Volume 26, Issue 4, Pages 2018-2041, 2017.
[ Paper ] [ BibTex ] [ Code ]
S. Guo, W. Huang, L. Wang, and Y. Qiao
Locally Supervised Deep Hybrid Model for Scene Recognition
in IEEE Transactions on Image Processing (TIP), Volume 26, Issue 2, Pages 808-820, 2017.
[ Paper ] [ BibTex ]
Z. Yuan, H. Wang, L. Wang, T. Lu, P. Shivakumara, and C. L. Tan
Modeling Spatial Layout for Scene Image Understanding via a Novel Multiscale Sum-Product Network
in Expert Systems With Applications (ESWA), Volume 63, Pages 231-240, 2016.
[ Paper ] [ BibTex ]
X. Peng, L. Wang, X. Wang, and Y. Qiao
Bag of Visual Words and Fusion Methods for Action Recognition: Comprehensive Study and Good Practice
in Computer Vision and Image Understanding (CVIU), Volume 150, Pages 109-125, 2016.
[ Paper ] [ BibTex ]
L. Wang, Y. Qiao, and X. Tang
MoFAP: A Multi-Level Representation for Action Recognition
in International Journal of Computer Vision (IJCV), Volume 119, Issue 3, Pages 254-271, 2016.
[ Paper ] [ BibTex ]
L. Wang, Y. Qiao, and X. Tang
Latent Hierarchical Model of Temporal Structure for Complex Activity Classification
in IEEE Transactions on Image Processing (TIP), Volume 23, Issue 2, Pages 810-822, 2014.
[ Paper ] [ BibTex ]

CVPR/ICCV/ECCV/ICLR Papers

Y. Cui, C. Jiang, L. Wang, G. Wu
MixFormer: End-to-End Tracking with Iterative Mixed Attention
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[ Paper ] [ Code ]
Z. Gao, L. Wang, B. Han, S. Guo
AdaMixer: A Simple and Accurate Query-based Object Detector
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[ Paper ] [ Code ]
L. Zhao, L. Wang
Decoupling Classification and Localization for Domain Adaptive Object Detection
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[ Paper ] [ Code ]
Y. Teng, L. Wang
Structured Sparse R-CNN for Direct Scene Graph Generation
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[ Paper ] [ Code ]
J. Lin, H. Duan, K. Chen, D. Lin, L. Wang
OCSampler: Compressing Videos to One Clip with Single-step Sampling
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[ Paper ] [ Code ]
J. Tang, Z. Liu, C. Qian, W. Wu, L. Wang
Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[ Paper ] [ Code ]
S. Guo, Z. Xiong, Y. Zhong, L. Wang, X. Guo, B. Han, W. Huang
Cross-Architecture Self-supervised Video Representation Learning
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[ Paper ] [ Code ]
Y. Li, L. Chen, R. He, Z. Wang, G. Wu, L. Wang
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions
in IEEE International Conference on Computer Vision (ICCV), 2021
A high-quality and fine-grained action detection benchmark
[ Paper ] [ Data ] [ Code ] [ Challenge ]
T. Li, L. Wang, G. Wu
Self Supervision to Distillation for Long-Tailed Visual Recognition
in IEEE International Conference on Computer Vision (ICCV), 2021
[ Paper ] [ Code (soon) ]
Z. Gao, L. Wang, G. Wu
Mutual Supervision for Dense Object Detection
in IEEE International Conference on Computer Vision (ICCV), 2021
[ Paper ] [ Code (soon) ]
Y. Teng, L. Wang, Z. Li, G. Wu
Target Adaptive Context Aggregation for Video Scene Graph Generation
in IEEE International Conference on Computer Vision (ICCV), 2021
[ Paper ] [ Code (soon) ]
Z. Liu, L. Wang, W. Wu, C. Qian, T. Lu
TAM: Temporal Adaptive Module for Video Recognition
in IEEE International Conference on Computer Vision (ICCV), 2021
[ Paper ] [ Code ]
J. Tan, J. Tang, L. Wang, G. Wu
Relaxed Transformer Decoders for Direct Action Proposal Generation
in IEEE International Conference on Computer Vision (ICCV), 2021
[ Paper ] [ Code ]
Y. Zhi, Z. Tong, L. Wang, G. Wu
MGSampler: An Explainable Sampling Strategy for Video Action Recognition
in IEEE International Conference on Computer Vision (ICCV), 2021
[ Paper ] [ Code (soon) ]
H. Zhang, Y. Tian, X. Zhou, W. Ouyang, Y. Liu, L. Wang, Z. Sun
3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop ( oral presentation )
in IEEE International Conference on Computer Vision (ICCV), 2021
[ Paper ] [ Code ]
T. Lu, L. Wang, G. Wu
CGA-Net: Category Guided Aggregation for Point Cloud Semantic Segmentation
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
[ Paper ] [ Code (soon) ]
L. Wang, Z. Tong, B. Ji, G. Wu
TDN: Temporal Difference Networks for Efficient Action Recognition
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
[ Paper ] [ Code ]
Z. Wang, Z. Gao, L. Wang, Z. Li, G. Wu
Boundary-Aware Cascade Networks for Temporal Action Segmentation
in European Conference on Computer Vision (ECCV), 2020
[ Paper ] [ Code ]
J. Wu, Z. Kuang, L. Wang, W. Zhang, G. Wu
Context-Aware RCNN: a Baseline for Action Detection in Videos
in European Conference on Computer Vision (ECCV), 2020
[ Paper ] [ Code ]
Y. Li, Z. Wang, L. Wang, G. Wu
Actions as Moving Points
in European Conference on Computer Vision (ECCV), 2020
[ Paper ] [ Code ]
C. Gao, Q. Liu, Q. Xu, L. Wang, J. Liu, C. Zou
SketchyCOCO: Image Generation from Freehand Scene Sketches ( oral presentation )
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020
[ Paper ] [ Code ]
Y. Li, B. Ji, X. Shi, J. Zhang, B. Kang, L. Wang
TEA: Temporal Excitation and Aggregation for Action Recognition
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020
[ Paper ] [ Code ]
S. Zhang, S. Guo, W. Huang, M. Scott, L. Wang
V4D: 4D Convolutional Neural Networks for Video-Level Representation Learning
in International Conference on Learning Representations (ICLR), 2020
[ Paper ] [ Code ]
Z. Gao, L. Wang, and G. Wu
LIP: Local Importance-based Pooling
in IEEE International Conference on Computer Vision (ICCV), 2019.
[ Paper ] [ BibTex ] [ Code ]
J. Wu, L. Wang, L. Wang, J. Guo, and G. Wu
Learning Actor Relation Graphs for Group Activity Recognition
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
[ Paper ] [ BibTex ] [ Code ]
D. Du, L. Wang, H. Wang, K. Zhao, G. Wu
Translate-to-Recognize Networks for RGB-D Scene Recognition
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
[ Paper ] [ BibTex ] [ Code ] [ Project Page ]
J. Guo, Z. Zhou, and L. Wang
Single Image Highlight Removal with a Sparse and Low-Rank Reflection Model
in European Conference on Computer Vision (ECCV), 2018.
[ Paper ]
L. Wang, W. Li, W. Li, and L. Van Gool
Appearance-and-Relation Networks for Video Classification
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
[ Paper ] [ Code ]
Y. Zhao, Y. Xiong, L. Wang, Z. Wu, X. Tang, and D. Lin
Temporal Action Detection with Structured Segment Networks
in IEEE International Conference on Computer Vision (ICCV), 2017.
[ Paper ] [ BibTex ] [ Project Page ] [ Code ]
L. Wang, Y. Xiong, D. Lin, and L. Van Gool
UntrimmedNets for Weakly Supervised Action Recognition and Detection
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
[ Paper ] [ BibTex ] [ Poster ] [ Code ]
J. Song, L. Wang, L. Van Gool, and O. Hilliges
Thin-Slicing Network: A Deep Structural Model for Human Pose Estimation in Videos ( oral presentation )
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
[ Paper ] [ BibTex ] [ Project Page ]
L. Wang, Y. Xiong, Z. Wang, Y. Qiao, D. Lin, X. Tang, and L. Van Gool
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
in European Conference on Computer Vision (ECCV), 2016.
major contribution to the winner solution of ActivityNet challenge 2016
[ Paper ] [ BibTex ] [ Poster ] [ Code ]
L. Wang, Y. Qiao, X. Tang, and L. Van Gool
Actionness Estimation Using Hybrid Fully Convolutional Networks
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
[ Paper ] [ BibTex ] [ Poster ] [ Project Page ] [ Code ]
B. Zhang, L. Wang, Z. Wang, Y. Qiao, and H. Wang
Real-time Action Recognition with Enhanced Motion Vector CNNs
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
[ Paper ] [ BibTex ] [ Poster ] [ Project Page ] [ Code ]
L. Wang, Y. Qiao, and X. Tang
Action Recognition with Trajectory-Pooled Deep-Convolutional Descriptors
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
[ Paper ] [ BibTex ] [ Poster ] [ Extended Abstract ] [ Project Page ] [ Code ]
L. Wang, Y. Qiao, and X. Tang
Video Action Detection with Relational Dynamic-Poselets
in European Conference on Computer Vision (ECCV), 2014.
[ Paper ] [ BibTex ] [ Poster ] [ Spotlight ] [ Code ]
Z. Cai, L. Wang, X. Peng, and Y. Qiao
Multi-View Super Vector for Action Recognition ( oral presentation )
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014.
[ Paper ] [ BibTex ] [ Video Spotlight ] [ Oral Presentation ] [ Poster ] [ Supplement ] [ Code ]
X. Peng*, L. Wang*, Y. Qiao, and Q. Peng (* indicates equal contribution)
Boosting VLAD with Supervised Dictionary Learning and High-Order Statistics
in European Conference on Computer Vision (ECCV), 2014.
[ Paper ] [ BibTex ]
L. Wang, Y. Qiao, and X. Tang
Mining Motion Atoms and Phrases for Complex Action Recognition
in IEEE International Conference on Computer Vision (ICCV), 2013.
[ Paper ] [ BibTex ] [ Poster ] [ Spotlight ] [ Project Page ]
L. Wang, Y. Qiao, and X. Tang
Motionlets: Mid-Level 3D Parts for Human Motion Recognition
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013.
[ Paper ] [ BibTex ] [ Poster ] [ Spotlight ] [ Project Page ]

Other Conference Papers

G. Chen, Y. Zhen, L. Wang, and T. Lu
DCAN: Improving Temporal Action Detection via Dual Context Aggregation
in AAAI Conference on Artificial Intelligence (AAAI), 2022
[ Paper ] [ Code ]
Z. Wang, L. Wang, T. Wu, T. Li, and G. Wu
Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding
in AAAI Conference on Artificial Intelligence (AAAI), 2022
[ Paper ] [ Code ]
Z. Zhu, L. Wang, S. Guo, and G. Wu
A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark
in British Machine Vision Conference (BMVC), 2021.
[ Paper ] [ Code ]
Z. Liu, D. Luo, Y. Wang, L. Wang, Y. Tai, C. Wang, J. Li, F. Huang, T. Lu
TEINet: Towards an Efficient Architecture for Video Recognition
in AAAI Conference on Artificial Intelligence (AAAI), 2020
[ Paper ] [ BibTex ]
S. Zhang, S. Guo, L. Wang, W. Huang, M. Scott
Knowledge Integration Networks for Action Recognition
in AAAI Conference on Artificial Intelligence (AAAI), 2020
[ Paper ] [ BibTex ]
Y. Li, W. Lin, T. Wang, J. See, R. Qian, N. Xu, L. Wang, S. Xu
Finding Action Tubes with a Sparse-to-Dense Framework
in AAAI Conference on Artificial Intelligence (AAAI), 2020
[ Paper ] [ BibTex ]
Y. Yao, Z. Sun, F. Shen, L. Liu, L. Wang, F. Zhu, L. Ding, G. Wu, L. Shao
Dynamically Visual Disambiguation of Keyword-based Image Search
in International Joint Conference on Artificial Intelligence (IJCAI), 2019
[ Paper ] [ BibTex ]
D. He, Z. Zhou, C. Gan, F. Li, X. Liu, Y. Li, L. Wang, S. Wen
StNet: Local and Global Spatial-Temporal Modeling for Action Recognition
in AAAI Conference on Artificial Intelligence (AAAI), 2019
[ Paper ] [ BibTex ]
Z. Wang, X. Liu, L. Chen, L. Wang, Y. Qiao, X. Xie, and C. Fowlkes
Structed Triplets Learning with Pos-tag Guided Attention for Visual Question Answering
in IEEE Winter Conference on Applications of Computer Vision (WACV), 2018.
[ Paper ] [ BibTex ]
Y. Wang, J. Song, L. Wang, L. Van Gool, and O. Hilliges
Two-Stream SR-CNNs for Action Recognition in Videos
in British Machine Vision Conference (BMVC), 2016.
[ Paper ] [ BibTex ]
Z. Wang, Y. Wang, L. Wang, and Y. Qiao
Codebook Enhancement of VLAD Representation for Visual Recognition
in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016.
[ Paper ] [ BibTex ]
X. Peng, L. Wang, Y. Qiao, and Q. Peng
A Joint Evaluation of Dictionary Learning and Feature Encoding for Action Recognition
in International Conference on Pattern Recognition (ICPR), 2014.
[ Paper ] [ BibTex ]
X. Wang, L. Wang, and Y. Qiao
A Comparative Study of Encoding, Pooling and Normalization Methods for Action Recognition
in Asian Conference on Computer Vision (ACCV), 2012.
[ Paper ] [ BibTex ] [ Poster ] [ Spotlight ]
L. Wang, Y. Wu, T. Lu, and K. Chen
Multiclass Object Detection by Combining Local Appearances and Context
in ACM International Conference on Multimedia (ACM MM), 2011.
[ Paper ] [ BibTex ]
L. Wang, Y. Wu, Z. Tian, Z. Sun, and T. Lu
A Novel Approach for Robust Surveillance Video Content Abstraction
in Pacific-Rim Conference on Multimedia (PCM), 2010.
[ Paper ] [ BibTex ]

Workshop and Notebook Papers

Y. Xiong, L. Wang, Z. Wang, B. Zhang, H. Song, W. Li, D. Lin, Y. Qiao, L. Van Gool, and X. Tang
CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016 ( rank 1st place )
in ActivityNet Large Scale Activity Recognition Challenge, CVPR, 2016.
[ Paper ] [ BibTex] [ Presentation ]
L. Wang, Z. Wang, S. Guo, and Y. Qiao
Better Exploiting OS-CNNs for Better Event Recognition in Images
in ChaLearn Looking at People (LAP) Challenge, ICCV, 2015.
[ Paper ] [ BibTex ] [ Presentation ] [ Project Page ]
L. Wang, Z. Wang, Y. Xiong, and Y. Qiao
CUHK&SIAT Submission for THUMOS15 Action Recognition Challenge
in THUMOS'15 Action Recognition Challenge, CVPR, 2015.
[ Paper ] [ BibTex ] [ Presentation ]
L. Wang, Z. Wang, W. Du, and Y. Qiao
Object-Scene Convolutional Neural Networks for Event Recognition in Images ( rank 1st place )
in ChaLearn Looking at People (LAP) Challenge, CVPR, 2015.
[ Paper ] [ BibTex ] [ Presentation ] [ Project Page ]
Z. Wang, L. Wang, W. Du, and Y. Qiao
Exploring Fisher Vector and Deep Networks for Action Spotting ( rank 1st place )
in ChaLearn Looking at People (LAP) Challenge, CVPR, 2015.
[ Paper ] [ BibTex ] [ Presentation ]
L. Wang, Y. Qiao, and X. Tang
Action Recognition and Detection by Combining Motion and Appearance Features,
in THUMOS'14 Action Recognition Challenge, ECCV, 2014.
[ Paper ] [ BibTex ] [ Presentation ]
X. Peng, L. Wang, Z. Cai, and Y. Qiao
Action and Gesture Temporal Spotting with Super Vector Representation ( rank 1st place )
in ChaLearn Looking at People (LAP) Challenge, ECCV, 2014.
[ Paper ] [ BibTex ] [ Presentation ]
X. Peng, L. Wang, Z. Cai, and Y. Qiao,
Hybrid Super Vector with Improved Dense Trajectories for Action Recognition,
in THUMOS'13 Action Recognition Challenge, ICCV, 2013.
[ Paper ] [ BibTex ]

Hu Zhuhua (胡祝华)

Technical Reports

Journal Papers

CVPR/ICCV/ECCV/ICLR Papers

Other Conference Papers

Workshop and Notebook Papers