Publications Technical Reports Y. Xiong, Y. Zhao, L. Wang , D. Lin, and X. Tang A Pursuit of Temporal Accuracy in General Activity Detection in arXiv 1703.03329 L. Wang , S. Guo, W. Huang, and Y. Qiao Places205-VGGNet Models for Scene Recognition in arXiv 1508.01667. L. Wang , Y. Xiong, Z. Wang, and Y. Qiao Towards good practices for very deep two-stream ConvNets in arXiv 1507.02159. Journal Papers D. Du, L. Wang , Z. Li, G. Wu Cross-Modal Pyramid Translation for RGB-D Scene Recognition Journal extension of TRecgNet with pyramid translation extension. in International Journal of Computer Vision (IJCV ), Volume 12, Issue 8, Pages 2309-2327, 2021. [ Paper ] [ Code ] Z. Ruan, C. Zou, L. Wu, G. Wu, and L. Wang SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Face Alignment and Reconstruction in IEEE Transactions on Image Processing (TIP ), Volume 30, Pages 5739-5806, 2021. [ Paper ] [ Code ] Y. Zheng, Z. Liu, Tong Lu, and L. Wang Dynamic Sampling Networks for Efficient Action Recognition in Videos A dynamic version of TSN for efficient action recognition in IEEE Transactions on Image Processing (TIP ), Volume 29, Pages 7970-7983, 2020. [ Paper ] [ BibTex ] Y. Zhao, Y. Xiong, L. Wang , Z. Wu, X. Tang, and D. Lin Temporal Action Detection with Structured Segment Networks Journal extension of SSN with more extensive study in International Journal of Computer Vision (IJCV ), Volume 128, Issue 1, Pages 74-95, 2020. [ Paper ] [ BibTex ] [ Code ] L. Wang , Y. Xiong, Z. Wang, Y. Qiao, D. Lin, X. Tang, and L. Van Gool Temporal Segment Networks for Action Recognition in Videos More extensive study on TSN and adding performance of I3D+TSN on Kinetics in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI ), Volume 41, Issue 11, Pages 2740-2755, 2019. [ Paper ] [ BibTex ] [ Code ] B. Zhang, L. Wang , Z. Wang, Y. Qiao, and H. Wang Real-Time Action Recognition with Deeply-Transferred Motion Vector CNNs in IEEE Transactions on Image Processing (TIP ), Volume 27, Issue 5, Pages 2326-2339, 2018. [ Paper ] [ BibTex ] [ Code ] L. Wang , Z. Wang, Y. Qiao, and L. Van Gool Transferring Deep Object and Scene Representations for Event Recognition in Still Images rank 1st place in cultural event recognition at ChaLearn LAP challenge CVPR 2015 in International Journal of Computer Vision (IJCV ), Volume 126, Issue 2-4, Pages 390-409, 2018. [ Paper ] [ BibTex ] [ Code ] L. Wang , S. Guo, W, Huang, Y. Xiong, and Y. Qiao Knowledge Guided Disambiguation for Large-Scale Scene Classification with Multi-Resolution CNNs rank 1st place at LSUN challenge 2016 and 2nd place at Places challenge 2015 in IEEE Transactions on Image Processing (TIP ), Volume 26, Issue 4, Pages 2055-2068, 2017. [ Paper ] [ BibTex ] [ Code ] Z. Wang, L. Wang , Y. Wang, B. Zhang, and Y. Qiao Weakly Supervised PatchNets: Describing and Aggregating Local Patches for Scene Recognition in IEEE Transactions on Image Processing (TIP ), Volume 26, Issue 4, Pages 2018-2041, 2017. [ Paper ] [ BibTex ] [ Code ] S. Guo, W. Huang, L. Wang , and Y. Qiao Locally Supervised Deep Hybrid Model for Scene Recognition in IEEE Transactions on Image Processing (TIP ), Volume 26, Issue 2, Pages 808-820, 2017. [ Paper ] [ BibTex ] Z. Yuan, H. Wang, L. Wang , T. Lu, P. Shivakumara, and C. L. Tan Modeling Spatial Layout for Scene Image Understanding via a Novel Multiscale Sum-Product Network in Expert Systems With Applications (ESWA ), Volume 63, Pages 231-240, 2016. [ Paper ] [ BibTex ] X. Peng, L. Wang , X. Wang, and Y. Qiao Bag of Visual Words and Fusion Methods for Action Recognition: Comprehensive Study and Good Practice in Computer Vision and Image Understanding (CVIU ), Volume 150, Pages 109-125, 2016. [ Paper ] [ BibTex ] L. Wang , Y. Qiao, and X. Tang MoFAP: A Multi-Level Representation for Action Recognition in International Journal of Computer Vision (IJCV ), Volume 119, Issue 3, Pages 254-271, 2016. [ Paper ] [ BibTex ] L. Wang , Y. Qiao, and X. Tang Latent Hierarchical Model of Temporal Structure for Complex Activity Classification in IEEE Transactions on Image Processing (TIP ), Volume 23, Issue 2, Pages 810-822, 2014. [ Paper ] [ BibTex ] CVPR/ICCV/ECCV/ICLR Papers Y. Cui, C. Jiang, L. Wang , G. Wu MixFormer: End-to-End Tracking with Iterative Mixed Attention in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2022 [ Paper ] [ Code ] Z. Gao, L. Wang , B. Han, S. Guo AdaMixer: A Simple and Accurate Query-based Object Detector in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2022 [ Paper ] [ Code ] L. Zhao, L. Wang Decoupling Classification and Localization for Domain Adaptive Object Detection in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2022 [ Paper ] [ Code ] Y. Teng, L. Wang Structured Sparse R-CNN for Direct Scene Graph Generation in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2022 [ Paper ] [ Code ] J. Lin, H. Duan, K. Chen, D. Lin, L. Wang OCSampler: Compressing Videos to One Clip with Single-step Sampling in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2022 [ Paper ] [ Code ] J. Tang, Z. Liu, C. Qian, W. Wu, L. Wang Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2022 [ Paper ] [ Code ] S. Guo, Z. Xiong, Y. Zhong, L. Wang , X. Guo, B. Han, W. Huang Cross-Architecture Self-supervised Video Representation Learning in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2022 [ Paper ] [ Code ] Y. Li, L. Chen, R. He, Z. Wang, G. Wu, L. Wang MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions in IEEE International Conference on Computer Vision (ICCV ), 2021 A high-quality and fine-grained action detection benchmark [ Paper ] [ Data ] [ Code ] [ Challenge ] T. Li, L. Wang , G. Wu Self Supervision to Distillation for Long-Tailed Visual Recognition in IEEE International Conference on Computer Vision (ICCV ), 2021 [ Paper ] [ Code (soon) ] Z. Gao, L. Wang , G. Wu Mutual Supervision for Dense Object Detection in IEEE International Conference on Computer Vision (ICCV ), 2021 [ Paper ] [ Code (soon) ] Y. Teng, L. Wang , Z. Li, G. Wu Target Adaptive Context Aggregation for Video Scene Graph Generation in IEEE International Conference on Computer Vision (ICCV ), 2021 [ Paper ] [ Code (soon) ] Z. Liu, L. Wang , W. Wu, C. Qian, T. Lu TAM: Temporal Adaptive Module for Video Recognition in IEEE International Conference on Computer Vision (ICCV ), 2021 [ Paper ] [ Code ] J. Tan, J. Tang, L. Wang , G. Wu Relaxed Transformer Decoders for Direct Action Proposal Generation in IEEE International Conference on Computer Vision (ICCV ), 2021 [ Paper ] [ Code ] Y. Zhi, Z. Tong, L. Wang , G. Wu MGSampler: An Explainable Sampling Strategy for Video Action Recognition in IEEE International Conference on Computer Vision (ICCV ), 2021 [ Paper ] [ Code (soon) ] H. Zhang, Y. Tian, X. Zhou, W. Ouyang, Y. Liu, L. Wang , Z. Sun 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop ( oral presentation ) in IEEE International Conference on Computer Vision (ICCV ), 2021 [ Paper ] [ Code ] T. Lu, L. Wang , G. Wu CGA-Net: Category Guided Aggregation for Point Cloud Semantic Segmentation in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2021 [ Paper ] [ Code (soon) ] L. Wang , Z. Tong, B. Ji, G. Wu TDN: Temporal Difference Networks for Efficient Action Recognition in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2021 [ Paper ] [ Code ] Z. Wang, Z. Gao, L. Wang , Z. Li, G. Wu Boundary-Aware Cascade Networks for Temporal Action Segmentation in European Conference on Computer Vision (ECCV ), 2020 [ Paper ] [ Code ] J. Wu, Z. Kuang, L. Wang , W. Zhang, G. Wu Context-Aware RCNN: a Baseline for Action Detection in Videos in European Conference on Computer Vision (ECCV ), 2020 [ Paper ] [ Code ] Y. Li, Z. Wang, L. Wang , G. Wu Actions as Moving Points in European Conference on Computer Vision (ECCV ), 2020 [ Paper ] [ Code ] C. Gao, Q. Liu, Q. Xu, L. Wang , J. Liu, C. Zou SketchyCOCO: Image Generation from Freehand Scene Sketches ( oral presentation ) in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2020 [ Paper ] [ Code ] Y. Li, B. Ji, X. Shi, J. Zhang, B. Kang, L. Wang TEA: Temporal Excitation and Aggregation for Action Recognition in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2020 [ Paper ] [ Code ] S. Zhang, S. Guo, W. Huang, M. Scott, L. Wang V4D: 4D Convolutional Neural Networks for Video-Level Representation Learning in International Conference on Learning Representations (ICLR ), 2020 [ Paper ] [ Code ] Z. Gao, L. Wang , and G. Wu LIP: Local Importance-based Pooling in IEEE International Conference on Computer Vision (ICCV ), 2019. [ Paper ] [ BibTex ] [ Code ] J. Wu, L. Wang , L. Wang, J. Guo, and G. Wu Learning Actor Relation Graphs for Group Activity Recognition in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2019. [ Paper ] [ BibTex ] [ Code ] D. Du, L. Wang , H. Wang, K. Zhao, G. Wu Translate-to-Recognize Networks for RGB-D Scene Recognition in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2019. [ Paper ] [ BibTex ] [ Code ] [ Project Page ] J. Guo, Z. Zhou, and L. Wang Single Image Highlight Removal with a Sparse and Low-Rank Reflection Model in European Conference on Computer Vision (ECCV ), 2018. [ Paper ] L. Wang , W. Li, W. Li, and L. Van Gool Appearance-and-Relation Networks for Video Classification in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2018. [ Paper ] [ Code ] Y. Zhao, Y. Xiong, L. Wang , Z. Wu, X. Tang, and D. Lin Temporal Action Detection with Structured Segment Networks in IEEE International Conference on Computer Vision (ICCV ), 2017. [ Paper ] [ BibTex ] [ Project Page ] [ Code ] L. Wang , Y. Xiong, D. Lin, and L. Van Gool UntrimmedNets for Weakly Supervised Action Recognition and Detection in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2017. [ Paper ] [ BibTex ] [ Poster ] [ Code ] J. Song, L. Wang , L. Van Gool, and O. Hilliges Thin-Slicing Network: A Deep Structural Model for Human Pose Estimation in Videos ( oral presentation ) in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2017. [ Paper ] [ BibTex ] [ Project Page ] L. Wang , Y. Xiong, Z. Wang, Y. Qiao, D. Lin, X. Tang, and L. Van Gool Temporal Segment Networks: Towards Good Practices for Deep Action Recognition in European Conference on Computer Vision (ECCV ), 2016. major contribution to the winner solution of ActivityNet challenge 2016 [ Paper ] [ BibTex ] [ Poster ] [ Code ] L. Wang , Y. Qiao, X. Tang, and L. Van Gool Actionness Estimation Using Hybrid Fully Convolutional Networks in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2016. [ Paper ] [ BibTex ] [ Poster ] [ Project Page ] [ Code ] B. Zhang, L. Wang , Z. Wang, Y. Qiao, and H. Wang Real-time Action Recognition with Enhanced Motion Vector CNNs in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2016. [ Paper ] [ BibTex ] [ Poster ] [ Project Page ] [ Code ] L. Wang , Y. Qiao, and X. Tang Action Recognition with Trajectory-Pooled Deep-Convolutional Descriptors in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2015. [ Paper ] [ BibTex ] [ Poster ] [ Extended Abstract ] [ Project Page ] [ Code ] L. Wang , Y. Qiao, and X. Tang Video Action Detection with Relational Dynamic-Poselets in European Conference on Computer Vision (ECCV ), 2014. [ Paper ] [ BibTex ] [ Poster ] [ Spotlight ] [ Code ] Z. Cai, L. Wang , X. Peng, and Y. Qiao Multi-View Super Vector for Action Recognition ( oral presentation ) in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2014. [ Paper ] [ BibTex ] [ Video Spotlight ] [ Oral Presentation ] [ Poster ] [ Supplement ] [ Code ] X. Peng*, L. Wang *, Y. Qiao, and Q. Peng (* indicates equal contribution) Boosting VLAD with Supervised Dictionary Learning and High-Order Statistics in European Conference on Computer Vision (ECCV ), 2014. [ Paper ] [ BibTex ] L. Wang , Y. Qiao, and X. Tang Mining Motion Atoms and Phrases for Complex Action Recognition in IEEE International Conference on Computer Vision (ICCV ), 2013. [ Paper ] [ BibTex ] [ Poster ] [ Spotlight ] [ Project Page ] L. Wang , Y. Qiao, and X. Tang Motionlets: Mid-Level 3D Parts for Human Motion Recognition in IEEE Conference on Computer Vision and Pattern Recognition (CVPR ), 2013. [ Paper ] [ BibTex ] [ Poster ] [ Spotlight ] [ Project Page ] Other Conference Papers G. Chen, Y. Zhen, L. Wang , and T. Lu DCAN: Improving Temporal Action Detection via Dual Context Aggregation in AAAI Conference on Artificial Intelligence (AAAI ), 2022 [ Paper ] [ Code ] Z. Wang, L. Wang , T. Wu, T. Li, and G. Wu Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding in AAAI Conference on Artificial Intelligence (AAAI ), 2022 [ Paper ] [ Code ] Z. Zhu, L. Wang , S. Guo, and G. Wu A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark in British Machine Vision Conference (BMVC ), 2021. [ Paper ] [ Code ] Z. Liu, D. Luo, Y. Wang, L. Wang , Y. Tai, C. Wang, J. Li, F. Huang, T. Lu TEINet: Towards an Efficient Architecture for Video Recognition in AAAI Conference on Artificial Intelligence (AAAI ), 2020 [ Paper ] [ BibTex ] S. Zhang, S. Guo, L. Wang , W. Huang, M. Scott Knowledge Integration Networks for Action Recognition in AAAI Conference on Artificial Intelligence (AAAI ), 2020 [ Paper ] [ BibTex ] Y. Li, W. Lin, T. Wang, J. See, R. Qian, N. Xu, L. Wang , S. Xu Finding Action Tubes with a Sparse-to-Dense Framework in AAAI Conference on Artificial Intelligence (AAAI ), 2020 [ Paper ] [ BibTex ] Y. Yao, Z. Sun, F. Shen, L. Liu, L. Wang , F. Zhu, L. Ding, G. Wu, L. Shao Dynamically Visual Disambiguation of Keyword-based Image Search in International Joint Conference on Artificial Intelligence (IJCAI ), 2019 [ Paper ] [ BibTex ] D. He, Z. Zhou, C. Gan, F. Li, X. Liu, Y. Li, L. Wang , S. Wen StNet: Local and Global Spatial-Temporal Modeling for Action Recognition in AAAI Conference on Artificial Intelligence (AAAI ), 2019 [ Paper ] [ BibTex ] Z. Wang, X. Liu, L. Chen, L. Wang , Y. Qiao, X. Xie, and C. Fowlkes Structed Triplets Learning with Pos-tag Guided Attention for Visual Question Answering in IEEE Winter Conference on Applications of Computer Vision (WACV ), 2018. [ Paper ] [ BibTex ] Y. Wang, J. Song, L. Wang , L. Van Gool, and O. Hilliges Two-Stream SR-CNNs for Action Recognition in Videos in British Machine Vision Conference (BMVC ), 2016. [ Paper ] [ BibTex ] Z. Wang, Y. Wang, L. Wang , and Y. Qiao Codebook Enhancement of VLAD Representation for Visual Recognition in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ), 2016. [ Paper ] [ BibTex ] X. Peng, L. Wang , Y. Qiao, and Q. Peng A Joint Evaluation of Dictionary Learning and Feature Encoding for Action Recognition in International Conference on Pattern Recognition (ICPR ), 2014. [ Paper ] [ BibTex ] X. Wang, L. Wang , and Y. Qiao A Comparative Study of Encoding, Pooling and Normalization Methods for Action Recognition in Asian Conference on Computer Vision (ACCV ), 2012. [ Paper ] [ BibTex ] [ Poster ] [ Spotlight ] L. Wang , Y. Wu, T. Lu, and K. Chen Multiclass Object Detection by Combining Local Appearances and Context in ACM International Conference on Multimedia (ACM MM ), 2011. [ Paper ] [ BibTex ] L. Wang , Y. Wu, Z. Tian, Z. Sun, and T. Lu A Novel Approach for Robust Surveillance Video Content Abstraction in Pacific-Rim Conference on Multimedia (PCM ), 2010. [ Paper ] [ BibTex ] Workshop and Notebook Papers Y. Xiong, L. Wang , Z. Wang, B. Zhang, H. Song, W. Li, D. Lin, Y. Qiao, L. Van Gool, and X. Tang CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016 ( rank 1st place ) in ActivityNet Large Scale Activity Recognition Challenge, CVPR , 2016. [ Paper ] [ BibTex] [ Presentation ] L. Wang , Z. Wang, S. Guo, and Y. Qiao Better Exploiting OS-CNNs for Better Event Recognition in Images in ChaLearn Looking at People (LAP ) Challenge, ICCV , 2015. [ Paper ] [ BibTex ] [ Presentation ] [ Project Page ] L. Wang , Z. Wang, Y. Xiong, and Y. Qiao CUHK&SIAT Submission for THUMOS15 Action Recognition Challenge in THUMOS'15 Action Recognition Challenge, CVPR , 2015. [ Paper ] [ BibTex ] [ Presentation ] L. Wang , Z. Wang, W. Du, and Y. Qiao Object-Scene Convolutional Neural Networks for Event Recognition in Images ( rank 1st place ) in ChaLearn Looking at People (LAP ) Challenge, CVPR , 2015. [ Paper ] [ BibTex ] [ Presentation ] [ Project Page ] Z. Wang, L. Wang , W. Du, and Y. Qiao Exploring Fisher Vector and Deep Networks for Action Spotting ( rank 1st place ) in ChaLearn Looking at People (LAP ) Challenge, CVPR , 2015. [ Paper ] [ BibTex ] [ Presentation ] L. Wang , Y. Qiao, and X. Tang Action Recognition and Detection by Combining Motion and Appearance Features, in THUMOS'14 Action Recognition Challenge, ECCV , 2014. [ Paper ] [ BibTex ] [ Presentation ] X. Peng, L. Wang , Z. Cai, and Y. Qiao Action and Gesture Temporal Spotting with Super Vector Representation ( rank 1st place ) in ChaLearn Looking at People (LAP ) Challenge, ECCV , 2014. [ Paper ] [ BibTex ] [ Presentation ] X. Peng, L. Wang , Z. Cai, and Y. Qiao, Hybrid Super Vector with Improved Dense Trajectories for Action Recognition, in THUMOS'13 Action Recognition Challenge, ICCV , 2013. [ Paper ] [ BibTex ]