Tianyu Yang

I am a Researcher at Meituan. Prior to that, I worked at Alibaba DAMO Academy, IDEA, and Tencent AI Lab. I received my PhD from City University of Hong Kong, advised by Prof. Antoni B. Chan.

My research interests lie in visual generative modeling, 3D vision, representation learning, and video understanding. Recently, I have been working on video generation.

profile photo
Recent Publications
WildActor: Unconstrained Identity-Preserving Video Generation
Qin Guo, Tianyu Yang, Xuanhua He, Fei Shen, Yong Zhang, Zhuoliang Kang, Xiaoming Wei, Dan Xu
arXiv, 2026
arXiv  /  project  /  code
Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory
Ruiqi Wu, Xuanhua He, Meng Cheng, Tianyu Yang, Yong Zhang, Zhuoliang Kang, Xunliang Cai, Xiaoming Wei, Chunle Guo, Chongyi Li, Ming-Ming Cheng
arXiv, 2026
arXiv  /  project  /  code
Active Intelligence in Video Avatars via Closed-loop World Modeling
Xuanhua He, Tianyu Yang, Ke Cao, Ruiqi Wu, Cheng Meng, Yong Zhang, Zhuoliang Kang, Xiaoming Wei, Qifeng Chen
Conference on Computer Vision and Pattern Recognition (CVPR), 2026
arXiv  /  project  /  code
ReEx-SQL: Reasoning with Execution-Aware Reinforcement Learning for Text-to-SQL
Yaxun Dai, Wenxuan Xie, Xialie Zhuang, Tianyu Yang, Yiying Yang, Haiqin Yang, Yuhang Zhao, Pingfu Chao, Wenhao Jiang
Annual Meeting of the Association for Computational Linguistics (ACL), 2026
arXiv
SCOPE: Scale-Consistent One-Pass Estimation of 3D Geometry
Zheng Zhang, Lihe Yang, Tianyu Yang, Chaohui Yu, Yixing Lao, Xiaoyang Guo, Biao Gong, Fan Wang, Hengshuang Zhao
ACM SIGGRAPH Conference Papers (SIGGRAPH), 2026
project
Show more
LongCat-Video-Avatar Technical Report
Meituan LongCat Team
Technical Report, 2025
project  /  code  /  hugging face  /  tech report
StableDepth: Scene-Consistent and Scale-Invariant Monocular Depth
Zheng Zhang, Lihe Yang, Tianyu Yang, Chaohui Yu, Xiaoyang Guo, Yixing Lao, Hengshuang Zhao
International Conference on Computer Vision (ICCV), 2025 (Highlight)
arXiv  /  project
Compressed3D: a Compressed Latent Space for 3D Generation from a Single Image
Bowen Zhang, Tianyu Yang, Yu Li, Lei Zhang, Xi Zhao
European Conference on Computer Vision (ECCV), 2024
arXiv  /  project
OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models
Zhe Kong, Yong Zhang, Tianyu Yang, Tao Wang, Kaihao Zhang, Bizhu Wu, Guanying Chen, Wei Liu, Wenhan Luo
European Conference on Computer Vision (ECCV), 2024
arXiv  /  project
AddMe: Zero-shot Group-photo Synthesis by Inserting People into Scenes
Dongxu Yue, Maomao Li, Yunfei Liu, Qin Guo, Ailing Zeng, Tianyu Yang, Yu Li
European Conference on Computer Vision (ECCV), 2024
arXiv  /  project
A Video is Worth 256 bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing
Maomao Li, Yu Li, Tianyu Yang, Yunfei Liu, Dongxu Yue, Zhihui Lin, Dong Xu
Conference on Computer Vision and Pattern Recognition (CVPR), 2024
arXiv  /  project
Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts
Xinhua Cheng, Tianyu Yang, Jianan Wang, Yu Li, Lei Zhang, Jian Zhang, Li Yuan
International Conference on Learning Representations (ICLR), 2024
arXiv  /  project
Symbol as Points: Panoptic Symbol Spotting via Point-based Representation
Wenlong Liu, Tianyu Yang, Yuhan Wang, Qizhi Yu, Lei Zhang
International Conference on Learning Representations (ICLR), 2024
arXiv  /  code
GPAvatar: Generalizable and Precise Head Avatar from Image(s)
Xuangeng Chu, Yu Li, Ailing Zeng, Tianyu Yang, Lijian Lin, Yunfei Liu, Tatsuya Harada
International Conference on Learning Representations (ICLR), 2024
arXiv  /  code
TOSS: High-quality Text-guided Novel View Synthesis from a Single Image
Yukai Shi, Jianan Wang, He Cao, Boshi Tang, Xianbiao Qi, Tianyu Yang, Yukun Huang, Shilong Liu, Lei Zhang, Heung-Yeung Shum
International Conference on Learning Representations (ICLR), 2024
arXiv  /  project
Consistent123: Improve Consistency for One Image to 3D Object Synthesis
Haohan Weng, Tianyu Yang, Jianan Wang, Yu Li, Tong Zhang, C.L.Philip Chen, Lei Zhang
arXiv, 2023
arXiv  /  project
Latent Video Diffusion Models for High-Fidelity Long Video Generation
Yingqing He, Tianyu Yang, Yong Zhang, Ying Shan, Qifeng Chen
arXiv, 2023
arXiv  /  project
Scalable Video Object Segmentation with Simplified Framework
Qiangqiang Wu, Tianyu Yang, Wei Wu, Antoni B. Chan
International Conference on Computer Vision (ICCV), 2023
arXiv  /  code
DropMAE: Masked Autoencoders with Spatial-Attention Dropout for Tracking Tasks
Qiangqiang Wu, Tianyu Yang, Ziquan Liu, Baoyuan Wu, Ying Shan, Antoni B. Chan
Conference on Computer Vision and Pattern Recognition (CVPR), 2023
arXiv  /  code
LocVTP: Video-Text Pre-training for Temporal Localization
Meng Cao, Tianyu Yang, Junwu Weng, Can Zhang, Jue Wang and Yuexian Zou
European Conference on Computer Vision (ECCV), 2022
arXiv  /  code  /  supp
Unsupervised Pre-training for Temporal Action Localization Tasks
Can Zhang, Tianyu Yang, Junwu Weng, Meng Cao, Jue Wang and Yuexian Zou
Conference on Computer Vision and Pattern Recognition (CVPR), 2022
arXiv  /  code  /  supp
Exploring Denoised Cross-video Contrast for Weakly-supervised Temporal Action Localization
Jingjing Li, Tianyu Yang, Wei Ji, Jue Wang and Li Cheng
Conference on Computer Vision and Pattern Recognition (CVPR), 2022
arXiv  /  code
SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization
Zhihui Lin, Tianyu Yang, Maomao Li, Ziyu Wang, Chun Yuan, Wenhao Jiang, Wei Liu
Conference on Computer Vision and Pattern Recognition (CVPR), 2022
arXiv  /  code  /  supp
Motion-aware Contrastive Video Representation Learning via Foreground-background Merging
Shuangrui Ding, Maomao Li, Tianyu Yang, Rui Qian, Haohang Xu, Qingyi Chen, Jue Wang and Hongkai Xiong
Conference on Computer Vision and Pattern Recognition (CVPR), 2022
arXiv  /  code  /  supp
VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples
Tian Pan, Yibing Song, Tianyu Yang, Wenhao Jiang, and Wei Liu
Conference on Computer Vision and Pattern Recognition (CVPR), 2021
arXiv  /  code
ROAM: Recurrently Optimizing Tracking Model
Tianyu Yang, Pengfei Xu, Runbo Hu, Hua Chai, and Antoni B. Chan
Conference on Computer Vision and Pattern Recognition (CVPR), 2020
code  /  otb100-results  /  lasot-results
Visual Tracking via Dynamic Memory Networks
Tianyu Yang, Antoni B. Chan
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019
code  /  otb100-results  /  vot2015-results  /  vot2016-results  /  vot2017-results
Learning Dynamic Memory Networks for Object Tracking
Tianyu Yang, Antoni B. Chan
European Conference on Computer Vision (ECCV), 2018
code  /  otb100-results  /  vot2016-results
Density-Preserving Hierarchical EM Algorithm: Simplifying Gaussian Mixture Models for Approximate Inference
Lei Yu, Tianyu Yang, Antoni B. Chan
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2018
Recurrent Filter Learning for Visual Tracking
Tianyu Yang, Antoni B. Chan
Workshop on Visual Object Tracking (VOT) Chanllenge, ICCV, 2017
code  /  otb100-results  /  vot2016-results
Approximate Inference for Generic Likelihoods via Density-Preserving GMM Simplification
Lei Yu, Tianyu Yang, Antoni B. Chan
Workshop on Advances in Approximate Bayesian Inference, NeurIPS, 2016
Robust Object Tracking With Reacquisition Ability Using Online Learned Detector
Tianyu Yang, Baopu Li, Max Q.-H. Meng
IEEE transactions on Cybernetics , 2014
Adaptive Visual Tracking with Reacquisition Ability for Arbitrary Objects
Tianyu Yang, Baopu Li, Chao Hu, Max Q.-H. Meng
International Conference on Robotics and Automation (ICRA) , 2013
Academic Services

Area Chair: ICLR 2026, NeurIPS 2026

Reviewer: CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR, TPAMI, IJCV, TIP