Selected Publications
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation
Neural Information Processing Systems (NeurIPS D&B), 2025
Introducing OpenS2V-Nexus which consists of: (i) OpenS2V-Eval, a fine-grained benchmark, and (ii) OpenS2V-5M, a million-scale dataset.
Identity-Preserving Text-to-Video Generation by Frequency Decomposition
IEEE Conference on Computer Vision and Pattern Recognition (CVPR Highlight), 2025
We present ConsisID, an identity-preserving text-to-video generation model, which can keep human-identity consistent in the generated video.
ChronoMagic-Bench : A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Neural Information Processing Systems (NeurIPS D&B Spotlight), 2024
We present ChronoMagic-Bench, a benchmark for metamorphic evaluation of text-to-time-lapse video generation, can reflect the physical prior capacity of the T2V model.
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
We are thrilled to present MagicTime, a metamorphic time-lapse video generation model and a new dataset ChronoMagic, support U-Net or DiT-based T2V frameworks.
Selected Projects
Patents
An acceleration system and method for deconvolution calculation in neural networks
CN202210582998.5 / CN114821262A
A cleaning device for exterior windows of high-rise buildings
CN201821848303.9 / CN209678371U
FPGA-based mixed-precision data frequency domain convolution acceleration method and system
CN201821848303.9 / CN209678371U
Awards
- National Scholarship Award, PRC (2025)
- Pacemaker to Merit Student, Peking University (2025)
- National Scholarship Award, PRC (2023)
- National Scholarship Award, PRC (2022)
- National Scholarship Award, PRC (2021)
Academic Service
- ICCV 2025, NeurIPS 2025, CVPR 2025, SIGGRAPH 2025, ACM MM 2024, ACM MM 2023