2026

  • P2Voxel: Pyramid Pivot Voxelization for 3D Mesh Tokenization Manuscript First
    Zhenhong Sun, Haozhe Liu, Yifu Wang, Xibin Song, Steve Wang, Huadong Mo, Daoyi Dong, Hongdong Li, Pan Ji
    Submitted to a Top-Tier Machine Learning Conference, 2026.
  • Hi-TOPS: Hierarchical Topology-aware Scoring Prior for 3D Part Decomposition Manuscript Co-First
    Ruoyu Wu, Zhenhong Sun, Xiaoming Gong, Yuxin Xian, Zhi Wang, Yawen Chen, Huadong Mo, Daoyi Dong
    Submitted to a Top-Tier Graphics Conference, 2026.
  • Look-Before-Move: Narrative-Grounded World Visual Attention in Dynamic 3D Story Worlds Manuscript Leader
    Jiaming Bian, Bingliang Li, Yuehao Wu, Pichao Wang, Zhi Wang, Hailan Ma, Huadong Mo, Zhenhong Sun
    Under Review, arXiv preprint arXiv:2606.26964, 2026.
  • XTalker: Turn, Smile, and Speak in Controllable Talking Portrait Animation Manuscript First
    Zhenhong Sun, Beier Wang, Zhicheng Zhang, Zhongju Wang, Yu Zhang, Hailan Ma, Daoyi Dong, Huadong Mo, Ming Lin
    Submitted to IEEE Transactions on Cybernetics, 2026.
  • Social Structure Matters in 3D Human-Human Interaction Generation Manuscript Leader
    Zhongju Wang, Beier Wang, Yatao Bian, Pichao Wang, Zhi Wang, Daoyi Dong, Hongdong Li, Huadong Mo, Zhenhong Sun
    Under Review, arXiv preprint arXiv:2606.24255, 2026.
  • Escaping Confidence Trap: Evolutionary Decoding for Mathematical Reasoning in Diffusion LLMs Manuscript First
    Zhenhong Sun, Hanqing Zhao, Yatao Bian, Rong-Cheng Tu, Liuyue Xie, Xu Zhang, Jue Wang, Davide Modolo, Daoyi Dong, Dacheng Tao
    Submitted to a Top-Tier Machine Learning Conference, 2026.
  • From Conflict to Consensus: Boosting Medical Reasoning via Multi-Round Agentic RAG Conference
    Wenhao Wu, Zhentao Tang, Yafu Li, Shixiong Kai, Mingxuan Yuan, Zhenhong Sun, Chunlin Chen, Zhi Wang
    International Conference on Machine Learning (ICML 2026).
  • Scalable In-Context Q-Learning Conference
    Jinmei Liu, Fuhong Liu, Zhenhong Sun, Jianye Hao, Huaxiong Li, Bo Wang, Daoyi Dong, Chunlin Chen, Zhi Wang
    International Conference on Learning Representations (ICLR 2026).
  • StoryBlender: Inter-Shot Consistent and Editable 3D Storyboard with Spatial-temporal Dynamics Preprint Co-First Leader
    Bingliang Li, Zhenhong Sun, Jiaming Bian, Yuehao Wu, Yifu Wang, Hongdong Li, Yatao Bian, Huadong Mo, Daoyi Dong
    arXiv preprint arXiv:2604.03315, 2026.
  • 3DXTalker: Unifying Identity, Lip Sync, Emotion, and Spatial Dynamics in Expressive 3D Talking Avatars Preprint Co-First Leader
    Zhongju Wang, Zhenhong Sun, Beier Wang, Yifu Wang, Daoyi Dong, Huadong Mo, Hongdong Li
    arXiv preprint arXiv:2602.10516, 2026.
  • Learning Hierarchical Time-Frequency Representation for Long-Term Time Series Forecasting Journal
    Zhongju Wang, Zhenhong Sun, Yatao Bian, Huadong Mo, Daoyi Dong
    Information Processing & Management, 63(2):104358, 2026.
  • Beyond the Dirac Delta: Mitigating Diversity Collapse in Reinforcement Fine-Tuning for Versatile Image Generation Preprint
    Jinmei Liu, Haoru Li, Zhenhong Sun, Chaofeng Chen, Yatao Bian, Bo Wang, Daoyi Dong, Chunlin Chen, Zhi Wang
    arXiv preprint arXiv:2601.12401, 2026.
  • T3-S2S: Training-free Triplet Tuning for Sketch to Scene Generation Journal First
    Zhenhong Sun, Yifu Wang, Yonhon Ng, Yunfei Duan, Daoyi Dong, Hongdong Li, Pan Ji
    Transactions on Machine Learning Research (TMLR), 2026.

2025

  • Text-to-Decision Agent: Offline Meta-Reinforcement Learning from Natural Language Supervision Conference Co-Corr
    Shilin Zhang, Zican Hu, Wenhao Wu, Xinyi Xie, Jianxiang Tang, Chunlin Chen, Daoyi Dong, Yu Cheng, Zhenhong Sun, Zhi Wang
    Conference on Neural Information Processing Systems (NeurIPS 2025), 2025.
  • Hierarchical and Step-Layer-Wise Tuning of Attention Specialty for Multi-Instance Synthesis in Diffusion Transformers Preprint Co-First Leader
    Chunyang Zhang, Zhenhong Sun, Zhicheng Zhang, Junyan Wang, Yu Zhang, Dong Gong, Huadong Mo, Daoyi Dong
    arXiv preprint arXiv:2504.10148, published April 14, 2025.
  • Learning Informative Latent Representation for Quantum State Tomography Journal Co-Corr
    Hailan Ma, Zhenhong Sun, Daoyi Dong, Dong Gong
    IEEE Transactions on Emerging Topics in Computational Intelligence, 2025.
  • Tomography of Quantum States From Structured Measurements via Quantum-Aware Transformer Journal Co-Corr
    Hailan Ma, Zhenhong Sun, Daoyi Dong, Chunlin Chen, Herschel Rabitz
    IEEE Transactions on Cybernetics, volume 55, issue 6, pages 2571-2582, 2025.

2024

  • EGGen: Image Generation with Multi-Entity Prior Learning through Entity Guidance Conference First
    Zhenhong Sun, Junyan Wang, Zhiyu Tan, Daoyi Dong, Hailan Ma, Hao Li, Dong Gong
    ACM Multimedia 2024, 2024.
  • Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches Preprint
    Yongzhi Xu, Yonhon Ng, Yifu Wang, Inkyu Sa, Yunfei Duan, Zhenhong Sun, Yang Li, Pan Ji, Hongdong Li
    arXiv preprint arXiv:2408.04567, 2024.
  • Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation Conference
    Junyan Wang, Zhenhong Sun, Zhiyu Tan, Xuanbai Chen, Weihua Chen, Hao Li, Cheng Zhang, Yang Song
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), 2024.

2023

  • Estimation of Quantum Channels Using Neural Networks Conference
    Hailan Ma, Zhenhong Sun, Shuixin Xiao, Daoyi Dong, Ian R. Petersen
    62nd IEEE Conference on Decision and Control (CDC 2023), 1195-1200, 2023.
  • Maximizing Spatio-Temporal Entropy of Deep 3D CNNs for Efficient Video Recognition Conference Co-First
    Junyan Wang, Zhenhong Sun, Yichen Qian, Dong Gong, Xiuyu Sun, Ming Lin, Maurice Pagnucco, Yang Song
    International Conference on Learning Representations (ICLR 2023, Equal First), 2023.

2022

  • MAE-DET: Revisiting Maximum Entropy Principle in Zero-Shot NAS for Efficient Object Detection Conference First
    Zhenhong Sun, Ming Lin, Xiuyu Sun, Zhiyu Tan, Hao Li, Rong Jin
    International Conference on Machine Learning (ICML 2022), 20810-20826, 2022.
  • Entropy-Driven Mixed-Precision Quantization for Deep Network Design Conference First
    Zhenhong Sun, Ce Ge, Junyan Wang, Ming Lin, Hesen Chen, Hao Li, Xiuyu Sun
    Conference on Neural Information Processing Systems (NeurIPS 2022), 2022.
  • Jmpnet: Joint Motion Prediction for Learning-Based Video Compression Conference
    Dongyang Li, Zhenhong Sun, Zhiyu Tan, Xiuyu Sun, Fangyi Zhang, Yichen Qian, Hao Li
    International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), 2022.

2021

  • Zen-NAS: A Zero-Shot NAS for High-Performance Image Recognition Conference
    Ming Lin, Pichao Wang, Zhenhong Sun, Hesen Chen, Xiuyu Sun, Qi Qian, Hao Li, Rong Jin
    International Conference on Computer Vision (ICCV 2021), 347-356, 2021.
  • Learning Accurate Entropy Model with Global Reference for Image Compression Conference
    Yichen Qian, Zhiyu Tan, Xiuyu Sun, Ming Lin, Dongyang Li, Zhenhong Sun, Hao Li, Rong Jin
    International Conference on Learning Representations (ICLR 2021), 2020.
  • Interpolation Variable Rate Image Compression Conference First
    Zhenhong Sun, Zhiyu Tan, Xiuyu Sun, Fangyi Zhang, Yichen Qian, Dongyang Li, Hao Li
    ACM International Conference on Multimedia (ACM MM 2021), 5574-5582, 2021.
  • Spatiotemporal Entropy Model Is All You Need for Learned Video Compression Preprint
    Zhenhong Sun, Zhiyu Tan, Xiuyu Sun, Fangyi Zhang, Dongyang Li, Yichen Qian, Hao Li
    arXiv preprint arXiv:2104.06083, 2021.

2019

  • End-to-end Optimized Image Compression with Attention Mechanism Workshop
    Lei Zhou, Zhenhong Sun, Xiangji Wu, Junmin Wu
    4 Tracks Winner of the Challenge on Learned Image Compression on CVPR 2019, 2019.