Publications
Publication list selected from the public Google Scholar profile.
2026
-
P2Voxel: Pyramid Pivot Voxelization for 3D Mesh Tokenization Manuscript FirstSubmitted to a Top-Tier Machine Learning Conference, 2026.
-
Hi-TOPS: Hierarchical Topology-aware Scoring Prior for 3D Part Decomposition Manuscript Co-FirstSubmitted to a Top-Tier Graphics Conference, 2026.
-
Look-Before-Move: Narrative-Grounded World Visual Attention in Dynamic 3D Story Worlds Manuscript LeaderUnder Review, arXiv preprint arXiv:2606.26964, 2026.
-
XTalker: Turn, Smile, and Speak in Controllable Talking Portrait Animation Manuscript FirstSubmitted to IEEE Transactions on Cybernetics, 2026.
-
Social Structure Matters in 3D Human-Human Interaction Generation Manuscript LeaderUnder Review, arXiv preprint arXiv:2606.24255, 2026.
-
Escaping Confidence Trap: Evolutionary Decoding for Mathematical Reasoning in Diffusion LLMs Manuscript FirstSubmitted to a Top-Tier Machine Learning Conference, 2026.
-
From Conflict to Consensus: Boosting Medical Reasoning via Multi-Round Agentic RAG ConferenceInternational Conference on Machine Learning (ICML 2026).
-
Scalable In-Context Q-Learning ConferenceInternational Conference on Learning Representations (ICLR 2026).
-
StoryBlender: Inter-Shot Consistent and Editable 3D Storyboard with Spatial-temporal Dynamics Preprint Co-First LeaderarXiv preprint arXiv:2604.03315, 2026.
-
3DXTalker: Unifying Identity, Lip Sync, Emotion, and Spatial Dynamics in Expressive 3D Talking Avatars Preprint Co-First LeaderarXiv preprint arXiv:2602.10516, 2026.
-
Learning Hierarchical Time-Frequency Representation for Long-Term Time Series Forecasting JournalInformation Processing & Management, 63(2):104358, 2026.
-
Beyond the Dirac Delta: Mitigating Diversity Collapse in Reinforcement Fine-Tuning for Versatile Image Generation PreprintarXiv preprint arXiv:2601.12401, 2026.
-
T3-S2S: Training-free Triplet Tuning for Sketch to Scene Generation Journal FirstTransactions on Machine Learning Research (TMLR), 2026.
2025
-
Text-to-Decision Agent: Offline Meta-Reinforcement Learning from Natural Language Supervision Conference Co-CorrConference on Neural Information Processing Systems (NeurIPS 2025), 2025.
-
Hierarchical and Step-Layer-Wise Tuning of Attention Specialty for Multi-Instance Synthesis in Diffusion Transformers Preprint Co-First LeaderarXiv preprint arXiv:2504.10148, published April 14, 2025.
-
Learning Informative Latent Representation for Quantum State Tomography Journal Co-CorrIEEE Transactions on Emerging Topics in Computational Intelligence, 2025.
-
Tomography of Quantum States From Structured Measurements via Quantum-Aware Transformer Journal Co-CorrIEEE Transactions on Cybernetics, volume 55, issue 6, pages 2571-2582, 2025.
2024
-
EGGen: Image Generation with Multi-Entity Prior Learning through Entity Guidance Conference FirstACM Multimedia 2024, 2024.
-
Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches PreprintarXiv preprint arXiv:2408.04567, 2024.
-
Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation ConferenceIEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), 2024.
2023
-
Estimation of Quantum Channels Using Neural Networks Conference62nd IEEE Conference on Decision and Control (CDC 2023), 1195-1200, 2023.
-
Maximizing Spatio-Temporal Entropy of Deep 3D CNNs for Efficient Video Recognition Conference Co-FirstInternational Conference on Learning Representations (ICLR 2023, Equal First), 2023.
2022
-
MAE-DET: Revisiting Maximum Entropy Principle in Zero-Shot NAS for Efficient Object Detection Conference FirstInternational Conference on Machine Learning (ICML 2022), 20810-20826, 2022.
-
Entropy-Driven Mixed-Precision Quantization for Deep Network Design Conference FirstConference on Neural Information Processing Systems (NeurIPS 2022), 2022.
-
Jmpnet: Joint Motion Prediction for Learning-Based Video Compression ConferenceInternational Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), 2022.
2021
-
Zen-NAS: A Zero-Shot NAS for High-Performance Image Recognition ConferenceInternational Conference on Computer Vision (ICCV 2021), 347-356, 2021.
-
Learning Accurate Entropy Model with Global Reference for Image Compression ConferenceInternational Conference on Learning Representations (ICLR 2021), 2020.
-
Interpolation Variable Rate Image Compression Conference FirstACM International Conference on Multimedia (ACM MM 2021), 5574-5582, 2021.
-
Spatiotemporal Entropy Model Is All You Need for Learned Video Compression PreprintarXiv preprint arXiv:2104.06083, 2021.
2019
-
End-to-end Optimized Image Compression with Attention Mechanism Workshop4 Tracks Winner of the Challenge on Learned Image Compression on CVPR 2019, 2019.