Hanhui Wang

hhwang_original.png

Room 472, West Village H

440 Huntington Avenue

Boston, Massachusetts

United States of America

Hanhui Wang (王翰辉) is a first-year Ph.D. student at the Visual Intelligence Lab at Northeastern University (NEU), where he is supervised by Prof. Huaizu Jiang. His research centers on generation and reasoning in AI systems, with current interests in controllable video generation, multimodal learning, and 3D/4D vision. His long-term goal is to bridge generative modeling and scene understanding toward building world models capable of causal reasoning and high-fidelity simulation of complex human-scene interactions.

Prior to joining NEU, he received his bachelor’s degree from Huazhong University of Science and Technology (HUST) and his master’s degree from the University of Southern California (USC). He has had the honor of collaborating with Prof. Xianzhi Li, Prof. Zhengzhong Tu, and Prof. Huaizu Jiang on research spanning diverse areas of computer vision. He also gained valuable industry experience through internships at leading technology companies such as iFLYTEK.

Research Keywords: Video Generation, Controllable Generation, World Models, Multimodal Learning, 3D/4D Vision, Generative AI.

News

Sep 18, 2025 :tada: Our paper Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models is accepted to NeurIPS 2025!
Jun 15, 2025 :tada: First conference, first poster, first time in Tennessee — I had an amazing time at CVPR 2025 in Nashville, the Music City. Grateful to everyone who made it so memorable. I’ll be returning, hungry for more!
Jun 7, 2025 :hammer_and_wrench: After putting it off for way too long, I finally rebuilt my personal website. It’s cleaner, more up to date, and a little closer to how I want to present my work (for now).
Jun 2, 2025 :tada: Our paper Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models was selected as a spotlight presentation at the CVPR 2025 Workshop on Computer Vision in the Wild (CVinW)! I will be presenting it in Nashville on June 11th!
Feb 26, 2025 :tada: Our paper Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing is accepted to CVPR 2025!
Feb 6, 2025 :tada: I’m excited to share that I have accepted a Ph.D. offer at Northeastern University, where I will be joining the Visual Intelligence Lab under the supervision of Prof. Huaizu Jiang. Looking forward to this new chapter in Boston!

First-Authored Publications

See a full publication list at here.

  1. NeurIPS’25
    Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models
    Fangrui Zhu*, Hanhui Wang*, Yiming Xie, Jing Gu, Tianye Ding, Jianwei Yang, and Huaizu Jiang
    In The 39th Annual Conference on Neural Information Processing Systems 2025
  2. CVPR’25
    Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing
    Hanhui Wang*, Yihua Zhang*, Ruizheng Bai, Yue Zhao, Sijia Liu, and Zhengzhong Tu
    In The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025
  3. arXiv’2024
    Leveraging SAM for Single-Source Domain Generalization in Medical Image Segmentation
    Hanhui Wang, Huaize Ye, Yi Xia, and Xueyan Zhang
    In arXiv 2024