Xinran Nicole Han

Xinran (Nicole) Han

I am a final-year Ph.D. candidate at Harvard University, advised by Prof. Todd Zickler. I also work closely with Prof. Ko Nishino and have spent two wonderful summers in Kyoto.

Email: xinranhan [at] g [dot] harvard [dot] edu

Previously, I graduated from the University of Pennsylvania majoring in Mathematics and Computer Science. During my undergrad, I was fortunate to work with Prof. Jianbo Shi and Prof. Dan Roth.

Google Scholar / Twitter / Github / CV

Research

My research lies at the intersection of generative models, computer vision, and multimodal learning.

I develop efficient architectures, representations and inference-time algorithms for visual perception and reasoning, drawing on insights from mathematical modeling and human visual perception. More broadly, I aim to build perceptually grounded visual systems that generalize robustly and learn efficiently. Some papers are highlighted.

	From Anyframe to Timelapse: Consistent Video Generation with Representation Alignment Xinran (Nicole) Han, Matias Mendieta, Moein Falahatgar Preprint, 2025 (Work done during internship at Apple.) blog post We introduce derivative representation alignment (dREPA) for image-to-video generation and show it improves subject consistency and leads to better generalization across artistic styles.
	Generative Perception of Shape and Material from Differential Motion Xinran (Nicole) Han, Ko Nishino, Todd Zickler NeurIPS, 2025 project page / paper / code We show that a novel pixel-space video diffusion model trained from scratch estimates accurate shape and material from short videos, and also produces diverse shape and material samples for ambiguous input images.
	Multistable Shape from Shading Emerges from Patch Diffusion Xinran (Nicole) Han, Todd Zickler, Ko Nishino NeurIPS, 2024 (Spotlight, Top 2%) project page / paper / code We present a bottom-up, patch-based diffusion model for monocular shape from shading that produces multimodal outputs, similar to multistable perception in humans.
	Transfer Learning for Bayesian Optimization on Heterogeneous Search Spaces Zhou Fan, Xinran Han, Zi Wang Transactions on Machine Learning Research (TMLR), February 2024 paper We present new theoretical insight on the equivalence of multi-task and single-task learning for stationary kernels and develop MPHD for model pre-training on heterogeneous domains.
	Curvature Fields from Shading Fields Xinran Han, Todd Zickler NeurIPS Workshop on Symmetry and Geometry in Neural Representations (PMLR 228), 2023 paper We present a neural model for inferring a curvature field from shading images that is invariant under lighting and texture variations, drawing on perceptual insights and mathematical derivations.
	Compositional Data and Task Augmentation for Instruction Following Soham Dan, Xinran Han, Dan Roth (* equal contribution) Findings of EMNLP, 2021 paper Auxiliary objectives and instruction augmentation improve spatial reasoning in the 'blocks world' task, especially under limited data.
	ForkGAN: Seeing into the Rainy Night Ziqiang Zheng, Yang Wu, Xinran Han, Jianbo Shi ECCV, 2020 (Oral Presentation, Top 2%) paper / talk We introduce a task-agnostic image translation model ForkGAN that effectively disentangle domain-specific and domain-invariant information.

Invited Talks
Feb 2026	Perception as Generation: Navigating Ambiguity with Diffusion Models Computer Science Colloquium, Harvard University
May 2025	Computational Models Exhibit Invariance and Multistability In Shape from Shading Vision Sciences Society (VSS) Annual Meeting [Slides] [Abstract]
Jan 2025	Towards Aligning Human and Computer Shape Perception Boston University
June 2024	Multistable Shape from Shading Emerges from Patch Diffusion Kyoto University
Dec 2023	Curvature Fields from Shading Fields New England Computer Vision Workshop (NECV)

Miscellaneous

Outside of research, I enjoy visiting art museums, watching movies and reading about philosophy and psychology.

I'm also using Jon's website template.

Research

Invited Talks

Miscellaneous