Sherwin Bahmani

I am a Computer Science PhD student at the University of Toronto, supervised by David Lindell and Andrea Tagliasacchi. I am also a research intern at the NVIDIA Toronto AI Lab led by Sanja Fidler.

I was born and grew up in Germany. I graduated from TU Darmstadt studying Computational Engineering, advised by Stefan Roth. I was a research intern at Snap Inc. working in the Creative Vision Group of Sergey Tulyakov.

Email / CV / Google Scholar / Twitter / Github

Research

I am interested in video, 3D, and 4D generation.

	AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers Sherwin Bahmani, Ivan Skorokhodov, Guocheng Qian, Aliaksandr Siarohin, Willi Menapace, Andrea Tagliasacchi, David B. Lindell, Sergey Tulyakov CVPR 2025 arXiv / Project Page / Code
	VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control Sherwin Bahmani, Ivan Skorokhodov, Aliaksandr Siarohin, Willi Menapace, Guocheng Qian, Michael Vasilkovsky, Hsin-Ying Lee, Chaoyang Wang, Jiaxu Zou, Andrea Tagliasacchi, David B. Lindell, Sergey Tulyakov ICLR 2025 arXiv / Project Page / Code
	SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation Koichi Namekata, Sherwin Bahmani, Ziyi Wu, Yash Kant, Igor Gilitschenski, David B. Lindell ICLR 2025 arXiv / Project Page / Code
	TC4D: Trajectory-Conditioned Text-to-4D Generation Sherwin Bahmani, Xian Liu, Wang Yifan, Ivan Skorokhodov, Victor Rong, Ziwei Liu, Xihui Liu, Jeong Joon Park, Sergey Tulyakov, Gordon Wetzstein, Andrea Tagliasacchi, David B. Lindell ECCV 2024* arXiv / Project Page / Code
	4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling Sherwin Bahmani, Ivan Skorokhodov, Victor Rong, Gordon Wetzstein, Leonidas Guibas, Peter Wonka, Sergey Tulyakov, Jeong Joon Park, Andrea Tagliasacchi, David B. Lindell CVPR 2024 arXiv / Project Page / Code
	CC3D: Layout-Conditioned Generation of Compositional 3D Scenes Sherwin Bahmani, Jeong Joon Park, Despoina Paschalidou, Xingguang Yan, Gordon Wetzstein, Leonidas Guibas, Andrea Tagliasacchi ICCV 2023 arXiv / Project Page / Code
	3D-Aware Video Generation Sherwin Bahmani, Jeong Joon Park, Despoina Paschalidou, Hao Tang, Gordon Wetzstein, Leonidas Guibas, Luc Van Gool, Radu Timofte TMLR 2023 arXiv / Project Page / Code
	Semantic Self-adaptation: Enhancing Generalization with a Single Sample Sherwin Bahmani, Oliver Hahn, Eduard Zamfir, Nikita Araslanov, Daniel Cremers, Stefan Roth TMLR 2023* arXiv / Code
	Towards Robust and Adaptive Motion Forecasting: A Causal Representation Perspective Yuejiang Liu, Riccardo Cadei, Jonas Schweizer, Sherwin Bahmani, Alexandre Alahi CVPR 2022 arXiv / Code

Academic Service

Reviewer: CVPR, ICCV, ECCV, SIGGRAPH, SIGGRAPH Asia, NeurIPS, ICLR, ICML, TPAMI, TVCG, Eurographics, Pacific Graphics, IEEE MultiMedia

Source code based on Jon Barron's website.