default search action

combined dblp search
author search
venue search
publication search

ask others

Shoubin Yu

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WangYSYCBB25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WangYSYCBB25
Ziyang Wang, Shoubin Yu, Elias Stengel-Eskin, Jaehong Yoon, Feng Cheng, Gedas Bertasius, Mohit Bansal:
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos. CVPR 2025: 3272-3283
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/DengCYYSTMBC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/DengCYYSTMBC25
Andong Deng, Tongjia Chen, Shoubin Yu, Taojiannan Yang, Lincoln Spencer, Yapeng Tian, Ajmal Saeed Mian, Mohit Bansal, Chen Chen:
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level. CVPR 2025: 8625-8636
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WangLHL0Y000B025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WangLHL0Y000B025
Zun Wang, Jialu Li, Yicong Hong, Songze Li, Kunchang Li, Shoubin Yu, Yi Wang, Yu Qiao, Yali Wang, Mohit Bansal, Limin Wang:
Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel. ICLR 2025
[c7]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YoonYPYB25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YoonYPYB25
Jaehong Yoon, Shoubin Yu, Vaidehi Patil, Huaxiu Yao, Mohit Bansal:
SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation. ICLR 2025
[c6]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YuYB25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YuYB25
Shoubin Yu, Jaehong Yoon, Mohit Bansal:
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion. ICLR 2025
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/SivakumaranYZYH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/SivakumaranYZYH25
Nithin Sivakumaran, Chia-Yu Yang, Abhay Zala, Shoubin Yu, Daeun Hong, Xiaotian Zou, Elias Stengel-Eskin, Dan Carpenter, Wookhee Min, Cindy E. Hmelo-Silver, Jonathan P. Rowe, James C. Lester, Mohit Bansal:
A Multimodal Classroom Video Question-Answering Framework for Automated Understanding of Collaborative Learning. ICMI 2025: 516-525
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-14350
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-14350
Shoubin Yu, Difan Liu, Ziqiao Ma, Yicong Hong, Yang Zhou, Hao Tan, Joyce Chai, Mohit Bansal:
VEGGIE: Instructional Editing and Reasoning of Video Concepts with Grounded Generation. CoRR abs/2503.14350 (2025)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-08641
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-08641
Jialu Li, Shoubin Yu, Han Lin, Jaemin Cho, Jaehong Yoon, Mohit Bansal:
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization. CoRR abs/2504.08641 (2025)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-06275
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-06275
Emmanouil Zaranis, António Farinhas, Saul José Rodrigues dos Santos, Beatriz Canaverde, Miguel Moura Ramos, Aditya K. Surikuchi, André Viveiros, Baohao Liao, Elena Belén Bueno-Benito, Nithin Sivakumaran, Pavlo Vasylenko, Shoubin Yu, Sonal Sannigrahi, Wafaa Mohammed, Ben Peters, Danae Sánchez Villegas, Elias Stengel-Eskin, Giuseppe Attanasio, Jaehong Yoon, Stella Frank, Alessandro Suglia, Chrysoula Zerva, Desmond Elliott, Mariella Dimiccoli, Mohit Bansal, Oswald Lanz, Raffaella Bernardi, Raquel Fernández, Sandro Pezzelle, Vlad Niculae, André F. T. Martins:
Movie Facts and Fibs (MF²): A Benchmark for Long Movie Understanding. CoRR abs/2506.06275 (2025)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-17113
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-17113
Shoubin Yu, Yue Zhang, Ziyang Wang, Jaehong Yoon, Mohit Bansal:
MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation. CoRR abs/2506.17113 (2025)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-18890
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-18890
Ziqiao Ma, Xuweiyi Chen, Shoubin Yu, Sai Bi, Kai Zhang, Ziwen Chen, Sihan Xu, Jianing Yang, Zexiang Xu, Kalyan Sunkavalli, Mohit Bansal, Joyce Chai, Hao Tan:
4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time. CoRR abs/2506.18890 (2025)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-06485
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-06485
Ziyang Wang, Jaehong Yoon, Shoubin Yu, Md Mohaiminul Islam, Gedas Bertasius, Mohit Bansal:
Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning. CoRR abs/2507.06485 (2025)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-08559
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-08559
Andong Deng, Taojiannan Yang, Shoubin Yu, Lincoln Spencer, Mohit Bansal, Chen Chen, Serena Yeung-Levy, Xiaohan Wang:
SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models. CoRR abs/2510.08559 (2025)
2024
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tcsv/YuZFDSWGLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcsv/YuZFDSWGLW24
Shoubin Yu, Zhongyin Zhao, Haoshu Fang, Andong Deng, Haisheng Su, Dongliang Wang, Weihao Gan, Cewu Lu, Wei Wu:
Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection. IEEE Trans. Circuits Syst. Video Technol. 34(8): 6661-6673 (2024)
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/0010LIWYBB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/0010LIWYBB24
Ce Zhang, Taixi Lu, Md Mohaiminul Islam, Ziyang Wang, Shoubin Yu, Mohit Bansal, Gedas Bertasius:
A Simple LLM Framework for Long-Range Video Question-Answering. EMNLP 2024: 21715-21737
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YuFZSOPB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YuFZSOPB24
Shoubin Yu, Jacob Zhiyuan Fang, Jian Zheng, Gunnar A. Sigurdsson, Vicente Ordonez, Robinson Piramuthu, Mohit Bansal:
Zero-Shot Controllable Image-to-Video Animation via Motion Decomposition. ACM Multimedia 2024: 3332-3341
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-05889
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-05889
Shoubin Yu, Jaehong Yoon, Mohit Bansal:
CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion. CoRR abs/2402.05889 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-09711
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-09711
Bo Wu, Shoubin Yu, Zhenfang Chen, Joshua B. Tenenbaum, Chuang Gan:
STAR: A Benchmark for Situated Reasoning in Real-World Videos. CoRR abs/2405.09711 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-18406
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-18406
Jaehong Yoon, Shoubin Yu, Mohit Bansal:
RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives. CoRR abs/2405.18406 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-19209
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-19209
Ziyang Wang, Shoubin Yu, Elias Stengel-Eskin, Jaehong Yoon, Feng Cheng, Gedas Bertasius, Mohit Bansal:
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos. CoRR abs/2405.19209 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-12761
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-12761
Jaehong Yoon, Shoubin Yu, Vaidehi Patil, Huaxiu Yao, Mohit Bansal:
SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation. CoRR abs/2410.12761 (2024)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-09921
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-09921
Andong Deng, Tongjia Chen, Shoubin Yu, Taojiannan Yang, Lincoln Spencer, Yapeng Tian, Ajmal Saeed Mian, Mohit Bansal, Chen Chen:
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level. CoRR abs/2411.09921 (2024)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-08467
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-08467
Zun Wang, Jialu Li, Yicong Hong, Songze Li, Kunchang Li, Shoubin Yu, Yi Wang, Yu Qiao, Yali Wang, Mohit Bansal, Limin Wang:
Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel. CoRR abs/2412.08467 (2024)
2023
[c2]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Yu0YB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Yu0YB23
Shoubin Yu, Jaemin Cho, Prateek Yadav, Mohit Bansal:
Self-Chained Image-Language Model for Video Localization and Question Answering. NeurIPS 2023
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-06988
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-06988
Shoubin Yu, Jaemin Cho, Prateek Yadav, Mohit Bansal:
Self-Chained Image-Language Model for Video Localization and Question Answering. CoRR abs/2305.06988 (2023)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-17235
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-17235
Ce Zhang, Taixi Lu, Md Mohaiminul Islam, Ziyang Wang, Shoubin Yu, Mohit Bansal, Gedas Bertasius:
A Simple LLM Framework for Long-Range Video Question-Answering. CoRR abs/2312.17235 (2023)
2021
[c1]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WuYC0G21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WuYC0G21
Bo Wu, Shoubin Yu, Zhenfang Chen, Josh Tenenbaum, Chuang Gan:
STAR: A Benchmark for Situated Reasoning in Real-World Videos. NeurIPS Datasets and Benchmarks 2021
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-03649
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-03649
Shoubin Yu, Zhongyin Zhao, Haoshu Fang, Andong Deng, Haisheng Su, Dongliang Wang, Weihao Gan, Cewu Lu, Wei Wu:
Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection. CoRR abs/2112.03649 (2021)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.