Остановите войну!
for scientists:
default search action
Mohit Bansal
- > Home > Persons > Mohit Bansal
Publications
- 2024
- [i216]David Wan, Jaemin Cho, Elias Stengel-Eskin, Mohit Bansal:
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training. CoRR abs/2403.02325 (2024) - [i215]Jialu Li, Jaemin Cho, Yi-Lin Sung, Jaehong Yoon, Mohit Bansal:
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data. CoRR abs/2403.06952 (2024) - [i213]Abhay Zala, Jaemin Cho, Han Lin, Jaehong Yoon, Mohit Bansal:
EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents. CoRR abs/2403.12014 (2024) - 2023
- [c213]Abhay Zala, Jaemin Cho, Satwik Kottur, Xilun Chen, Barlas Oguz, Yashar Mehdad, Mohit Bansal:
Hierarchical Video-Moment Retrieval and Step-Captioning. CVPR 2023: 23056-23065 - [c199]Jaemin Cho, Abhay Zala, Mohit Bansal:
DALL-EVAL: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models. ICCV 2023: 3020-3031 - [c195]Jaemin Cho, Abhay Zala, Mohit Bansal:
Visual Programming for Step-by-Step Text-to-Image Generation and Evaluation. NeurIPS 2023 - [c190]Zhenhailong Wang, Ansel Blume, Sha Li, Genglin Liu, Jaemin Cho, Zineng Tang, Mohit Bansal, Heng Ji:
Paxion: Patching Action Knowledge in Video-Language Foundation Models. NeurIPS 2023 - [c187]Shoubin Yu, Jaemin Cho, Prateek Yadav, Mohit Bansal:
Self-Chained Image-Language Model for Video Localization and Question Answering. NeurIPS 2023 - [c185]Zineng Tang, Jaemin Cho, Jie Lei, Mohit Bansal:
PERCEIVER-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention. WACV 2023: 4399-4409 - [i209]Abhay Zala, Jaemin Cho, Satwik Kottur, Xilun Chen, Barlas Oguz, Yashar Mehdad, Mohit Bansal:
Hierarchical Video-Moment Retrieval and Step-Captioning. CoRR abs/2303.16406 (2023) - [i207]Jaemin Cho, Linjie Li, Zhengyuan Yang, Zhe Gan, Lijuan Wang, Mohit Bansal:
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation. CoRR abs/2304.06671 (2023) - [i203]Shoubin Yu, Jaemin Cho, Prateek Yadav, Mohit Bansal:
Self-Chained Image-Language Model for Video Localization and Question Answering. CoRR abs/2305.06988 (2023) - [i202]Zhenhailong Wang, Ansel Blume, Sha Li, Genglin Liu, Jaemin Cho, Zineng Tang, Mohit Bansal, Heng Ji:
Paxion: Patching Action Knowledge in Video-Language Foundation Models. CoRR abs/2305.10683 (2023) - [i200]Jaemin Cho, Abhay Zala, Mohit Bansal:
Visual Programming for Text-to-Image Generation and Evaluation. CoRR abs/2305.15328 (2023) - [i188]Han Lin, Abhay Zala, Jaemin Cho, Mohit Bansal:
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning. CoRR abs/2309.15091 (2023) - [i180]Abhay Zala, Han Lin, Jaemin Cho, Mohit Bansal:
DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning. CoRR abs/2310.12128 (2023) - [i178]Jaemin Cho, Yushi Hu, Roopal Garg, Peter Anderson, Ranjay Krishna, Jason Baldridge, Mohit Bansal, Jordi Pont-Tuset, Su Wang:
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation. CoRR abs/2310.18235 (2023) - 2022
- [c183]Revanth Gangi Reddy, Xilin Rui, Manling Li, Xudong Lin, Haoyang Wen, Jaemin Cho, Lifu Huang, Mohit Bansal, Avirup Sil, Shih-Fu Chang, Alexander G. Schwing, Heng Ji:
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding. AAAI 2022: 11200-11208 - [c174]Yi-Lin Sung, Jaemin Cho, Mohit Bansal:
VL-ADAPTER: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks. CVPR 2022: 5217-5227 - [c163]Jaemin Cho, Seunghyun Yoon, Ajinkya Kale, Franck Dernoncourt, Trung Bui, Mohit Bansal:
Fine-grained Image Captioning with CLIP Reward. NAACL-HLT (Findings) 2022: 517-527 - [c150]Yi-Lin Sung, Jaemin Cho, Mohit Bansal:
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning. NeurIPS 2022 - [c149]Zineng Tang, Jaemin Cho, Yixin Nie, Mohit Bansal:
TVLT: Textless Vision-Language Transformer. NeurIPS 2022 - [i169]Jaemin Cho, Abhay Zala, Mohit Bansal:
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers. CoRR abs/2202.04053 (2022) - [i154]Jaemin Cho, Seunghyun Yoon, Ajinkya Kale, Franck Dernoncourt, Trung Bui, Mohit Bansal:
Fine-grained Image Captioning with CLIP Reward. CoRR abs/2205.13115 (2022) - [i152]Yi-Lin Sung, Jaemin Cho, Mohit Bansal:
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning. CoRR abs/2206.06522 (2022) - [i141]Zineng Tang, Jaemin Cho, Yixin Nie, Mohit Bansal:
TVLT: Textless Vision-Language Transformer. CoRR abs/2209.14156 (2022) - [i136]Zineng Tang, Jaemin Cho, Jie Lei, Mohit Bansal:
Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention. CoRR abs/2211.11701 (2022) - 2021
- [c120]Jaemin Cho, Jie Lei, Hao Tan, Mohit Bansal:
Unifying Vision-and-Language Tasks via Text Generation. ICML 2021: 1931-1942 - [c107]Zineng Tang, Jaemin Cho, Hao Tan, Mohit Bansal:
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer. NeurIPS 2021: 24468-24481 - [i128]Jaemin Cho, Jie Lei, Hao Tan, Mohit Bansal:
Unifying Vision-and-Language Tasks via Text Generation. CoRR abs/2102.02779 (2021) - [i108]Zineng Tang, Jaemin Cho, Hao Tan, Mohit Bansal:
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer. CoRR abs/2107.02681 (2021) - [i95]Yi-Lin Sung, Jaemin Cho, Mohit Bansal:
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks. CoRR abs/2112.06825 (2021) - [i91]Revanth Gangi Reddy, Xilin Rui, Manling Li, Xudong Lin, Haoyang Wen, Jaemin Cho, Lifu Huang, Mohit Bansal, Avirup Sil, Shih-Fu Chang, Alexander G. Schwing, Heng Ji:
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding. CoRR abs/2112.10728 (2021)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-04-18 20:34 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint