Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

Mohit Bansal

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

◀ ▶ joint publications with Jaemin Cho 0001

> Home > Persons > Mohit Bansal

Publications

2024
[i216]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-02325
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-02325
David Wan, Jaemin Cho, Elias Stengel-Eskin, Mohit Bansal:
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training. CoRR abs/2403.02325 (2024)
[i215]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-06952
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-06952
Jialu Li, Jaemin Cho, Yi-Lin Sung, Jaehong Yoon, Mohit Bansal:
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data. CoRR abs/2403.06952 (2024)
[i213]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-12014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-12014
Abhay Zala, Jaemin Cho, Han Lin, Jaehong Yoon, Mohit Bansal:
EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents. CoRR abs/2403.12014 (2024)
2023
[c213]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/Zala0K0OMB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/Zala0K0OMB23
Abhay Zala, Jaemin Cho, Satwik Kottur, Xilun Chen, Barlas Oguz, Yashar Mehdad, Mohit Bansal:
Hierarchical Video-Moment Retrieval and Step-Captioning. CVPR 2023: 23056-23065
[c199]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/0001ZB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/0001ZB23
Jaemin Cho, Abhay Zala, Mohit Bansal:
DALL-EVAL: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models. ICCV 2023: 3020-3031
[c195]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/0001ZB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0001ZB23
Jaemin Cho, Abhay Zala, Mohit Bansal:
Visual Programming for Step-by-Step Text-to-Image Generation and Evaluation. NeurIPS 2023
[c190]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/WangBLL0TBJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangBLL0TBJ23
Zhenhailong Wang, Ansel Blume, Sha Li, Genglin Liu, Jaemin Cho, Zineng Tang, Mohit Bansal, Heng Ji:
Paxion: Patching Action Knowledge in Video-Language Foundation Models. NeurIPS 2023
[c187]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Yu0YB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Yu0YB23
Shoubin Yu, Jaemin Cho, Prateek Yadav, Mohit Bansal:
Self-Chained Image-Language Model for Video Localization and Question Answering. NeurIPS 2023
[c185]
- view
  authority control:
- export record
  dblp key:
  - conf/wacv/TangCLB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wacv/TangCLB23
Zineng Tang, Jaemin Cho, Jie Lei, Mohit Bansal:
PERCEIVER-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention. WACV 2023: 4399-4409
[i209]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-16406
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-16406
Abhay Zala, Jaemin Cho, Satwik Kottur, Xilun Chen, Barlas Oguz, Yashar Mehdad, Mohit Bansal:
Hierarchical Video-Moment Retrieval and Step-Captioning. CoRR abs/2303.16406 (2023)
[i207]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-06671
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-06671
Jaemin Cho, Linjie Li, Zhengyuan Yang, Zhe Gan, Lijuan Wang, Mohit Bansal:
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation. CoRR abs/2304.06671 (2023)
[i203]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-06988
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-06988
Shoubin Yu, Jaemin Cho, Prateek Yadav, Mohit Bansal:
Self-Chained Image-Language Model for Video Localization and Question Answering. CoRR abs/2305.06988 (2023)
[i202]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-10683
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-10683
Zhenhailong Wang, Ansel Blume, Sha Li, Genglin Liu, Jaemin Cho, Zineng Tang, Mohit Bansal, Heng Ji:
Paxion: Patching Action Knowledge in Video-Language Foundation Models. CoRR abs/2305.10683 (2023)
[i200]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-15328
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-15328
Jaemin Cho, Abhay Zala, Mohit Bansal:
Visual Programming for Text-to-Image Generation and Evaluation. CoRR abs/2305.15328 (2023)
[i188]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-15091
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-15091
Han Lin, Abhay Zala, Jaemin Cho, Mohit Bansal:
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning. CoRR abs/2309.15091 (2023)
[i180]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-12128
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-12128
Abhay Zala, Han Lin, Jaemin Cho, Mohit Bansal:
DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning. CoRR abs/2310.12128 (2023)
[i178]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-18235
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-18235
Jaemin Cho, Yushi Hu, Roopal Garg, Peter Anderson, Ranjay Krishna, Jason Baldridge, Mohit Bansal, Jordi Pont-Tuset, Su Wang:
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation. CoRR abs/2310.18235 (2023)
2022
[c183]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ReddyRL0W0HBSCS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ReddyRL0W0HBSCS22
Revanth Gangi Reddy, Xilin Rui, Manling Li, Xudong Lin, Haoyang Wen, Jaemin Cho, Lifu Huang, Mohit Bansal, Avirup Sil, Shih-Fu Chang, Alexander G. Schwing, Heng Ji:
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding. AAAI 2022: 11200-11208
[c174]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/Sung0B22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/Sung0B22
Yi-Lin Sung, Jaemin Cho, Mohit Bansal:
VL-ADAPTER: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks. CVPR 2022: 5217-5227
[c163]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/00010KDBB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/00010KDBB22
Jaemin Cho, Seunghyun Yoon, Ajinkya Kale, Franck Dernoncourt, Trung Bui, Mohit Bansal:
Fine-grained Image Captioning with CLIP Reward. NAACL-HLT (Findings) 2022: 517-527
[c150]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Sung0B22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Sung0B22
Yi-Lin Sung, Jaemin Cho, Mohit Bansal:
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning. NeurIPS 2022
[c149]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Tang0NB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Tang0NB22
Zineng Tang, Jaemin Cho, Yixin Nie, Mohit Bansal:
TVLT: Textless Vision-Language Transformer. NeurIPS 2022
[i169]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-04053
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-04053
Jaemin Cho, Abhay Zala, Mohit Bansal:
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers. CoRR abs/2202.04053 (2022)
[i154]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-13115
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-13115
Jaemin Cho, Seunghyun Yoon, Ajinkya Kale, Franck Dernoncourt, Trung Bui, Mohit Bansal:
Fine-grained Image Captioning with CLIP Reward. CoRR abs/2205.13115 (2022)
[i152]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-06522
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-06522
Yi-Lin Sung, Jaemin Cho, Mohit Bansal:
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning. CoRR abs/2206.06522 (2022)
[i141]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-14156
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-14156
Zineng Tang, Jaemin Cho, Yixin Nie, Mohit Bansal:
TVLT: Textless Vision-Language Transformer. CoRR abs/2209.14156 (2022)
[i136]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-11701
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-11701
Zineng Tang, Jaemin Cho, Jie Lei, Mohit Bansal:
Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention. CoRR abs/2211.11701 (2022)
2021
[c120]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ChoLTB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChoLTB21
Jaemin Cho, Jie Lei, Hao Tan, Mohit Bansal:
Unifying Vision-and-Language Tasks via Text Generation. ICML 2021: 1931-1942
[c107]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/TangCTB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TangCTB21
Zineng Tang, Jaemin Cho, Hao Tan, Mohit Bansal:
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer. NeurIPS 2021: 24468-24481
[i128]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-02779
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-02779
Jaemin Cho, Jie Lei, Hao Tan, Mohit Bansal:
Unifying Vision-and-Language Tasks via Text Generation. CoRR abs/2102.02779 (2021)
[i108]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-02681
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-02681
Zineng Tang, Jaemin Cho, Hao Tan, Mohit Bansal:
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer. CoRR abs/2107.02681 (2021)
[i95]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-06825
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-06825
Yi-Lin Sung, Jaemin Cho, Mohit Bansal:
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks. CoRR abs/2112.06825 (2021)
[i91]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-10728
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-10728
Revanth Gangi Reddy, Xilin Rui, Manling Li, Xudong Lin, Haoyang Wen, Jaemin Cho, Lifu Huang, Mohit Bansal, Avirup Sil, Shih-Fu Chang, Alexander G. Schwing, Heng Ji:
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding. CoRR abs/2112.10728 (2021)

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.