default search action
Ranjay Krishna
Ranjay A. Krishna
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c56]Cheng-Yu Hsieh, Yung-Sung Chuang, Chun-Liang Li, Zifeng Wang, Long T. Le, Abhishek Kumar, James R. Glass, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister:
Found in the middle: Calibrating Positional Attention Bias Improves Long Context Utilization. ACL (Findings) 2024: 14982-14995 - [c55]Kalyani Marathe, Mahtab Bigverdi, Nishat Khan, Tuhin Kundu, Patrick Howe, Sharan Ranjit S, Anand Bhattad, Aniruddha Kembhavi, Linda G. Shapiro, Ranjay Krishna:
MIMIC: Masked Image Modeling with Image Correspondences. CVPR Workshops 2024: 718-727 - [c54]Yushi Hu, Otilia Stretcu, Chun-Ta Lu, Krishnamurthy Viswanathan, Kenji Hata, Enming Luo, Ranjay Krishna, Ariel Fuxman:
Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models. CVPR 2024: 9590-9601 - [c53]Mehmet Saygin Seyfioglu, Wisdom Oluchi Ikezogwo, Fatemeh Ghezloo, Ranjay Krishna, Linda G. Shapiro:
Quilt-LLaVA: Visual Instruction Tuning by Extracting Localized Narratives from Open-Source Histopathology Videos. CVPR 2024: 13183-13192 - [c52]Chenhao Zheng, Jieyu Zhang, Aniruddha Kembhavi, Ranjay Krishna:
Iterated Learning Improves Compositionality in Large Vision-Language Models. CVPR 2024: 13785-13795 - [c51]Kiana Ehsani, Tanmay Gupta, Rose Hendrix, Jordi Salvador, Luca Weihs, Kuo-Hao Zeng, Kunal Pratap Singh, Yejin Kim, Winson Han, Alvaro Herrasti, Ranjay Krishna, Dustin Schwenk, Eli VanderBilt, Aniruddha Kembhavi:
SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World. CVPR 2024: 16238-16250 - [c50]Yue Yang, Fan-Yun Sun, Luca Weihs, Eli VanderBilt, Alvaro Herrasti, Winson Han, Jiajun Wu, Nick Haber, Ranjay Krishna, Lingjie Liu, Chris Callison-Burch, Mark Yatskar, Aniruddha Kembhavi, Christopher Clark:
Holodeck: Language Guided Generation of 3D Embodied AI Environments. CVPR 2024: 16277-16287 - [c49]Imad Eddine Toubal, Aditya Avinash, Neil Gordon Alldrin, Jan Dlabal, Wenlei Zhou, Enming Luo, Otilia Stretcu, Hao Xiong, Chun-Ta Lu, Howard Zhou, Ranjay Krishna, Ariel Fuxman, Tom Duerig:
Modeling Collaborator: Enabling Subjective Vision Classification with Minimal Human Effort via LLM Tool-Use. CVPR 2024: 17553-17563 - [c48]Zixian Ma, Weikai Huang, Jieyu Zhang, Tanmay Gupta, Ranjay Krishna:
m &m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks. ECCV (10) 2024: 18-34 - [c47]Zuyan Liu, Benlin Liu, Jiahui Wang, Yuhao Dong, Guangyi Chen, Yongming Rao, Ranjay Krishna, Jiwen Lu:
Efficient Inference of Vision Instruction-Following Models with Elastic Cache. ECCV (17) 2024: 54-69 - [c46]Xingyu Fu, Yushi Hu, Bangzheng Li, Yu Feng, Haoyu Wang, Xudong Lin, Dan Roth, Noah A. Smith, Wei-Chiu Ma, Ranjay Krishna:
BLINK: Multimodal Large Language Models Can See but Not Perceive. ECCV (23) 2024: 148-166 - [c45]Roopal Garg, Andrea Burns, Burcu Karagol Ayan, Yonatan Bitton, Ceslee Montgomery, Yasumasa Onoe, Andrew Bunner, Ranjay Krishna, Jason Baldridge, Radu Soricut:
ImageInWords: Unlocking Hyper-Detailed Image Descriptions. EMNLP 2024: 93-127 - [c44]Yung-Sung Chuang, Linlu Qiu, Cheng-Yu Hsieh, Ranjay Krishna, Yoon Kim, James R. Glass:
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps. EMNLP 2024: 1419-1436 - [c43]Abhinav Bandari, Lu Yin, Cheng-Yu Hsieh, Ajay Jaiswal, Tianlong Chen, Li Shen, Ranjay Krishna, Shiwei Liu:
Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM Pruning. EMNLP 2024: 18089-18099 - [c42]Jaemin Cho, Yushi Hu, Jason M. Baldridge, Roopal Garg, Peter Anderson, Ranjay Krishna, Mohit Bansal, Jordi Pont-Tuset, Su Wang:
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation. ICLR 2024 - [c41]Ainaz Eftekhar, Kuo-Hao Zeng, Jiafei Duan, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna:
Selective Visual Representations Improve Convergence and Generalization for Embodied AI. ICLR 2024 - [c40]Shaokun Zhang, Jieyu Zhang, Jiale Liu, Linxin Song, Chi Wang, Ranjay Krishna, Qingyun Wu:
Offline Training of Language Model Agents with Functions as Learnable Weights. ICML 2024 - [c39]Jun Wang, Chun-Cheng Chang, Jiafei Duan, Dieter Fox, Ranjay Krishna:
EVE: Enabling Anyone to Train Robots using Augmented Reality. UIST 2024: 34:1-34:13 - [c38]Wei Qiao, Tushar Dogra, Otilia Stretcu, Yu-Han Lyu, Tiantian Fang, Dongjin Kwon, Chun-Ta Lu, Enming Luo, Yuan Wang, Chih-Chun Chia, Ariel Fuxman, Fangzhou Wang, Ranjay Krishna, Mehmet Tek:
Scaling Up LLM Reviews for Google Ads Content Moderation. WSDM 2024: 1174-1175 - [i82]Wilbert Pumacay, Ishika Singh, Jiafei Duan, Ranjay Krishna, Jesse Thomason, Dieter Fox:
THE COLOSSEUM: A Benchmark for Evaluating Generalization for Robotic Manipulation. CoRR abs/2402.08191 (2024) - [i81]Shaokun Zhang, Jieyu Zhang, Jiale Liu, Linxin Song, Chi Wang, Ranjay Krishna, Qingyun Wu:
Training Language Model Agents without Modifying Language Models. CoRR abs/2402.11359 (2024) - [i80]Wei Qiao, Tushar Dogra, Otilia Stretcu, Yu-Han Lyu, Tiantian Fang, Dongjin Kwon, Chun-Ta Lu, Enming Luo, Yuan Wang, Chih-Chun Chia, Ariel Fuxman, Fangzhou Wang, Ranjay Krishna, Mehmet Tek:
Scaling Up LLM Reviews for Google Ads Content Moderation. CoRR abs/2402.14590 (2024) - [i79]Imad Eddine Toubal, Aditya Avinash, Neil Gordon Alldrin, Jan Dlabal, Wenlei Zhou, Enming Luo, Otilia Stretcu, Hao Xiong, Chun-Ta Lu, Howard Zhou, Ranjay Krishna, Ariel Fuxman, Tom Duerig:
Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use. CoRR abs/2403.02626 (2024) - [i78]Zixian Ma, Weikai Huang, Jieyu Zhang, Tanmay Gupta, Ranjay Krishna:
m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks. CoRR abs/2403.11085 (2024) - [i77]Xiang Fan, Anand Bhattad, Ranjay Krishna:
Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion. CoRR abs/2403.14617 (2024) - [i76]Chenhao Zheng, Jieyu Zhang, Aniruddha Kembhavi, Ranjay Krishna:
Iterated Learning Improves Compositionality in Large Vision-Language Models. CoRR abs/2404.02145 (2024) - [i75]Jun Wang, Chun-Cheng Chang, Jiafei Duan, Dieter Fox, Ranjay Krishna:
EVE: Enabling Anyone to Train Robot using Augmented Reality. CoRR abs/2404.06089 (2024) - [i74]Xingyu Fu, Yushi Hu, Bangzheng Li, Yu Feng, Haoyu Wang, Xudong Lin, Dan Roth, Noah A. Smith, Wei-Chiu Ma, Ranjay Krishna:
BLINK: Multimodal Large Language Models Can See but Not Perceive. CoRR abs/2404.12390 (2024) - [i73]Ankit Vani, Bac Nguyen, Samuel Lavoie, Ranjay Krishna, Aaron C. Courville:
SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision. CoRR abs/2404.15721 (2024) - [i72]Roopal Garg, Andrea Burns, Burcu Karagol Ayan, Yonatan Bitton, Ceslee Montgomery, Yasumasa Onoe, Andrew Bunner, Ranjay Krishna, Jason Baldridge, Radu Soricut:
ImageInWords: Unlocking Hyper-Detailed Image Descriptions. CoRR abs/2405.02793 (2024) - [i71]Thao Nguyen, Matthew Wallingford, Sebastin Santy, Wei-Chiu Ma, Sewoong Oh, Ludwig Schmidt, Pang Wei Koh, Ranjay Krishna:
Multilingual Diversity Improves Vision-Language Representations. CoRR abs/2405.16915 (2024) - [i70]Ethan Shen, Alan Fan, Sarah M. Pratt, Jae Sung Park, Matthew Wallingford, Sham M. Kakade, Ari Holtzman, Ranjay Krishna, Ali Farhadi, Aditya Kusupati:
Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass. CoRR abs/2405.18400 (2024) - [i69]Scott Geng, Cheng-Yu Hsieh, Vivek Ramanujan, Matthew Wallingford, Chun-Liang Li, Pang Wei Koh, Ranjay Krishna:
The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better. CoRR abs/2406.05184 (2024) - [i68]Yushi Hu, Weijia Shi, Xingyu Fu, Dan Roth, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Ranjay Krishna:
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models. CoRR abs/2406.09403 (2024) - [i67]Wentao Yuan, Jiafei Duan, Valts Blukis, Wilbert Pumacay, Ranjay Krishna, Adithyavairavan Murali, Arsalan Mousavian, Dieter Fox:
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics. CoRR abs/2406.10721 (2024) - [i66]Jieyu Zhang, Weikai Huang, Zixian Ma, Oscar Michel, Dong He, Tanmay Gupta, Wei-Chiu Ma, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna:
Task Me Anything. CoRR abs/2406.11775 (2024) - [i65]Cheng-Yu Hsieh, Yung-Sung Chuang, Chun-Liang Li, Zifeng Wang, Long T. Le, Abhishek Kumar, James R. Glass, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister:
Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization. CoRR abs/2406.16008 (2024) - [i64]Jiafei Duan, Wentao Yuan, Wilbert Pumacay, Yi Ru Wang, Kiana Ehsani, Dieter Fox, Ranjay Krishna:
Manipulate-Anything: Automating Real-World Robots using Vision-Language Models. CoRR abs/2406.18915 (2024) - [i63]Yu-Guan Hsieh, Cheng-Yu Hsieh, Shih-Ying Yeh, Louis Béthune, Hadipour Ansari, Pavan Kumar Anasosalu Vasu, Chun-Liang Li, Ranjay Krishna, Oncel Tuzel, Marco Cuturi:
Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions. CoRR abs/2407.06723 (2024) - [i62]Yung-Sung Chuang, Linlu Qiu, Cheng-Yu Hsieh, Ranjay Krishna, Yoon Kim, James R. Glass:
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps. CoRR abs/2407.07071 (2024) - [i61]Zuyan Liu, Benlin Liu, Jiahui Wang, Yuhao Dong, Guangyi Chen, Yongming Rao, Ranjay Krishna, Jiwen Lu:
Efficient Inference of Vision Instruction-Following Models with Elastic Cache. CoRR abs/2407.18121 (2024) - [i60]Benlin Liu, Yuhao Dong, Yiqin Wang, Yongming Rao, Yansong Tang, Wei-Chiu Ma, Ranjay Krishna:
Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model. CoRR abs/2408.00754 (2024) - [i59]Enhao Zhang, Nicole Sullivan, Brandon Haynes, Ranjay Krishna, Magdalena Balazinska:
Self-Enhancing Video Data Management System for Compositional Events with Large Language Models [Technical Report]. CoRR abs/2408.02243 (2024) - [i58]Matt Deitke, Christopher Clark, Sangho Lee, Rohun Tripathi, Yue Yang, Jae Sung Park, Mohammadreza Salehi, Niklas Muennighoff, Kyle Lo, Luca Soldaini, Jiasen Lu, Taira Anderson, Erin Bransom, Kiana Ehsani, Huong Ngo, Yen-Sung Chen, Ajay Patel, Mark Yatskar, Chris Callison-Burch, Andrew Head, Rose Hendrix, Favyen Bastani, Eli VanderBilt, Nathan Lambert, Yvonne Chou, Arnavi Chheda, Jenna Sparks, Sam Skjonsberg, Michael Schmitz, Aaron Sarnat, Byron Bischoff, Pete Walsh, Chris Newell, Piper Wolters, Tanmay Gupta, Kuo-Hao Zeng, Jon Borchardt, Dirk Groeneveld, Jen Dumas, Crystal Nam, Sophie Lebrecht, Caitlin Wittlif, Carissa Schoenick, Oscar Michel, Ranjay Krishna, Luca Weihs, Noah A. Smith, Hannaneh Hajishirzi, Ross B. Girshick, Ali Farhadi, Aniruddha Kembhavi:
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models. CoRR abs/2409.17146 (2024) - [i57]Jiafei Duan, Wilbert Pumacay, Nishanth Kumar, Yi Ru Wang, Shulin Tian, Wentao Yuan, Ranjay Krishna, Dieter Fox, Ajay Mandlekar, Yijie Guo:
AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation. CoRR abs/2410.00371 (2024) - [i56]Mohammadreza Salehi, Jae Sung Park, Tanush Yadav, Aditya Kusupati, Ranjay Krishna, Yejin Choi, Hannaneh Hajishirzi, Ali Farhadi:
ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition. CoRR abs/2410.05774 (2024) - [i55]Abhinav Bandari, Lu Yin, Cheng-Yu Hsieh, Ajay Kumar Jaiswal, Tianlong Chen, Li Shen, Ranjay Krishna, Shiwei Liu:
Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM Pruning. CoRR abs/2410.07461 (2024) - [i54]Zhengyu Hu, Jieyu Zhang, Zhihan Xiong, Alexander Ratner, Hui Xiong, Ranjay Krishna:
Language Model Preference Evaluation with Multiple Weak Evaluators. CoRR abs/2410.12869 (2024) - [i53]Baiqi Li, Zhiqiu Lin, Wenxuan Peng, Jean de Dieu Nyandwi, Daniel Jiang, Zixian Ma, Simran Khanuja, Ranjay Krishna, Graham Neubig, Deva Ramanan:
NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples. CoRR abs/2410.14669 (2024) - 2023
- [j7]Helena Vasconcelos, Matthew Jörke, Madeleine Grunde-McLaughlin, Tobias Gerstenberg, Michael S. Bernstein, Ranjay Krishna:
Explanations Can Reduce Overreliance on AI Systems During Decision-Making. Proc. ACM Hum. Comput. Interact. 7(CSCW1): 1-38 (2023) - [j6]Song Bai, Philip H. S. Torr, Ranjay Krishna, Li Fei-Fei, Abhinav Gupta, Song-Chun Zhu:
Guest Editorial: Introduction to the Special Section on Graphs in Vision and Pattern Analysis. IEEE Trans. Pattern Anal. Mach. Intell. 45(6): 6867-6869 (2023) - [j5]Enhao Zhang, Maureen Daum, Dong He, Brandon Haynes, Ranjay Krishna, Magdalena Balazinska:
EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions. Proc. VLDB Endow. 16(11): 2714-2727 (2023) - [j4]Enhao Zhang, Maureen Daum, Dong He, Manasi Ganti, Brandon Haynes, Ranjay Krishna, Magdalena Balazinska:
EQUI-VOCAL Demonstration: Synthesizing Video Queries from User Interactions. Proc. VLDB Endow. 16(12): 3978-3981 (2023) - [j3]Maureen Daum, Enhao Zhang, Dong He, Stephen Mussmann, Brandon Haynes, Ranjay Krishna, Magdalena Balazinska:
VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building. Proc. VLDB Endow. 16(13): 4188-4201 (2023) - [c37]Cheng-Yu Hsieh, Chun-Liang Li, Chih-Kuan Yeh, Hootan Nakhost, Yasuhisa Fujii, Alex Ratner, Ranjay Krishna, Chen-Yu Lee, Tomas Pfister:
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes. ACL (Findings) 2023: 8003-8017 - [c36]Jiafei Duan, Yi Ru Wang, Mohit Shridhar, Dieter Fox, Ranjay Krishna:
AR2-D2: Training a Robot Without a Robot. CoRL 2023: 2838-2848 - [c35]Zixian Ma, Jerry Hong, Mustafa Omer Gul, Mona Gandhi, Irena Gao, Ranjay Krishna:
@ CREPE: Can Vision-Language Foundation Models Reason Compositionally? CVPR 2023: 10910-10921 - [c34]Yushi Hu, Benlin Liu, Jungo Kasai, Yizhong Wang, Mari Ostendorf, Ranjay Krishna, Noah A. Smith:
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering. ICCV 2023: 20349-20360 - [c33]Otilia Stretcu, Edward Vendrow, Kenji Hata, Krishnamurthy Viswanathan, Vittorio Ferrari, Sasan Tavakkol, Wenlei Zhou, Aditya Avinash, Enming Luo, Neil Gordon Alldrin, MohammadHossein Bateni, Gabriel Berger, Andrew Bunner, Chun-Ta Lu, Javier A Rey, Giulia DeSalvo, Ranjay Krishna, Ariel Fuxman:
Agile Modeling: From Concept to Classifier in Minutes. ICCV 2023: 22266-22277 - [c32]Samir Yitzhak Gadre, Gabriel Ilharco, Alex Fang, Jonathan Hayase, Georgios Smyrnis, Thao Nguyen, Ryan Marten, Mitchell Wortsman, Dhruba Ghosh, Jieyu Zhang, Eyal Orgad, Rahim Entezari, Giannis Daras, Sarah M. Pratt, Vivek Ramanujan, Yonatan Bitton, Kalyani Marathe, Stephen Mussmann, Richard Vencu, Mehdi Cherti, Ranjay Krishna, Pang Wei Koh, Olga Saukh, Alexander J. Ratner, Shuran Song, Hannaneh Hajishirzi, Ali Farhadi, Romain Beaumont, Sewoong Oh, Alex Dimakis, Jenia Jitsev, Yair Carmon, Vaishaal Shankar, Ludwig Schmidt:
DataComp: In search of the next generation of multimodal datasets. NeurIPS 2023 - [c31]Cheng-Yu Hsieh, Jieyu Zhang, Zixian Ma, Aniruddha Kembhavi, Ranjay Krishna:
SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality. NeurIPS 2023 - [c30]Wisdom Oluchi Ikezogwo, Mehmet Saygin Seyfioglu, Fatemeh Ghezloo, Dylan Stefan Chan Geva, Fatwir Sheikh Mohammed, Pavan Kumar Anand, Ranjay Krishna, Linda G. Shapiro:
Quilt-1M: One Million Image-Text Pairs for Histopathology. NeurIPS 2023 - [c29]Oscar Michel, Anand Bhattad, Eli VanderBilt, Ranjay Krishna, Aniruddha Kembhavi, Tanmay Gupta:
OBJECT 3DIT: Language-guided 3D-aware Image Editing. NeurIPS 2023 - [c28]Arijit Ray, Filip Radenovic, Abhimanyu Dubey, Bryan A. Plummer, Ranjay Krishna, Kate Saenko:
Cola: A Benchmark for Compositional Text-to-image Retrieval. NeurIPS 2023 - [c27]Yue Yu, Yuchen Zhuang, Jieyu Zhang, Yu Meng, Alexander J. Ratner, Ranjay Krishna, Jiaming Shen, Chao Zhang:
Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias. NeurIPS 2023 - [i52]Enhao Zhang, Maureen Daum, Dong He, Magdalena Balazinska, Brandon Haynes, Ranjay Krishna:
EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions [Technical Report]. CoRR abs/2301.00929 (2023) - [i51]Maureen Daum, Enhao Zhang, Dong He, Stephen Mussmann, Brandon Haynes, Ranjay Krishna, Magdalena Balazinska:
VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building. CoRR abs/2303.04068 (2023) - [i50]Yushi Hu, Benlin Liu, Jungo Kasai, Yizhong Wang, Mari Ostendorf, Ranjay Krishna, Noah A. Smith:
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering. CoRR abs/2303.11897 (2023) - [i49]Samir Yitzhak Gadre, Gabriel Ilharco, Alex Fang, Jonathan Hayase, Georgios Smyrnis, Thao Nguyen, Ryan Marten, Mitchell Wortsman, Dhruba Ghosh, Jieyu Zhang, Eyal Orgad, Rahim Entezari, Giannis Daras, Sarah M. Pratt, Vivek Ramanujan, Yonatan Bitton, Kalyani Marathe, Stephen Mussmann, Richard Vencu, Mehdi Cherti, Ranjay Krishna, Pang Wei Koh, Olga Saukh, Alexander Ratner, Shuran Song, Hannaneh Hajishirzi, Ali Farhadi, Romain Beaumont, Sewoong Oh, Alex Dimakis, Jenia Jitsev, Yair Carmon, Vaishaal Shankar, Ludwig Schmidt:
DataComp: In search of the next generation of multimodal datasets. CoRR abs/2304.14108 (2023) - [i48]Cheng-Yu Hsieh, Chun-Liang Li, Chih-Kuan Yeh, Hootan Nakhost, Yasuhisa Fujii, Alexander Ratner, Ranjay Krishna, Chen-Yu Lee, Tomas Pfister:
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes. CoRR abs/2305.02301 (2023) - [i47]Arijit Ray, Filip Radenovic, Abhimanyu Dubey, Bryan A. Plummer, Ranjay Krishna, Kate Saenko:
COLA: How to adapt vision-language models to Compose Objects Localized with Attributes? CoRR abs/2305.03689 (2023) - [i46]Wisdom Oluchi Ikezogwo, Mehmet Saygin Seyfioglu, Fatemeh Ghezloo, Dylan Stefan Chan Geva, Fatwir Sheikh Mohammed, Pavan Kumar Anand, Ranjay Krishna, Linda G. Shapiro:
Quilt-1M: One Million Image-Text Pairs for Histopathology. CoRR abs/2306.11207 (2023) - [i45]Jiafei Duan, Yi Ru Wang, Mohit Shridhar, Dieter Fox, Ranjay Krishna:
AR2-D2: Training a Robot Without a Robot. CoRR abs/2306.13818 (2023) - [i44]Cheng-Yu Hsieh, Jieyu Zhang, Zixian Ma, Aniruddha Kembhavi, Ranjay Krishna:
SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality. CoRR abs/2306.14610 (2023) - [i43]Kalyani Marathe, Mahtab Bigverdi, Nishat Khan, Tuhin Kundu, Aniruddha Kembhavi, Linda G. Shapiro, Ranjay Krishna:
MIMIC: Masked Image Modeling with Image Correspondences. CoRR abs/2306.15128 (2023) - [i42]Yue Yu, Yuchen Zhuang, Jieyu Zhang, Yu Meng, Alexander Ratner, Ranjay Krishna, Jiaming Shen, Chao Zhang:
Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias. CoRR abs/2306.15895 (2023) - [i41]Oscar Michel, Anand Bhattad, Eli VanderBilt, Ranjay Krishna, Aniruddha Kembhavi, Tanmay Gupta:
OBJECT 3DIT: Language-guided 3D-aware Image Editing. CoRR abs/2307.11073 (2023) - [i40]Cheng-Yu Hsieh, Si-An Chen, Chun-Liang Li, Yasuhisa Fujii, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister:
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models. CoRR abs/2308.00675 (2023) - [i39]Jieyu Zhang, Ranjay Krishna, Ahmed Hassan Awadallah, Chi Wang:
EcoAssistant: Using LLM Assistant More Affordably and Accurately. CoRR abs/2310.03046 (2023) - [i38]Andre Ye, Sebastin Santy, Jena D. Hwang, Amy X. Zhang, Ranjay Krishna:
Cultural and Linguistic Diversity Improves Visual Representations. CoRR abs/2310.14356 (2023) - [i37]Jaemin Cho, Yushi Hu, Roopal Garg, Peter Anderson, Ranjay Krishna, Jason Baldridge, Mohit Bansal, Jordi Pont-Tuset, Su Wang:
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation. CoRR abs/2310.18235 (2023) - [i36]Ryan Liu, Howard Yen, Raja Marjieh, Thomas L. Griffiths, Ranjay Krishna:
Improving Interpersonal Communication by Simulating Audiences with Language Models. CoRR abs/2311.00687 (2023) - [i35]Ainaz Eftekhar, Kuo-Hao Zeng, Jiafei Duan, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna:
Selective Visual Representations Improve Convergence and Generalization for Embodied AI. CoRR abs/2311.04193 (2023) - [i34]Jiao Sun, Deqing Fu, Yushi Hu, Su Wang, Royi Rassin, Da-Cheng Juan, Dana Alon, Charles Herrmann, Sjoerd van Steenkiste, Ranjay Krishna, Cyrus Rashtchian:
DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback. CoRR abs/2311.17946 (2023) - [i33]Dina Bashkirova, Arijit Ray, Rupayan Mallick, Sarah Adel Bargal, Jianming Zhang, Ranjay Krishna, Kate Saenko:
Lasagna: Layered Score Distillation for Disentangled Object Relighting. CoRR abs/2312.00833 (2023) - [i32]Kiana Ehsani, Tanmay Gupta, Rose Hendrix, Jordi Salvador, Luca Weihs, Kuo-Hao Zeng, Kunal Pratap Singh, Yejin Kim, Winson Han, Alvaro Herrasti, Ranjay Krishna, Dustin Schwenk, Eli VanderBilt, Aniruddha Kembhavi:
Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World. CoRR abs/2312.02976 (2023) - [i31]Yushi Hu, Otilia Stretcu, Chun-Ta Lu, Krishnamurthy Viswanathan, Kenji Hata, Enming Luo, Ranjay Krishna, Ariel Fuxman:
Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models. CoRR abs/2312.03052 (2023) - [i30]Mehmet Saygin Seyfioglu, Wisdom Oluchi Ikezogwo, Fatemeh Ghezloo, Ranjay Krishna, Linda G. Shapiro:
Quilt-LLaVA: Visual Instruction Tuning by Extracting Localized Narratives from Open-Source Histopathology Videos. CoRR abs/2312.04746 (2023) - [i29]Yue Yang, Fan-Yun Sun, Luca Weihs, Eli VanderBilt, Alvaro Herrasti, Winson Han, Jiajun Wu, Nick Haber, Ranjay Krishna, Lingjie Liu, Chris Callison-Burch, Mark Yatskar, Aniruddha Kembhavi, Christopher Clark:
Holodeck: Language Guided Generation of 3D Embodied AI Environments. CoRR abs/2312.09067 (2023) - [i28]Madeleine Grunde-McLaughlin, Michelle S. Lam, Ranjay Krishna, Daniel S. Weld, Jeffrey Heer:
Designing LLM Chains by Adapting Techniques from Crowdsourcing Workflows. CoRR abs/2312.11681 (2023) - 2022
- [c26]Maureen Daum, Enhao Zhang, Dong He, Magdalena Balazinska, Brandon Haynes, Ranjay Krishna, Apryle Craig, Aaron Wirsing:
VOCAL: Video Organization and Interactive Compositional AnaLytics. CIDR 2022 - [c25]Mona Gandhi, Mustafa Omer Gul, Eva Prakash, Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala:
Measuring Compositional Consistency for Video Question Answering. CVPR 2022: 5036-5045 - [c24]Zixian Ma, Rose E. Wang, Fei-Fei Li, Michael S. Bernstein, Ranjay Krishna:
ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward. NeurIPS 2022 - [i27]Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala:
AGQA 2.0: An Updated Benchmark for Compositional Spatio-Temporal Reasoning. CoRR abs/2204.06105 (2022) - [i26]Mona Gandhi, Mustafa Omer Gul, Eva Prakash, Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala:
Measuring Compositional Consistency for Video Question Answering. CoRR abs/2204.07190 (2022) - [i25]Zixian Ma, Rose E. Wang, Li Fei-Fei, Michael S. Bernstein, Ranjay Krishna:
ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward. CoRR abs/2210.04365 (2022) - [i24]Helena Vasconcelos, Matthew Jörke, Madeleine Grunde-McLaughlin, Tobias Gerstenberg, Michael S. Bernstein, Ranjay Krishna:
Explanations Can Reduce Overreliance on AI Systems During Decision-Making. CoRR abs/2212.06823 (2022) - [i23]Zixian Ma, Jerry Hong, Mustafa Omer Gul, Mona Gandhi, Irena Gao, Ranjay Krishna:
CREPE: Can Vision-Language Foundation Models Reason Compositionally? CoRR abs/2212.07796 (2022) - 2021
- [b1]Ranjay Krishna:
Visual intelligence through human learning. Stanford University, USA, 2021 - [c23]Siddharth Karamcheti, Ranjay Krishna, Li Fei-Fei, Christopher D. Manning:
Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering. ACL/IJCNLP (1) 2021: 7265-7281 - [c22]Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala:
AGQA: A Benchmark for Compositional Spatio-Temporal Reasoning. CVPR 2021: 11287-11297 - [i22]Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala:
AGQA: A Benchmark for Compositional Spatio-Temporal Reasoning. CoRR abs/2103.16002 (2021) - [i21]Siddharth Karamcheti, Ranjay Krishna, Li Fei-Fei, Christopher D. Manning:
Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering. CoRR abs/2107.02331 (2021) - [i20]Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ B. Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri S. Chatterji, Annie S. Chen, Kathleen Creel, Jared Quincy Davis, Dorottya Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh, Li Fei-Fei, Chelsea Finn, Trevor Gale, Lauren E. Gillespie, Karan Goel, Noah D. Goodman, Shelby Grossman, Neel Guha, Tatsunori Hashimoto, Peter Henderson, John Hewitt, Daniel E. Ho, Jenny Hong, Kyle Hsu, Jing Huang, Thomas Icard, Saahil Jain, Dan Jurafsky, Pratyusha Kalluri, Siddharth Karamcheti, Geoff Keeling, Fereshte Khani, Omar Khattab, Pang Wei Koh, Mark S. Krass, Ranjay Krishna, Rohith Kuditipudi, et al.:
On the Opportunities and Risks of Foundation Models. CoRR abs/2108.07258 (2021) - [i19]Ranjay Krishna, Mitchell L. Gordon, Li Fei-Fei, Michael S. Bernstein:
Visual Intelligence through Human Interaction. CoRR abs/2111.06913 (2021) - 2020
- [j2]Pranav Khadpe, Ranjay Krishna, Li Fei-Fei, Jeffrey T. Hancock, Michael S. Bernstein:
Conceptual Metaphors Impact Perceptions of Human-AI Collaboration. Proc. ACM Hum. Comput. Interact. 4(CSCW2): 163:1-163:26 (2020) - [c21]Rachel Gardner, Maya Varma, Clare Zhu, Ranjay Krishna:
Determining Question-Answer Plausibility in Crowdsourced Datasets Using Multi-Task Learning. W-NUT@EMNLP 2020: 22-27 - [c20]Jingwei Ji, Ranjay Krishna, Li Fei-Fei, Juan Carlos Niebles:
Action Genome: Actions As Compositions of Spatio-Temporal Scene Graphs. CVPR 2020: 10233-10244 - [i18]