default search action

combined dblp search
author search
venue search
publication search

ask others

Mitchell Wortsman

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c24]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/GadreSSGWSMFLKX25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GadreSSGWSMFLKX25
Samir Yitzhak Gadre, Georgios Smyrnis, Vaishaal Shankar, Suchin Gururangan, Mitchell Wortsman, Rulin Shao, Jean Mercat, Alex Fang, Jeffrey Li, Sedrick Keh, Rui Xin, Marianna Nezhurina, Igor Vasiljevic, Luca Soldaini, Jenia Jitsev, Alex Dimakis, Gabriel Ilharco, Pang Wei Koh, Shuran Song, Thomas Kollar, et al.:
Language models scale reliably with over-training and on downstream tasks. ICLR 2025
2024
[b1]
- view
  - electronic edition via handle.net
  - details & citations
- export record
  dblp key:
  - phd/us/Wortsman24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/us/Wortsman24
Mitchell Wortsman:
Robust and reliable large-scale transfer learning. University of Washington, USA, 2024
[j2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/npjdm/ChanNWDSGM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/npjdm/ChanNWDSGM24
Justin Chan, Solomon Nsumba, Mitchell Wortsman, Achal Dave, Ludwig Schmidt, Shyamnath Gollakota, Kelly E. Michaelsen:
Detecting clinical medication errors with AI enabled wearable cameras. npj Digit. Medicine 7(1) (2024)
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/GroeneveldBWBKT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/GroeneveldBWBKT24
Dirk Groeneveld, Iz Beltagy, Evan Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi:
OLMo: Accelerating the Science of Language Models. ACL (1) 2024: 15789-15809
[c22]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WortsmanLXEAACG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WortsmanLXEAACG24
Mitchell Wortsman, Peter J. Liu, Lechao Xiao, Katie E. Everett, Alexander A. Alemi, Ben Adlam, John D. Co-Reyes, Izzeddin Gur, Abhishek Kumar, Roman Novak, Jeffrey Pennington, Jascha Sohl-Dickstein, Kelvin Xu, Jaehoon Lee, Justin Gilmer, Simon Kornblith:
Small-scale proxies for large-scale Transformer training instabilities. ICLR 2024
[c21]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/EverettXWANLGSK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/EverettXWANLGSK24
Katie E. Everett, Lechao Xiao, Mitchell Wortsman, Alexander A. Alemi, Roman Novak, Peter J. Liu, Izzeddin Gur, Jascha Sohl-Dickstein, Leslie Pack Kaelbling, Jaehoon Lee, Jeffrey Pennington:
Scaling Exponents Across Parameterizations and Optimizers. ICML 2024
[c20]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiFSIJGBGKAGXMH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiFSIJGBGKAGXMH24
Jeffrey Li, Alex Fang, Georgios Smyrnis, Maor Ivgi, Matt Jordan, Samir Yitzhak Gadre, Hritik Bansal, Etash Guha, Sedrick Scott Keh, Kushal Arora, Saurabh Garg, Rui Xin, Niklas Muennighoff, Reinhard Heckel, Jean Mercat, Mayee F. Chen, Suchin Gururangan, Mitchell Wortsman, Alon Albalak, Yonatan Bitton, Marianna Nezhurina, Amro Abbas, Cheng-Yu Hsieh, Dhruba Ghosh, Josh Gardner, Maciej Kilian, Hanlin Zhang, Rulin Shao, Sarah M. Pratt, Sunny Sanyal, Gabriel Ilharco, Giannis Daras, Kalyani Marathe, Aaron Gokaslan, Jieyu Zhang, Khyathi Raghavi Chandu, Thao Nguyen, Igor Vasiljevic, Sham M. Kakade, Shuran Song, Sujay Sanghavi, Fartash Faghri, Sewoong Oh, Luke Zettlemoyer, Kyle Lo, Alaaeldin El-Nouby, Hadi Pouransari, Alexander Toshev, Stephanie Wang, Dirk Groeneveld, Luca Soldaini, Pang Wei Koh, Jenia Jitsev, Thomas Kollar, Alex Dimakis, Yair Carmon, Achal Dave, Ludwig Schmidt, Vaishaal Shankar:
DataComp-LM: In search of the next generation of training sets for language models. NeurIPS 2024
[c19]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/PorianWJSC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PorianWJSC24
Tomer Porian, Mitchell Wortsman, Jenia Jitsev, Ludwig Schmidt, Yair Carmon:
Resolving Discrepancies in Compute-Optimal Scaling of Language Models. NeurIPS 2024
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-00838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-00838
Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi:
OLMo: Accelerating the Science of Language Models. CoRR abs/2402.00838 (2024)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-08540
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-08540
Samir Yitzhak Gadre, Georgios Smyrnis, Vaishaal Shankar, Suchin Gururangan, Mitchell Wortsman, Rulin Shao, Jean Mercat, Alex Fang, Jeffrey Li, Sedrick Keh, Rui Xin, Marianna Nezhurina, Igor Vasiljevic, Jenia Jitsev, Alexandros G. Dimakis, Gabriel Ilharco, Shuran Song, Thomas Kollar, Yair Carmon, Achal Dave, Reinhard Heckel, Niklas Muennighoff, Ludwig Schmidt:
Language models scale reliably with over-training and on downstream tasks. CoRR abs/2403.08540 (2024)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-11794
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-11794
Jeffrey Li, Alex Fang, Georgios Smyrnis, Maor Ivgi, Matt Jordan, Samir Yitzhak Gadre, Hritik Bansal, Etash Kumar Guha, Sedrick Keh, Kushal Arora, Saurabh Garg, Rui Xin, Niklas Muennighoff, Reinhard Heckel, Jean Mercat, Mayee F. Chen, Suchin Gururangan, Mitchell Wortsman, Alon Albalak, Yonatan Bitton, Marianna Nezhurina, Amro Abbas, Cheng-Yu Hsieh, Dhruba Ghosh, Josh Gardner, Maciej Kilian, Hanlin Zhang, Rulin Shao, Sarah M. Pratt, Sunny Sanyal, Gabriel Ilharco, Giannis Daras, Kalyani Marathe, Aaron Gokaslan, Jieyu Zhang, Khyathi Raghavi Chandu, Thao Nguyen, Igor Vasiljevic, Sham M. Kakade, Shuran Song, Sujay Sanghavi, Fartash Faghri, Sewoong Oh, Luke Zettlemoyer, Kyle Lo, Alaaeldin El-Nouby, Hadi Pouransari, Alexander Toshev, Stephanie Wang, Dirk Groeneveld, Luca Soldaini, Pang Wei Koh, Jenia Jitsev, Thomas Kollar, Alexandros G. Dimakis, Yair Carmon, Achal Dave, Ludwig Schmidt, Vaishaal Shankar:
DataComp-LM: In search of the next generation of training sets for language models. CoRR abs/2406.11794 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-19146
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-19146
Tomer Porian, Mitchell Wortsman, Jenia Jitsev, Ludwig Schmidt, Yair Carmon:
Resolving Discrepancies in Compute-Optimal Scaling of Language Models. CoRR abs/2406.19146 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-05872
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-05872
Katie Everett, Lechao Xiao, Mitchell Wortsman, Alexander A. Alemi, Roman Novak, Peter J. Liu, Izzeddin Gur, Jascha Sohl-Dickstein, Leslie Pack Kaelbling, Jaehoon Lee, Jeffrey Pennington:
Scaling Exponents Across Parameterizations and Optimizers. CoRR abs/2407.05872 (2024)
2023
[j1]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/WortsmanGLFSRM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/WortsmanGLFSRM23
Mitchell Wortsman, Suchin Gururangan, Shen Li, Ali Farhadi, Ludwig Schmidt, Michael G. Rabbat, Ari S. Morcos:
lo-fi: distributed fine-tuning without communication. Trans. Mach. Learn. Res. 2023 (2023)
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ChertiBWWIGSSJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ChertiBWWIGSSJ23
Mehdi Cherti, Romain Beaumont, Ross Wightman, Mitchell Wortsman, Gabriel Ilharco, Cade Gordon, Christoph Schuhmann, Ludwig Schmidt, Jenia Jitsev:
Reproducible Scaling Laws for Contrastive Language-Image Learning. CVPR 2023: 2818-2829
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/GadreWISS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/GadreWISS23
Samir Yitzhak Gadre, Mitchell Wortsman, Gabriel Ilharco, Ludwig Schmidt, Shuran Song:
CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object Navigation. CVPR 2023: 23171-23181
[c16]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/IlharcoRWSHF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/IlharcoRWSHF23
Gabriel Ilharco, Marco Túlio Ribeiro, Mitchell Wortsman, Ludwig Schmidt, Hannaneh Hajishirzi, Ali Farhadi:
Editing models with task arithmetic. ICLR 2023
[c15]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/GadreIFHSNMWGZO23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GadreIFHSNMWGZO23
Samir Yitzhak Gadre, Gabriel Ilharco, Alex Fang, Jonathan Hayase, Georgios Smyrnis, Thao Nguyen, Ryan Marten, Mitchell Wortsman, Dhruba Ghosh, Jieyu Zhang, Eyal Orgad, Rahim Entezari, Giannis Daras, Sarah M. Pratt, Vivek Ramanujan, Yonatan Bitton, Kalyani Marathe, Stephen Mussmann, Richard Vencu, Mehdi Cherti, Ranjay Krishna, Pang Wei Koh, Olga Saukh, Alexander J. Ratner, Shuran Song, Hannaneh Hajishirzi, Ali Farhadi, Romain Beaumont, Sewoong Oh, Alex Dimakis, Jenia Jitsev, Yair Carmon, Vaishaal Shankar, Ludwig Schmidt:
DataComp: In search of the next generation of multimodal datasets. NeurIPS 2023
[c14]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WortsmanDZMFS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WortsmanDZMFS23
Mitchell Wortsman, Tim Dettmers, Luke Zettlemoyer, Ari Morcos, Ali Farhadi, Ludwig Schmidt:
Stable and low-precision training for large-scale vision-language models. NeurIPS 2023
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-13602
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-13602
Rahim Entezari, Mitchell Wortsman, Olga Saukh, Moein Shariatnia, Hanie Sedghi, Ludwig Schmidt:
The Role of Pre-training Data in Transfer Learning. CoRR abs/2302.13602 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-13013
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-13013
Mitchell Wortsman, Tim Dettmers, Luke Zettlemoyer, Ari Morcos, Ali Farhadi, Ludwig Schmidt:
Stable and low-precision training for large-scale vision-language models. CoRR abs/2304.13013 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-14108
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-14108
Samir Yitzhak Gadre, Gabriel Ilharco, Alex Fang, Jonathan Hayase, Georgios Smyrnis, Thao Nguyen, Ryan Marten, Mitchell Wortsman, Dhruba Ghosh, Jieyu Zhang, Eyal Orgad, Rahim Entezari, Giannis Daras, Sarah M. Pratt, Vivek Ramanujan, Yonatan Bitton, Kalyani Marathe, Stephen Mussmann, Richard Vencu, Mehdi Cherti, Ranjay Krishna, Pang Wei Koh, Olga Saukh, Alexander Ratner, Shuran Song, Hannaneh Hajishirzi, Ali Farhadi, Romain Beaumont, Sewoong Oh, Alex Dimakis, Jenia Jitsev, Yair Carmon, Vaishaal Shankar, Ludwig Schmidt:
DataComp: In search of the next generation of multimodal datasets. CoRR abs/2304.14108 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-01390
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-01390
Anas Awadalla, Irena Gao, Josh Gardner, Jack Hessel, Yusuf Hanafy, Wanrong Zhu, Kalyani Marathe, Yonatan Bitton, Samir Yitzhak Gadre, Shiori Sagawa, Jenia Jitsev, Simon Kornblith, Pang Wei Koh, Gabriel Ilharco, Mitchell Wortsman, Ludwig Schmidt:
OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models. CoRR abs/2308.01390 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08586
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08586
Mitchell Wortsman, Jaehoon Lee, Justin Gilmer, Simon Kornblith:
Replacing softmax with ReLU in Vision Transformers. CoRR abs/2309.08586 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-14322
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-14322
Mitchell Wortsman, Peter J. Liu, Lechao Xiao, Katie Everett, Alex Alemi, Ben Adlam, John D. Co-Reyes, Izzeddin Gur, Abhishek Kumar, Roman Novak, Jeffrey Pennington, Jascha Sohl-Dickstein, Kelvin Xu, Jaehoon Lee, Justin Gilmer, Simon Kornblith:
Small-scale proxies for large-scale Transformer training instabilities. CoRR abs/2309.14322 (2023)
2022
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WortsmanIKLKRLH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WortsmanIKLKRLH22
Mitchell Wortsman, Gabriel Ilharco, Jong Wook Kim, Mike Li, Simon Kornblith, Rebecca Roelofs, Raphael Gontijo Lopes, Hannaneh Hajishirzi, Ali Farhadi, Hongseok Namkoong, Ludwig Schmidt:
Robust fine-tuning of zero-shot models. CVPR 2022: 7949-7961
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/AwadallaWIMMHS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/AwadallaWIMMHS22
Anas Awadalla, Mitchell Wortsman, Gabriel Ilharco, Sewon Min, Ian Magnusson, Hannaneh Hajishirzi, Ludwig Schmidt:
Exploring The Landscape of Distributional Robustness for Question Answering Models. EMNLP (Findings) 2022: 5971-5987
[c11]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/FangIWWSDS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/FangIWWSDS22
Alex Fang, Gabriel Ilharco, Mitchell Wortsman, Yuhao Wan, Vaishaal Shankar, Achal Dave, Ludwig Schmidt:
Data Determines Distributional Robustness in Contrastive Language Image Pre-training (CLIP). ICML 2022: 6216-6234
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WortsmanIGRLMNF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WortsmanIGRLMNF22
Mitchell Wortsman, Gabriel Ilharco, Samir Yitzhak Gadre, Rebecca Roelofs, Raphael Gontijo Lopes, Ari S. Morcos, Hongseok Namkoong, Ali Farhadi, Yair Carmon, Simon Kornblith, Ludwig Schmidt:
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time. ICML 2022: 23965-23998
[c9]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/IlharcoWGSHKFS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/IlharcoWGSHKFS22
Gabriel Ilharco, Mitchell Wortsman, Samir Yitzhak Gadre, Shuran Song, Hannaneh Hajishirzi, Simon Kornblith, Ali Farhadi, Ludwig Schmidt:
Patching open-vocabulary models by interpolating weights. NeurIPS 2022
[c8]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/NguyenIWOS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/NguyenIWOS22
Thao Nguyen, Gabriel Ilharco, Mitchell Wortsman, Sewoong Oh, Ludwig Schmidt:
Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIP. NeurIPS 2022
[c7]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/SchuhmannBVGWCC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SchuhmannBVGWCC22
Christoph Schuhmann, Romain Beaumont, Richard Vencu, Cade Gordon, Ross Wightman, Mehdi Cherti, Theo Coombes, Aarush Katta, Clayton Mullis, Mitchell Wortsman, Patrick Schramowski, Srivatsa Kundurthy, Katherine Crowson, Ludwig Schmidt, Robert Kaczmarczyk, Jenia Jitsev:
LAION-5B: An open large-scale dataset for training next generation image-text models. NeurIPS 2022
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-05482
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-05482
Mitchell Wortsman, Gabriel Ilharco, Samir Yitzhak Gadre, Rebecca Roelofs, Raphael Gontijo Lopes, Ari S. Morcos, Hongseok Namkoong, Ali Farhadi, Yair Carmon, Simon Kornblith, Ludwig Schmidt:
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time. CoRR abs/2203.05482 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-10421
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-10421
Samir Yitzhak Gadre, Mitchell Wortsman, Gabriel Ilharco, Ludwig Schmidt, Shuran Song:
CLIP on Wheels: Zero-Shot Object Navigation as Object Localization and Exploration. CoRR abs/2203.10421 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-01397
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-01397
Alex Fang, Gabriel Ilharco, Mitchell Wortsman, Yuhao Wan, Vaishaal Shankar, Achal Dave, Ludwig Schmidt:
Data Determines Distributional Robustness in Contrastive Language Image Pre-training (CLIP). CoRR abs/2205.01397 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-05516
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-05516
Thao Nguyen, Gabriel Ilharco, Mitchell Wortsman, Sewoong Oh, Ludwig Schmidt:
Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIP. CoRR abs/2208.05516 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-05592
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-05592
Gabriel Ilharco, Mitchell Wortsman, Samir Yitzhak Gadre, Shuran Song, Hannaneh Hajishirzi, Simon Kornblith, Ali Farhadi, Ludwig Schmidt:
Patching open-vocabulary models by interpolating weights. CoRR abs/2208.05592 (2022)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-08402
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-08402
Christoph Schuhmann, Romain Beaumont, Richard Vencu, Cade Gordon, Ross Wightman, Mehdi Cherti, Theo Coombes, Aarush Katta, Clayton Mullis, Mitchell Wortsman, Patrick Schramowski, Srivatsa Kundurthy, Katherine Crowson, Ludwig Schmidt, Robert Kaczmarczyk, Jenia Jitsev:
LAION-5B: An open large-scale dataset for training next generation image-text models. CoRR abs/2210.08402 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-11948
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-11948
Mitchell Wortsman, Suchin Gururangan, Shen Li, Ali Farhadi, Ludwig Schmidt, Michael G. Rabbat, Ari S. Morcos:
lo-fi: distributed fine-tuning without communication. CoRR abs/2210.11948 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-12517
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-12517
Anas Awadalla, Mitchell Wortsman, Gabriel Ilharco, Sewon Min, Ian Magnusson, Hannaneh Hajishirzi, Ludwig Schmidt:
Exploring The Landscape of Distributional Robustness for Question Answering Models. CoRR abs/2210.12517 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-04089
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-04089
Gabriel Ilharco, Marco Túlio Ribeiro, Mitchell Wortsman, Suchin Gururangan, Ludwig Schmidt, Hannaneh Hajishirzi, Ali Farhadi:
Editing Models with Task Arithmetic. CoRR abs/2212.04089 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-07143
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-07143
Mehdi Cherti, Romain Beaumont, Ross Wightman, Mitchell Wortsman, Gabriel Ilharco, Cade Gordon, Christoph Schuhmann, Ludwig Schmidt, Jenia Jitsev:
Reproducible scaling laws for contrastive language-image learning. CoRR abs/2212.07143 (2022)
2021
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WortsmanHGFR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WortsmanHGFR21
Mitchell Wortsman, Maxwell Horton, Carlos Guestrin, Ali Farhadi, Mohammad Rastegari:
Learning Neural Network Subspaces. ICML 2021: 11217-11227
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-10472
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-10472
Mitchell Wortsman, Maxwell Horton, Carlos Guestrin, Ali Farhadi, Mohammad Rastegari:
Learning Neural Network Subspaces. CoRR abs/2102.10472 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-01903
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-01903
Mitchell Wortsman, Gabriel Ilharco, Mike Li, Jong Wook Kim, Hannaneh Hajishirzi, Ali Farhadi, Hongseok Namkoong, Ludwig Schmidt:
Robust fine-tuning of zero-shot models. CoRR abs/2109.01903 (2021)
2020
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/RamanujanWKFR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/RamanujanWKFR20
Vivek Ramanujan, Mitchell Wortsman, Aniruddha Kembhavi, Ali Farhadi, Mohammad Rastegari:
What's Hidden in a Randomly Weighted Neural Network? CVPR 2020: 11890-11899
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KusupatiRSW0KF20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KusupatiRSW0KF20
Aditya Kusupati, Vivek Ramanujan, Raghav Somani, Mitchell Wortsman, Prateek Jain, Sham M. Kakade, Ali Farhadi:
Soft Threshold Weight Reparameterization for Learnable Sparsity. ICML 2020: 5544-5555
[c3]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WortsmanRLKRYF20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WortsmanRLKRYF20
Mitchell Wortsman, Vivek Ramanujan, Rosanne Liu, Aniruddha Kembhavi, Mohammad Rastegari, Jason Yosinski, Ali Farhadi:
Supermasks in Superposition. NeurIPS 2020
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-03231
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-03231
Aditya Kusupati, Vivek Ramanujan, Raghav Somani, Mitchell Wortsman, Prateek Jain, Sham M. Kakade, Ali Farhadi:
Soft Threshold Weight Reparameterization for Learnable Sparsity. CoRR abs/2002.03231 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-14769
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-14769
Mitchell Wortsman, Vivek Ramanujan, Rosanne Liu, Aniruddha Kembhavi, Mohammad Rastegari, Jason Yosinski, Ali Farhadi:
Supermasks in Superposition. CoRR abs/2006.14769 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-00172
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-00172
Maxwell Van Gelder, Mitchell Wortsman, Kiana Ehsani:
Deconstructing the Structure of Sparse Neural Networks. CoRR abs/2012.00172 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WortsmanERFM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WortsmanERFM19
Mitchell Wortsman, Kiana Ehsani, Mohammad Rastegari, Ali Farhadi, Roozbeh Mottaghi:
Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning. CVPR 2019: 6750-6759
[c1]
- view
- export record
  dblp key:
  - conf/nips/WortsmanFR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WortsmanFR19
Mitchell Wortsman, Ali Farhadi, Mohammad Rastegari:
Discovering Neural Wirings. NeurIPS 2019: 2680-2690
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-00586
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-00586
Mitchell Wortsman, Ali Farhadi, Mohammad Rastegari:
Discovering Neural Wirings. CoRR abs/1906.00586 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-13299
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-13299
Vivek Ramanujan, Mitchell Wortsman, Aniruddha Kembhavi, Ali Farhadi, Mohammad Rastegari:
What's Hidden in a Randomly Weighted Neural Network? CoRR abs/1911.13299 (2019)
2018
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1812-00971
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-00971
Mitchell Wortsman, Kiana Ehsani, Mohammad Rastegari, Ali Farhadi, Roozbeh Mottaghi:
Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning. CoRR abs/1812.00971 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.