default search action
Manish Nagireddy
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c6]Manish Nagireddy, Lamogha Chiazor, Moninder Singh, Ioana Baldini:
SocialStigmaQA: A Benchmark to Uncover Stigma Amplification in Generative Language Models. AAAI 2024: 21454-21462 - [c5]Inkit Padhi, Karthikeyan Natesan Ramamurthy, Prasanna Sattigeri, Manish Nagireddy, Pierre L. Dognin, Kush R. Varshney:
Value Alignment from Unstructured Text. EMNLP (Industry Track) 2024: 1083-1095 - [c4]Erik Miehling, Manish Nagireddy, Prasanna Sattigeri, Elizabeth Daly, David Piorkowski, John T. Richards:
Language Models in Dialogue: Conversational Maxims for Human-AI Interactions. EMNLP (Findings) 2024: 14420-14437 - [c3]Inkit Padhi, Pierre L. Dognin, Jesus Rios, Ronny Luss, Swapnaja Achintalwar, Matthew Riemer, Miao Liu, Prasanna Sattigeri, Manish Nagireddy, Kush R. Varshney, Djallel Bouneffouf:
ComVas: Contextual Moral Values Alignment System. IJCAI 2024: 8759-8762 - [c2]Manish Nagireddy, Bernat Guillen Pegueroles, Ioana Baldini:
DARE to Diversify: DAta Driven and Diverse LLM REd Teaming. KDD 2024: 6420-6421 - [i13]Swapnaja Achintalwar, Adriana Alvarado Garcia, Ateret Anaby-Tavor, Ioana Baldini, Sara E. Berger, Bishwaranjan Bhattacharjee, Djallel Bouneffouf, Subhajit Chaudhury, Pin-Yu Chen, Lamogha Chiazor, Elizabeth M. Daly, Rogério Abreu de Paula, Pierre L. Dognin, Eitan Farchi, Soumya Ghosh, Michael Hind, Raya Horesh, George Kour, Ja Young Lee, Erik Miehling, Keerthiram Murugesan, Manish Nagireddy, Inkit Padhi, David Piorkowski, Ambrish Rawat, Orna Raz, Prasanna Sattigeri, Hendrik Strobelt, Sarathkrishna Swaminathan, Christoph Tillmann, Aashka Trivedi, Kush R. Varshney, Dennis Wei, Shalisha Witherspoon, Marcel Zalmanovici:
Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations. CoRR abs/2403.06009 (2024) - [i12]Swapnaja Achintalwar, Ioana Baldini, Djallel Bouneffouf, Joan Byamugisha, Maria Chang, Pierre L. Dognin, Eitan Farchi, Ndivhuwo Makondo, Aleksandra Mojsilovic, Manish Nagireddy, Karthikeyan Natesan Ramamurthy, Inkit Padhi, Orna Raz, Jesus Rios, Prasanna Sattigeri, Moninder Singh, Siphiwe Thwala, Rosario A. Uceda-Sosa, Kush R. Varshney:
Alignment Studio: Aligning Large Language Models to Particular Contextual Regulations. CoRR abs/2403.09704 (2024) - [i11]Pierre L. Dognin, Jesus Rios, Ronny Luss, Inkit Padhi, Matthew D. Riemer, Miao Liu, Prasanna Sattigeri, Manish Nagireddy, Kush R. Varshney, Djallel Bouneffouf:
Contextual Moral Value Alignment Through Context-Based Aggregation. CoRR abs/2403.12805 (2024) - [i10]Lucas Monteiro Paes, Dennis Wei, Hyo Jin Do, Hendrik Strobelt, Ronny Luss, Amit Dhurandhar, Manish Nagireddy, Karthikeyan Natesan Ramamurthy, Prasanna Sattigeri, Werner Geyer, Soumya Ghosh:
Multi-Level Explanations for Generative Language Models. CoRR abs/2403.14459 (2024) - [i9]Erik Miehling, Manish Nagireddy, Prasanna Sattigeri, Elizabeth M. Daly, David Piorkowski, John T. Richards:
Language Models in Dialogue: Conversational Maxims for Human-AI Interactions. CoRR abs/2403.15115 (2024) - [i8]Hussein Mozannar, Valerie Chen, Mohammed Alsobay, Subhro Das, Sebastian Zhao, Dennis Wei, Manish Nagireddy, Prasanna Sattigeri, Ameet Talwalkar, David A. Sontag:
The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers. CoRR abs/2404.02806 (2024) - [i7]Manish Nagireddy, Inkit Padhi, Soumya Ghosh, Prasanna Sattigeri:
When in Doubt, Cascade: Towards Building Efficient and Capable Guardrails. CoRR abs/2407.06323 (2024) - [i6]Inkit Padhi, Karthikeyan Natesan Ramamurthy, Prasanna Sattigeri, Manish Nagireddy, Pierre L. Dognin, Kush R. Varshney:
Value Alignment from Unstructured Text. CoRR abs/2408.10392 (2024) - [i5]Bruce W. Lee, Inkit Padhi, Karthikeyan Natesan Ramamurthy, Erik Miehling, Pierre L. Dognin, Manish Nagireddy, Amit Dhurandhar:
Programming Refusal with Conditional Activation Steering. CoRR abs/2409.05907 (2024) - 2023
- [i4]Manish Nagireddy, Moninder Singh, Samuel C. Hoffman, Evaline Ju, Karthikeyan Natesan Ramamurthy, Kush R. Varshney:
Function Composition in Trustworthy Machine Learning: Implementation Choices, Insights, and Questions. CoRR abs/2302.09190 (2023) - [i3]Manish Nagireddy, Lamogha Chiazor, Moninder Singh, Ioana Baldini:
SocialStigmaQA: A Benchmark to Uncover Stigma Amplification in Generative Language Models. CoRR abs/2312.07492 (2023) - 2022
- [c1]Wesley Hanwen Deng, Manish Nagireddy, Michelle Seng Ah Lee, Jatinder Singh, Zhiwei Steven Wu, Kenneth Holstein, Haiyi Zhu:
Exploring How Machine Learning Practitioners (Try To) Use Fairness Toolkits. FAccT 2022: 473-484 - [i2]Nil-Jana Akpinar, Manish Nagireddy, Logan Stapleton, Hao Fei Cheng, Haiyi Zhu, Zhiwei Steven Wu, Hoda Heidari:
A Sandbox Tool to Bias(Stress)-Test Fairness Algorithms. CoRR abs/2204.10233 (2022) - [i1]Wesley Hanwen Deng, Manish Nagireddy, Michelle Seng Ah Lee, Jatinder Singh, Zhiwei Steven Wu, Kenneth Holstein, Haiyi Zhu:
Exploring How Machine Learning Practitioners (Try To) Use Fairness Toolkits. CoRR abs/2205.06922 (2022)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-22 19:46 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint