default search action
Caiming Xiong
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j13]Ansong Ni, Pengcheng Yin, Yilun Zhao, Martin Riddell, Troy Feng, Rui Shen, Stephen Yin, Ye Liu, Semih Yavuz, Caiming Xiong, Shafiq Joty, Yingbo Zhou, Dragomir Radev, Arman Cohan:
L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models. Trans. Assoc. Comput. Linguistics 12: 1311-1329 (2024) - [c215]Congying Xia, Chen Xing, Jiangshu Du, Xinyi Yang, Yihao Feng, Ran Xu, Wenpeng Yin, Caiming Xiong:
FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability. ACL (1) 2024: 680-699 - [c214]Itai Feigenbaum, Devansh Arpit, Shelby Heinecke, Juan Carlos Niebles, Weiran Yao, Caiming Xiong, Silvio Savarese, Huan Wang:
Causal Layering via Conditional Entropy. CLeaR 2024: 1176-1191 - [c213]Bram Wallace, Meihua Dang, Rafael Rafailov, Linqi Zhou, Aaron Lou, Senthil Purushwalkam, Stefano Ermon, Caiming Xiong, Shafiq Joty, Nikhil Naik:
Diffusion Model Alignment Using Direct Preference Optimization. CVPR 2024: 8228-8238 - [c212]Shu Zhang, Xinyi Yang, Yihao Feng, Can Qin, Chia-Chih Chen, Ning Yu, Zeyuan Chen, Huan Wang, Silvio Savarese, Stefano Ermon, Caiming Xiong, Ran Xu:
HIVE: Harnessing Human Feedback for Instructional Visual Editing. CVPR 2024: 9026-9036 - [c211]Le Xue, Ning Yu, Shu Zhang, Artemis Panagopoulou, Junnan Li, Roberto Martín-Martín, Jiajun Wu, Caiming Xiong, Ran Xu, Juan Carlos Niebles, Silvio Savarese:
ULIP-2: Towards Scalable Multimodal Pre-Training for 3D Understanding. CVPR 2024: 27081-27091 - [c210]Lifu Tu, Jin Qu, Semih Yavuz, Shafiq Joty, Wenhao Liu, Caiming Xiong, Yingbo Zhou:
Efficiently Aligned Cross-Lingual Transfer Learning for Conversational Tasks using Prompt-Tuning. EACL (Findings) 2024: 1278-1294 - [c209]Jianguo Zhang, Kun Qian, Zhiwei Liu, Shelby Heinecke, Rui Meng, Ye Liu, Zhou Yu, Huan Wang, Silvio Savarese, Caiming Xiong:
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI. EACL (Findings) 2024: 2299-2315 - [c208]Ning Yu, Chia-Chih Chen, Zeyuan Chen, Rui Meng, Gang Wu, Paul Josel, Juan Carlos Niebles, Caiming Xiong, Ran Xu:
LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer. ECCV (20) 2024: 169-187 - [c207]Artemis Panagopoulou, Le Xue, Ning Yu, Junnan Li, Dongxu Li, Shafiq Joty, Ran Xu, Silvio Savarese, Caiming Xiong, Juan Carlos Niebles:
X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-Modal Reasoning. ECCV (45) 2024: 177-197 - [c206]Philippe Laban, Alexander R. Fabbri, Caiming Xiong, Chien-Sheng Wu:
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems. EMNLP 2024: 9885-9903 - [c205]Lifu Tu, Semih Yavuz, Jin Qu, Jiacheng Xu, Rui Meng, Caiming Xiong, Yingbo Zhou:
Unlocking Anticipatory Text Generation: A Constrained Approach for Large Language Models Decoding. EMNLP 2024: 15532-15548 - [c204]Simeng Han, Aaron Yu, Rui Shen, Zhenting Qi, Martin Riddell, Wenfei Zhou, Yujie Qiao, Yilun Zhao, Semih Yavuz, Ye Liu, Shafiq Joty, Yingbo Zhou, Caiming Xiong, Dragomir Radev, Rex Ying, Arman Cohan:
P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains. EMNLP (Findings) 2024: 16553-16565 - [c203]Simeng Han, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Wenfei Zhou, James Coady, David Peng, Yujie Qiao, Luke Benson, Lucy Sun, Alexander Wardle-Solano, Hannah Szabó, Ekaterina Zubova, Matthew Burtell, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Alexander R. Fabbri, Wojciech Kryscinski, Semih Yavuz, Ye Liu, Xi Victoria Lin, Shafiq Joty, Yingbo Zhou, Caiming Xiong, Rex Ying, Arman Cohan, Dragomir Radev:
FOLIO: Natural Language Reasoning with First-Order Logic. EMNLP 2024: 22017-22031 - [c202]Tianyu Guo, Wei Hu, Song Mei, Huan Wang, Caiming Xiong, Silvio Savarese, Yu Bai:
How Do Transformers Learn In-Context Beyond Simple Functions? A Case Study on Learning with Representations. ICLR 2024 - [c201]Jiacheng Guo, Minshuo Chen, Huan Wang, Caiming Xiong, Mengdi Wang, Yu Bai:
Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight. ICLR 2024 - [c200]Yiheng Xu, Hongjin Su, Chen Xing, Boyu Mi, Qian Liu, Weijia Shi, Binyuan Hui, Fan Zhou, Yitao Liu, Tianbao Xie, Zhoujun Cheng, Siheng Zhao, Lingpeng Kong, Bailin Wang, Caiming Xiong, Tao Yu:
Lemur: Harmonizing Natural Language and Code for Language Agents. ICLR 2024 - [c199]Weiran Yao, Shelby Heinecke, Juan Carlos Niebles, Zhiwei Liu, Yihao Feng, Le Xue, Rithesh R. N., Zeyuan Chen, Jianguo Zhang, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese:
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization. ICLR 2024 - [c198]Yue Huang, Lichao Sun, Haoran Wang, Siyuan Wu, Qihui Zhang, Yuan Li, Chujie Gao, Yixin Huang, Wenhan Lyu, Yixuan Zhang, Xiner Li, Hanchi Sun, Zhengliang Liu, Yixin Liu, Yijue Wang, Zhikun Zhang, Bertie Vidgen, Bhavya Kailkhura, Caiming Xiong, Chaowei Xiao, Chunyuan Li, Eric P. Xing, Furong Huang, Hao Liu, Heng Ji, Hongyi Wang, Huan Zhang, Huaxiu Yao, Manolis Kellis, Marinka Zitnik, Meng Jiang, Mohit Bansal, James Zou, Jian Pei, Jian Liu, Jianfeng Gao, Jiawei Han, Jieyu Zhao, Jiliang Tang, Jindong Wang, Joaquin Vanschoren, John C. Mitchell, Kai Shu, Kaidi Xu, Kai-Wei Chang, Lifang He, Lifu Huang, Michael Backes, Neil Zhenqiang Gong, Philip S. Yu, Pin-Yu Chen, Quanquan Gu, Ran Xu, Rex Ying, Shuiwang Ji, Suman Jana, Tianlong Chen, Tianming Liu, Tianyi Zhou, William Wang, Xiang Li, Xiangliang Zhang, Xiao Wang, Xing Xie, Xun Chen, Xuyu Wang, Yan Liu, Yanfang Ye, Yinzhi Cao, Yong Chen, Yue Zhao:
Position: TrustLLM: Trustworthiness in Large Language Models. ICML 2024 - [c197]Gerald Woo, Chenghao Liu, Akshat Kumar, Caiming Xiong, Silvio Savarese, Doyen Sahoo:
Unified Training of Universal Time Series Forecasting Transformers. ICML 2024 - [c196]Manli Shu, Le Xue, Ning Yu, Roberto Martín-Martín, Caiming Xiong, Tom Goldstein, Juan Carlos Niebles, Ran Xu:
Hierarchical Point Attention for Indoor 3D Object Detection. ICRA 2024: 4245-4251 - [c195]Kung-Hsiang Huang, Philippe Laban, Alexander R. Fabbri, Prafulla Kumar Choubey, Shafiq Joty, Caiming Xiong, Chien-Sheng Wu:
Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles. NAACL-HLT 2024: 570-593 - [c194]Yusen Zhang, Nan Zhang, Yixin Liu, Alexander R. Fabbri, Junru Liu, Ryo Kamoi, Xiaoxin Lu, Caiming Xiong, Jieyu Zhao, Dragomir Radev, Kathleen R. McKeown, Rui Zhang:
Fair Abstractive Summarization of Diverse Perspectives. NAACL-HLT 2024: 3404-3426 - [c193]Anthony Meng Huat Tiong, Junqi Zhao, Boyang Li, Junnan Li, Steven C. H. Hoi, Caiming Xiong:
What Are We Measuring When We Evaluate Large Vision-Language Models? An Analysis of Latent Factors and Biases. NAACL-HLT 2024: 3427-3454 - [c192]Bo Pang, Caiming Xiong, Yingbo Zhou:
ARM: Alignment with Residual Energy-Based Model. NAACL-HLT 2024: 8225-8236 - [c191]Philippe Laban, Jesse Vig, Marti A. Hearst, Caiming Xiong, Chien-Sheng Wu:
Beyond the Chat: Executable and Verifiable Text-Editing with LLMs. UIST 2024: 20:1-20:23 - [i280]David Junhao Zhang, Dongxu Li, Hung Le, Mike Zheng Shou, Caiming Xiong, Doyen Sahoo:
Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions. CoRR abs/2401.01827 (2024) - [i279]Lichao Sun, Yue Huang, Haoran Wang, Siyuan Wu, Qihui Zhang, Chujie Gao, Yixin Huang, Wenhan Lyu, Yixuan Zhang, Xiner Li, Zhengliang Liu, Yixin Liu, Yijue Wang, Zhikun Zhang, Bhavya Kailkhura, Caiming Xiong, Chaowei Xiao, Chunyuan Li, Eric P. Xing, Furong Huang, Hao Liu, Heng Ji, Hongyi Wang, Huan Zhang, Huaxiu Yao, Manolis Kellis, Marinka Zitnik, Meng Jiang, Mohit Bansal, James Zou, Jian Pei, Jian Liu, Jianfeng Gao, Jiawei Han, Jieyu Zhao, Jiliang Tang, Jindong Wang, John C. Mitchell, Kai Shu, Kaidi Xu, Kai-Wei Chang, Lifang He, Lifu Huang, Michael Backes, Neil Zhenqiang Gong, Philip S. Yu, Pin-Yu Chen, Quanquan Gu, Ran Xu, Rex Ying, Shuiwang Ji, Suman Jana, Tianlong Chen, Tianming Liu, Tianyi Zhou, William Wang, Xiang Li, Xiangliang Zhang, Xiao Wang, Xing Xie, Xun Chen, Xuyu Wang, Yan Liu, Yanfang Ye, Yinzhi Cao, Yue Zhao:
TrustLLM: Trustworthiness in Large Language Models. CoRR abs/2401.05561 (2024) - [i278]Tong Niu, Caiming Xiong, Semih Yavuz, Yingbo Zhou:
Parameter-Efficient Detoxification with Contrastive Decoding. CoRR abs/2401.06947 (2024) - [i277]Itai Feigenbaum, Devansh Arpit, Huan Wang, Shelby Heinecke, Juan Carlos Niebles, Weiran Yao, Caiming Xiong, Silvio Savarese:
Editing Arbitrary Propositions in LLMs without Subject Labels. CoRR abs/2401.07526 (2024) - [i276]Itai Feigenbaum, Devansh Arpit, Huan Wang, Shelby Heinecke, Juan Carlos Niebles, Weiran Yao, Caiming Xiong, Silvio Savarese:
Causal Layering via Conditional Entropy. CoRR abs/2401.10495 (2024) - [i275]Gerald Woo, Chenghao Liu, Akshat Kumar, Caiming Xiong, Silvio Savarese, Doyen Sahoo:
Unified Training of Universal Time Series Forecasting Transformers. CoRR abs/2402.02592 (2024) - [i274]Shiyu Wang, Yihao Feng, Tian Lan, Ning Yu, Yu Bai, Ran Xu, Huan Wang, Caiming Xiong, Silvio Savarese:
Text2Data: Low-Resource Data Generation with Textual Control. CoRR abs/2402.10941 (2024) - [i273]Jianguo Zhang, Tian Lan, Rithesh Murthy, Zhiwei Liu, Weiran Yao, Juntao Tan, Thai Hoang, Liangwei Yang, Yihao Feng, Zuxin Liu, Tulika Manoj Awalgaonkar, Juan Carlos Niebles, Silvio Savarese, Shelby Heinecke, Huan Wang, Caiming Xiong:
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning. CoRR abs/2402.15506 (2024) - [i272]Zhiwei Liu, Weiran Yao, Jianguo Zhang, Liangwei Yang, Zuxin Liu, Juntao Tan, Prafulla Kumar Choubey, Tian Lan, Jason Wu, Huan Wang, Shelby Heinecke, Caiming Xiong, Silvio Savarese:
AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System. CoRR abs/2402.15538 (2024) - [i271]Congying Xia, Chen Xing, Jiangshu Du, Xinyi Yang, Yihao Feng, Ran Xu, Wenpeng Yin, Caiming Xiong:
FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability. CoRR abs/2402.18667 (2024) - [i270]Mathieu Ravaut, Bosheng Ding, Fangkai Jiao, Hailin Chen, Xingxuan Li, Ruochen Zhao, Chengwei Qin, Caiming Xiong, Shafiq Joty:
How Much are LLMs Contaminated? A Comprehensive Survey and the LLMSanitize Library. CoRR abs/2404.00699 (2024) - [i269]Anthony Meng Huat Tiong, Junqi Zhao, Boyang Li, Junnan Li, Steven C. H. Hoi, Caiming Xiong:
What Are We Measuring When We Evaluate Large Vision-Language Models? An Analysis of Latent Factors and Biases. CoRR abs/2404.02415 (2024) - [i268]Tianbao Xie, Danyang Zhang, Jixuan Chen, Xiaochuan Li, Siheng Zhao, Ruisheng Cao, Toh Jing Hua, Zhoujun Cheng, Dongchan Shin, Fangyu Lei, Yitao Liu, Yiheng Xu, Shuyan Zhou, Silvio Savarese, Caiming Xiong, Victor Zhong, Tao Yu:
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments. CoRR abs/2404.07972 (2024) - [i267]Divyansh Agarwal, Alexander R. Fabbri, Philippe Laban, Ben Risher, Shafiq Joty, Caiming Xiong, Chien-Sheng Wu:
Investigating the prompt leakage effect and black-box defenses for multi-turn LLM interactions. CoRR abs/2404.16251 (2024) - [i266]Hanze Dong, Wei Xiong, Bo Pang, Haoxiang Wang, Han Zhao, Yingbo Zhou, Nan Jiang, Doyen Sahoo, Caiming Xiong, Tong Zhang:
RLHF Workflow: From Reward Modeling to Online RLHF. CoRR abs/2405.07863 (2024) - [i265]Juncheng Liu, Chenghao Liu, Gerald Woo, Yiwei Wang, Bryan Hooi, Caiming Xiong, Doyen Sahoo:
UniTST: Effectively Modeling Inter-Series and Intra-Series Dependencies for Multivariate Time Series Forecasting. CoRR abs/2406.04975 (2024) - [i264]Rithesh Murthy, Liangwei Yang, Juntao Tan, Tulika Manoj Awalgaonkar, Yilun Zhou, Shelby Heinecke, Sachin Desai, Jason Wu, Ran Xu, Sarah Tan, Jianguo Zhang, Zhiwei Liu, Shirley Kokane, Zuxin Liu, Ming Zhu, Huan Wang, Caiming Xiong, Silvio Savarese:
MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases. CoRR abs/2406.10290 (2024) - [i263]Anas Awadalla, Le Xue, Oscar Lo, Manli Shu, Hannah Lee, Etash Kumar Guha, Matt Jordan, Sheng Shen, Mohamed Awadalla, Silvio Savarese, Caiming Xiong, Ran Xu, Yejin Choi, Ludwig Schmidt:
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens. CoRR abs/2406.11271 (2024) - [i262]Zuxin Liu, Thai Hoang, Jianguo Zhang, Ming Zhu, Tian Lan, Shirley Kokane, Juntao Tan, Weiran Yao, Zhiwei Liu, Yihao Feng, Rithesh Murthy, Liangwei Yang, Silvio Savarese, Juan Carlos Niebles, Huan Wang, Shelby Heinecke, Caiming Xiong:
APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets. CoRR abs/2406.18518 (2024) - [i261]Philippe Laban, Alexander R. Fabbri, Caiming Xiong, Chien-Sheng Wu:
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems. CoRR abs/2407.01370 (2024) - [i260]Hung Le, Yingbo Zhou, Caiming Xiong, Silvio Savarese, Doyen Sahoo:
INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness. CoRR abs/2407.02518 (2024) - [i259]Ruisheng Cao, Fangyu Lei, Haoyuan Wu, Jixuan Chen, Yeqiao Fu, Hongcheng Gao, Xinzhuang Xiong, Hanchong Zhang, Yuchen Mao, Wenjing Hu, Tianbao Xie, Hongshen Xu, Danyang Zhang, Sida Wang, Ruoxi Sun, Pengcheng Yin, Caiming Xiong, Ansong Ni, Qian Liu, Victor Zhong, Lu Chen, Kai Yu, Tao Yu:
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows? CoRR abs/2407.10956 (2024) - [i258]Shayne Longpre, Robert Mahari, Ariel Lee, Campbell Lund, Hamidah Oderinwale, William Brannon, Nayan Saxena, Naana Obeng-Marnu, Tobin South, Cole Hunter, Kevin Klyman, Christopher Klamm, Hailey Schoelkopf, Nikhil Singh, Manuel Cherep, Ahmad Anis, An Dinh, Caroline Chitongo, Da Yin, Damien Sileo, Deividas Mataciunas, Diganta Misra, Emad A. Alghamdi, Enrico Shippole, Jianguo Zhang, Joanna Materzynska, Kun Qian, Kush Tiwary, Lester James V. Miranda, Manan Dey, Minnie Liang, Mohammed Hamdy, Niklas Muennighoff, Seonghyeon Ye, Seungone Kim, Shrestha Mohanty, Vipul Gupta, Vivek Sharma, Vu Minh Chien, Xuhui Zhou, Yizhi Li, Caiming Xiong, Luis Villa, Stella Biderman, Hanlin Li, Daphne Ippolito, Sara Hooker, Jad Kabbara, Sandy Pentland:
Consent in Crisis: The Rapid Decline of the AI Data Commons. CoRR abs/2407.14933 (2024) - [i257]Yilun Zhou, Caiming Xiong, Silvio Savarese, Chien-Sheng Wu:
Shared Imagination: LLMs Hallucinate Alike. CoRR abs/2407.16604 (2024) - [i256]Yuhui Xu, Zhanming Jie, Hanze Dong, Lei Wang, Xudong Lu, Aojun Zhou, Amrita Saha, Caiming Xiong, Doyen Sahoo:
ThinK: Thinner Key Cache by Query-Driven Pruning. CoRR abs/2407.21018 (2024) - [i255]Liangwei Yang, Zhiwei Liu, Jianguo Zhang, Rithesh Murthy, Shelby Heinecke, Huan Wang, Caiming Xiong, Philip S. Yu:
Personalized Multi-task Training for Recommender System. CoRR abs/2407.21364 (2024) - [i254]Tian Lan, Huan Wang, Caiming Xiong, Silvio Savarese:
Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain Agnostic Framework for Data-Driven Scientific Research. CoRR abs/2408.00930 (2024) - [i253]Kexun Zhang, Weiran Yao, Zuxin Liu, Yihao Feng, Zhiwei Liu, Rithesh Murthy, Tian Lan, Lei Li, Renze Lou, Jiacheng Xu, Bo Pang, Yingbo Zhou, Shelby Heinecke, Silvio Savarese, Huan Wang, Caiming Xiong:
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents. CoRR abs/2408.07060 (2024) - [i252]Le Xue, Manli Shu, Anas Awadalla, Jun Wang, An Yan, Senthil Purushwalkam, Honglu Zhou, Viraj Prabhu, Yutong Dai, Michael S. Ryoo, Shrikant Kendre, Jieyu Zhang, Can Qin, Shu Zhang, Chia-Chih Chen, Ning Yu, Juntao Tan, Tulika Manoj Awalgaonkar, Shelby Heinecke, Huan Wang, Yejin Choi, Ludwig Schmidt, Zeyuan Chen, Silvio Savarese, Juan Carlos Niebles, Caiming Xiong, Ran Xu:
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models. CoRR abs/2408.08872 (2024) - [i251]Can Qin, Congying Xia, Krithika Ramakrishnan, Michael S. Ryoo, Lifu Tu, Yihao Feng, Manli Shu, Honglu Zhou, Anas Awadalla, Jun Wang, Senthil Purushwalkam, Le Xue, Yingbo Zhou, Huan Wang, Silvio Savarese, Juan Carlos Niebles, Zeyuan Chen, Ran Xu, Caiming Xiong:
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations. CoRR abs/2408.12590 (2024) - [i250]Jianguo Zhang, Tian Lan, Ming Zhu, Zuxin Liu, Thai Hoang, Shirley Kokane, Weiran Yao, Juntao Tan, Akshara Prabhakar, Haolin Chen, Zhiwei Liu, Yihao Feng, Tulika Manoj Awalgaonkar, Rithesh Murthy, Eric Hu, Zeyuan Chen, Ran Xu, Juan Carlos Niebles, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong:
xLAM: A Family of Large Action Models to Empower AI Agent Systems. CoRR abs/2409.03215 (2024) - [i249]Peifeng Wang, Austin Xu, Yilun Zhou, Caiming Xiong, Shafiq Joty:
Direct Judgement Preference Optimization. CoRR abs/2409.14664 (2024) - [i248]Xiangyu Peng, Congying Xia, Xinyi Yang, Caiming Xiong, Chien-Sheng Wu, Chen Xing:
ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement. CoRR abs/2410.02108 (2024) - [i247]Yifei Ming, Senthil Purushwalkam, Shrey Pandit, Zixuan Ke, Xuan-Phi Nguyen, Caiming Xiong, Shafiq Joty:
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows". CoRR abs/2410.03727 (2024) - [i246]Lei Wang, Shan Dong, Yuhui Xu, Hanze Dong, Yalu Wang, Amrita Saha, Ee-Peng Lim, Caiming Xiong, Doyen Sahoo:
MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs. CoRR abs/2410.04698 (2024) - [i245]Zirui Zhao, Hanze Dong, Amrita Saha, Caiming Xiong, Doyen Sahoo:
Automatic Curriculum Expert Iteration for Reliable LLM Reasoning. CoRR abs/2410.07627 (2024) - [i244]Simeng Han, Aaron Yu, Rui Shen, Zhenting Qi, Martin Riddell, Wenfei Zhou, Yujie Qiao, Yilun Zhao, Semih Yavuz, Ye Liu, Shafiq Joty, Yingbo Zhou, Caiming Xiong, Dragomir Radev, Rex Ying, Arman Cohan:
P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains. CoRR abs/2410.09207 (2024) - [i243]Taha Aksu, Gerald Woo, Juncheng Liu, Xu Liu, Chenghao Liu, Silvio Savarese, Caiming Xiong, Doyen Sahoo:
GIFT-Eval: A Benchmark For General Time Series Forecasting Model Evaluation. CoRR abs/2410.10393 (2024) - [i242]Xu Liu, Juncheng Liu, Gerald Woo, Taha Aksu, Yuxuan Liang, Roger Zimmermann, Chenghao Liu, Silvio Savarese, Caiming Xiong, Doyen Sahoo:
Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts. CoRR abs/2410.10469 (2024) - [i241]Viraj Prabhu, Senthil Purushwalkam, An Yan, Caiming Xiong, Ran Xu:
Trust but Verify: Programmatic VLM Evaluation in the Wild. CoRR abs/2410.13121 (2024) - [i240]Taha Aksu, Chenghao Liu, Amrita Saha, Sarah Tan, Caiming Xiong, Doyen Sahoo:
XForecast: Evaluating Natural Language Explanations for Time Series Forecasting. CoRR abs/2410.14180 (2024) - [i239]Kaige Xie, Philippe Laban, Prafulla Kumar Choubey, Caiming Xiong, Chien-Sheng Wu:
Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage. CoRR abs/2410.15531 (2024) - [i238]Michael S. Ryoo, Honglu Zhou, Shrikant Kendre, Can Qin, Le Xue, Manli Shu, Silvio Savarese, Ran Xu, Caiming Xiong, Juan Carlos Niebles:
xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs. CoRR abs/2410.16267 (2024) - [i237]Prafulla Kumar Choubey, Xin Su, Man Luo, Xiangyu Peng, Caiming Xiong, Tiep Le, Shachar Rosenman, Vasudev Lal, Phil Mui, Ricky Ho Yin Chan, Phillip Howard, Chien-Sheng Wu:
Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency. CoRR abs/2410.16597 (2024) - [i236]Zhiwei Liu, Weiran Yao, Jianguo Zhang, Rithesh Murthy, Liangwei Yang, Zuxin Liu, Tian Lan, Ming Zhu, Juntao Tan, Shirley Kokane, Thai Hoang, Juan Carlos Niebles, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong:
PRACT: Optimizing Principled Reasoning and Acting of LLM Agent. CoRR abs/2410.18528 (2024) - [i235]Antonio A. Ginart, Naveen Kodali, Jason Lee, Caiming Xiong, Silvio Savarese, John Emmons:
Asynchronous Tool Usage for Real-Time Agents. CoRR abs/2410.21620 (2024) - [i234]Tong Niu, Shafiq Joty, Ye Liu, Caiming Xiong, Yingbo Zhou, Semih Yavuz:
JudgeRank: Leveraging Large Language Models for Reasoning-Intensive Reranking. CoRR abs/2411.00142 (2024) - 2023
- [j12]Aadyot Bhatnagar, Paul Kassianik, Chenghao Liu, Tian Lan, Wenzhuo Yang, Rowan Cassius, Doyen Sahoo, Devansh Arpit, Sri Subramanian, Gerald Woo, Amrita Saha, Arun Kumar Jagota, Gokulakrishnan Gopalakrishnan, Manpreet Singh, K. C. Krithika, Sukumar Maddineni, Dae-ki Cho, Bo Zong, Yingbo Zhou, Caiming Xiong, Silvio Savarese, Steven C. H. Hoi, Huan Wang:
Merlion: End-to-End Machine Learning for Time Series. J. Mach. Learn. Res. 24: 226:1-226:6 (2023) - [j11]Anthony Meng Huat Tiong, Junnan Li, Guosheng Lin, Boyang Li, Caiming Xiong, Steven C. H. Hoi:
Improving Tail-Class Representation with Centroid Contrastive Learning. Pattern Recognit. Lett. 168: 123-130 (2023) - [j10]Xiang 'Anthony' Chen, Chien-Sheng Wu, Lidiya Murakhovs'ka, Philippe Laban, Tong Niu, Wenhao Liu, Caiming Xiong:
Marvista: Exploring the Design of a Human-AI Collaborative News Reading Tool. ACM Trans. Comput. Hum. Interact. 30(6): 92:1-92:27 (2023) - [c190]Eric Zhao, Alexander R. Trott, Caiming Xiong, Stephan Zheng:
Learning to Play General-Sum Games against Multiple Boundedly Rational Agents. AAAI 2023: 11781-11789 - [c189]Zahra Fatemi, Chen Xing, Wenhao Liu, Caiming Xiong:
Improving Gender Fairness of Pre-Trained Language Models without Catastrophic Forgetting. ACL (2) 2023: 1249-1262 - [c188]Fan Yin, Jesse Vig, Philippe Laban, Shafiq Joty, Caiming Xiong, Chien-Sheng Wu:
Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning. ACL (1) 2023: 3063-3079 - [c187]Yixin Liu, Alexander R. Fabbri, Pengfei Liu, Yilun Zhao, Linyong Nan, Ruilin Han, Simeng Han, Shafiq Joty, Chien-Sheng Wu, Caiming Xiong, Dragomir Radev:
Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation. ACL (1) 2023: 4140-4170 - [c186]Philippe Laban, Jesse Vig, Wojciech Kryscinski, Shafiq Joty, Caiming Xiong, Chien-Sheng Wu:
SWiPE: A Dataset for Document-Level Simplification of Wikipedia Pages. ACL (1) 2023: 10674-10695 - [c185]Jiacheng Xu, Caiming Xiong, Silvio Savarese, Yingbo Zhou:
Best-k Search Algorithm for Neural Text Generation. ACL (1) 2023: 12385-12401 - [c184]Philippe Laban, Chien-Sheng Wu, Lidiya Murakhovs'ka, Xiang 'Anthony' Chen, Caiming Xiong:
Designing and Evaluating Interfaces that Highlight News Coverage Diversity Using Discord Questions. CHI 2023: 104:1-104:21 - [c183]Ziwei Fan, Zhiwei Liu, Shelby Heinecke, Jianguo Zhang, Huan Wang, Caiming Xiong, Philip S. Yu:
Zero-shot Item-based Recommendation via Multi-task Product Knowledge Graph Pre-Training. CIKM 2023: 483-493 - [c182]Le Xue, Mingfei Gao, Chen Xing, Roberto Martín-Martín, Jiajun Wu, Caiming Xiong, Ran Xu, Juan Carlos Niebles, Silvio Savarese:
ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding. CVPR 2023: 1179-1189 - [c181]Hiroaki Hayashi, Wojciech Kryscinski, Bryan McCann, Nazneen Rajani, Caiming Xiong:
What's New? Summarizing Contributions in Scientific Literature. EACL 2023: 1019-1031 - [c180]Bo Pang, Semih Yavuz, Caiming Xiong, Yingbo Zhou:
SharPT: Shared Latent Space Prompt Tuning. EACL (Findings) 2023: 1214-1220 - [c179]Bo Pang, Erik Nijkamp, Wojciech Kryscinski, Silvio Savarese, Yingbo Zhou, Caiming Xiong:
Long Document Summarization with Top-down and Bottom-up Inference. EACL (Findings) 2023: 1237-1254 - [c178]Ye Liu, Semih Yavuz, Rui Meng, Dragomir Radev, Caiming Xiong, Shafiq Joty, Yingbo Zhou:
HPE: Answering Complex Questions over Text by Hybrid Question Parsing and Execution. EMNLP (Findings) 2023: 4437-4451 - [c177]Prafulla Kumar Choubey, Alexander R. Fabbri, Caiming Xiong, Chien-Sheng Wu:
Lexical Repetitions Lead to Rote Learning: Unveiling the Impact of Lexical Overlap in Train and Test Reference Summaries. EMNLP (Findings) 2023: 7269-7283 - [c176]Philippe Laban, Wojciech Kryscinski, Divyansh Agarwal, Alexander R. Fabbri, Caiming Xiong, Shafiq Joty, Chien-Sheng Wu:
SummEdits: Measuring LLM Ability at Factual Reasoning Through The Lens of Summarization. EMNLP 2023: 9662-9676 - [c175]Lidiya Murakhovs'ka, Philippe Laban, Tian Xie, Caiming Xiong, Chien-Sheng Wu:
Salespeople vs SalesBot: Exploring the Role of Educational Value in Conversational Recommender Systems. EMNLP (Findings) 2023: 9823-9838 - [c174]Yixin Liu, Alexander R. Fabbri, Yilun Zhao, Pengfei Liu, Shafiq Joty, Chien-Sheng Wu, Caiming Xiong, Dragomir Radev:
Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation. EMNLP 2023: 16360-16368 - [c173]Can Qin, Ning Yu, Chen Xing, Shu Zhang, Zeyuan Chen, Stefano Ermon, Yun Fu, Caiming Xiong, Ran Xu:
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation. ICCV 2023: 23028-23039 - [c172]Le Xue, Mingfei Gao, Zeyuan Chen, Caiming Xiong, Ran Xu:
Robustness Evaluation of Transformer-Based Form Field Extractors via Form Attacks. ICDAR (2) 2023: 167-184 - [c171]Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu:
Binding Language Models in Symbolic Languages. ICLR 2023 - [c170]Yihao Feng, Shentao Yang, Shujian Zhang, Jianguo Zhang, Caiming Xiong, Mingyuan Zhou, Huan Wang:
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems. ICLR 2023 - [c169]Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong:
CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis. ICLR 2023 - [c168]Xiangyu Peng, Chen Xing, Prafulla Kumar Choubey, Chien-Sheng Wu, Caiming Xiong:
Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning. ICLR 2023 - [c167]Aadyot Bhatnagar, Huan Wang, Caiming Xiong, Yu Bai:
Improved Online Conformal Prediction via Strongly Adaptive Online Learning. ICML 2023: 2337-2363 - [c166]Fan Chen, Huan Wang, Caiming Xiong, Song Mei, Yu Bai:
Lower Bounds for Learning in Revealing POMDPs. ICML 2023: 5104-5161 - [c165]Yu Bai, Fan Chen, Huan Wang, Caiming Xiong, Song Mei:
Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection. NeurIPS 2023 - [c164]Can Qin, Shu Zhang, Ning Yu, Yihao Feng, Xinyi Yang, Yingbo Zhou, Huan Wang, Juan Carlos Niebles, Caiming Xiong, Silvio Savarese, Stefano Ermon, Yun Fu, Ran Xu:
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild. NeurIPS 2023 - [c163]Shentao Yang, Shujian Zhang, Congying Xia, Yihao Feng, Caiming Xiong, Mingyuan Zhou:
Preference-grounded Token-level Guidance for Language Model Fine-tuning. NeurIPS 2023 - [c162]