default search action
Dahua Lin
Person information
- affiliation: Chinese University of Hong Kong, Department of Information Engineering, CUHK - SenseTime Joint Lab, Hong Kong
- affiliation: Toyota Technological Institute at Chicago, IL, USA
- affiliation (PhD 2012): Massachusetts Institute of Technology, Cambridge, MA, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j22]Hongyang Li, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu, Huijie Wang, Jia Zeng, Zhiqi Li, Jiazhi Yang, Hanming Deng, Hao Tian, Enze Xie, Jiangwei Xie, Li Chen, Tianyu Li, Yang Li, Yulu Gao, Xiaosong Jia, Si Liu, Jianping Shi, Dahua Lin, Yu Qiao:
Delving Into the Devils of Bird's-Eye-View Perception: A Review, Evaluation and Recipe. IEEE Trans. Pattern Anal. Mach. Intell. 46(4): 2151-2170 (2024) - [j21]Weijia Li, Zhenghao Hu, Lingxuan Meng, Jinwang Wang, Juepeng Zheng, Runmin Dong, Conghui He, Gui-Song Xia, Haohuan Fu, Dahua Lin:
Weakly Supervised 3-D Building Reconstruction From Monocular Remote Sensing Images. IEEE Trans. Geosci. Remote. Sens. 62: 1-15 (2024) - [j20]Yuan Liu, Songyang Zhang, Jiacheng Chen, Kai Chen, Dahua Lin:
PixMIM: Rethinking Pixel Reconstruction in Masked Image Modeling. Trans. Mach. Learn. Res. 2024 (2024) - [j19]Jiangfei Duan, Xiuhong Li, Ping Xu, Xingcheng Zhang, Shengen Yan, Yun Liang, Dahua Lin:
Proteus: Simulating the Performance of Distributed DNN Training. IEEE Trans. Parallel Distributed Syst. 35(10): 1867-1878 (2024) - [c214]Lijun Li, Bowen Dong, Ruohui Wang, Xuhao Hu, Wangmeng Zuo, Dahua Lin, Yu Qiao, Jing Shao:
SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models. ACL (Findings) 2024: 3923-3954 - [c213]Chenyu Shi, Xiao Wang, Qiming Ge, Songyang Gao, Xianjun Yang, Tao Gui, Qi Zhang, Xuanjing Huang, Xun Zhao, Dahua Lin:
Navigating the OverKill in Large Language Models. ACL (1) 2024: 4602-4614 - [c212]Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang, Dahua Lin, Kai Chen:
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark. ACL (Findings) 2024: 6884-6915 - [c211]Jie Ren, Qipeng Guo, Hang Yan, Dongrui Liu, Quanshi Zhang, Xipeng Qiu, Dahua Lin:
Identifying Semantic Induction Heads to Understand In-Context Learning. ACL (Findings) 2024: 6916-6932 - [c210]Ziwei Ji, Yuzhe Gu, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen:
ANAH: Analytical Annotation of Hallucinations in Large Language Models. ACL (1) 2024: 8135-8158 - [c209]Yu Sun, Keyuchen Keyuchen, Shujie Wang, Peiji Li, Qipeng Guo, Hang Yan, Xipeng Qiu, Xuanjing Huang, Dahua Lin:
F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods. ACL (1) 2024: 9348-9369 - [c208]Zehui Chen, Kuikun Liu, Qiuchen Wang, Wenwei Zhang, Jiangning Liu, Dahua Lin, Kai Chen, Feng Zhao:
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models. ACL (Findings) 2024: 9354-9366 - [c207]Zehui Chen, Weihua Du, Wenwei Zhang, Kuikun Liu, Jiangning Liu, Miao Zheng, Jingming Zhuo, Songyang Zhang, Dahua Lin, Kai Chen, Feng Zhao:
T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step. ACL (1) 2024: 9510-9529 - [c206]Yikun Wang, Rui Zheng, Liang Ding, Qi Zhang, Dahua Lin, Dacheng Tao:
Uncertainty Aware Learning for Language Model Alignment. ACL (1) 2024: 11087-11099 - [c205]Demin Song, Honglin Guo, Yunhua Zhou, Shuhao Xing, Yudong Wang, Zifan Song, Wenwei Zhang, Qipeng Guo, Hang Yan, Xipeng Qiu, Dahua Lin:
Code Needs Comments: Enhancing Code LLMs with Comment Augmentation. ACL (Findings) 2024: 13640-13656 - [c204]Yunfan Shao, Linyang Li, Zhaoye Fei, Hang Yan, Dahua Lin, Xipeng Qiu:
Balanced Data Sampling for Language Model Training with Clustering. ACL (Findings) 2024: 14012-14023 - [c203]Can Tan, Honglin Fang, Junye Zhang, Dahua Lin, Liyan Zhang, Yong Zhang, Peng Yu:
A Knowledge-driven Self-healing Dual-loop and Validation for Autonomous Networks. APNet 2024: 185-186 - [c202]Xupeng Miao, Chunan Shi, Jiangfei Duan, Xiaoli Xi, Dahua Lin, Bin Cui, Zhihao Jia:
SpotServe: Serving Generative Large Language Models on Preemptible Instances. ASPLOS (2) 2024: 1112-1127 - [c201]Xian Liu, Xiaohang Zhan, Jiaxiang Tang, Ying Shan, Gang Zeng, Dahua Lin, Xihui Liu, Ziwei Liu:
HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting. CVPR 2024: 6646-6657 - [c200]Yuming Jiang, Tianxing Wu, Shuai Yang, Chenyang Si, Dahua Lin, Yu Qiao, Chen Change Loy, Ziwei Liu:
VideoBooth: Diffusion-based Video Generation with Image Prompts. CVPR 2024: 6689-6700 - [c199]Xuekun Jiang, Anyi Rao, Jingbo Wang, Dahua Lin, Bo Dai:
Cinematic Behavior Transfer via NeRF-based Differentiable Filming. CVPR 2024: 6723-6732 - [c198]Qihang Zhang, Chaoyang Wang, Aliaksandr Siarohin, Peiye Zhuang, Yinghao Xu, Ceyuan Yang, Dahua Lin, Bolei Zhou, Sergey Tulyakov, Hsin-Ying Lee:
Towards Text-guided 3D Scene Composition. CVPR 2024: 6829-6838 - [c197]Zeyi Sun, Ye Fang, Tong Wu, Pan Zhang, Yuhang Zang, Shu Kong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
Alpha-CLIP: A CLIP Model Focusing on Wherever you Want. CVPR 2024: 13019-13029 - [c196]Qidong Huang, Xiaoyi Dong, Pan Zhang, Bin Wang, Conghui He, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu:
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation. CVPR 2024: 13418-13427 - [c195]Tai Wang, Xiaohan Mao, Chenming Zhu, Runsen Xu, Ruiyuan Lyu, Peisen Li, Xiao Chen, Wenwei Zhang, Kai Chen, Tianfan Xue, Xihui Liu, Cewu Lu, Dahua Lin, Jiangmiao Pang:
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI. CVPR 2024: 19757-19767 - [c194]Tao Lu, Mulin Yu, Linning Xu, Yuanbo Xiangli, Limin Wang, Dahua Lin, Bo Dai:
Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering. CVPR 2024: 20654-20664 - [c193]Ziqi Huang, Yinan He, Jiashuo Yu, Fan Zhang, Chenyang Si, Yuming Jiang, Yuanhan Zhang, Tianxing Wu, Qingyang Jin, Nattapol Chanpaisit, Yaohui Wang, Xinyuan Chen, Limin Wang, Dahua Lin, Yu Qiao, Ziwei Liu:
VBench: Comprehensive Benchmark Suite for Video Generative Models. CVPR 2024: 21807-21818 - [c192]Tong Wu, Guandao Yang, Zhibing Li, Kai Zhang, Ziwei Liu, Leonidas J. Guibas, Dahua Lin, Gordon Wetzstein:
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation. CVPR 2024: 22227-22238 - [c191]Zhangyang Qi, Ye Fang, Zeyi Sun, Xiaoyang Wu, Tong Wu, Jiaqi Wang, Dahua Lin, Hengshuang Zhao:
GPT4Point: A Unified Framework for Point-Language Understanding and Generation. CVPR 2024: 26407-26417 - [c190]Jiaming Han, Kaixiong Gong, Yiyuan Zhang, Jiaqi Wang, Kaipeng Zhang, Dahua Lin, Yu Qiao, Peng Gao, Xiangyu Yue:
OneLLM: One Framework to Align All Modalities with Language. CVPR 2024: 26574-26585 - [c189]Rongjie Li, Songyang Zhang, Dahua Lin, Kai Chen, Xuming He:
From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models. CVPR 2024: 28076-28086 - [c188]Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang, Wangbo Zhao, Yike Yuan, Jiaqi Wang, Conghui He, Ziwei Liu, Kai Chen, Dahua Lin:
MMBench: Is Your Multi-modal Model an All-Around Player? ECCV (6) 2024: 216-233 - [c187]Xiao Fu, Wei Yin, Mu Hu, Kaixuan Wang, Yuexin Ma, Ping Tan, Shaojie Shen, Dahua Lin, Xiaoxiao Long:
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image. ECCV (22) 2024: 241-258 - [c186]Rui Qian, Shuangrui Ding, Dahua Lin:
Rethinking Image-to-Video Adaptation: An Object-Centric Perspective. ECCV (43) 2024: 329-348 - [c185]Yuwei Guo, Ceyuan Yang, Anyi Rao, Maneesh Agrawala, Dahua Lin, Bo Dai:
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models. ECCV (42) 2024: 330-348 - [c184]Yuwei Guo, Ceyuan Yang, Anyi Rao, Zhengyang Liang, Yaohui Wang, Yu Qiao, Maneesh Agrawala, Dahua Lin, Bo Dai:
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning. ICLR 2024 - [c183]Xinyuan Chen, Yaohui Wang, Lingjun Zhang, Shaobin Zhuang, Xin Ma, Jiashuo Yu, Yali Wang, Dahua Lin, Yu Qiao, Ziwei Liu:
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction. ICLR 2024 - [c182]Xiaoran Liu, Hang Yan, Chenxin An, Xipeng Qiu, Dahua Lin:
Scaling Laws of RoPE-based Extrapolation. ICLR 2024 - [c181]Xian Liu, Jian Ren, Aliaksandr Siarohin, Ivan Skorokhodov, Yanyu Li, Dahua Lin, Xihui Liu, Ziwei Liu, Sergey Tulyakov:
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion. ICLR 2024 - [c180]Zeqi Xiao, Tai Wang, Jingbo Wang, Jinkun Cao, Wenwei Zhang, Bo Dai, Dahua Lin, Jiangmiao Pang:
Unified Human-Scene Interaction via Prompted Chain-of-Contacts. ICLR 2024 - [c179]Jiangfei Duan, Runyu Lu, Haojie Duanmu, Xiuhong Li, Xingcheng Zhang, Dahua Lin, Ion Stoica, Hao Zhang:
MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving. ICML 2024 - [c178]Songyang Gao, Qiming Ge, Wei Shen, Shihan Dou, Junjie Ye, Xiao Wang, Rui Zheng, Yicheng Zou, Zhi Chen, Hang Yan, Qi Zhang, Dahua Lin:
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback. ICML 2024 - [c177]Haodong Duan, Jueqi Wei, Chonghua Wang, Hongwei Liu, Yixiao Fang, Songyang Zhang, Dahua Lin, Kai Chen:
BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues. NAACL-HLT (Findings) 2024: 3184-3200 - [c176]Chonghua Wang, Haodong Duan, Songyang Zhang, Dahua Lin, Kai Chen:
Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks. NAACL-HLT 2024: 3712-3724 - [c175]Kexin Huang, Xiangyang Liu, Qianyu Guo, Tianxiang Sun, Jiawei Sun, Yaru Wang, Zeyang Zhou, Yixu Wang, Yan Teng, Xipeng Qiu, Yingchun Wang, Dahua Lin:
Flames: Benchmarking Value Alignment of LLMs in Chinese. NAACL-HLT 2024: 4551-4591 - [c174]Jiangfei Duan, Ziang Song, Xupeng Miao, Xiaoli Xi, Dahua Lin, Harry Xu, Minjia Zhang, Zhihao Jia:
Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances. NSDI 2024 - [c173]Qinghao Hu, Zhisheng Ye, Zerui Wang, Guoteng Wang, Meng Zhang, Qiaoling Chen, Peng Sun, Dahua Lin, Xiaolin Wang, Yingwei Luo, Yonggang Wen, Tianwei Zhang:
Characterization of Large Language Model Development in the Datacenter. NSDI 2024: 709-729 - [i267]Tong Wu, Guandao Yang, Zhibing Li, Kai Zhang, Ziwei Liu, Leonidas J. Guibas, Dahua Lin, Gordon Wetzstein:
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation. CoRR abs/2401.04092 (2024) - [i266]Mingxin Huang, Dezhi Peng, Hongliang Li, Zhenghao Peng, Chongyu Liu, Dahua Lin, Yuliang Liu, Xiang Bai, Lianwen Jin:
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting. CoRR abs/2401.07641 (2024) - [i265]Songyang Gao, Qiming Ge, Wei Shen, Shihan Dou, Junjie Ye, Xiao Wang, Rui Zheng, Yicheng Zou, Zhi Chen, Hang Yan, Qi Zhang, Dahua Lin:
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback. CoRR abs/2401.11458 (2024) - [i264]Zhaoye Fei, Yunfan Shao, Linyang Li, Zhiyuan Zeng, Hang Yan, Xipeng Qiu, Dahua Lin:
Query of CC: Unearthing Large Scale Domain-Specific Knowledge from Public Corpora. CoRR abs/2401.14624 (2024) - [i263]Yu Sun, Keyu Chen, Shujie Wang, Qipeng Guo, Hang Yan, Xipeng Qiu, Xuanjing Huang, Dahua Lin:
F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods. CoRR abs/2401.14869 (2024) - [i262]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Xilin Wei, Songyang Zhang, Haodong Duan, Maosong Cao, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model. CoRR abs/2401.16420 (2024) - [i261]Chenyu Shi, Xiao Wang, Qiming Ge, Songyang Gao, Xianjun Yang, Tao Gui, Qi Zhang, Xuanjing Huang, Xun Zhao, Dahua Lin:
Navigating the OverKill in Large Language Models. CoRR abs/2401.17633 (2024) - [i260]Lijun Li, Bowen Dong, Ruohui Wang, Xuhao Hu, Wangmeng Zuo, Dahua Lin, Yu Qiao, Jing Shao:
SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models. CoRR abs/2402.05044 (2024) - [i259]Tao Yuan, Xuefei Ning, Dong Zhou, Zhijie Yang, Shiyao Li, Minghui Zhuang, Zheyue Tan, Zhuyu Yao, Dahua Lin, Boxun Li, Guohao Dai, Shengen Yan, Yu Wang:
LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K. CoRR abs/2402.05136 (2024) - [i258]Huaiyuan Ying, Shuo Zhang, Linyang Li, Zhejian Zhou, Yunfan Shao, Zhaoye Fei, Yichuan Ma, Jiawei Hong, Kuikun Liu, Ziyi Wang, Yudong Wang, Zijian Wu, Shuaibin Li, Fengzhe Zhou, Hongwei Liu, Songyang Zhang, Wenwei Zhang, Hang Yan, Xipeng Qiu, Jiayu Wang, Kai Chen, Dahua Lin:
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning. CoRR abs/2402.06332 (2024) - [i257]Ying Jin, Jiaqi Wang, Dahua Lin:
SepRep-Net: Multi-source Free Domain Adaptation via Model Separation And Reparameterization. CoRR abs/2402.08249 (2024) - [i256]Jiahe Chen, Jinkun Cao, Dahua Lin, Kris Kitani, Jiangmiao Pang:
Mixed Gaussian Flow for Diverse Trajectory Prediction. CoRR abs/2402.12238 (2024) - [i255]Zhiyuan Zeng, Qipeng Guo, Zhaoye Fei, Zhangyue Yin, Yunhua Zhou, Linyang Li, Tianxiang Sun, Hang Yan, Dahua Lin, Xipeng Qiu:
Turn Waste into Worth: Rectifying Top-k Router of MoE. CoRR abs/2402.12399 (2024) - [i254]Demin Song, Honglin Guo, Yunhua Zhou, Shuhao Xing, Yudong Wang, Zifan Song, Wenwei Zhang, Qipeng Guo, Hang Yan, Xipeng Qiu, Dahua Lin:
Code Needs Comments: Enhancing Code LLMs with Comment Augmentation. CoRR abs/2402.13013 (2024) - [i253]Jie Ren, Qipeng Guo, Hang Yan, Dongrui Liu, Xipeng Qiu, Dahua Lin:
Identifying Semantic Induction Heads to Understand In-Context Learning. CoRR abs/2402.13055 (2024) - [i252]Kai Lv, Xiaoran Liu, Qipeng Guo, Hang Yan, Conghui He, Xipeng Qiu, Dahua Lin:
LongWanjuan: Towards Systematic Measurement for Long Text Quality. CoRR abs/2402.13583 (2024) - [i251]Tian Lan, Wenwei Zhang, Chen Xu, Heyan Huang, Dahua Lin, Kai Chen, Xianling Mao:
CriticBench: Evaluating Large Language Models as Critic. CoRR abs/2402.13764 (2024) - [i250]Yunfan Shao, Linyang Li, Zhaoye Fei, Hang Yan, Dahua Lin, Xipeng Qiu:
Balanced Data Sampling for Language Model Training with Clustering. CoRR abs/2402.14526 (2024) - [i249]Yuhang Cao, Pan Zhang, Xiaoyi Dong, Dahua Lin, Jiaqi Wang:
DualFocus: Integrating Macro and Micro Perspectives in Multi-modal Large Language Models. CoRR abs/2402.14767 (2024) - [i248]Runyu Peng, Yunhua Zhou, Qipeng Guo, Yang Gao, Hang Yan, Xipeng Qiu, Dahua Lin:
Data-freeWeight Compress and Denoise for Large Language Models. CoRR abs/2402.16319 (2024) - [i247]Shuangrui Ding, Zihan Liu, Xiaoyi Dong, Pan Zhang, Rui Qian, Conghui He, Dahua Lin, Jiaqi Wang:
SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation. CoRR abs/2402.17645 (2024) - [i246]Jiantao Qiu, Haijun Lv, Zhenjiang Jin, Rui Wang, Wenchang Ning, Jia Yu, ChaoBin Zhang, Zhenxiang Li, Pei Chu, Yuan Qu, Jin Shi, Lindong Lu, Runyu Peng, Zhiyuan Zeng, Huanze Tang, Zhikai Lei, Jiawei Hong, Keyu Chen, Zhaoye Fei, Ruiliang Xu, Wei Li, Zhongying Tu, Dahua Lin, Yu Qiao, Hang Yan, Conghui He:
WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset. CoRR abs/2402.19282 (2024) - [i245]Fangzhou Hong, Jiaxiang Tang, Ziang Cao, Min Shi, Tong Wu, Zhaoxi Chen, Tengfei Wang, Liang Pan, Dahua Lin, Ziwei Liu:
3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors. CoRR abs/2403.02234 (2024) - [i244]Qinghao Hu, Zhisheng Ye, Zerui Wang, Guoteng Wang, Meng Zhang, Qiaoling Chen, Peng Sun, Dahua Lin, Xiaolin Wang, Yingwei Luo, Yonggang Wen, Tianwei Zhang:
Characterization of Large Language Model Development in the Datacenter. CoRR abs/2403.07648 (2024) - [i243]Bowen Li, Wenhan Wu, Ziwei Tang, Lin Shi, John Yang, Jinyang Li, Shunyu Yao, Chen Qian, Binyuan Hui, Qicheng Zhang, Zhiyin Yu, He Du, Ping Yang, Dahua Lin, Chao Peng, Kai Chen:
DevBench: A Comprehensive Benchmark for Software Development. CoRR abs/2403.08604 (2024) - [i242]Xiao Fu, Wei Yin, Mu Hu, Kaixuan Wang, Yuexin Ma, Ping Tan, Shaojie Shen, Dahua Lin, Xiaoxiao Long:
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image. CoRR abs/2403.12013 (2024) - [i241]Zehui Chen, Kuikun Liu, Qiuchen Wang, Wenwei Zhang, Jiangning Liu, Dahua Lin, Kai Chen, Feng Zhao:
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models. CoRR abs/2403.12881 (2024) - [i240]Ziyu Liu, Zeyi Sun, Yuhang Zang, Wei Li, Pan Zhang, Xiaoyi Dong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition. CoRR abs/2403.13805 (2024) - [i239]Jiangfei Duan, Ziang Song, Xupeng Miao, Xiaoli Xi, Dahua Lin, Harry Xu, Minjia Zhang, Zhihao Jia:
Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances. CoRR abs/2403.14097 (2024) - [i238]Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Jiaqi Wang, Yu Qiao, Dahua Lin, Feng Zhao:
Are We on the Right Way for Evaluating Large Vision-Language Models? CoRR abs/2403.20330 (2024) - [i237]Rongjie Li, Songyang Zhang, Dahua Lin, Kai Chen, Xuming He:
From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models. CoRR abs/2404.00906 (2024) - [i236]Jiangfei Duan, Runyu Lu, Haojie Duanmu, Xiuhong Li, Xingcheng Zhang, Dahua Lin, Ion Stoica, Hao Zhang:
MuxServe: Flexible Multiplexing for Efficient Multiple LLM Serving. CoRR abs/2404.02015 (2024) - [i235]Chonghua Wang, Haodong Duan, Songyang Zhang, Dahua Lin, Kai Chen:
Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks. CoRR abs/2404.06480 (2024) - [i234]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. CoRR abs/2404.06512 (2024) - [i233]Xiao Wang, Tianze Chen, Xianjun Yang, Qi Zhang, Xun Zhao, Dahua Lin:
Unveiling the Misuse Potential of Base Large Language Models via In-Context Learning. CoRR abs/2404.10552 (2024) - [i232]Junfeng Long, Wenye Yu, Quanyi Li, Zirui Wang, Dahua Lin, Jiangmiao Pang:
Learning H-Infinity Locomotion Control. CoRR abs/2404.14405 (2024) - [i231]Zhe Chen, Weiyun Wang, Hao Tian, Shenglong Ye, Zhangwei Gao, Erfei Cui, Wenwen Tong, Kongzhi Hu, Jiapeng Luo, Zheng Ma, Ji Ma, Jiaqi Wang, Xiaoyi Dong, Hang Yan, Hewei Guo, Conghui He, Botian Shi, Zhenjiang Jin, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai, Licheng Wen, Xiangchao Yan, Min Dou, Lewei Lu, Xizhou Zhu, Tong Lu, Dahua Lin, Yu Qiao, Jifeng Dai, Wenhai Wang:
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites. CoRR abs/2404.16821 (2024) - [i230]Ye Fang, Zeyi Sun, Tong Wu, Jiaqi Wang, Ziwei Liu, Gordon Wetzstein, Dahua Lin:
Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials. CoRR abs/2404.16829 (2024) - [i229]Wei Li, Ren Ma, Jiang Wu, Chenya Gu, Jiahui Peng, Jinyang Len, Songyang Zhang, Hang Yan, Dahua Lin, Conghui He:
FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models. CoRR abs/2404.18359 (2024) - [i228]Haojie Duanmu, Zhihang Yuan, Xiuhong Li, Jiangfei Duan, Xingcheng Zhang, Dahua Lin:
SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models. CoRR abs/2405.06219 (2024) - [i227]Yilun Chen, Shuai Yang, Haifeng Huang, Tai Wang, Ruiyuan Lyu, Runsen Xu, Dahua Lin, Jiangmiao Pang:
Grounded 3D-LLM with Referent Tokens. CoRR abs/2405.10370 (2024) - [i226]Ying Jin, Pengyang Ling, Xiaoyi Dong, Pan Zhang, Jiaqi Wang, Dahua Lin:
ReasonPix2Pix: Instruction Reasoning Dataset for Advanced Image Editing. CoRR abs/2405.11190 (2024) - [i225]Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang, Dahua Lin, Kai Chen:
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark. CoRR abs/2405.12209 (2024) - [i224]Rui Qian, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Shuangrui Ding, Dahua Lin, Jiaqi Wang:
Streaming Long Video Understanding with Large Language Models. CoRR abs/2405.16009 (2024) - [i223]Bin Wang, Linke Ouyang, Fan Wu, Wenchang Ning, Xiao Han, Zhiyuan Zhao, Jiahui Peng, Yiying Jiang, Dahua Lin, Conghui He:
DSDL: Data Set Description Language for Bridging Modalities and Tasks in AI Data. CoRR abs/2405.18315 (2024) - [i222]Zifan Song, Yudong Wang, Wenwei Zhang, Kuikun Liu, Chengqi Lyu, Demin Song, Qipeng Guo, Hang Yan, Dahua Lin, Kai Chen, Cairong Zhao:
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data. CoRR abs/2405.19265 (2024) - [i221]Ziwei Ji, Yuzhe Gu, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen:
ANAH: Analytical Annotation of Hallucinations in Large Language Models. CoRR abs/2405.20315 (2024) - [i220]Zeyi Sun, Tong Wu, Pan Zhang, Yuhang Zang, Xiaoyi Dong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang:
Bootstrap3D: Improving 3D Content Creation with Synthetic Data. CoRR abs/2406.00093 (2024) - [i219]Huaiyuan Ying, Zijian Wu, Yihan Geng, Jiayu Wang, Dahua Lin, Kai Chen:
Lean Workbook: A large-scale Lean problem set formalized from natural language math problems. CoRR abs/2406.03847 (2024) - [i218]Lin Chen, Xilin Wei, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Bin Lin, Zhenyu Tang, Li Yuan, Yu Qiao, Dahua Lin, Feng Zhao, Jiaqi Wang:
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions. CoRR abs/2406.04325 (2024) - [i217]Yikun Wang, Rui Zheng, Liang Ding, Qi Zhang, Dahua Lin, Dacheng Tao:
Uncertainty Aware Learning for Language Model Alignment. CoRR abs/2406.04854 (2024) - [i216]Qingyun Li, Zhe Chen, Weiyun Wang, Wenhai Wang, Shenglong Ye, Zhenjiang Jin, Guanzhou Chen, Yinan He, Zhangwei Gao, Erfei Cui, Jiashuo Yu, Hao Tian, Jiasheng Zhou, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai, Licheng Wen, Xiangchao Yan, Zhenxiang Li, Pei Chu, Yi Wang, Min Dou, Changyao Tian, Xizhou Zhu, Lewei Lu, Yushi Chen, Junjun He, Zhongying Tu, Tong Lu, Yali Wang, Limin Wang, Dahua Lin, Yu Qiao, Botian Shi, Conghui He, Jifeng Dai:
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text. CoRR abs/2406.08418 (2024) - [i215]Ruiyuan Lyu, Tai Wang, Jingli Lin, Shuai Yang, Xiaohan Mao, Yilun Chen, Runsen Xu, Haifeng Huang, Chenming Zhu, Dahua Lin, Jiangmiao Pang:
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations. CoRR abs/2406.09401 (2024) - [i214]Jiaqi Wang, Yuhang Zang, Pan Zhang, Tao Chu, Yuhang Cao, Zeyi Sun, Ziyu Liu, Xiaoyi Dong, Tong Wu, Dahua Lin, Zeming Chen, Zhi Wang, Lingchen Meng, Wenhao Yao, Jianwei Yang, Sihong Wu, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Peixi Wu, Bosong Chai, Xuan Nie, Longquan Yan, Zeyu Wang, Qifan Zhou, Boning Wang, Jiaqi Huang, Zunnan Xu, Xiu Li, Kehong Yuan, Yanyan Zu, Jiayao Ha, Qiong Gao, Licheng Jiao:
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results. CoRR abs/2406.11739 (2024) - [i213]Ziyu Liu, Tao Chu, Yuhang Zang, Xilin Wei, Xiaoyi Dong, Pan Zhang, Zijian Liang, Yuanjun Xiong, Yu Qiao, Dahua Lin, Jiaqi Wang:
MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs. CoRR abs/2406.11833 (2024) - [i212]Zhen Huang, Zengzhi Wang, Shijie Xia, Xuefeng Li, Haoyang Zou, Ruijie Xu, Run-Ze Fan, Lyumanshan Ye, Ethan Chern, Yixin Ye, Yikai Zhang, Yuqing Yang, Ting Wu, Binjie Wang, Shichao Sun, Yang Xiao, Yiyuan Li, Fan Zhou, Steffi Chern, Yiwei Qin, Yan Ma, Jiadi Su, Yixiu Liu, Yuxiang Zheng, Shaoting Zhang, Dahua Lin, Yu Qiao, Pengfei Liu:
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI. CoRR abs/2406.12753 (2024) - [i211]Xinyu Fang, Kangrui Mao, Haodong Duan, Xiangyu Zhao, Yining Li, Dahua Lin, Kai Chen:
MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding. CoRR abs/2406.14515 (2024) - [i210]Yuxuan Qiao, Haodong Duan, Xinyu Fang, Junming Yang, Lin Chen, Songyang Zhang, Jiaqi Wang, Dahua Lin, Kai Chen:
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs. CoRR abs/2406.14544 (2024) - [i209]Zhiwei Fei, Songyang Zhang, Xiaoyu Shen, Dawei Zhu, Xiao Wang, Maosong Cao, Fengzhe Zhou, Yining Li, Wenwei Zhang, Dahua Lin, Kai Chen, Jidong Ge:
InternLM-Law: An Open Source Chinese Legal Large Language Model. CoRR abs/2406.14887 (2024) - [i208]