


default search action
Xiaodan Liang
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j67]Xiaojun Wang, Zichen Lou, Xiaodan Liang:
Optimal operation of integrated electricity and gas networks with risk analysis using downside risk constraints method. Comput. Chem. Eng. 184: 108641 (2024) - [j66]Linfeng Li, Weixing Su, Fang Liu, Maowei He, Xiaodan Liang:
Multi-scale adaptive networks for efficient inference. Int. J. Mach. Learn. Cybern. 15(2): 267-282 (2024) - [j65]Guangrun Wang
, Changlin Li
, Liuchun Yuan, Jiefeng Peng, Xiaoyu Xian
, Xiaodan Liang
, Xiaojun Chang
, Liang Lin
:
DNA Family: Boosting Weight-Sharing NAS With Block-Wise Supervisions. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 2722-2740 (2024) - [j64]Bingqian Lin
, Yunshuang Nie
, Ziming Wei
, Yi Zhu
, Hang Xu
, Shikui Ma
, Jianzhuang Liu
, Xiaodan Liang
:
Correctable Landmark Discovery via Large Models for Vision-Language Navigation. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 8534-8548 (2024) - [j63]Hanlin Zhang
, Shuai Lin
, Weiyang Liu
, Pan Zhou
, Jian Tang, Xiaodan Liang
, Eric P. Xing:
Iterative Graph Self-Distillation. IEEE Trans. Knowl. Data Eng. 36(3): 1161-1169 (2024) - [j62]Shuai Lin
, Chen Liu, Pan Zhou
, Zi-Yuan Hu
, Shuojia Wang, Ruihui Zhao
, Yefeng Zheng
, Liang Lin
, Eric P. Xing, Xiaodan Liang
:
Prototypical Graph Contrastive Learning. IEEE Trans. Neural Networks Learn. Syst. 35(2): 2747-2758 (2024) - [j61]Jinghui Qin
, Zhicheng Yang
, Jiaqi Chen, Xiaodan Liang
, Liang Lin
:
Template-Based Contrastive Distillation Pretraining for Math Word Problem Solving. IEEE Trans. Neural Networks Learn. Syst. 35(9): 12823-12835 (2024) - [j60]Yanxin Long
, Jianhua Han, Runhui Huang
, Hang Xu, Yi Zhu, Chunjing Xu, Xiaodan Liang:
Fine-Grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection. IEEE Trans. Neural Networks Learn. Syst. 35(11): 16277-16287 (2024) - [c244]Xuan Huang, Hanhui Li, Zejun Yang, Zhisheng Wang, Xiaodan Liang:
3D Visibility-Aware Generalizable Neural Radiance Fields for Interacting Hands. AAAI 2024: 2400-2408 - [c243]Hanhui Li, Xiaojian Lin, Xuan Huang, Zejun Yang, Zhisheng Wang, Xiaodan Liang:
Monocular 3D Hand Mesh Recovery via Dual Noise Estimation. AAAI 2024: 3046-3054 - [c242]Luoyang Lin, Zutao Jiang, Xiaodan Liang, Liqian Ma, Michael C. Kampffmeyer, Xiaochun Cao:
PTUS: Photo-Realistic Talking Upper-Body Synthesis via 3D-Aware Motion Decomposition Warping. AAAI 2024: 3441-3449 - [c241]Zhenyu Xie, Yang Wu, Xuehao Gao, Zhongqian Sun, Wei Yang, Xiaodan Liang:
Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced Hierarchical Diffusion Model. AAAI 2024: 6252-6260 - [c240]Meng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang, Ruyang Liu, Long Chen, Xiaodan Liang, Li Yuan, Ge Li:
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter. ACL (Findings) 2024: 7160-7174 - [c239]Jiaqi Chen, Bingqian Lin, Ran Xu, Zhenhua Chai, Xiaodan Liang, Kwan-Yee Kenneth Wong:
MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation. ACL (1) 2024: 9796-9810 - [c238]Yinya Huang, Ruixin Hong, Hongming Zhang, Wei Shao, Zhicheng Yang, Dong Yu, Changshui Zhang, Xiaodan Liang, Linqi Song:
CLOMO: Counterfactual Logical Modification with Large Language Models. ACL (1) 2024: 11012-11034 - [c237]Qingxing Cao, Junhao Cheng, Xiaodan Liang, Liang Lin:
VisDiaHalBench: A Visual Dialogue Benchmark For Diagnosing Hallucination in Large Vision-Language Models. ACL (1) 2024: 12161-12176 - [c236]Xiwen Liang, Liang Ma, Shanshan Guo, Jianhua Han, Hang Xu, Shikui Ma, Xiaodan Liang:
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation. ACL (Findings) 2024: 12538-12559 - [c235]Yuxuan Hu
, Minghuan Tan
, Chenwei Zhang
, Zixuan Li
, Xiaodan Liang
, Min Yang
, Chengming Li
, Xiping Hu
:
APTNESS: Incorporating Appraisal Theory and Emotion Support Strategies for Empathetic Response Generation. CIKM 2024: 900-909 - [c234]Lewei Yao, Renjie Pi, Jianhua Han, Xiaodan Liang, Hang Xu, Wei Zhang, Zhenguo Li, Dan Xu:
DetCLIPv3: Towards Versatile Generative Open-Vocabulary Object Detection. CVPR 2024: 5610-5619 - [c233]Xinpeng Ding, Jianhua Han, Hang Xu, Xiaodan Liang, Wei Zhang, Xiaomeng Li:
Holistic Autonomous Driving Understanding by Bird'View Injected Multi-Modal Large Models. CVPR 2024: 13668-13677 - [c232]Sihao Lin, Pumeng Lyu, Dongrui Liu, Tao Tang, Xiaodan Liang, Andy Song, Xiaojun Chang:
MLP Can Be a Good Transformer Learner. CVPR 2024: 19489-19498 - [c231]Tang Tao, Guangrun Wang, Yixing Lao, Peng Chen, Jie Liu, Liang Lin, Kaicheng Yu, Xiaodan Liang:
AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis. CVPR 2024: 21230-21240 - [c230]Zhijian Huang, Tao Tang, Shaoxiang Chen, Sihao Lin, Zequn Jie, Lin Ma, Guangrun Wang, Xiaodan Liang:
Making Large Language Models Better Planners with Reasoning-Decision Alignment. ECCV (36) 2024: 73-90 - [c229]Runhui Huang, Kaixin Cai, Jianhua Han, Xiaodan Liang, Renjing Pei, Guansong Lu, Songcen Xu, Wei Zhang, Hang Xu:
LayerDiff: Exploring Text-Guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model. ECCV (76) 2024: 144-160 - [c228]Shiyue Zhang, Zheng Chong, Xujie Zhang, Hanhui Li, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang:
GarmentAligner: Text-to-Garment Generation via Retrieval-Augmented Multi-level Corrections. ECCV (25) 2024: 148-164 - [c227]Mingjie Li, Haokun Lin, Liang Qiu, Xiaodan Liang, Ling Chen, Abdulmotaleb Elsaddik, Xiaojun Chang:
Contrastive Learning with Counterfactual Explanations for Radiology Report Generation. ECCV (43) 2024: 162-180 - [c226]Guian Fang
, Wenbiao Yan
, Yuanfan Guo
, Jianhua Han
, Zutao Jiang
, Hang Xu
, Shengcai Liao
, Xiaodan Liang
:
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-Fine Pose-Reversible Guidance. ECCV (32) 2024: 201-217 - [c225]Zhicheng Yang, Yinya Huang, Jing Xiong, Liang Feng, Xiaodan Liang, Yiwei Wang, Jing Tang:
AlignedCoT: Prompting Large Language Models via Native-Speaking Demonstrations. EMNLP (Findings) 2024: 2857-2896 - [c224]Yinya Huang, Xiaohan Lin, Zhengying Liu, Qingxing Cao, Huajian Xin, Haiming Wang, Zhenguo Li, Linqi Song, Xiaodan Liang:
MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data. ICLR 2024 - [c223]Renjie Pi, Lewei Yao, Jianhua Han, Xiaodan Liang, Wei Zhang, Hang Xu:
Ins-DetCLIP: Aligning Detection Model to Follow Human-Language Instruction. ICLR 2024 - [c222]Haiming Wang, Huajian Xin, Chuanyang Zheng, Zhengying Liu, Qingxing Cao, Yinya Huang, Jing Xiong, Han Shi, Enze Xie, Jian Yin, Zhenguo Li, Xiaodan Liang:
LEGO-Prover: Neural Theorem Proving with Growing Libraries. ICLR 2024 - [c221]Jing Xiong, Zixuan Li, Chuanyang Zheng, Zhijiang Guo, Yichun Yin, Enze Xie, Zhicheng Yang, Qingxing Cao, Haiming Wang, Xiongwei Han, Jing Tang, Chengming Li, Xiaodan Liang:
DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning. ICLR 2024 - [c220]Yuxuan Hu, Chenwei Zhang, Min Yang
, Xiaodan Liang
, Chengming Li
, Xiping Hu
:
Learning to Generalize Unseen Domains via Multi-source Meta Learning for Text Classification. ICPR (19) 2024: 412-428 - [c219]Tang Tao
, Longfei Gao
, Guangrun Wang
, Yixing Lao
, Peng Chen
, Hengshuang Zhao
, Dayang Hao
, Xiaodan Liang
, Mathieu Salzmann
, Kaicheng Yu
:
LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields. ACM Multimedia 2024: 390-398 - [c218]Zhenyu Xie
, Haoye Dong
, Yufei Gao
, Zehua Ma
, Xiaodan Liang
:
DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models. ACM Multimedia 2024: 10784-10793 - [c217]Xiaohan Lin, Qingxing Cao, Yinya Huang, Zhicheng Yang, Zhengying Liu, Zhenguo Li, Xiaodan Liang:
ATG: Benchmarking Automated Theorem Generation for Generative Language Models. NAACL-HLT (Findings) 2024: 4465-4480 - [c216]Xuan Huang, Hanhui Li, Wanquan Liu, Xiaodan Liang, Yiqiang Yan, Yuhao Cheng, Chenqiang Gao:
Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars. NeurIPS 2024 - [c215]Xiaohan Lin, Qingxing Cao, Yinya Huang, Haiming Wang, Jianqiao Lu, Zhengying Liu, Linqi Song, Xiaodan Liang:
FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving. NeurIPS 2024 - [c214]Youpeng Wen, Junfan Lin, Yi Zhu, Jianhua Han, Hang Xu, Shen Zhao, Xiaodan Liang:
VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation. NeurIPS 2024 - [c213]Kaidong Zhang, Pengzhen Ren, Bingqian Lin, Junfan Lin, Shikui Ma, Hang Xu, Xiaodan Liang:
PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation. NeurIPS 2024 - [i289]Xuan Huang, Hanhui Li, Zejun Yang, Zhisheng Wang, Xiaodan Liang:
3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands. CoRR abs/2401.00979 (2024) - [i288]Xinpeng Ding, Jianhua Han, Hang Xu, Xiaodan Liang, Wei Zhang, Xiaomeng Li:
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models. CoRR abs/2401.00988 (2024) - [i287]Jiaqi Chen, Bingqian Lin, Ran Xu, Zhenhua Chai, Xiaodan Liang, Kwan-Yee K. Wong:
MapGPT: Map-Guided Prompting for Unified Vision-and-Language Navigation. CoRR abs/2401.07314 (2024) - [i286]Yinya Huang, Xiaohan Lin, Zhengying Liu, Qingxing Cao, Huajian Xin, Haiming Wang, Zhenguo Li, Linqi Song, Xiaodan Liang:
MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data. CoRR abs/2402.08957 (2024) - [i285]Tao Tang, Guangrun Wang, Yixing Lao, Peng Chen, Jie Liu, Liang Lin, Kaicheng Yu, Xiaodan Liang:
AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis. CoRR abs/2402.17483 (2024) - [i284]Guangrun Wang, Changlin Li, Liuchun Yuan, Jiefeng Peng, Xiaoyu Xian, Xiaodan Liang, Xiaojun Chang, Liang Lin:
DNA Family: Boosting Weight-Sharing NAS with Block-Wise Supervisions. CoRR abs/2403.01326 (2024) - [i283]Bingqian Lin, Yanxin Long, Yi Zhu, Fengda Zhu, Xiaodan Liang, Qixiang Ye, Liang Lin:
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning. CoRR abs/2403.05770 (2024) - [i282]Bingqian Lin, Yunshuang Nie, Ziming Wei, Jiaqi Chen, Shikui Ma, Jianhua Han, Hang Xu, Xiaojun Chang, Xiaodan Liang:
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning. CoRR abs/2403.07376 (2024) - [i281]Zicheng Zhang, Tong Zhang, Yi Zhu, Jianzhuang Liu, Xiaodan Liang, Qixiang Ye, Wei Ke:
Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation. CoRR abs/2403.08426 (2024) - [i280]Minbin Huang, Yanxin Long, Xinchi Deng, Ruihang Chu, Jiangfeng Xiong, Xiaodan Liang, Hong Cheng, Qinglin Lu, Wei Liu:
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation. CoRR abs/2403.08857 (2024) - [i279]Runhui Huang, Kaixin Cai, Jianhua Han, Xiaodan Liang, Renjing Pei, Guansong Lu, Songcen Xu, Wei Zhang, Hang Xu:
LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model. CoRR abs/2403.11929 (2024) - [i278]Sihao Lin, Pumeng Lyu, Dongrui Liu, Tao Tang, Xiaodan Liang, Andy Song, Xiaojun Chang:
MLP Can Be A Good Transformer Learner. CoRR abs/2404.05657 (2024) - [i277]Lewei Yao, Renjie Pi, Jianhua Han, Xiaodan Liang, Hang Xu, Wei Zhang, Zhenguo Li, Dan Xu:
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection. CoRR abs/2404.09216 (2024) - [i276]Jiehui Huang, Xiao Dong, Wenhui Song, Hanhui Li, Jun Zhou, Yuhao Cheng, Shutao Liao, Long Chen, Yiqiang Yan, Shengcai Liao, Xiaodan Liang:
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving. CoRR abs/2404.16771 (2024) - [i275]Junhao Cheng, Baiqiao Yin, Kaixin Cai, Minbin Huang, Hanhui Li, Yuxin He, Xi Lu, Yue Li, Yifei Li, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang:
TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation. CoRR abs/2404.18919 (2024) - [i274]Xujie Zhang, Ente Lin, Xiu Li, Yuxuan Luo, Michael Kampffmeyer, Xin Dong, Xiaodan Liang:
MMTryon: Multi-Modal Multi-Reference Control for High-Quality Fashion Generation. CoRR abs/2405.00448 (2024) - [i273]Xiaohan Lin, Qingxing Cao, Yinya Huang, Zhicheng Yang, Zhengying Liu, Zhenguo Li, Xiaodan Liang:
ATG: Benchmarking Automated Theorem Generation for Generative Language Models. CoRR abs/2405.06677 (2024) - [i272]Siyu Lou, Yuntian Chen, Xiaodan Liang, Liang Lin, Quanshi Zhang:
Quantifying In-Context Reasoning Effects and Memorization Effects in LLMs. CoRR abs/2405.11880 (2024) - [i271]Huajian Xin, Daya Guo, Zhihong Shao, Zhizhou Ren, Qihao Zhu, Bo Liu, Chong Ruan, Wenda Li, Xiaodan Liang:
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data. CoRR abs/2405.14333 (2024) - [i270]Haiming Wang, Huajian Xin, Zhengying Liu, Wenda Li, Yinya Huang, Jianqiao Lu, Zhicheng Yang, Jing Tang, Jian Yin, Zhenguo Li, Xiaodan Liang:
Proving Theorems Recursively. CoRR abs/2405.14414 (2024) - [i269]Jian Zhao, Lei Jin, Jianshu Li, Zheng Zhu, Yinglei Teng, Jiaojiao Zhao, Sadaf Gulshad, Zheng Wang, Bo Zhao, Xiangbo Shu, Yunchao Wei, Xuecheng Nie, Xiaojie Jin, Xiaodan Liang, Shin'ichi Satoh, Yandong Guo, Cewu Lu, Junliang Xing, Jane Shengmei Shen:
The SkatingVerse Workshop & Challenge: Methods and Results. CoRR abs/2405.17188 (2024) - [i268]Jun Zheng, Fuwei Zhao, Youjiang Xu, Xin Dong, Xiaodan Liang:
VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers. CoRR abs/2405.18326 (2024) - [i267]Bingqian Lin, Yunshuang Nie, Ziming Wei, Yi Zhu, Hang Xu, Shikui Ma, Jianzhuang Liu, Xiaodan Liang:
Correctable Landmark Discovery via Large Models for Vision-Language Navigation. CoRR abs/2405.18721 (2024) - [i266]Meng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang, Ruyang Liu, Long Chen, Xiaodan Liang, Li Yuan, Ge Li:
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter. CoRR abs/2405.19465 (2024) - [i265]Junhao Cheng, Xi Lu, Hanhui Li, Khun Loun Zai, Baiqiao Yin, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang:
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation. CoRR abs/2406.01388 (2024) - [i264]Lijun Zhou, Tao Tang, Pengkun Hao, Zihang He, Kalok Ho, Shuo Gu, Wenbo Hou, Zhihui Hao, Haiyang Sun, Kun Zhan, Peng Jia, Xianpeng Lang, Xiaodan Liang:
UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking. CoRR abs/2406.02147 (2024) - [i263]Gexin Huang, Chenfei Wu, Mingjie Li, Xiaojun Chang, Ling Chen, Ying Sun, Shen Zhao, Xiaodan Liang, Liang Lin:
Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification. CoRR abs/2406.02990 (2024) - [i262]Xiaohan Lin, Qingxing Cao, Yinya Huang, Haiming Wang, Jianqiao Lu, Zhengying Liu, Linqi Song, Xiaodan Liang:
FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving. CoRR abs/2406.14408 (2024) - [i261]Sukmin Yun, Haokun Lin, Rusiru Thushara, Mohammad Qazim Bhat, Yongxin Wang, Zutao Jiang, Mingkai Deng, Jinhong Wang, Tianhua Tao, Junbo Li, Haonan Li, Preslav Nakov, Timothy Baldwin, Zhengzhong Liu, Eric P. Xing, Xiaodan Liang, Zhiqiang Shen:
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs. CoRR abs/2406.20098 (2024) - [i260]Jiaqi Chen, Bingqian Lin, Xinmin Liu, Xiaodan Liang, Kwan-Yee K. Wong:
Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation. CoRR abs/2407.05890 (2024) - [i259]Guian Fang, Wenbiao Yan, Yuanfan Guo, Jianhua Han, Zutao Jiang, Hang Xu, Shengcai Liao, Xiaodan Liang:
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance. CoRR abs/2407.06937 (2024) - [i258]Hao Wang, Pengzhen Ren, Zequn Jie, Xiao Dong, Chengjian Feng, Yinlong Qian, Lin Ma, Dongmei Jiang, Yaowei Wang, Xiangyuan Lan, Xiaodan Liang:
OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion. CoRR abs/2407.07844 (2024) - [i257]Runhui Huang, Xinpeng Ding, Chunwei Wang, Jianhua Han, Yulong Liu, Hengshuang Zhao, Hang Xu, Lu Hou, Wei Zhang, Xiaodan Liang:
HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models. CoRR abs/2407.08706 (2024) - [i256]Zhicheng Yang, Yinya Huang, Wei Shi, Liang Feng, Linqi Song, Yiwei Wang, Xiaodan Liang, Jing Tang:
Benchmarking LLMs for Optimization Modeling and Enhancing Reasoning via Reverse Socratic Synthesis. CoRR abs/2407.09887 (2024) - [i255]Mingjie Li, Haokun Lin, Liang Qiu, Xiaodan Liang, Ling Chen, Abdulmotaleb Elsaddik, Xiaojun Chang:
Contrastive Learning with Counterfactual Explanations for Radiology Report Generation. CoRR abs/2407.14474 (2024) - [i254]Zheng Chong, Xiao Dong, Haoxiang Li, Shiyue Zhang, Wenqing Zhang, Xujie Zhang, Hanqing Zhao, Xiaodan Liang:
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models. CoRR abs/2407.15886 (2024) - [i253]Zhenyu Xie, Haoye Dong, Yufei Gao, Zehua Ma, Xiaodan Liang:
DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models. CoRR abs/2407.16511 (2024) - [i252]Yuxuan Hu, Minghuan Tan, Chenwei Zhang, Zixuan Li, Xiaodan Liang, Min Yang, Chengming Li, Xiping Hu:
APTNESS: Incorporating Appraisal Theory and Emotion Support Strategies for Empathetic Response Generation. CoRR abs/2407.21048 (2024) - [i251]Jiasong Feng, Ao Ma, Jing Wang, Bo Cheng, Xiaodan Liang, Dawei Leng, Yuhui Yin:
FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance. CoRR abs/2408.08189 (2024) - [i250]Haoran Tang, Meng Cao, Jinfa Huang, Ruyang Liu, Peng Jin, Ge Li, Xiaodan Liang:
MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval. CoRR abs/2408.10575 (2024) - [i249]Zhiqiang Wang, Hao Zheng, Yunshuang Nie, Wenjun Xu, Qingwei Wang, Hua Ye, Zhe Li, Kaidong Zhang, Xuewen Cheng, Wanxi Dong, Chang Cai, Liang Lin, Feng Zheng, Xiaodan Liang:
All Robots in One: A New Standard and Unified Dataset for Versatile, General-Purpose Embodied Agents. CoRR abs/2408.10899 (2024) - [i248]Shiyue Zhang, Zheng Chong, Xujie Zhang, Hanhui Li, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang:
GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections. CoRR abs/2408.12352 (2024) - [i247]Cong Wang, Jiaxi Gu, Panwen Hu, Haoyu Zhao, Yuanfan Guo, Jianhua Han, Hang Xu, Xiaodan Liang:
EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation. CoRR abs/2408.13005 (2024) - [i246]Zhijian Huang, Tao Tang, Shaoxiang Chen, Sihao Lin, Zequn Jie, Lin Ma, Guangrun Wang, Xiaodan Liang:
Making Large Language Models Better Planners with Reasoning-Decision Alignment. CoRR abs/2408.13890 (2024) - [i245]Jing Wang, Ao Ma, Jiasong Feng, Dawei Leng, Yuhui Yin, Xiaodan Liang:
Qihoo-T2X: An Efficiency-Focused Diffusion Transformer via Proxy Tokens for Text-to-Any-Task. CoRR abs/2409.04005 (2024) - [i244]Sanoojan Baliah, Qinliang Lin, Shengcai Liao, Xiaodan Liang, Muhammad Haris Khan:
Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models. CoRR abs/2409.07269 (2024) - [i243]Yuxuan Hu, Chenwei Zhang, Min Yang, Xiaodan Liang, Chengming Li, Xiping Hu:
Learning to Generalize Unseen Domains via Multi-Source Meta Learning for Text Classification. CoRR abs/2409.13787 (2024) - [i242]Kai Chen, Yunhao Gou, Runhui Huang, Zhili Liu, Daxin Tan, Jing Xu, Chunwei Wang, Yi Zhu, Yihan Zeng, Kuo Yang, Dingdong Wang, Kun Xiang, Haoyuan Li, Haoli Bai, Jianhua Han, Xiaohui Li, Weike Jin, Nian Xie, Yu Zhang, James T. Kwok, Hengshuang Zhao, Xiaodan Liang, Dit-Yan Yeung, Xiao Chen, Zhenguo Li, Wei Zhang, Qun Liu, Jun Yao, Lanqing Hong, Lu Hou, Hang Xu:
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions. CoRR abs/2409.18042 (2024) - [i241]Changlin Li, Jiawei Zhang, Sihao Lin, Zongxin Yang, Junwei Liang, Xiaodan Liang, Xiaojun Chang:
Efficient Training of Large Vision Models via Advanced Automated Progressive Learning. CoRR abs/2410.00350 (2024) - [i240]Zixuan Li, Jing Xiong, Fanghua Ye, Chuanyang Zheng, Xun Wu, Jianqiao Lu, Zhongwei Wan, Xiaodan Liang, Chengming Li, Zhenan Sun, Lingpeng Kong, Ngai Wong:
UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation. CoRR abs/2410.02719 (2024) - [i239]Xuan Huang, Hanhui Li, Wanquan Liu, Xiaodan Liang, Yiqiang Yan, Yuhao Cheng, Chengqiang Gao:
Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars. CoRR abs/2410.08840 (2024) - [i238]Kaidong Zhang, Pengzhen Ren, Bingqian Lin, Junfan Lin, Shikui Ma, Hang Xu, Xiaodan Liang:
PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation. CoRR abs/2410.10394 (2024) - [i237]Jianqi Chen, Panwen Hu, Xiaojun Chang, Zhenwei Shi, Michael Christian Kampffmeyer, Xiaodan Liang:
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes. CoRR abs/2410.10790 (2024) - [i236]Meng Cao, Yuyang Liu, Yingfei Liu, Tiancai Wang, Jiahua Dong, Henghui Ding, Xiangyu Zhang, Ian Reid, Xiaodan Liang:
Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models. CoRR abs/2411.02564 (2024) - [i235]Panwen Hu, Jin Jiang, Jianqi Chen, Mingfei Han, Shengcai Liao, Xiaojun Chang, Xiaodan Liang:
StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration. CoRR abs/2411.04925 (2024) - [i234]Youpeng Wen, Junfan Lin, Yi Zhu, Jianhua Han, Hang Xu, Shen Zhao, Xiaodan Liang:
VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation. CoRR abs/2411.09153 (2024) - [i233]Yu Yan, Rongtao Xu, Jiazhao Zhang, Peiyang Li, Xiaodan Liang, Jianqin Yin:
InstruGen: Automatic Instruction Generation for Vision-and-Language Navigation Via Large Multimodal Models. CoRR abs/2411.11394 (2024) - [i232]Kun Xiang, Zhili Liu, Zihao Jiang, Yunshuang Nie, Runhui Huang, Haoxiang Fan, Hanhui Li, Weiran Huang, Yihan Zeng, Jianhua Han, Lanqing Hong, Hang Xu, Xiaodan Liang:
AtomThink: A Slow Thinking Framework for Multimodal Mathematical Reasoning. CoRR abs/2411.11930 (2024) - [i231]Meng Cao, Haoran Tang, Haoze Zhao, Hangyu Guo, Jiaheng Liu, Ge Zhang, Ruyang Liu, Qiang Sun, Ian Reid, Xiaodan Liang:
PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos. CoRR abs/2412.01800 (2024) - [i230]Yongxin Wang, Meng Cao, Haokun Lin, Mingfei Han, Liang Ma, Jin Jiang, Yuhao Cheng, Xiaodan Liang:
EACO: Enhancing Alignment in Multimodal LLMs via Critical Observation. CoRR abs/2412.04903 (2024) - [i229]Pengzhen Ren, Min Li, Zhen Luo, Xinshuai Song, Ziwei Chen, Weijia Liufu, Yixuan Yang, Hao Zheng, Rongtao Xu, Zitong Huang, Tongsheng Ding, Luyang Xie, Kaidong Zhang, Changfei Fu, Yang Liu, Liang Lin, Feng Zheng, Xiaodan Liang:
InfiniteWorld: A Unified Scalable Simulation Framework for General Visual-Language Robot Interaction. CoRR abs/2412.05789 (2024) - [i228]Zhijian Huang, Chengjian Feng, Feng Yan, Baihui Xiao, Zequn Jie, Yujie Zhong, Xiaodan Liang, Lin Ma:
DriveMM: All-in-One Large Multimodal Model for Autonomous Driving. CoRR abs/2412.07689 (2024) - [i227]Mingfei Han, Liang Ma, Kamila Zhumakhanova, Ekaterina Radionova, Jingyi Zhang, Xiaojun Chang, Xiaodan Liang, Ivan Laptev:
RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation. CoRR abs/2412.08591 (2024) - [i226]Jun Zheng, Jing Wang, Fuwei Zhao, Xujie Zhang, Xiaodan Liang:
Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism. CoRR abs/2412.09822 (2024) - [i225]Ente Lin, Xujie Zhang, Fuwei Zhao, Yuxuan Luo, Xin Dong, Long Zeng, Xiaodan Liang:
DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder. CoRR abs/2412.17644 (2024) - 2023
- [j59]Qiuyan Wang
, Xiaodan Liang, Rize Jin, Yang Yan:
Applications of Strongly Regular Cayley Graphs to Codebooks. IEEE Access 11: 106980-106986 (2023) - [j58]Siyi Hu, Yifan Zhong, Minquan Gao, Weixun Wang, Hao Dong, Xiaodan Liang, Zhihui Li, Xiaojun Chang, Yaodong Yang:
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library. J. Mach. Learn. Res. 24: 315:1-315:23 (2023) - [j57]Hang Chen, Bowei Cao, Jiangcun Yang, He Ren, Xingqiu Xia, Xiaowen Zhang, Wei Yan, Xiaodan Liang, Chen Li:
Construction and effect evaluation of prediction model for red blood cell transfusion requirement in cesarean section based on artificial intelligence. BMC Medical Informatics Decis. Mak. 23(1): 213 (2023) - [j56]