default search action
Xupeng Miao
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j13]Yingxia Shao, Hongzheng Li, Xizhi Gu, Hongbo Yin, Yawen Li, Xupeng Miao, Wentao Zhang, Bin Cui, Lei Chen:
Distributed Graph Neural Network Training: A Survey. ACM Comput. Surv. 56(8): 191:1-191:39 (2024) - [c30]Zihao Yu, Haoyang Li, Fangcheng Fu, Xupeng Miao, Bin Cui:
Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference. AAAI 2024: 16605-16613 - [c29]Zhengxin Zhang, Dan Zhao, Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Qing Li, Yong Jiang, Zhihao Jia:
Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models. ACL (1) 2024: 1-17 - [c28]Muyan Hu, Ashwin Venkatram, Shreyashri Biswas, Balamurugan Marimuthu, Bohan Hou, Gabriele Oliaro, Haojie Wang, Liyan Zheng, Xupeng Miao, Jidong Zhai, Zhihao Jia:
Optimal Kernel Orchestration for Tensor Programs with Korch. ASPLOS (3) 2024: 755-769 - [c27]Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Zeyu Wang, Zhengxin Zhang, Rae Ying Yee Wong, Alan Zhu, Lijie Yang, Xiaoxiang Shi, Chunan Shi, Zhuoming Chen, Daiyaan Arfeen, Reyna Abhyankar, Zhihao Jia:
SpecInfer: Accelerating Large Language Model Serving with Tree-based Speculative Inference and Verification. ASPLOS (3) 2024: 932-949 - [c26]Xupeng Miao, Chunan Shi, Jiangfei Duan, Xiaoli Xi, Dahua Lin, Bin Cui, Zhihao Jia:
SpotServe: Serving Generative Large Language Models on Preemptible Instances. ASPLOS (2) 2024: 1112-1127 - [c25]Peiwen Yuan, Xinglin Wang, Shaoxiong Feng, Boyuan Pan, Yiwei Li, Heda Wang, Xupeng Miao, Kan Li:
Generative Dense Retrieval: Memory Can Be a Burden. EACL (1) 2024: 2835-2845 - [c24]Zhuo Chang, Xinyi Zhang, Yang Li, Xupeng Miao, Yanzhao Qin, Bin Cui:
MFIX: An Efficient and Reliable Index Advisor via Multi-Fidelity Bayesian Optimization. ICDE 2024: 4343-4356 - [c23]Xupeng Miao, Shenhan Zhu, Fangcheng Fu, Ziyu Guo, Zhi Yang, Yaofeng Tu, Zhihao Jia, Bin Cui:
X-former Elucidator: Reviving Efficient Attention for Long Context Language Modeling. IJCAI 2024: 8179-8187 - [c22]Jiangfei Duan, Ziang Song, Xupeng Miao, Xiaoli Xi, Dahua Lin, Harry Xu, Minjia Zhang, Zhihao Jia:
Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances. NSDI 2024 - [c21]Xupeng Miao, Zhihao Jia, Bin Cui:
Demystifying Data Management for Large Language Models. SIGMOD Conference Companion 2024: 547-555 - [c20]Hao Ge, Fangcheng Fu, Haoyang Li, Xuanyu Wang, Sheng Lin, Yujie Wang, Xiaonan Nie, Hailin Zhang, Xupeng Miao, Bin Cui:
Enabling Parallelism Hot Switching for Efficient Training of Large Language Models. SOSP 2024: 178-194 - [i30]Zhengxin Zhang, Dan Zhao, Xupeng Miao, Gabriele Oliaro, Qing Li, Yong Jiang, Zhihao Jia:
Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models. CoRR abs/2401.07159 (2024) - [i29]Peiwen Yuan, Xinglin Wang, Shaoxiong Feng, Boyuan Pan, Yiwei Li, Heda Wang, Xupeng Miao, Kan Li:
Generative Dense Retrieval: Memory Can Be a Burden. CoRR abs/2401.10487 (2024) - [i28]Xupeng Miao, Gabriele Oliaro, Xinhao Cheng, Mengdi Wu, Colin Unger, Zhihao Jia:
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning. CoRR abs/2402.18789 (2024) - [i27]Jiangfei Duan, Ziang Song, Xupeng Miao, Xiaoli Xi, Dahua Lin, Harry Xu, Minjia Zhang, Zhihao Jia:
Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances. CoRR abs/2403.14097 (2024) - [i26]Yixuan Mei, Yonghao Zhuang, Xupeng Miao, Juncheng Yang, Zhihao Jia, Rashmi Vinayak:
Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs. CoRR abs/2406.01566 (2024) - [i25]Muyan Hu, Ashwin Venkatram, Shreyashri Biswas, Balamurugan Marimuthu, Bohan Hou, Gabriele Oliaro, Haojie Wang, Liyan Zheng, Xupeng Miao, Jidong Zhai:
Optimal Kernel Orchestration for Tensor Programs with Korch. CoRR abs/2406.09465 (2024) - [i24]Byungsoo Jeon, Mengdi Wu, Shiyi Cao, Sunghyun Kim, Sunghyun Park, Neeraj Aggarwal, Colin Unger, Daiyaan Arfeen, Peiyuan Liao, Xupeng Miao, Mohammad Alizadeh, Gregory R. Ganger, Tianqi Chen, Zhihao Jia:
GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism. CoRR abs/2406.17145 (2024) - [i23]Hailin Zhang, Xiaodong Ji, Yilin Chen, Fangcheng Fu, Xupeng Miao, Xiaonan Nie, Weipeng Chen, Bin Cui:
PQCache: Product Quantization-based KVCache for Long Context LLM Inference. CoRR abs/2407.12820 (2024) - [i22]Mingkuan Xu, Shiyi Cao, Xupeng Miao, Umut A. Acar, Zhihao Jia:
Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version). CoRR abs/2408.09055 (2024) - [i21]Yujie Wang, Shenhan Zhu, Fangcheng Fu, Xupeng Miao, Jie Zhang, Juan Zhu, Fan Hong, Yong Li, Bin Cui:
Efficient Multi-Task Large Model Training via Data Heterogeneity-aware Model Management. CoRR abs/2409.03365 (2024) - 2023
- [j12]Xupeng Miao, Xiaonan Nie, Hailin Zhang, Tong Zhao, Bin Cui:
Hetu: a highly efficient automatic parallel distributed deep learning system. Sci. China Inf. Sci. 66(1) (2023) - [j11]Xiaonan Nie, Xupeng Miao, Zilong Wang, Zichao Yang, Jilong Xue, Lingxiao Ma, Gang Cao, Bin Cui:
FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement. Proc. ACM Manag. Data 1(1): 110:1-110:19 (2023) - [j10]Xupeng Miao, Yining Shi, Zhi Yang, Bin Cui, Zhihao Jia:
SDPipe: A Semi-Decentralized Framework for Heterogeneity-aware Pipeline-parallel Training. Proc. VLDB Endow. 16(9): 2354-2363 (2023) - [j9]Xiaonan Nie, Yi Liu, Fangcheng Fu, Jinbao Xue, Dian Jiao, Xupeng Miao, Yangyu Tao, Bin Cui:
Angel-PTM: A Scalable and Economical Large-scale Pre-training System in Tencent. Proc. VLDB Endow. 16(12): 3781-3794 (2023) - [j8]Hailin Zhang, Penghao Zhao, Xupeng Miao, Yingxia Shao, Zirui Liu, Tong Yang, Bin Cui:
Experimental Analysis of Large-scale Learnable Vector Storage Compression. Proc. VLDB Endow. 17(4): 808-822 (2023) - [j7]Xupeng Miao, Wentao Zhang, Yingxia Shao, Bin Cui, Lei Chen, Ce Zhang, Jiawei Jiang:
Lasagne: A Multi-Layer Graph Convolutional Network Framework via Node-Aware Deep Architecture. IEEE Trans. Knowl. Data Eng. 35(2): 1721-1733 (2023) - [j6]Xupeng Miao, Wentao Zhang, Yuezihan Jiang, Fangcheng Fu, Yingxia Shao, Lei Chen, Yangyu Tao, Gang Cao, Bin Cui:
P2CG: a privacy preserving collaborative graph neural network training framework. VLDB J. 32(4): 717-736 (2023) - [c19]Ziyu Guo, Renrui Zhang, Longtian Qiu, Xianzheng Ma, Xupeng Miao, Xuming He, Bin Cui:
CALIP: Zero-Shot Enhancement of CLIP with Parameter-Free Attention. AAAI 2023: 746-754 - [c18]Youhe Jiang, Fangcheng Fu, Xupeng Miao, Xiaonan Nie, Bin Cui:
OSDP: Optimal Sharded Data Parallel for Distributed Deep Learning. IJCAI 2023: 2142-2150 - [c17]Hailin Zhang, Yujing Wang, Qi Chen, Ruiheng Chang, Ting Zhang, Ziming Miao, Yingyan Hou, Yang Ding, Xupeng Miao, Haonan Wang, Bochen Pang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Qi Zhang, Fan Yang, Xing Xie, Mao Yang, Bin Cui:
Model-enhanced Vector Index. NeurIPS 2023 - [c16]Liyan Zheng, Haojie Wang, Jidong Zhai, Muyan Hu, Zixuan Ma, Tuowei Wang, Shuhong Huang, Xupeng Miao, Shizhi Tang, Kezhao Huang, Zhihao Jia:
EINNET: Optimizing Tensor Programs with Derivation-Based Transformations. OSDI 2023: 739-755 - [i20]Xiaonan Nie, Yi Liu, Fangcheng Fu, Jinbao Xue, Dian Jiao, Xupeng Miao, Yangyu Tao, Bin Cui:
Angel-PTM: A Scalable and Economical Large-scale Pre-training System in Tencent. CoRR abs/2303.02868 (2023) - [i19]Xiaonan Nie, Xupeng Miao, Zilong Wang, Zichao Yang, Jilong Xue, Lingxiao Ma, Gang Cao, Bin Cui:
FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement. CoRR abs/2304.03946 (2023) - [i18]Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Zeyu Wang, Rae Ying Yee Wong, Zhuoming Chen, Daiyaan Arfeen, Reyna Abhyankar, Zhihao Jia:
SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification. CoRR abs/2305.09781 (2023) - [i17]Zihao Yu, Haoyang Li, Fangcheng Fu, Xupeng Miao, Bin Cui:
FISEdit: Accelerating Text-to-image Editing via Cache-enabled Sparse Diffusion Inference. CoRR abs/2305.17423 (2023) - [i16]Yujie Wang, Youhe Jiang, Xupeng Miao, Fangcheng Fu, Xiaonan Nie, Bin Cui:
Improving Automatic Parallel Training via Balanced Memory Workload Optimization. CoRR abs/2307.02031 (2023) - [i15]Hailin Zhang, Yujing Wang, Qi Chen, Ruiheng Chang, Ting Zhang, Ziming Miao, Yingyan Hou, Yang Ding, Xupeng Miao, Haonan Wang, Bochen Pang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Qi Zhang, Fan Yang, Xing Xie, Mao Yang, Bin Cui:
Model-enhanced Vector Index. CoRR abs/2309.13335 (2023) - [i14]Xupeng Miao, Chunan Shi, Jiangfei Duan, Xiaoli Xi, Dahua Lin, Bin Cui, Zhihao Jia:
SpotServe: Serving Generative Large Language Models on Preemptible Instances. CoRR abs/2311.15566 (2023) - [i13]Hailin Zhang, Penghao Zhao, Xupeng Miao, Yingxia Shao, Zirui Liu, Tong Yang, Bin Cui:
Experimental Analysis of Large-scale Learnable Vector Storage Compression. CoRR abs/2311.15578 (2023) - [i12]Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Hongyi Jin, Tianqi Chen, Zhihao Jia:
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems. CoRR abs/2312.15234 (2023) - 2022
- [j5]Fangcheng Fu, Xupeng Miao, Jiawei Jiang, Huanran Xue, Bin Cui:
Towards Communication-efficient Vertical Federated Learning Training via Cache-enabled Local Update. Proc. VLDB Endow. 15(10): 2111-2120 (2022) - [j4]Xupeng Miao, Yujie Wang, Youhe Jiang, Chunan Shi, Xiaonan Nie, Hailin Zhang, Bin Cui:
Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism. Proc. VLDB Endow. 16(3): 470-479 (2022) - [j3]Xupeng Miao, Lingxiao Ma, Zhi Yang, Yingxia Shao, Bin Cui, Lele Yu, Jiawei Jiang:
CuWide: Towards Efficient Flow-Based Training for Sparse Wide Models on GPUs. IEEE Trans. Knowl. Data Eng. 34(9): 4119-4132 (2022) - [c15]Hongbo Yin, Yingxia Shao, Xupeng Miao, Yawen Li, Bin Cui:
Scalable Graph Sampling on GPUs with Compressed Graph. CIKM 2022: 2383-2392 - [c14]Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li:
PointCLIP: Point Cloud Understanding by CLIP. CVPR 2022: 8542-8552 - [c13]Xupeng Miao, Wentao Zhang, Yingxia Shao, Bin Cui, Lei Chen, Ce Zhang, Jiawei Jiang:
Lasagne: A Multi-Layer Graph Convolutional Network Framework via Node-aware Deep Architecture (Extended Abstract). ICDE 2022: 1561-1562 - [c12]Sicong Dong, Xupeng Miao, Pengkai Liu, Xin Wang, Bin Cui, Jianxin Li:
HET-KG: Communication-Efficient Knowledge Graph Embedding Training via Hotness-Aware Cache. ICDE 2022: 1754-1766 - [c11]Yuezihan Jiang, Yu Cheng, Hanyu Zhao, Wentao Zhang, Xupeng Miao, Yu He, Liang Wang, Zhi Yang, Bin Cui:
Zoomer: Boosting Retrieval on Web-scale Graphs by Regions of Interest. ICDE 2022: 2224-2236 - [c10]Xiaonan Nie, Xupeng Miao, Zhi Yang, Bin Cui:
TSPLIT: Fine-grained GPU Memory Management for Efficient DNN Training via Tensor Splitting. ICDE 2022: 2615-2628 - [c9]Xupeng Miao, Yining Shi, Hailin Zhang, Xin Zhang, Xiaonan Nie, Zhi Yang, Bin Cui:
HET-GMP: A Graph-based System Approach to Scaling Large Embedding Model Training. SIGMOD Conference 2022: 470-480 - [i11]Yuezihan Jiang, Yu Cheng, Hanyu Zhao, Wentao Zhang, Xupeng Miao, Yu He, Liang Wang, Zhi Yang, Bin Cui:
ZOOMER: Boosting Retrieval on Web-scale Graphs by Regions of Interest. CoRR abs/2203.12596 (2022) - [i10]Xiaonan Nie, Pinxue Zhao, Xupeng Miao, Tong Zhao, Bin Cui:
HetuMoE: An Efficient Trillion-scale Mixture-of-Expert Distributed Training System. CoRR abs/2203.14685 (2022) - [i9]Fangcheng Fu, Xupeng Miao, Jiawei Jiang, Huanran Xue, Bin Cui:
Towards Communication-efficient Vertical Federated Learning Training via Cache-enabled Local Updates. CoRR abs/2207.14628 (2022) - [i8]Youhe Jiang, Xupeng Miao, Xiaonan Nie, Bin Cui:
OSDP: Optimal Sharded Data Parallel for Distributed Deep Learning. CoRR abs/2209.13258 (2022) - [i7]Ziyu Guo, Renrui Zhang, Longtian Qiu, Xianzheng Ma, Xupeng Miao, Xuming He, Bin Cui:
CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention. CoRR abs/2209.14169 (2022) - [i6]Yingxia Shao, Hongzheng Li, Xizhi Gu, Hongbo Yin, Yawen Li, Xupeng Miao, Wentao Zhang, Bin Cui, Lei Chen:
Distributed Graph Neural Network Training: A Survey. CoRR abs/2211.00216 (2022) - [i5]Xupeng Miao, Yujie Wang, Youhe Jiang, Chunan Shi, Xiaonan Nie, Hailin Zhang, Bin Cui:
Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism. CoRR abs/2211.13878 (2022) - 2021
- [j2]Xupeng Miao, Hailin Zhang, Yining Shi, Xiaonan Nie, Zhi Yang, Yangyu Tao, Bin Cui:
HET: Scaling out Huge Embedding Model Training via Cache-enabled Distributed Framework. Proc. VLDB Endow. 15(2): 312-320 (2021) - [j1]Yingxia Shao, Shiyue Huang, Yawen Li, Xupeng Miao, Bin Cui, Lei Chen:
Memory-aware framework for fast and scalable second-order random walk over billion-edge natural graphs. VLDB J. 30(5): 769-797 (2021) - [c8]Xupeng Miao, Lingxiao Ma, Zhi Yang, Yingxia Shao, Bin Cui, Lele Yu, Jiawei Jiang:
CuWide: Towards Efficient Flow-based Training for Sparse Wide Models on GPUs (Extended Abstract). ICDE 2021: 2330-2331 - [c7]Xupeng Miao, Nezihe Merve Gürel, Wentao Zhang, Zhichao Han, Bo Li, Wei Min, Susie Xi Rao, Hansheng Ren, Yinan Shan, Yingxia Shao, Yujie Wang, Fan Wu, Hui Xue, Yaming Yang, Zitao Zhang, Yang Zhao, Shuai Zhang, Yujing Wang, Bin Cui, Ce Zhang:
DeGNN: Improving Graph Neural Networks with Graph Decomposition. KDD 2021: 1223-1233 - [c6]Wentao Zhang, Yuezihan Jiang, Yang Li, Zeang Sheng, Yu Shen, Xupeng Miao, Liang Wang, Zhi Yang, Bin Cui:
ROD: Reception-aware Online Distillation for Sparse Graphs. KDD 2021: 2232-2242 - [c5]Xupeng Miao, Xiaonan Nie, Yingxia Shao, Zhi Yang, Jiawei Jiang, Lingxiao Ma, Bin Cui:
Heterogeneity-Aware Distributed Machine Learning Training via Partial Reduce. SIGMOD Conference 2021: 2262-2270 - [i4]Wentao Zhang, Yuezihan Jiang, Yang Li, Zeang Sheng, Yu Shen, Xupeng Miao, Liang Wang, Zhi Yang, Bin Cui:
ROD: Reception-aware Online Distillation for Sparse Graphs. CoRR abs/2107.11789 (2021) - [i3]Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li:
PointCLIP: Point Cloud Understanding by CLIP. CoRR abs/2112.02413 (2021) - [i2]Xupeng Miao, Hailin Zhang, Yining Shi, Xiaonan Nie, Zhi Yang, Yangyu Tao, Bin Cui:
HET: Scaling out Huge Embedding Model Training via Cache-enabled Distributed Framework. CoRR abs/2112.07221 (2021) - [i1]Xiaonan Nie, Shijie Cao, Xupeng Miao, Lingxiao Ma, Jilong Xue, Youshan Miao, Zichao Yang, Zhi Yang, Bin Cui:
Dense-to-Sparse Gate for Mixture-of-Experts. CoRR abs/2112.14397 (2021) - 2020
- [c4]Jiawei Jiang, Pin Xiao, Lele Yu, Xiaosen Li, Jiefeng Cheng, Xupeng Miao, Zhipeng Zhang, Bin Cui:
PSGraph: How Tencent trains extremely large-scale graphs with Spark? ICDE 2020: 1549-1557 - [c3]Wentao Zhang, Xupeng Miao, Yingxia Shao, Jiawei Jiang, Lei Chen, Olivier Ruas, Bin Cui:
Reliable Data Distillation on Graph Convolutional Network. SIGMOD Conference 2020: 1399-1414 - [c2]Yingxia Shao, Shiyue Huang, Xupeng Miao, Bin Cui, Lei Chen:
Memory-Aware Framework for Efficient Second-Order Random Walk on Large Graphs. SIGMOD Conference 2020: 1797-1812
2010 – 2019
- 2019
- [c1]Zhipeng Zhang, Bin Cui, Yingxia Shao, Lele Yu, Jiawei Jiang, Xupeng Miao:
PS2: Parameter Server on Spark. SIGMOD Conference 2019: 376-388
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-02 21:25 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint