


default search action
Shiyi Cao
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [c14]Byungsoo Jeon, Mengdi Wu, Shiyi Cao
, Sunghyun Kim, Sunghyun Park, Neeraj Aggarwal, Colin Unger, Daiyaan Arfeen
, Peiyuan Liao
, Xupeng Miao
, Mohammad Alizadeh, Gregory R. Ganger, Tianqi Chen, Zhihao Jia:
GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism. ASPLOS (1) 2025: 557-571 - [c13]Shiyi Cao
, Shu Liu
, Tyler Griggs
, Peter Schafhalter
, Xiaoxuan Liu
, Ying Sheng
, Joseph E. Gonzalez
, Matei Zaharia
, Ion Stoica
:
MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs. ASPLOS (1) 2025: 715-730 - [i14]Shiyi Cao, Yichuan Wang, Ziming Mao, Pin-Lun Hsu, Liangsheng Yin, Tian Xia, Dacheng Li, Shu Liu, Yineng Zhang, Yang Zhou, Ying Sheng, Joseph Gonzalez, Ion Stoica:
Locality-aware Fair Scheduling in LLM Serving. CoRR abs/2501.14312 (2025) - [i13]Dacheng Li, Shiyi Cao, Tyler Griggs, Shu Liu, Xiangxi Mo, Eric Tang, Sumanth Hegde, Kourosh Hakhamaneshi, Shishir G. Patil, Matei Zaharia, Joseph E. Gonzalez, Ion Stoica:
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters! CoRR abs/2502.07374 (2025) - [i12]Dacheng Li, Shiyi Cao, Chengkun Cao, Xiuyu Li, Shangyin Tan, Kurt Keutzer, Jiarong Xing, Joseph E. Gonzalez, Ion Stoica:
S*: Test Time Scaling for Code Generation. CoRR abs/2502.14382 (2025) - [i11]Dacheng Li, Yunhao Fang, Yukang Chen, Shuo Yang, Shiyi Cao, Justin Wong, Michael Luo, Xiaolong Wang, Hongxu Yin, Joseph E. Gonzalez, Ion Stoica, Song Han, Yao Lu:
WorldModelBench: Judging Video Generation Models As World Models. CoRR abs/2502.20694 (2025) - 2024
- [c12]Ying Sheng, Shiyi Cao, Dacheng Li, Coleman Hooper, Nicholas Lee, Shuo Yang, Christopher Chou, Banghua Zhu, Lianmin Zheng, Kurt Keutzer, Joseph Gonzalez, Ion Stoica:
SLoRA: Scalable Serving of Thousands of LoRA Adapters. MLSys 2024 - [c11]Ling Yang, Zhaochen Yu, Tianjun Zhang, Shiyi Cao, Minkai Xu, Wentao Zhang, Joseph E. Gonzalez, Bin Cui:
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models. NeurIPS 2024 - [c10]Lianmin Zheng, Liangsheng Yin, Zhiqiang Xie, Chuyue Sun, Jeff Huang, Cody Hao Yu, Shiyi Cao, Christos Kozyrakis, Ion Stoica, Joseph E. Gonzalez, Clark W. Barrett, Ying Sheng:
SGLang: Efficient Execution of Structured Language Model Programs. NeurIPS 2024 - [c9]Ying Sheng, Shiyi Cao, Dacheng Li, Banghua Zhu, Zhuohan Li, Danyang Zhuo, Joseph E. Gonzalez, Ion Stoica:
Fairness in Serving Large Language Models. OSDI 2024: 965-988 - [c8]Mingkuan Xu, Shiyi Cao, Xupeng Miao, Umut A. Acar, Zhihao Jia:
Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs. SC 2024: 81 - [i10]Ying Sheng, Shiyi Cao, Dacheng Li, Banghua Zhu, Zhuohan Li, Danyang Zhuo, Joseph E. Gonzalez, Ion Stoica:
Fairness in Serving Large Language Models. CoRR abs/2401.00588 (2024) - [i9]Shu Liu, Asim Biswal, Audrey Cheng, Xiangxi Mo, Shiyi Cao, Joseph E. Gonzalez, Ion Stoica, Matei Zaharia:
Optimizing LLM Queries in Relational Workloads. CoRR abs/2403.05821 (2024) - [i8]Ling Yang, Zhaochen Yu, Tianjun Zhang, Shiyi Cao, Minkai Xu, Wentao Zhang, Joseph E. Gonzalez, Bin Cui:
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models. CoRR abs/2406.04271 (2024) - [i7]Byungsoo Jeon, Mengdi Wu, Shiyi Cao, Sunghyun Kim, Sunghyun Park, Neeraj Aggarwal, Colin Unger, Daiyaan Arfeen, Peiyuan Liao, Xupeng Miao, Mohammad Alizadeh, Gregory R. Ganger, Tianqi Chen, Zhihao Jia:
GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism. CoRR abs/2406.17145 (2024) - [i6]Mingkuan Xu, Shiyi Cao, Xupeng Miao, Umut A. Acar, Zhihao Jia:
Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version). CoRR abs/2408.09055 (2024) - [i5]Xuanlin Jiang, Yang Zhou, Shiyi Cao, Ion Stoica, Minlan Yu:
NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM Inference. CoRR abs/2411.01142 (2024) - [i4]Shiyi Cao, Shu Liu, Tyler Griggs, Peter Schafhalter, Xiaoxuan Liu, Ying Sheng, Joseph E. Gonzalez, Matei Zaharia, Ion Stoica:
MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs. CoRR abs/2411.11217 (2024) - [i3]Zhijian Liu, Ligeng Zhu, Baifeng Shi, Zhuoyang Zhang, Yuming Lou, Shang Yang, Haocheng Xi, Shiyi Cao, Yuxian Gu, Dacheng Li, Xiuyu Li, Yunhao Fang, Yukang Chen, Cheng-Yu Hsieh, De-An Huang, An-Chieh Cheng, Vishwesh Nath, Jinyi Hu, Sifei Liu, Ranjay Krishna, Daguang Xu, Xiaolong Wang, Pavlo Molchanov, Jan Kautz, Hongxu Yin, Song Han, Yao Lu:
NVILA: Efficient Frontier Visual Language Models. CoRR abs/2412.04468 (2024) - 2023
- [j2]Shiyi Cao, Xijun Hu, Yezi Wang, Cunyou Chen
, Dong Xu
, Tingting Bai:
Understanding Spatial-Temporal Interactions of Ecosystem Services and Their Drivers in a Multi-Scale Perspective of Miluo Using Multi-Source Remote Sensing Data. Remote. Sens. 15(14): 3479 (2023) - [i2]Ying Sheng, Shiyi Cao, Dacheng Li, Coleman Hooper, Nicholas Lee, Shuo Yang, Christopher Chou, Banghua Zhu, Lianmin Zheng, Kurt Keutzer, Joseph E. Gonzalez, Ion Stoica:
S-LoRA: Serving Thousands of Concurrent LoRA Adapters. CoRR abs/2311.03285 (2023) - [i1]Lianmin Zheng, Liangsheng Yin, Zhiqiang Xie, Jeff Huang, Chuyue Sun, Cody Hao Yu, Shiyi Cao, Christos Kozyrakis, Ion Stoica, Joseph E. Gonzalez, Clark W. Barrett, Ying Sheng:
Efficiently Programming Large Language Models using SGLang. CoRR abs/2312.07104 (2023) - 2022
- [c7]Shiyi Cao, Salvatore Di Girolamo, Torsten Hoefler:
Accelerating Data Serialization/Deserialization Protocols with In-Network Compute. ExaMPI@SC 2022: 22-30 - [c6]Xiuqi Huang
, Shiyi Cao, Yuanning Gao, Xiaofeng Gao, Guihai Chen:
LightPro: Lightweight Probabilistic Workload Prediction Framework for Database-as-a-Service. ICWS 2022: 160-169 - [c5]Yanze Wang, Tianyu Gao, Yaping Liu, Tao Xu, Wenbo Yu, Zhiqun Yang, Qiang Guo, Rui Zhou, Shiyi Cao, Xinhua Xiao, Lin Zhang:
Novel Mirror-flipped Mode Permutation Technique for Long-haul Mode-division Multiplexing Transmissions. OFC 2022: 1-3 - [c4]Linbo Yang, Zhiqun Yang, Tao Xu, Lijie Hou, Rui Zhou, Lin Gan, Shiyi Cao, Xinhua Xiao, Lin Zhang:
Low-loss Mode Field Adapter Using Reverse Tapering for Fundamental Mode Transmission over MMFs. OFC 2022: 1-3
2010 – 2019
- 2019
- [c3]Shiyi Cao, Yuanning Gao, Xiaofeng Gao, Guihai Chen
:
AdaM: An Adaptive Fine-Grained Scheme for Distributed Metadata Management. ICPP 2019: 37:1-37:10 - 2014
- [c2]Liangjia Zong, Han Zhao, Zhiyong Feng, Shiyi Cao:
Demonstration of ultra-compact contentionless-ROADM based on flexible wavelength router. ECOC 2014: 1-3 - 2013
- [j1]Shiyi Cao, Feng Wang
, Wilson Tam
, Lap Ah Tse
, Jean Hee Kim
, Junan Liu
, Zuxun Lu:
A hybrid seasonal prediction model for tuberculosis incidence in China. BMC Medical Informatics Decis. Mak. 13: 56 (2013) - [c1]Bo Wu, Shaofeng Qiu, Zhiyong Feng, Shiyi Cao, Han Zhao, Junling Xiang, Chiwu Ding, Gordon Ning Liu, Ning Deng, Qianjin Xiong:
Green and agile petabit optical sub-wavelength switching prototype for the future OTN multi-chassis switch cluster. OFC/NFOEC 2013: 1-3
Coauthor Index
aka: Joseph E. Gonzalez

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-05-03 00:04 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint