default search action

combined dblp search
author search
venue search
publication search

ask others

Yizhou Shan

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taco/HuHXCWXCFWBSS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taco/HuHXCWXCFWBSS25
Cunchen Hu, Heyang Huang, Liangliang Xu, Xusheng Chen, Chenxi Wang, Jiang Xu, Shuang Chen, Hao Feng, Sa Wang, Yungang Bao, Ninghui Sun, Yizhou Shan:
ShuffleInfer: Disaggregate LLM Inference for Mixed Downstream Workloads. ACM Trans. Archit. Code Optim. 22(2): 77:1-77:24 (2025)
[c22]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/HuHWLHLCXS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuHWLHLCXS25
Junhao Hu, Wenrui Huang, Weidong Wang, Zhenwen Li, Tiancheng Hu, Zhixia Liu, Xusheng Chen, Tao Xie, Yizhou Shan:
RaaS: Reasoning-Aware Attention Sparsity for Efficient LLM Reasoning. ACL (Findings) 2025: 2577-2590
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/hpca/PanLLLSZLWZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hpca/PanLLLSZLWZ25
Xiurui Pan, Endian Li, Qiao Li, Shengwen Liang, Yizhou Shan, Ke Zhou, Yingwei Luo, Xiaolin Wang, Jie Zhang:
InstAttention: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference. HPCA 2025: 1510-1525
[c20]
- view
- export record
  dblp key:
  - conf/icml/HuHWWHZFCS025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HuHWWHZFCS025
Junhao Hu, Wenrui Huang, Weidong Wang, Haoyi Wang, Tiancheng Hu, Qin Zhang, Hao Feng, Xusheng Chen, Yizhou Shan, Tao Xie:
EPIC: Efficient Position-Independent Caching for Serving Large Language Models. ICML 2025
[c19]
- view
  - electronic edition @ usenix.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/nsdi/LiHLXZZ0CCSW25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nsdi/LiHLXZZ0CCSW25
Quanxi Li, Hong Huang, Ying Liu, Yanwen Xia, Jie Zhang, Mosong Zhou, Xiaobing Feng, Huimin Cui, Quan Chen, Yizhou Shan, Chenxi Wang:
Beehive: A Scalable Disaggregated Memory Runtime Exploiting Asynchrony of Multithreaded Programs. NSDI 2025: 167-187
[c18]
- view
  - electronic edition @ usenix.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/osdi/ZhangWLWS0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/osdi/ZhangWLWS0025
Dingyan Zhang, Haotian Wang, Yang Liu, Xingda Wei, Yizhou Shan, Rong Chen, Haibo Chen:
BlitzScale: Fast and Live Large Model Autoscaling with O(1) Host Caching. OSDI 2025: 275-293
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/LiuZSFWSGZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/LiuZSFWSGZ25
Guangda Liu, Chenqi Zhang, Yizhou Shan, Hao Feng, Zeke Wang, Shixuan Sun, Minyi Guo, Jieru Zhao:
DHAP: Towards Efficient OLAP in a Disaggregated and Heterogeneous Environment. SC 2025: 2233-2250
[c16]
- view
  - electronic edition @ usenix.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/usenix/HuXLHCXLMZWDDRL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/usenix/HuXLHCXLMZWDDRL25
Junhao Hu, Jiang Xu, Zhixia Liu, Yulong He, Yuetao Chen, Hao Xu, Jiang Liu, Jie Meng, Baoquan Zhang, Shining Wan, Gengyuan Dan, Zhiyu Dong, Zhihao Ren, Changhong Liu, Tao Xie, Dayun Lin, Qin Zhang, Yue Yu, Hao Feng, Xusheng Chen, Yizhou Shan:
DEEPSERVE: Serverless Large Language Model Serving at Scale. USENIX ATC 2025: 57-72
[c15]
- view
  - electronic edition @ usenix.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/usenix/0002LWYWCSY025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/usenix/0002LWYWCSY025
Suyi Li, Hanfeng Lu, Tianyuan Wu, Minchen Yu, Qizhen Weng, Xusheng Chen, Yizhou Shan, Binhang Yuan, Wei Wang:
Toppings: CPU-Assisted, Rank-Aware Adapter Serving for LLM Inference. USENIX ATC 2025: 613-629
[c14]
- view
  - electronic edition @ usenix.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/usenix/Zhang0HZCSZ0B0W25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/usenix/Zhang0HZCSZ0B0W25
Xu Zhang, Ke Liu, Yuan Hui, Xiaolong Zheng, Yisong Chang, Yizhou Shan, Guanghui Zhang, Ke Zhang, Yungang Bao, Mingyu Chen, Chenxi Wang:
DRack: A CXL-Disaggregated Rack Architecture to Boost Inter-Rack Communication. USENIX ATC 2025: 1261-1279
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-14417
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-14417
Junhao Hu, Jiang Xu, Zhixia Liu, Yulong He, Yuetao Chen, Hao Xu, Jiang Liu, Baoquan Zhang, Shining Wan, Gengyuan Dan, Zhiyu Dong, Zhihao Ren, Jie Meng, Chao He, Changhong Liu, Tao Xie, Dayun Lin, Qin Zhang, Yue Yu, Hao Feng, Xusheng Chen, Yizhou Shan:
DeepFlow: Serverless Large Language Model Serving at Scale. CoRR abs/2501.14417 (2025)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-11147
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-11147
Junhao Hu, Wenrui Huang, Weidong Wang, Zhenwen Li, Tiancheng Hu, Zhixia Liu, Xusheng Chen, Tao Xie, Yizhou Shan:
Efficient Long-Decoding Inference with Reasoning-Aware Attention Sparsity. CoRR abs/2502.11147 (2025)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-03756
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-03756
Hang Zhang, Jiuchen Shi, Yixiao Wang, Quan Chen, Yizhou Shan, Minyi Guo:
Improving the Serving Performance of Multi-LoRA Large Language Models via Efficient LoRA and KV Cache Management. CoRR abs/2505.03756 (2025)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-13497
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-13497
Heyang Huang, Cunchen Hu, Jiaqi Zhu, Ziyuan Gao, Liangliang Xu, Yizhou Shan, Yungang Bao, Ninghui Sun, Tianwei Zhang, Sa Wang:
DDiT: Dynamic Resource Allocation for Diffusion Transformer Model Serving. CoRR abs/2506.13497 (2025)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-14851
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-14851
Yifei Liu, Zuo Gan, Zhenghao Gan, Weiye Wang, Chen Chen, Yizhou Shan, Xusheng Chen, Zhenhua Han, Yifei Zhu, Shixuan Sun, Minyi Guo:
Efficient Serving of LLM Applications with Probabilistic Demand Modeling. CoRR abs/2506.14851 (2025)
2024
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/fpga/LinSKK024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/fpga/LinSKK024
Will Lin, Yizhou Shan, Ryan Kosta, Arvind Krishnamurthy, Yiying Zhang:
SuperNIC: An FPGA-Based, Cloud-Oriented SmartNIC. FPGA 2024: 130-141
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-11181
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-11181
Cunchen Hu, Heyang Huang, Liangliang Xu, Xusheng Chen, Jiang Xu, Shuang Chen, Hao Feng, Chenxi Wang, Sa Wang, Yungang Bao, Ninghui Sun, Yizhou Shan:
Inference without Interference: Disaggregate LLM Inference for Mixed Downstream Workloads. CoRR abs/2401.11181 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-11240
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-11240
Suyi Li, Hanfeng Lu, Tianyuan Wu, Minchen Yu, Qizhen Weng, Xusheng Chen, Yizhou Shan, Binhang Yuan, Wei Wang:
CaraServe: CPU-Assisted and Rank-Aware LoRA Serving for Generative LLM Inference. CoRR abs/2401.11240 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-11299
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-11299
Pai Zeng, Zhenyu Ning, Jieru Zhao, Weihao Cui, Mengwei Xu, Liwei Guo, Xusheng Chen, Yizhou Shan:
The CAP Principle for LLM Serving: A Survey of Long-Context Large Language Model Serving. CoRR abs/2405.11299 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-17565
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-17565
Cunchen Hu, Heyang Huang, Junhao Hu, Jiang Xu, Xusheng Chen, Tao Xie, Chenxi Wang, Sa Wang, Yungang Bao, Ninghui Sun, Yizhou Shan:
MemServe: Context Caching for Disaggregated LLM Serving with Elastic Memory Pool. CoRR abs/2406.17565 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-04992
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-04992
Xiurui Pan, Endian Li, Qiao Li, Shengwen Liang, Yizhou Shan, Ke Zhou, Yingwei Luo, Xiaolin Wang, Jie Zhang:
InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference. CoRR abs/2409.04992 (2024)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-15332
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-15332
Junhao Hu, Wenrui Huang, Haoyi Wang, Weidong Wang, Tiancheng Hu, Qin Zhang, Hao Feng, Xusheng Chen, Yizhou Shan, Tao Xie:
EPIC: Efficient Position-Independent Context Caching for Serving Large Language Models. CoRR abs/2410.15332 (2024)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-17246
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-17246
Dingyan Zhang, Haotian Wang, Yang Liu, Xingda Wei, Yizhou Shan, Rong Chen, Haibo Chen:
Fast and Live Model Auto Scaling with O(1) Host Caching. CoRR abs/2412.17246 (2024)
2023
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/date/LiLLLLCYXBCS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/date/LiLLLLCYXBCS23
Haifeng Li, Ke Liu, Ting Liang, Zuojun Li, Tianyue Lu, Yisong Chang, Hui Yuan, Yinben Xia, Yungang Bao, Mingyu Chen, Yizhou Shan:
MARB: Bridge the Semantic Gap between Operating System and Application Memory Access Behavior. DATE 2023: 1-6
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/hotos/HuWWSBZKZCXZFS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hotos/HuWWSBZKZCXZFS23
Cunchen Hu, Chenxi Wang, Sa Wang, Ninghui Sun, Yungang Bao, Jieru Zhao, Sanidhya Kashyap, Pengfei Zuo, Xusheng Chen, Liangliang Xu, Qin Zhang, Hao Feng, Yizhou Shan:
Skadi: Building a Distributed Runtime for Data Systems in Disaggregated Data Centers. HotOS 2023: 94-102
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/hpca/LiLLLLYXBCS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hpca/LiLLLLYXBCS23
Haifeng Li, Ke Liu, Ting Liang, Zuojun Li, Tianyue Lu, Hui Yuan, Yinben Xia, Yungang Bao, Mingyu Chen, Yizhou Shan:
HoPP: Hardware-Software Co-Designed Page Prefetching for Disaggregated Memory. HPCA 2023: 1168-1181
[c9]
- view
  - electronic edition @ usenix.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/osdi/ZhouSCGPB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/osdi/ZhouSCGPB23
Ziqiao Zhou, Yizhou Shan, Weidong Cui, Xinyang Ge, Marcus Peinado, Andrew Baumann:
Core slicing: closing the gap between leaky confidential VMs and bare-metal cloud. OSDI 2023: 247-267
2022
[b1]
- view
  - electronic edition @ escholarship.org
  - details & citations
- export record
  dblp key:
  - phd/us/Shan22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/us/Shan22
Yizhou Shan:
Distributing and Disaggregating Hardware Resources in Data Centers. University of California, San Diego, USA, 2022
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/apsys/ShanLG022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsys/ShanLG022
Yizhou Shan, Will Lin, Zhiyuan Guo, Yiying Zhang:
Towards a fully disaggregated and programmable data center. APSys 2022: 18-28
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/asplos/GuoSLHZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asplos/GuoSLHZ22
Zhiyuan Guo, Yizhou Shan, Xuhao Luo, Yutong Huang, Yiying Zhang:
Clio: a hardware-software co-designed disaggregated memory system. ASPLOS 2022: 417-433
2021
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-03492
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-03492
Zhiyuan Guo, Yizhou Shan, Xuhao Luo, Yutong Huang, Yiying Zhang:
Clio: A Hardware-Software Co-Designed Disaggregated Memory System. CoRR abs/2108.03492 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-07744
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-07744
Yizhou Shan, Will Lin, Ryan Kosta, Arvind Krishnamurthy, Yiying Zhang:
Disaggregating and Consolidating Network Functionalities with SuperNIC. CoRR abs/2109.07744 (2021)
2020
[c6]
- view
  - electronic edition @ usenix.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/usenix/TsaiSZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/usenix/TsaiSZ20
Shin-Yeh Tsai, Yizhou Shan, Yiying Zhang:
Disaggregating Persistent Memory and Controlling Them Remotely: An Exploration of Passive Disaggregated Key-Value Stores. USENIX ATC 2020: 33-48

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/systor/NovakovicSKCZEP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/systor/NovakovicSKCZEP19
Stanko Novakovic, Yizhou Shan, Aasheesh Kolli, Michael Cui, Yiying Zhang, Haggai Eran, Boris Pismenny, Liran Liss, Michael Wei, Dan Tsafrir, Marcos K. Aguilera:
Storm: a fast transactional dataplane for remote data structures. SYSTOR 2019: 97-108
[c4]
- view
  - electronic edition @ usenix.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/usenix/ShanHCZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/usenix/ShanHCZ19
Yizhou Shan, Yutong Huang, Yilun Chen, Yiying Zhang:
LegoOS: A Disseminated, Distributed OS for Hardware Resource Disaggregation. USENIX ATC 2019
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-02411
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-02411
Stanko Novakovic, Yizhou Shan, Aasheesh Kolli, Michael Cui, Yiying Zhang, Haggai Eran, Liran Liss, Michael Wei, Dan Tsafrir, Marcos K. Aguilera:
Storm: a fast transactional dataplane for remote data structures. CoRR abs/1902.02411 (2019)
2018
[c3]
- view
- export record
  dblp key:
  - conf/osdi/ShanHCZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/osdi/ShanHCZ18
Yizhou Shan, Yutong Huang, Yilun Chen, Yiying Zhang:
LegoOS: A Disseminated, Distributed OS for Hardware Resource Disaggregation. OSDI 2018: 69-87
2017
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/cloud/ShanTZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cloud/ShanTZ17
Yizhou Shan, Shin-Yeh Tsai, Yiying Zhang:
Distributed shared persistent memory. SoCC 2017: 323-337
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/cloud/ShanHHCZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cloud/ShanHHCZ17
Yizhou Shan, Sumukh Hallymysore, Yutong Huang, Yilun Chen, Yiying Zhang:
Disaggregated operating system. SoCC 2017: 628

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.