


default search action
Yuhan Liu 0004
Person information
Other persons with the same name
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c6]Jiayi Yao
, Hanchen Li
, Yuhan Liu
, Siddhant Ray
, Yihua Cheng
, Qizheng Zhang
, Kuntai Du
, Shan Lu
, Junchen Jiang
:
CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion. EuroSys 2025: 94-109
[d3]Chengcheng Wan
, Shicheng Liu
, Sophie Xie
, Yuhan Liu
, Henry Hoffmann
, Michael Maire
, Shan Lu
:
Artifact for "Keeper: Automated Testing and Fixing of Machine Learning Software". Version 3. Zenodo, 2025 [all versions]
[i11]Hanchen Li, Yuhan Liu, Yihua Cheng, Kuntai Du, Junchen Jiang:
Towards More Economical Context-Augmented LLM Generation by Reusing Stored KV Cache. CoRR abs/2503.14647 (2025)
[i10]Shaoting Feng, Hanchen Li, Kuntai Du, Zhuohan Gu, Yuhan Liu, Jiayi Yao, Siddhant Ray, Samuel Shen, Yihua Cheng, Ganesh Ananthanarayanan, Junchen Jiang:
AdaptCache: KV Cache Native Storage Hierarchy for Low-Delay and High-Quality Language Model Serving. CoRR abs/2509.00105 (2025)
[i9]Xingyu Xiang, Raj Joshi, Yuhan Liu, Jiayi Yao, Chenxingyu Zhao, Junchen Jiang, Yang Zhou, Eddie Kohler, Minlan Yu:
ShadowServe: Interference-Free KV Cache Fetching for Distributed Prefix Caching. CoRR abs/2509.16857 (2025)
[i8]Chen Wang, Xunzhuo Liu, Yuhan Liu, Yue Zhu, Xiangxi Mo, Junchen Jiang, Huamin Chen:
When to Reason: Semantic Router for vLLM. CoRR abs/2510.08731 (2025)
[i7]Yihua Cheng, Yuhan Liu, Jiayi Yao, Yuwei An, Xiaokun Chen, Shaoting Feng, Yuyang Huang, Samuel Shen, Kuntai Du, Junchen Jiang:
LMCache: An Efficient KV Cache Layer for Enterprise-Scale LLM Inference. CoRR abs/2510.09665 (2025)- 2024
[j2]Chengcheng Wan
, Shicheng Liu
, Sophie Xie
, Yuhan Liu
, Henry Hoffmann
, Michael Maire
, Shan Lu
:
Keeper: Automated Testing and Fixing of Machine Learning Software. ACM Trans. Softw. Eng. Methodol. 33(7): 167:1-167:33 (2024)
[c5]Hanchen Li
, Yuhan Liu
, Yihua Cheng
, Siddhant Ray
, Kuntai Du
, Junchen Jiang
:
Eloquent: A More Robust Transmission Scheme for LLM Token Streaming. NAIC 2024: 34-40
[c4]Yihua Cheng, Ziyi Zhang, Hanchen Li, Anton Arapin, Yue Zhang, Qizheng Zhang
, Yuhan Liu, Kuntai Du, Xu Zhang, Francis Y. Yan, Amrita Mazumdar, Nick Feamster, Junchen Jiang:
GRACE: Loss-Resilient Real-Time Video through Neural Codecs. NSDI 2024: 509-531
[c3]Yuhan Liu, Chengcheng Wan, Kuntai Du, Henry Hoffmann, Junchen Jiang, Shan Lu, Michael Maire:
ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML Applications. OSDI 2024: 365-386
[c2]Yuhan Liu
, Hanchen Li
, Yihua Cheng
, Siddhant Ray
, Yuyang Huang
, Qizheng Zhang
, Kuntai Du
, Jiayi Yao
, Shan Lu
, Ganesh Ananthanarayanan
, Michael Maire
, Henry Hoffmann
, Ari Holtzman
, Junchen Jiang
:
CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving. SIGCOMM 2024: 38-56
[d2]Chengcheng Wan
, Shicheng Liu
, Sophie Xie
, Yuhan Liu
, Henry Hoffmann
, Michael Maire
, Shan Lu
:
Artifact for "Keeper: Automated Testing and Fixing of Machine Learning Software". Version 1. Zenodo, 2024 [all versions]
[d1]Chengcheng Wan
, Shicheng Liu
, Sophie Xie
, Yuhan Liu
, Henry Hoffmann
, Michael Maire
, Shan Lu
:
Artifact for "Keeper: Automated Testing and Fixing of Machine Learning Software". Version 2. Zenodo, 2024 [all versions]
[i6]Hanchen Li, Yuhan Liu, Yihua Cheng, Siddhant Ray, Kuntai Du, Junchen Jiang:
Chatterbox: Robust Transport for LLM Token Streaming under Unstable Network. CoRR abs/2401.12961 (2024)
[i5]Jiayi Yao, Hanchen Li, Yuhan Liu, Siddhant Ray, Yihua Cheng, Qizheng Zhang, Kuntai Du, Shan Lu, Junchen Jiang:
CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion. CoRR abs/2405.16444 (2024)
[i4]Yuhan Liu, Esha Choukse, Shan Lu, Junchen Jiang, Madan Musuvathi:
DroidSpeak: Enhancing Cross-LLM Communication. CoRR abs/2411.02820 (2024)- 2023
[j1]Chengcheng Wan
, Yuhan Liu
, Kuntai Du
, Henry Hoffmann
, Junchen Jiang
, Michael Maire
, Shan Lu
:
Run-Time Prevention of Software Integration Failures of Machine Learning APIs. Proc. ACM Program. Lang. 7(OOPSLA2): 264-291 (2023)
[c1]Kuntai Du
, Yuhan Liu
, Yitian Hao
, Qizheng Zhang
, Haodong Wang
, Yuyang Huang
, Ganesh Ananthanarayanan
, Junchen Jiang
:
OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation. SoCC 2023: 158-176
[i3]Kuntai Du, Yuhan Liu, Yitian Hao, Qizheng Zhang, Haodong Wang, Yuyang Huang, Ganesh Ananthanarayanan, Junchen Jiang:
OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation. CoRR abs/2310.02422 (2023)
[i2]Yuhan Liu, Chengcheng Wan, Kuntai Du, Henry Hoffmann, Junchen Jiang, Shan Lu, Michael Maire:
Automatic and Efficient Customization of Neural Networks for ML Applications. CoRR abs/2310.04685 (2023)
[i1]Yuhan Liu, Hanchen Li, Kuntai Du, Jiayi Yao, Yihua Cheng, Yuyang Huang, Shan Lu, Michael Maire, Henry Hoffmann, Ari Holtzman, Ganesh Ananthanarayanan, Junchen Jiang:
CacheGen: Fast Context Loading for Language Model Applications. CoRR abs/2310.07240 (2023)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-01-18 21:33 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID






