default search action

combined dblp search
author search
venue search
publication search

ask others

Kuntai Du

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/asplos/DuCONJ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asplos/DuCONJ25
Kuntai Du, Yihua Cheng, Peder A. Olsen, Shadi A. Noghabi, Junchen Jiang:
Earth+: On-Board Satellite Imagery Compression Leveraging Historical Earth Observations. ASPLOS (1) 2025: 361-376
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/eurosys/YaoLLRCZD0J25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eurosys/YaoLLRCZD0J25
Jiayi Yao, Hanchen Li, Yuhan Liu, Siddhant Ray, Yihua Cheng, Qizheng Zhang, Kuntai Du, Shan Lu, Junchen Jiang:
CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion. EuroSys 2025: 94-109
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/sosp/DuWZCLSCYLQSJ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sosp/DuWZCLSCYLQSJ25
Kuntai Du, Bowen Wang, Chen Zhang, Yiming Cheng, Qing Lan, Hejian Sang, Yihua Cheng, Jiayi Yao, Xiaoxuan Liu, Yifan Qiao, Ion Stoica, Junchen Jiang:
PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications. SOSP 2025: 399-414
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/sosp/ZhangDLKMWLYLLZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sosp/ZhangDLKMWLYLLZ25
Chen Zhang, Kuntai Du, Shu Liu, Woosuk Kwon, Xiangxi Mo, Yufeng Wang, Xiaoxuan Liu, Kaichao You, Zhuohan Li, Mingsheng Long, Jidong Zhai, Joseph Gonzalez, Ion Stoica:
Jenga: Effective Memory Management for Serving LLM with Heterogeneity. SOSP 2025: 446-461
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/sosp/Ray0GDFANJ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sosp/Ray0GDFANJ25
Siddhant Ray, Rui Pan, Zhuohan Gu, Kuntai Du, Shaoting Feng, Ganesh Ananthanarayanan, Ravi Netravali, Junchen Jiang:
METIS: Fast Quality-Aware RAG Systems with Configuration Adaptation. SOSP 2025: 606-622
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-14647
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-14647
Hanchen Li, Yuhan Liu, Yihua Cheng, Kuntai Du, Junchen Jiang:
Towards More Economical Context-Augmented LLM Generation by Reusing Stored KV Cache. CoRR abs/2503.14647 (2025)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-18292
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-18292
Chen Zhang, Kuntai Du, Shu Liu, Woosuk Kwon, Xiangxi Mo, Yufeng Wang, Xiaoxuan Liu, Kaichao You, Zhuohan Li, Mingsheng Long, Jidong Zhai, Joseph Gonzalez, Ion Stoica:
Jenga: Effective Memory Management for Serving LLM with Heterogeneity. CoRR abs/2503.18292 (2025)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-07203
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-07203
Kuntai Du, Bowen Wang, Chen Zhang, Yiming Cheng, Qing Lan, Hejian Sang, Yihua Cheng, Jiayi Yao, Xiaoxuan Liu, Yifan Qiao, Ion Stoica, Junchen Jiang:
PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications. CoRR abs/2505.07203 (2025)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-00105
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-00105
Shaoting Feng, Hanchen Li, Kuntai Du, Zhuohan Gu, Yuhan Liu, Jiayi Yao, Siddhant Ray, Samuel Shen, Yihua Cheng, Ganesh Ananthanarayanan, Junchen Jiang:
AdaptCache: KV Cache Native Storage Hierarchy for Low-Delay and High-Quality Language Model Serving. CoRR abs/2509.00105 (2025)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-09665
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-09665
Yihua Cheng, Yuhan Liu, Jiayi Yao, Yuwei An, Xiaokun Chen, Shaoting Feng, Yuyang Huang, Samuel Shen, Kuntai Du, Junchen Jiang:
LMCache: An Efficient KV Cache Layer for Enterprise-Scale LLM Inference. CoRR abs/2510.09665 (2025)
2024
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/naic/LiLCRDJ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naic/LiLCRDJ24
Hanchen Li, Yuhan Liu, Yihua Cheng, Siddhant Ray, Kuntai Du, Junchen Jiang:
Eloquent: A More Robust Transmission Scheme for LLM Token Streaming. NAIC 2024: 34-40
[c9]
- view
  - electronic edition @ usenix.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/nsdi/ChengZLAZZLDZYM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nsdi/ChengZLAZZLDZYM24
Yihua Cheng, Ziyi Zhang, Hanchen Li, Anton Arapin, Yue Zhang, Qizheng Zhang, Yuhan Liu, Kuntai Du, Xu Zhang, Francis Y. Yan, Amrita Mazumdar, Nick Feamster, Junchen Jiang:
GRACE: Loss-Resilient Real-Time Video through Neural Codecs. NSDI 2024: 509-531
[c8]
- view
  - electronic edition @ usenix.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/osdi/Liu0DHJ0M24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/osdi/Liu0DHJ0M24
Yuhan Liu, Chengcheng Wan, Kuntai Du, Henry Hoffmann, Junchen Jiang, Shan Lu, Michael Maire:
ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML Applications. OSDI 2024: 365-386
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/sigcomm/LiuLCRHZDY0AMHH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigcomm/LiuLCRHZDY0AMHH24
Yuhan Liu, Hanchen Li, Yihua Cheng, Siddhant Ray, Yuyang Huang, Qizheng Zhang, Kuntai Du, Jiayi Yao, Shan Lu, Ganesh Ananthanarayanan, Michael Maire, Henry Hoffmann, Ari Holtzman, Junchen Jiang:
CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving. SIGCOMM 2024: 38-56
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-12961
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-12961
Hanchen Li, Yuhan Liu, Yihua Cheng, Siddhant Ray, Kuntai Du, Junchen Jiang:
Chatterbox: Robust Transport for LLM Token Streaming under Unstable Network. CoRR abs/2401.12961 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-11434
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-11434
Kuntai Du, Yihua Cheng, Peder A. Olsen, Shadi A. Noghabi, Ranveer Chandra, Junchen Jiang:
Earth+: on-board satellite imagery compression leveraging historical earth observations. CoRR abs/2403.11434 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-16444
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-16444
Jiayi Yao, Hanchen Li, Yuhan Liu, Siddhant Ray, Yihua Cheng, Qizheng Zhang, Kuntai Du, Shan Lu, Junchen Jiang:
CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion. CoRR abs/2405.16444 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-13761
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-13761
Yihua Cheng, Kuntai Du, Jiayi Yao, Junchen Jiang:
Do Large Language Models Need a Content Delivery Network? CoRR abs/2409.13761 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-13009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-13009
Zhuohan Gu, Jiayi Yao, Kuntai Du, Junchen Jiang:
LLMSteer: Improving Long-Context LLM Inference by Steering Attention on Reused Contexts. CoRR abs/2411.13009 (2024)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-10543
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-10543
Siddhant Ray, Rui Pan, Zhuohan Gu, Kuntai Du, Ganesh Ananthanarayanan, Ravi Netravali, Junchen Jiang:
RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation. CoRR abs/2412.10543 (2024)
2023
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/pacmpl/WanLDHJML23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pacmpl/WanLDHJML23
Chengcheng Wan, Yuhan Liu, Kuntai Du, Henry Hoffmann, Junchen Jiang, Michael Maire, Shan Lu:
Run-Time Prevention of Software Integration Failures of Machine Learning APIs. Proc. ACM Program. Lang. 7(OOPSLA2): 264-291 (2023)
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/cloud/DuLHZWHAJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cloud/DuLHZWHAJ23
Kuntai Du, Yuhan Liu, Yitian Hao, Qizheng Zhang, Haodong Wang, Yuyang Huang, Ganesh Ananthanarayanan, Junchen Jiang:
OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation. SoCC 2023: 158-176
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-02422
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-02422
Kuntai Du, Yuhan Liu, Yitian Hao, Qizheng Zhang, Haodong Wang, Yuyang Huang, Ganesh Ananthanarayanan, Junchen Jiang:
OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation. CoRR abs/2310.02422 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-04685
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-04685
Yuhan Liu, Chengcheng Wan, Kuntai Du, Henry Hoffmann, Junchen Jiang, Shan Lu, Michael Maire:
Automatic and Efficient Customization of Neural Networks for ML Applications. CoRR abs/2310.04685 (2023)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-07240
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-07240
Yuhan Liu, Hanchen Li, Kuntai Du, Jiayi Yao, Yihua Cheng, Yuyang Huang, Shan Lu, Michael Maire, Henry Hoffmann, Ari Holtzman, Ganesh Ananthanarayanan, Junchen Jiang:
CacheGen: Fast Context Loading for Language Model Applications. CoRR abs/2310.07240 (2023)
2022
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/cloud/WangDJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cloud/WangDJ22
Haodong Wang, Kuntai Du, Junchen Jiang:
Minimizing packet retransmission for real-time video analytics. SoCC 2022: 340-347
[c4]
- view
  - electronic edition @ mlsys.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mlsys/DuZAWXJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsys/DuZAWXJ22
Kuntai Du, Qizheng Zhang, Anton Arapin, Haodong Wang, Zhengxu Xia, Junchen Jiang:
AccMPEG: Optimizing Video Encoding for Accurate Video Analytics. MLSys 2022
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/wmcsa/ZhangDANJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wmcsa/ZhangDANJ22
Qizheng Zhang, Kuntai Du, Neil Agarwal, Ravi Netravali, Junchen Jiang:
Understanding the potential of server-driven edge video analytics. HotMobile 2022: 8-14
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-12534
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-12534
Kuntai Du, Qizheng Zhang, Anton Arapin, Haodong Wang, Zhengxu Xia, Junchen Jiang:
AccMPEG: Optimizing Video Encoding for Video Analytics. CoRR abs/2204.12534 (2022)
2020
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/mobicom/WangFCXWXSDHL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mobicom/WangFCXWXSDHL20
Purui Wang, Lilei Feng, Guojun Chen, Chenren Xu, Yue Wu, Kenuo Xu, Guobin Shen, Kuntai Du, Gang Huang, Xuanzhe Liu:
Renovating road signs for infrastructure-to-vehicle networking: a visible light backscatter communication and networking approach. MobiCom 2020: 6:1-6:13
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/sigcomm/DuPYCZHJ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigcomm/DuPYCZHJ20
Kuntai Du, Ahsan Pervaiz, Xin Yuan, Aakanksha Chowdhery, Qizheng Zhang, Henry Hoffmann, Junchen Jiang:
Server-Driven Video Streaming for Deep Learning Inference. SIGCOMM 2020: 557-570

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.