


default search action
Chiyue Wei
Person information
- affiliation: Duke University, Department of Computer and Electrical Engineering, Durham, NC, USA
- affiliation (former): Tsinghua University, Department of Electronic Engineering, Beijing, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[j1]Yiran Chen
, Cong Guo
, Yintao He
, Mingyuan Ma, Tergel Molom-Ochir
, Nicky Ramos, Haoxuan Shan, Chiyue Wei
, Hai Li
:
Circuits to Systems: Codesigning Efficient AI Hardware. IEEE Des. Test 42(6): 54-62 (2025)
[c7]Chiyue Wei, Cong Guo, Feng Cheng, Shiyu Li, Hao (Frank) Yang, Hai Helen Li, Yiran Chen:
Prosperity: Accelerating Spiking Neural Networks via Product Sparsity. HPCA 2025: 806-820
[c6]Feng Cheng
, Cong Guo
, Chiyue Wei
, Junyao Zhang
, Changchun Zhou
, Edward Hanson
, Jiaqi Zhang
, Xiaoxiao Liu
, Hai Li
, Yiran Chen
:
Ecco: Improving Memory Bandwidth and Capacity for LLMs via Entropy-Aware Cache Compression. ISCA 2025: 793-807
[c5]Chiyue Wei
, Bowen Duan
, Cong Guo
, Jingyang Zhang
, Qingyue Song
, Hai Li
, Yiran Chen
:
Phi: Leveraging Pattern-based Hierarchical Sparsity for High-Efficiency Spiking Neural Networks. ISCA 2025: 930-943
[c4]Cong Guo
, Chiyue Wei
, Jiaming Tang
, Bowen Duan
, Song Han
, Hai Li
, Yiran Chen
:
Transitive Array: An Efficient GEMM Accelerator with Result Reuse. ISCA 2025: 990-1004
[i12]Mark Horton, Tergel Molom-Ochir, Peter Liu, Bhavna Gopal, Chiyue Wei, Cong Guo, Brady Taylor, Deliang Fan, Shan X. Wang, Hai Li, Yiran Chen:
Hamming Attention Distillation: Binarizing Keys and Queries for Efficient Long-Context Transformers. CoRR abs/2502.01770 (2025)
[i11]Chiyue Wei, Cong Guo, Feng Cheng, Shiyu Li, Hao (Frank) Yang, Hai Helen Li, Yiran Chen:
Prosperity: Accelerating Spiking Neural Networks via Product Sparsity. CoRR abs/2503.03379 (2025)
[i10]Cong Guo, Chiyue Wei, Jiaming Tang, Bowen Duan, Song Han, Hai Li, Yiran Chen:
Transitive Array: An Efficient GEMM Accelerator with Result Reuse. CoRR abs/2504.16339 (2025)
[i9]Feng Cheng, Cong Guo, Chiyue Wei, Junyao Zhang, Changchun Zhou, Edward Hanson, Jiaqi Zhang, Xiaoxiao Liu, Hai Helen Li, Yiran Chen:
Ecco: Improving Memory Bandwidth and Capacity for LLMs via Entropy-aware Cache Compression. CoRR abs/2505.06901 (2025)
[i8]Chiyue Wei, Bowen Duan, Cong Guo, Jingyang Zhang, Qingyue Song, Hai Helen Li, Yiran Chen:
Phi: Leveraging Pattern-based Hierarchical Sparsity for High-Efficiency Spiking Neural Networks. CoRR abs/2505.10909 (2025)
[i7]Xinhua Chen, Sitao Huang, Cong Guo, Chiyue Wei, Yintao He, Jianyi Zhang, Hai Helen Li, Yiran Chen:
DPad: Efficient Diffusion Language Models with Suffix Dropout. CoRR abs/2508.14148 (2025)
[i6]Yuzhe Fu, Changchun Zhou, Hancheng Ye, Bowen Duan, Qiyu Huang, Chiyue Wei, Cong Guo, Hai Helen Li, Yiran Chen:
FractalCloud: A Fractal-Inspired Architecture for Efficient Large-Scale Point Cloud Processing. CoRR abs/2511.07665 (2025)
[i5]Tergel Molom-Ochir, Benjamin F. Morris III, Mark Horton, Chiyue Wei, Cong Guo, Brady Taylor, Peter Liu, Shan X. Wang, Deliang Fan, Hai Helen Li, Yiran Chen:
CAMformer: Associative Memory is All You Need. CoRR abs/2511.19740 (2025)
[i4]Haoxuan Shan, Cong Guo, Chiyue Wei, Feng Cheng, Junyao Zhang, Hai Li, Yiran Chen:
Platinum: Path-Adaptable LUT-Based Accelerator Tailored for Low-Bit Weight Matrix Multiplication. CoRR abs/2511.21910 (2025)
[i3]Chiyue Wei, Cong Guo, Junyao Zhang, Haoxuan Shan, Yifan Xu, Ziyue Zhang, Yudong Liu, Qinsi Wang, Changchun Zhou, Hai (Helen) Li, Yiran Chen:
Focus: A Streaming Concentration Architecture for Efficient Vision-Language Models. CoRR abs/2512.14661 (2025)- 2024
[c3]Tianyu Fu
, Chiyue Wei
, Yu Wang
, Rex Ying
:
DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting. WSDM 2024: 218-227
[i2]Cong Guo
, Feng Cheng, Zhixu Du, James Kiessling, Jonathan Ku, Shiyu Li, Ziru Li, Mingyuan Ma, Tergel Molom-Ochir, Benjamin Morris, Haoxuan Shan, Jingwei Sun, Yitu Wang, Chiyue Wei, Xueying Wu, Yuhao Wu, Hao (Frank) Yang, Jingyang Zhang, Junyao Zhang, Qilin Zheng, Guanglei Zhou, Hai Li, Yiran Chen:
A Survey: Collaborative Hardware and Software Design in the Era of Large Language Models. CoRR abs/2410.07265 (2024)- 2023
[c2]Tianyu Fu
, Chiyue Wei, Zhenhua Zhu, Shang Yang, Zhongming Yu, Guohao Dai, Huazhong Yang, Yu Wang:
CLAP: Locality Aware and Parallel Triangle Counting with Content Addressable Memory. DATE 2023: 1-6
[i1]Tianyu Fu
, Chiyue Wei, Yu Wang, Rex Ying:
DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting. CoRR abs/2308.08198 (2023)- 2022
[c1]Guohao Dai, Zhenhua Zhu, Tianyu Fu
, Chiyue Wei, Bangyan Wang, Xiangyu Li
, Yuan Xie, Huazhong Yang, Yu Wang:
DIMMining: pruning-efficient and parallel graph mining on near-memory-computing. ISCA 2022: 130-145
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-12 00:18 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







