


default search action
Depei Qian 0002
Person information
Other persons with the same name
- Depei Qian — disambiguation page
- Depei Qian 0001
![0000-0002-9301-8394 [0000-0002-9301-8394]](https://dblp.org/img/orcid-mark.12x12.png)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[c15]Kaige Zhang
, Hailong Yang
, Xin You
, Tianyu Feng
, Yufan Xu
, Zhongzhi Luan
, Yi Liu
, Depei Qian
:
Exploiting Efficient Mapping and Pipelined Execution for Accelerating SpMV on Tensor Cores. PPoPP 2026: 245-258
[c14]Siqi Wang
, Hailong Yang
, Pengbo Wang
, Hongliang Cao
, Yufan Xu
, Xuezhu Wang
, Zhongzhi Luan
, Yi Liu
, Depei Qian
:
ElasGNN: An Elastic Training Framework for Distributed GNN Training. PPoPP 2026: 551-563
[c13]Yiqing Wang
, Hailong Yang
, Enze Yu
, Qingxiao Sun
, Kejie Ma
, Kaige Zhang
, Chenhao Xie
, Depei Qian
:
APERTURE: Algorithm-System Co-optimization for Temporal Graph Network Inference. PPoPP 2026: 564-576- 2025
[j20]Kelun Lei
, Shaokang Du
, Xin You
, Hailong Yang
, Zhongzhi Luan
, Yi Liu
, Depei Qian
:
Exploiting Dynamic Regular Patterns in Irregular Programs for Efficient Vectorization. ACM Trans. Archit. Code Optim. 22(2): 50:1-50:25 (2025)
[j19]Zhibo Xuan
, Xin You
, Tianyu Feng
, Hailong Yang
, Zhongzhi Luan
, Yi Liu
, Depei Qian
:
SimTrace: Exploiting Spatial and Temporal Sampling for Large-Scale Performance Analysis. ACM Trans. Archit. Code Optim. 22(2): 55:1-55:26 (2025)
[j18]Pengyu Mu
, Yi Liu
, Rui Wang
, Guoxiang Liu
, Hangcheng An
, Qianhe Zhao
, Hailong Yang
, Chenhao Xie
, Zhongzhi Luan
, Chunye Gong
, Depei Qian
:
Deep Learning Operators Performance Tuning for Changeable Sized Input Data on Tensor Accelerate Hardware. IEEE Trans. Computers 74(6): 2101-2113 (2025)
[j17]Qianhe Zhao
, Rui Wang
, Yi Liu
, Hailong Yang
, Zhongzhi Luan
, Depei Qian
:
Sifter: An Efficient Operator Auto-Tuner With Speculative Design Space Exploration for Deep Learning Compiler. IEEE Trans. Computers 74(10): 3251-3262 (2025)
[j16]Zhibo Xuan
, Xin Sun
, Xin You
, Hailong Yang
, Zhongzhi Luan
, Yi Liu
, Depei Qian
:
Identifying Performance Inefficiencies of Parallel Program With Spatial and Temporal Trace Analysis. IEEE Trans. Parallel Distributed Syst. 36(7): 1387-1400 (2025)
[j15]Yufei Yang
, Chenhao Xie
, Liansheng Liu
, Xiyuan Peng
, Yu Peng
, Hailong Yang
, Depei Qian
:
PreTrans: Enabling Efficient CGRA Multi-Task Context Switch Through Config Pre-Mapping and Data Transceiving. IEEE Trans. Parallel Distributed Syst. 36(11): 2214-2228 (2025)
[c12]Xing Cong
, FuKai Sun
, YiFan Chen
, Chenhao Xie
, Yi Liu
, Depei Qian
:
CB-SpMV: A Data Aggregating and Balance Algorithm for for Cache-Friendly Block-Based SpMV on GPUs. ICS 2025: 149-160
[c11]Shanghao Liu
, Hailong Yang
, Xin You
, Zhongzhi Luan
, Yi Liu
, Depei Qian
:
Efficient Locality-aware Instruction Stream Scheduling for Stencil Computation on ARM Processors. ICS 2025: 250-264
[c10]Siqi Wang
, Hailong Yang
, Pengbo Wang
, Shaokang Du
, Yufan Xu
, Qingxiao Sun
, Xiaoyan Liu
, Xuezhu Wang
, Xuning Liang
, Zhongzhi Luan
, Yi Liu
, Depei Qian
:
Accelerating Complex Stencil Computations with Adaptive Fusion Strategy. ICS 2025: 265-278
[c9]Shaokang Du
, Kelun Lei
, Xin You
, Hailong Yang
, Yufan Xu
, Zhongzhi Luan
, Yi Liu
, Depei Qian
:
Zero-Value Code Specialization via Profile-Guided Control Data Flow Analysis. SC 2025: 316-330
[c8]Siqi Wang
, Hailong Yang
, Xuezhu Wang
, Tongxuan Liu
, Pengbo Wang
, Yufan Xu
, Xuning Liang
, Kejie Ma
, Tianyu Feng
, Xin You
, Ruihao Gong
, Rui Wang
, Zhongzhi Luan
, Yi Liu
, Depei Qian
:
Towards Efficient LLM Inference via Collective and Adaptive Speculative Decoding. SC 2025: 973-990- 2024
[j14]Jiaxing Qi
, Zhongzhi Luan
, Shaohan Huang
, Carol J. Fung
, Hailong Yang
, Depei Qian
:
SpikeLog: Log-Based Anomaly Detection via Potential-Assisted Spiking Neuron Network. IEEE Trans. Knowl. Data Eng. 36(12): 9322-9335 (2024)
[j13]Siqi Wang
, Tianyu Feng
, Hailong Yang
, Xin You
, Bangduo Chen
, Tongxuan Liu
, Zhongzhi Luan
, Depei Qian
:
AtRec: Accelerating Recommendation Model Training on CPUs. IEEE Trans. Parallel Distributed Syst. 35(6): 750-763 (2024)
[j12]Jiaxing Qi
, Wencong Xiao, Mingzhen Li
, Chaojie Yang, Yong Li, Wei Lin, Hailong Yang
, Zhongzhi Luan
, Depei Qian
:
ElasticBatch: A Learning-Augmented Elastic Scheduling System for Batch Inference on MIG. IEEE Trans. Parallel Distributed Syst. 35(10): 1708-1720 (2024)
[c7]Siyu Wu
, Hailong Yang
, Xin You
, Ruihao Gong
, Yi Liu
, Zhongzhi Luan
, Depei Qian
:
PRoof: A Comprehensive Hierarchical Profiling Framework for Deep Neural Networks with Roofline Analysis. ICPP 2024: 822-832
[c6]Kaige Zhang
, Xiaoyan Liu
, Hailong Yang
, Tianyu Feng
, Xinyu Yang
, Yi Liu
, Zhongzhi Luan
, Depei Qian
:
Jigsaw: Accelerating SpMM with Vector Sparsity on Sparse Tensor Core. ICPP 2024: 1124-1134
[c5]Xiaoyan Liu
, Xuegui Zheng
, Hailong Yang
, Zhongzhi Luan
, Depei Qian
:
Tetris: Accelerating Sparse Convolution by Exploiting Memory Reuse on GPU. PPoPP 2024: 229-242- 2023
[j11]Hailong Yang
, Yi Liu
, Zhongzhi Luan
, Lin Gan
, Guangwen Yang, Depei Qian
:
Input-Aware Sparse Tensor Storage Format Selection for Optimizing MTTKRP. Computer 56(8): 4-7 (2023)
[j10]Pengyu Mu
, Yi Liu
, Rui Wang
, Guoxiang Liu
, Zhonghao Sun, Hailong Yang
, Zhongzhi Luan, Depei Qian
:
HAOTuner: A Hardware Adaptive Operator Auto-Tuner for Dynamic Shape Tensor Compilers. IEEE Trans. Computers 72(11): 3178-3190 (2023)
[c4]Mingzhen Li
, Hailong Yang
, Shanjun Zhang
, Fengwei Yu
, Ruihao Gong
, Yi Liu
, Zhongzhi Luan
, Depei Qian
:
Exploiting Subgraph Similarities for Efficient Auto-tuning of Tensor Programs. ICPP 2023: 786-796
[c3]Kelun Lei
, Xin You
, Hailong Yang
, Zhongzhi Luan
, Depei Qian
:
BiRFIA: Selective Binary Rewriting for Function Interception on ARM. ICS 2023: 87-98
[c2]Mingzhen Li
, Wencong Xiao
, Hailong Yang
, Biao Sun
, Hanyu Zhao
, Shiru Ren
, Zhongzhi Luan
, Xianyan Jia
, Yi Liu
, Yong Li
, Wei Lin
, Depei Qian
:
EasyScale: Elastic Training with Consistent Accuracy and Improved Utilization on GPUs. SC 2023: 55:1-55:14
[c1]Xin You
, Hailong Yang
, Kelun Lei
, Zhongzhi Luan
, Depei Qian
:
TrivialSpy: Identifying Software Triviality via Fine-grained and Dataflow-based Value Profiling. SC 2023: 90:1-90:13- 2022
[j9]Qingxiao Sun
, Yi Liu
, Hailong Yang
, Ming Dun, Zhongzhi Luan
, Lin Gan
, Guangwen Yang, Depei Qian
:
Input-Aware Sparse Tensor Storage Format Selection for Optimizing MTTKRP. IEEE Trans. Computers 71(8): 1968-1981 (2022)
[j8]Shaozhi Dai
, Zhongzhi Luan
, Shaohan Huang, Carol J. Fung, He Wang, Hailong Yang
, Depei Qian
:
REVAL: Recommend Which Variables to Log With Pretrained Model and Graph Neural Network. IEEE Trans. Netw. Serv. Manag. 19(4): 4045-4057 (2022)- 2021
[j7]Xiaogang Zhong, Mingzhen Li
, Hailong Yang
, Yi Liu
, Depei Qian
:
swMR: A Framework for Accelerating MapReduce Applications on Sunway Taihulight. IEEE Trans. Emerg. Top. Comput. 9(2): 1020-1030 (2021)
[j6]Mingzhen Li
, Yi Liu
, Xiaoyan Liu, Qingxiao Sun, Xin You, Hailong Yang
, Zhongzhi Luan
, Lin Gan, Guangwen Yang, Depei Qian
:
The Deep Learning Compiler: A Comprehensive Survey. IEEE Trans. Parallel Distributed Syst. 32(3): 708-727 (2021)- 2020
[j5]Minxuan Zhou, Andreas Prodromou, Rui Wang
, Hailong Yang
, Depei Qian
, Dean M. Tullsen
:
Temperature-Aware DRAM Cache Management - Relaxing Thermal Constraints in 3-D Systems. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 39(10): 1973-1986 (2020)
[j4]Lan Gao
, Yunlong Xu
, Rui Wang
, Zhongzhi Luan
, Zhibin Yu
, Depei Qian
:
Thread-Level Locking for SIMT Architectures. IEEE Trans. Parallel Distributed Syst. 31(5): 1121-1136 (2020)
[j3]Yongmin Hu, Hailong Yang
, Zhongzhi Luan
, Lin Gan, Guangwen Yang, Depei Qian
:
Massively Scaling Seismic Processing on Sunway TaihuLight Supercomputer. IEEE Trans. Parallel Distributed Syst. 31(5): 1194-1208 (2020)
[j2]Mingzhen Li
, Yi Liu
, Hailong Yang
, Zhongzhi Luan
, Lin Gan, Guangwen Yang, Depei Qian
:
Accelerating Sparse Cholesky Factorization on Sunway Manycore Architecture. IEEE Trans. Parallel Distributed Syst. 31(7): 1636-1650 (2020)
2010 – 2019
- 2018
[j1]Chao Yu
, Yuebin Bai
, Hailong Yang
, Kun Cheng, Yuhao Gu
, Zhongzhi Luan, Depei Qian
:
SMGuard: A Flexible and Fine-Grained Resource Management Framework for GPUs. IEEE Trans. Parallel Distributed Syst. 29(12): 2849-2862 (2018)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-05 00:11 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







