default search action
Xuehai Qian
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j20]Zhiding Liang, Jinglei Cheng, Hang Ren, Hanrui Wang, Fei Hua, Zhixin Song, Yongshan Ding, Frederic T. Chong, Song Han, Xuehai Qian, Yiyu Shi:
NAPA: Intermediate-Level Variational Native-Pulse Ansatz for Variational Quantum Algorithms. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 43(6): 1834-1847 (2024) - [c66]Yalong Shan, Yongkui Yang, Xuehai Qian, Zhibin Yu:
Guser: A GPGPU Power Stressmark Generator. HPCA 2024: 1111-1124 - [i33]Qinyi Luo, Penghan Wang, Wei Zhang, Fan Lai, Jiachen Mao, Xiaohan Wei, Jun Song, Wei-Yu Tsai, Shuai Yang, Yuxi Hu, Xuehai Qian:
Fine-Grained Embedding Dimension Optimization During Training for Recommender Systems. CoRR abs/2401.04408 (2024) - 2023
- [j19]Edward Hanson, Shiyu Li, Xuehai Qian, Hai Helen Li, Yiran Chen:
DyNNamic: Dynamically Reshaping, High Data-Reuse Accelerator for Compact DNNs. IEEE Trans. Computers 72(3): 880-892 (2023) - [j18]Chao Wang, Xuehai Qian:
RDMA-Enabled Concurrency Control Protocols for Transactions in the Cloud Era. IEEE Trans. Cloud Comput. 11(1): 798-810 (2023) - [c65]Hongtao Chen, Mingxing Zhang, Ke Yang, Kang Chen, Albert Y. Zomaya, Yongwei Wu, Xuehai Qian:
Achieving Sub-second Pairwise Query over Evolving Graphs. ASPLOS (2) 2023: 1-15 - [c64]Jingji Chen, Xuehai Qian:
DecoMine: A Compilation-Based Graph Pattern Mining System with Pattern Decomposition. ASPLOS (1) 2023: 47-61 - [c63]Jingji Chen, Xuehai Qian:
Khuzdul: Efficient and Scalable Distributed Graph Pattern Mining Engine. ASPLOS (2) 2023: 413-426 - [c62]Zhiding Liang, Zhixin Song, Jinglei Cheng, Zichang He, Ji Liu, Hanrui Wang, Ruiyang Qin, Yiru Wang, Song Han, Xuehai Qian, Yiyu Shi:
Hybrid Gate-Pulse Model for Variational Quantum Algorithms. DAC 2023: 1-6 - [i32]Jingji Chen, Zhuoming Chen, Xuehai Qian:
GNNPipe: Accelerating Distributed Full-Graph GNN Training with Pipelined Model Parallelism. CoRR abs/2308.10087 (2023) - [i31]Hanrui Wang, Yilian Liu, Pengyu Liu, Jiaqi Gu, Zirui Li, Zhiding Liang, Jinglei Cheng, Yongshan Ding, Xuehai Qian, Yiyu Shi, David Z. Pan, Frederic T. Chong, Song Han:
RobustState: Boosting Fidelity of Quantum State Preparation via Noise-Aware Variational Training. CoRR abs/2311.16035 (2023) - 2022
- [j17]Le-Le Li, Jiang-Yi Liu, Jianping Fan, Xuehai Qian, Kai Hwang, Yeh-Ching Chung, Zhibin Yu:
SOCA-DOM: A Mobile System-on-Chip Array System for Analyzing Big Data on the Move. J. Comput. Sci. Technol. 37(6): 1271-1289 (2022) - [j16]Wei Niu, Zhengang Li, Xiaolong Ma, Peiyan Dong, Gang Zhou, Xuehai Qian, Xue Lin, Yanzhi Wang, Bin Ren:
GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices Based on Fine-Grained Structured Weight Sparsity. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6224-6239 (2022) - [j15]Xiaolong Ma, Sheng Lin, Shaokai Ye, Zhezhi He, Linfeng Zhang, Geng Yuan, Sia Huat Tan, Zhengang Li, Deliang Fan, Xuehai Qian, Xue Lin, Kaisheng Ma, Yanzhi Wang:
Non-Structured DNN Weight Pruning - Is It Beneficial in Any Platform? IEEE Trans. Neural Networks Learn. Syst. 33(9): 4930-4944 (2022) - [j14]Jidong Zhai, Liyan Zheng, Jinghan Sun, Feng Zhang, Xiongchao Tang, Xuehai Qian, Bingsheng He, Wei Xue, Wenguang Chen, Weimin Zheng:
Leveraging Code Snippets to Detect Variations in the Performance of HPC Systems. IEEE Trans. Parallel Distributed Syst. 33(12): 3558-3574 (2022) - [c61]Gengyu Rao, Jingji Chen, Jason Yik, Xuehai Qian:
SparseCore: stream ISA and processor specialization for sparse computation. ASPLOS 2022: 186-199 - [c60]Lutan Zhao, Peinan Li, Rui Hou, Michael C. Huang, Xuehai Qian, Lixin Zhang, Dan Meng:
HyBP: Hybrid Isolation-Randomization Secure Branch Predictor. HPCA 2022: 346-359 - [c59]Zhiding Liang, Hanrui Wang, Jinglei Cheng, Yongshan Ding, Hang Ren, Zhengqi Gao, Zhirui Hu, Duane S. Boning, Xuehai Qian, Song Han, Weiwen Jiang, Yiyu Shi:
Variational Quantum Pulse Learning. QCE 2022: 556-565 - [i30]Zhiding Liang, Jinglei Cheng, Hang Ren, Hanrui Wang, Fei Hua, Yongshan Ding, Fred Chong, Song Han, Yiyu Shi, Xuehai Qian:
PAN: Pulse Ansatz on NISQ Machines. CoRR abs/2208.01215 (2022) - [i29]Jinglei Cheng, Hanrui Wang, Zhiding Liang, Yiyu Shi, Song Han, Xuehai Qian:
TopGen: Topology-Aware Bottom-Up Generator for Variational Quantum Circuits. CoRR abs/2210.08190 (2022) - [i28]Hanrui Wang, Pengyu Liu, Jinglei Cheng, Zhiding Liang, Jiaqi Gu, Zirui Li, Yongshan Ding, Weiwen Jiang, Yiyu Shi, Xuehai Qian, David Z. Pan, Frederic T. Chong, Song Han:
QuEst: Graph Transformer for Quantum Circuit Reliability Estimation. CoRR abs/2210.16724 (2022) - [i27]Zhiding Liang, Zhixin Song, Jinglei Cheng, Zichang He, Ji Liu, Hanrui Wang, Ruiyang Qin, Yiru Wang, Song Han, Xuehai Qian, Yiyu Shi:
Hybrid Gate-Pulse Model for Variational Quantum Algorithms. CoRR abs/2212.00661 (2022) - 2021
- [j13]Xuehai Qian:
Graph processing and machine learning architectures with emerging memory technologies: a survey. Sci. China Inf. Sci. 64(6) (2021) - [j12]Xue Li, Mingxing Zhang, Kang Chen, Yongwei Wu, Xuehai Qian, Weimin Zheng:
3-D Partitioning for Large-Scale Graph Processing. IEEE Trans. Computers 70(1): 111-127 (2021) - [j11]Xiongchao Tang, Chen Zhang, Jidong Zhai, Xuehai Qian, Wenguang Chen, Yong Jiang:
A Fast Lock for Explicit Message Passing Architectures. IEEE Trans. Computers 70(10): 1555-1568 (2021) - [c58]Lutan Zhao, Peinan Li, Rui Hou, Michael C. Huang, Jiazhen Li, Lixin Zhang, Xuehai Qian, Dan Meng:
A Lightweight Isolation Mechanism for Secure Branch Predictors. DAC 2021: 1267-1272 - [c57]Sung-En Chang, Yanyu Li, Mengshu Sun, Runbin Shi, Hayden K. H. So, Xuehai Qian, Yanzhi Wang, Xue Lin:
Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework. HPCA 2021: 208-220 - [c56]Geng Yuan, Payman Behnam, Zhengang Li, Ali Shafiee, Sheng Lin, Xiaolong Ma, Hang Liu, Xuehai Qian, Mahdi Nazm Bojnordi, Yanzhi Wang, Caiwen Ding:
FORMS: Fine-grained Polarized ReRAM-based In-situ Computation for Mixed-signal DNN Accelerator. ISCA 2021: 265-278 - [c55]Qingcheng Xiao, Size Zheng, Bingzhe Wu, Pengcheng Xu, Xuehai Qian, Yun Liang:
HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation. ISCA 2021: 1055-1068 - [c54]Chunhua Deng, Yang Sui, Siyu Liao, Xuehai Qian, Bo Yuan:
GoSPA: An Energy-efficient High-performance Globally Optimized SParse Convolutional Neural Network Accelerator. ISCA 2021: 1110-1123 - [c53]Shiyu Li, Edward Hanson, Xuehai Qian, Hai (Helen) Li, Yiran Chen:
ESCALATE: Boosting the Efficiency of Sparse CNN Accelerator with Kernel Decomposition. MICRO 2021: 992-1004 - [i26]Qingcheng Xiao, Size Zheng, Bingzhe Wu, Pengcheng Xu, Xuehai Qian, Yun Liang:
HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation. CoRR abs/2105.01585 (2021) - [i25]Jingji Chen, Xuehai Qian:
Kudu: An Efficient and Scalable Distributed Graph Pattern Mining Engine. CoRR abs/2105.03789 (2021) - [i24]Geng Yuan, Payman Behnam, Zhengang Li, Ali Shafiee, Sheng Lin, Xiaolong Ma, Hang Liu, Xuehai Qian, Mahdi Nazm Bojnordi, Yanzhi Wang, Caiwen Ding:
FORMS: Fine-grained Polarized ReRAM-based In-situ Computation for Mixed-signal DNN Accelerator. CoRR abs/2106.09144 (2021) - [i23]Wei Niu, Zhengang Li, Xiaolong Ma, Peiyan Dong, Gang Zhou, Xuehai Qian, Xue Lin, Yanzhi Wang, Bin Ren:
GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices based on Fine-Grained Structured Weight Sparsity. CoRR abs/2108.11033 (2021) - 2020
- [j10]Xuehai Qian, Yanzhi Wang, Avinash Karanth:
Guest Editors' Introduction to the Special Issue on Machine Learning Architectures and Accelerators. IEEE Trans. Computers 69(7): 929-930 (2020) - [j9]Hao Yan, Hebin R. Cherian, Ethan C. Ahn, Xuehai Qian, Lide Duan:
iCELIA: A Full-Stack Framework for STT-MRAM-Based Deep Learning Acceleration. IEEE Trans. Parallel Distributed Syst. 31(2): 408-422 (2020) - [j8]Xiebing Wang, Xuehai Qian, Alois C. Knoll, Kai Huang:
Efficient Performance Estimation and Work-Group Size Pruning for OpenCL Kernels on GPUs. IEEE Trans. Parallel Distributed Syst. 31(5): 1089-1106 (2020) - [c52]Xingbin Wang, Rui Hou, Boyan Zhao, Fengkai Yuan, Jun Zhang, Dan Meng, Xuehai Qian:
DNNGuard: An Elastic Heterogeneous DNN Accelerator Architecture against Adversarial Attacks. ASPLOS 2020: 19-34 - [c51]Qinyi Luo, Jiaao He, Youwei Zhuo, Xuehai Qian:
Prague: High-Performance Heterogeneity-Aware Asynchronous Decentralized Training. ASPLOS 2020: 401-416 - [c50]Teng Ma, Mingxing Zhang, Kang Chen, Zhuo Song, Yongwei Wu, Xuehai Qian:
AsymNVM: An Efficient Framework for Implementing Persistent Data Structures on Asymmetric NVM Architecture. ASPLOS 2020: 757-773 - [c49]Xuan Peng, Xuanhua Shi, Hulin Dai, Hai Jin, Weiliang Ma, Qian Xiong, Fan Yang, Xuehai Qian:
Capuchin: Tensor-based GPU Memory Management for Deep Learning. ASPLOS 2020: 891-905 - [c48]Wei Niu, Xiaolong Ma, Sheng Lin, Shihao Wang, Xuehai Qian, Xue Lin, Yanzhi Wang, Bin Ren:
PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning. ASPLOS 2020: 907-922 - [c47]Sheng Xu, Xiaoming Chen, Xuehai Qian, Yinhe Han:
TUPIM: A Transparent and Universal Processing-in-Memory Architecture for Unmodified Binaries. ACM Great Lakes Symposium on VLSI 2020: 199-204 - [c46]Linghao Song, Fan Chen, Youwei Zhuo, Xuehai Qian, Hai Li, Yiran Chen:
AccPar: Tensor Partitioning for Heterogeneous Deep Learning Accelerators. HPCA 2020: 342-355 - [c45]Jinglei Cheng, Haoqing Deng, Xuehai Qian:
AccQOC: Accelerating Quantum Optimal Control Based Pulse Generation. ISCA 2020: 543-555 - [c44]Youwei Zhuo, Jingji Chen, Qinyi Luo, Yanzhi Wang, Hailong Yang, Depei Qian, Xuehai Qian:
SympleGraph: distributed graph processing with precise loop-carried dependency guarantee. PLDI 2020: 592-607 - [i22]Wei Niu, Xiaolong Ma, Sheng Lin, Shihao Wang, Xuehai Qian, Xue Lin, Yanzhi Wang, Bin Ren:
PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning. CoRR abs/2001.00138 (2020) - [i21]Chao Wang, Kezhao Huang, Xuehai Qian:
A Comprehensive Evaluation of RDMA-enabled Concurrency Control Protocols. CoRR abs/2002.12664 (2020) - [i20]Chunhua Deng, Siyu Liao, Yi Xie, Keshab K. Parhi, Xuehai Qian, Bo Yuan:
PERMDNN: Efficient Compressed DNN Architecture with Permuted Diagonal Matrices. CoRR abs/2004.10936 (2020) - [i19]Lutan Zhao, Peinan Li, Rui Hou, Michael C. Huang, Jiazhen Li, Lixin Zhang, Xuehai Qian, Dan Meng:
A Lightweight Isolation Mechanism for Secure Branch Predictors. CoRR abs/2005.08183 (2020) - [i18]You Wu, Xuehai Qian:
ReversiSpec: Reversible Coherence Protocol for Defending Transient Attacks. CoRR abs/2006.16535 (2020) - [i17]Jingji Chen, Xuehai Qian:
DwarvesGraph: A High-Performance Graph Mining System with Pattern Decomposition. CoRR abs/2008.09682 (2020) - [i16]Linghao Song, Fan Chen, Xuehai Qian, Hai Li, Yiran Chen:
Low-Cost Floating-Point Processing in ReRAM for Scientific Computing. CoRR abs/2011.03190 (2020) - [i15]Sung-En Chang, Yanyu Li, Mengshu Sun, Runbin Shi, Hayden Kwok-Hay So, Xuehai Qian, Yanzhi Wang, Xue Lin:
Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework. CoRR abs/2012.04240 (2020) - [i14]Gengyu Rao, Jingji Chen, Xuehai Qian:
IntersectX: An Accelerator for Graph Mining. CoRR abs/2012.10848 (2020)
2010 – 2019
- 2019
- [j7]Sheng Xu, Xiaoming Chen, Ying Wang, Yinhe Han, Xuehai Qian, Xiaowei Li:
PIMSim: A Flexible and Detailed Processing-in-Memory Simulator. IEEE Comput. Archit. Lett. 18(1): 6-9 (2019) - [j6]Linghao Song, You Wu, Xuehai Qian, Hai Li, Yiran Chen:
ReBNN: in-situ acceleration of binarized neural networks in ReRAM using complementary resistive cell. CCF Trans. High Perform. Comput. 1(3-4): 196-208 (2019) - [j5]Zhe Li, Ji Li, Ao Ren, Ruizhe Cai, Caiwen Ding, Xuehai Qian, Jeffrey Draper, Bo Yuan, Jian Tang, Qinru Qiu, Yanzhi Wang:
HEIF: Highly Efficient Stochastic Computing-Based Inference Framework for Deep Neural Networks. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 38(8): 1543-1556 (2019) - [j4]Youwei Zhuo, Jingji Chen, Gengyu Rao, Qinyi Luo, Yanzhi Wang, Hailong Yang, Depei Qian, Xuehai Qian:
Distributed Graph Processing System and Processing-in-memory Architecture with Precise Loop-carried Dependency Guarantee. ACM Trans. Comput. Syst. 37(1-4): 5:1-5:37 (2019) - [j3]Zhiyuan Ai, Mingxing Zhang, Yongwei Wu, Xuehai Qian, Kang Chen, Weimin Zheng:
Clip: A Disk I/O Focused Parallel Out-of-Core Graph Processing System. IEEE Trans. Parallel Distributed Syst. 30(1): 45-62 (2019) - [c43]Xiongchao Tang, Jidong Zhai, Xuehai Qian, Wenguang Chen:
pLock: A Fast Lock for Architectures with Explicit Inter-core Message Passing. ASPLOS 2019: 765-778 - [c42]Qinyi Luo, Jinkun Lin, Youwei Zhuo, Xuehai Qian:
Hop: Heterogeneity-aware Decentralized Training. ASPLOS 2019: 893-907 - [c41]Ao Ren, Tianyun Zhang, Shaokai Ye, Jiayu Li, Wenyao Xu, Xuehai Qian, Xue Lin, Yanzhi Wang:
ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Methods of Multipliers. ASPLOS 2019: 925-938 - [c40]Linghao Song, Jiachen Mao, Youwei Zhuo, Xuehai Qian, Hai Li, Yiran Chen:
HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array. HPCA 2019: 56-68 - [c39]Zhe Li, Caiwen Ding, Siyue Wang, Wujie Wen, Youwei Zhuo, Chang Liu, Qinru Qiu, Wenyao Xu, Xue Lin, Xuehai Qian, Yanzhi Wang:
E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs. HPCA 2019: 69-80 - [c38]Xiebing Wang, Kai Huang, Alois C. Knoll, Xuehai Qian:
A Hybrid Framework for Fast and Accurate GPU Performance Estimation through Source-Level Analysis and Trace-Based Simulation. HPCA 2019: 506-518 - [c37]Yimin Jiang, Yong Cui, Wenfei Wu, Zhe Xu, Jiahan Gu, K. K. Ramakrishnan, Yongchao He, Xuehai Qian:
SpeedyBox: Low-Latency NFV Service Chains with Cross-NF Runtime Consolidation. ICDCS 2019: 68-79 - [c36]Chunhua Deng, Fangxuan Sun, Xuehai Qian, Jun Lin, Zhongfeng Wang, Bo Yuan:
TIE: energy-efficient tensor train-based inference engine for deep neural network. ISCA 2019: 264-278 - [c35]Yuzhao Wang, Lele Li, You Wu, Junqing Yu, Zhibin Yu, Xuehai Qian:
TPShare: a time-space sharing scheduling abstraction for shared cloud via vertical labels. ISCA 2019: 499-512 - [c34]Ruizhe Cai, Ao Ren, Olivia Chen, Ning Liu, Caiwen Ding, Xuehai Qian, Jie Han, Wenhui Luo, Nobuyuki Yoshikawa, Yanzhi Wang:
A stochastic-computing based deep learning framework using adiabatic quantum-flux-parametron superconducting technology. ISCA 2019: 567-578 - [c33]Youwei Zhuo, Chao Wang, Mingxing Zhang, Rui Wang, Dimin Niu, Yanzhi Wang, Xuehai Qian:
GraphQ: Scalable PIM-Based Graph Processing. MICRO 2019: 712-725 - [i13]Linghao Song, Jiachen Mao, Youwei Zhuo, Xuehai Qian, Hai Li, Yiran Chen:
HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array. CoRR abs/1901.02067 (2019) - [i12]Qinyi Luo, Jinkun Lin, Youwei Zhuo, Xuehai Qian:
Hop: Heterogeneity-Aware Decentralized Training. CoRR abs/1902.01064 (2019) - [i11]Yanzhi Wang, Shaokai Ye, Zhezhi He, Xiaolong Ma, Linfeng Zhang, Sheng Lin, Geng Yuan, Sia Huat Tan, Zhengang Li, Deliang Fan, Xuehai Qian, Xue Lin, Kaisheng Ma:
Non-structured DNN Weight Pruning Considered Harmful. CoRR abs/1907.02124 (2019) - [i10]Ruizhe Cai, Ao Ren, Olivia Chen, Ning Liu, Caiwen Ding, Xuehai Qian, Jie Han, Wenhui Luo, Nobuyuki Yoshikawa, Yanzhi Wang:
A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron SuperconductingTechnology. CoRR abs/1907.09077 (2019) - [i9]Qinyi Luo, Jiaao He, Youwei Zhuo, Xuehai Qian:
Heterogeneity-Aware Asynchronous Decentralized Training. CoRR abs/1909.08029 (2019) - 2018
- [j2]Mengxing Liu, Mingxing Zhang, Kang Chen, Xuehai Qian, Yongwei Wu, Weimin Zheng, Jinglei Ren:
DudeTx: Durable Transactions Made Decoupled. ACM Trans. Storage 14(1): 7:1-7:28 (2018) - [c32]Yanzhi Wang, Caiwen Ding, Zhe Li, Geng Yuan, Siyu Liao, Xiaolong Ma, Bo Yuan, Xuehai Qian, Jian Tang, Qinru Qiu, Xue Lin:
Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework. AAAI 2018: 4235-4243 - [c31]Xiaoxiao Liu, Wei Wen, Xuehai Qian, Hai Li, Yiran Chen:
Neu-NoC: A high-efficient interconnection network for accelerated neuromorphic systems. ASP-DAC 2018: 141-146 - [c30]Ruizhe Cai, Ao Ren, Ning Liu, Caiwen Ding, Luhao Wang, Xuehai Qian, Massoud Pedram, Yanzhi Wang:
VIBNN: Hardware Acceleration of Bayesian Neural Networks. ASPLOS 2018: 476-488 - [c29]Zhibin Yu, Zhendong Bei, Xuehai Qian:
Datasize-Aware High Dimensional Configurations Auto-Tuning of In-Memory Cluster Computing. ASPLOS 2018: 564-577 - [c28]Mingxing Zhang, Yongwei Wu, Youwei Zhuo, Xuehai Qian, Chengying Huan, Kang Chen:
Wonderland: A Novel Abstraction-Based Out-Of-Core Graph Processing System. ASPLOS 2018: 608-621 - [c27]Bing Li, Linghao Song, Fan Chen, Xuehai Qian, Yiran Chen, Hai Helen Li:
ReRAM-based accelerator for deep learning. DATE 2018: 815-820 - [c26]Abdulaziz Tabbakh, Xuehai Qian, Murali Annavaram:
G-TSC: Timestamp Based Coherence for GPUs. HPCA 2018: 403-415 - [c25]Linghao Song, Youwei Zhuo, Xuehai Qian, Hai Helen Li, Yiran Chen:
GraphR: Accelerating Graph Processing Using ReRAM. HPCA 2018: 531-543 - [c24]Mingxing Zhang, Youwei Zhuo, Chao Wang, Mingyu Gao, Yongwei Wu, Kang Chen, Christos Kozyrakis, Xuehai Qian:
GraphP: Reducing Communication for PIM-Based Graph Processing with Efficient Data Partition. HPCA 2018: 544-557 - [c23]Youwei Zhuo, Jinglei Cheng, Qinyi Luo, Jidong Zhai, Yanzhi Wang, Zhongzhi Luan, Xuehai Qian:
CSE: Parallel Finite State Machines with Convergence Set Enumeration. MICRO 2018: 29-41 - [c22]Chunhua Deng, Siyu Liao, Yi Xie, Keshab K. Parhi, Xuehai Qian, Bo Yuan:
PermDNN: Efficient Compressed DNN Architecture with Permuted Diagonal Matrices. MICRO 2018: 189-202 - [c21]Yirong Lv, Bin Sun, Qingyi Luo, Jing Wang, Zhibin Yu, Xuehai Qian:
CounterMiner: Mining Big Performance Data from Hardware Counters. MICRO 2018: 613-626 - [c20]Xiongchao Tang, Jidong Zhai, Xuehai Qian, Bingsheng He, Wei Xue, Wenguang Chen:
vSensor: leveraging fixed-workload snippets of programs for performance variance detection. PPoPP 2018: 124-136 - [i8]Ruizhe Cai, Ao Ren, Ning Liu, Caiwen Ding, Luhao Wang, Xuehai Qian, Massoud Pedram, Yanzhi Wang:
VIBNN: Hardware Acceleration of Bayesian Neural Networks. CoRR abs/1802.00822 (2018) - [i7]Yanzhi Wang, Caiwen Ding, Zhe Li, Geng Yuan, Siyu Liao, Xiaolong Ma, Bo Yuan, Xuehai Qian, Jian Tang, Qinru Qiu, Xue Lin:
Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework. CoRR abs/1802.06402 (2018) - [i6]Teng Ma, Mingxing Zhang, Kang Chen, Xuehai Qian, Yongwei Wu:
An Efficient Framework for Implementing Persist Data Structures on Remote NVM. CoRR abs/1809.09395 (2018) - [i5]Zhe Li, Caiwen Ding, Siyue Wang, Wujie Wen, Youwei Zhuo, Chang Liu, Qinru Qiu, Wenyao Xu, Xue Lin, Xuehai Qian, Yanzhi Wang:
E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs. CoRR abs/1812.07106 (2018) - [i4]Ao Ren, Tianyun Zhang, Shaokai Ye, Jiayu Li, Wenyao Xu, Xuehai Qian, Xue Lin, Yanzhi Wang:
ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers. CoRR abs/1812.11677 (2018) - 2017
- [c19]Mengxing Liu, Mingxing Zhang, Kang Chen, Xuehai Qian, Yongwei Wu, Weimin Zheng, Jinglei Ren:
DudeTM: Building Durable Transactions with Decoupling for Persistent Memory. ASPLOS 2017: 329-343 - [c18]Ao Ren, Zhe Li, Caiwen Ding, Qinru Qiu, Yanzhi Wang, Ji Li, Xuehai Qian, Bo Yuan:
SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing. ASPLOS 2017: 405-418 - [c17]Linghao Song, Xuehai Qian, Hai Li, Yiran Chen:
PipeLayer: A Pipelined ReRAM-Based Accelerator for Deep Learning. HPCA 2017: 541-552 - [c16]Abdulaziz Tabbakh, Murali Annavaram, Xuehai Qian:
Power Efficient Sharing-Aware GPU Data Management. IPDPS 2017: 698-707 - [c15]Caiwen Ding, Siyu Liao, Yanzhi Wang, Zhe Li, Ning Liu, Youwei Zhuo, Chao Wang, Xuehai Qian, Yu Bai, Geng Yuan, Xiaolong Ma, Yipeng Zhang, Jian Tang, Qinru Qiu, Xue Lin, Bo Yuan:
CirCNN: accelerating and compressing deep neural networks using block-circulant weight matrices. MICRO 2017: 395-408 - [c14]Zhiyuan Ai, Mingxing Zhang, Yongwei Wu, Xuehai Qian, Kang Chen, Weimin Zheng:
Squeezing out All the Value of Loaded Data: An Out-of-core Graph Processing System with Reduced Disk I/O. USENIX ATC 2017: 125-137 - [i3]Linghao Song, Youwei Zhuo, Xuehai Qian, Hai Li, Yiran Chen:
GraphR: Accelerating Graph Processing Using ReRAM. CoRR abs/1708.06248 (2017) - [i2]Caiwen Ding, Siyu Liao, Yanzhi Wang, Zhe Li, Ning Liu, Youwei Zhuo, Chao Wang, Xuehai Qian, Yu Bai, Geng Yuan, Xiaolong Ma, Yipeng Zhang, Jian Tang, Qinru Qiu, Xue Lin, Bo Yuan:
CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices. CoRR abs/1708.08917 (2017) - 2016
- [c13]Xuehai Qian, Koushik Sen, Paul Hargrove, Costin Iancu:
SReplay: Deterministic Sub-Group Replay for One-Sided Communication. ICS 2016: 17:1-17:13 - [c12]Mingxing Zhang, Yongwei Wu, Kang Chen, Xuehai Qian, Xue Li, Weimin Zheng:
Exploring the Hidden Dimension in Graph Processing. OSDI 2016: 285-300 - [c11]Xuehai Qian, Koushik Sen, Paul Hargrove, Costin Iancu:
OPR: deterministic group replay for one-sided communication. PPoPP 2016: 47:1-47:2 - [i1]Ao Ren, Ji Li, Zhe Li, Caiwen Ding, Xuehai Qian, Qinru Qiu, Bo Yuan, Yanzhi Wang:
SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing. CoRR abs/1611.05939 (2016) - 2015
- [j1]Hui Wang, Rui Wang, Zhongzhi Luan, Xuehai Qian, Depei Qian:
Improving multiprocessor performance with fine-grain coherence bypass. Sci. China Inf. Sci. 58(1): 1-15 (2015) - 2014
- [c10]Xuehai Qian, Benjamín Sahelices, Josep Torrellas:
OmniOrder: Directory-based conflict serialization of transactions. ISCA 2014: 421-432 - [c9]Xuehai Qian, Benjamín Sahelices, Depei Qian:
Pacifier: Record and replay for relaxed-consistency multiprocessors with distributed directory protocol. ISCA 2014: 433-444 - 2013
- [b1]Xuehai Qian:
Scalable and flexible bulk architecture. University of Illinois Urbana-Champaign, USA, 2013 - [c8]Xuehai Qian, Josep Torrellas, Benjamín Sahelices