default search action
Yunquan Zhang
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j48]Yunquan Zhang, Guangming Tan, Liang Yuan:
Special issue of HPCChina 2023. CCF Trans. High Perform. Comput. 6(1): 1-2 (2024) - [j47]Cunyang Wei, Haipeng Jia, Yunquan Zhang, Jianyu Yao, Chendi Li, Wenxuan Cao:
IrGEMM: An Input-Aware Tuning Framework for Irregular GEMM on ARM and X86 CPUs. IEEE Trans. Parallel Distributed Syst. 35(9): 1672-1689 (2024) - [c90]Lei Xu, Haipeng Jia, Yunquan Zhang, Luhan Wang, Xianmeng Jiang:
HAM-SpMSpV: an Optimized Parallel Algorithm for Masked Sparse Matrix-Sparse Vector Multiplications on multi-core CPUs. HPDC 2024: 160-173 - [c89]Wenxuan Zhao, Liang Yuan, Baicheng Yan, Penghao Ma, Yunquan Zhang, Long Wang, Zhe Wang:
Stencil Computation with Vector Outer Product. ICS 2024: 247-258 - [c88]Luhan Wang, Haipeng Jia, Lei Xu, Cunyang Wei, Kun Li, Xianmeng Jiang, Yunquan Zhang:
VNEC: A Vectorized Non-Empty Column Format for SpMV on CPUs. IPDPS 2024: 14-25 - [c87]Zhiqian Xu, Honghui Shang, Yi Fan, Xiongzhi Zeng, Yunquan Zhang, Chu Guo:
Scalable and Differentiable Simulator for Quantum Computational Chemistry. IPDPS 2024: 230-240 - [c86]Ruge Zhang, Haipeng Jia, Yunquan Zhang, Baicheng Yan, Penghao Ma, Long Wang, Wenxuan Zhao:
OpenFFT-SME: An Efficient Outer Product Pattern FFT Library on ARM SME CPUs. IPDPS 2024: 938-949 - [c85]Yuetao Chen, Kun Li, Yuhao Wang, Donglin Bai, Lei Wang, Lingxiao Ma, Liang Yuan, Yunquan Zhang, Ting Cao, Mao Yang:
ConvStencil: Transform Stencil Computation to Matrix Multiplication on Tensor Cores. PPoPP 2024: 333-347 - 2023
- [j46]Yan Zeng, Yong Ding, Dongyang Ou, Jilin Zhang, Yongjian Ren, Yunquan Zhang:
MP-DPS: adaptive distributed training for deep learning based on node merging and path prediction. CCF Trans. High Perform. Comput. 5(4): 429-441 (2023) - [j45]Yan Zeng, Yuankai Mu, Junfeng Yuan, Siyuan Teng, Jilin Zhang, Jian Wan, Yongjian Ren, Yunquan Zhang:
Adaptive Federated Learning With Non-IID Data. Comput. J. 66(11): 2758-2772 (2023) - [j44]Hang Cao, Liang Yuan, He Zhang, Yunquan Zhang, Baodong Wu, Kun Li, Shigang Li, Minghua Zhang, Pengqi Lu, Junmin Xiao:
AGCM-3DLF: Accelerating Atmospheric General Circulation Model via 3-D Parallelization and Leap-Format. IEEE Trans. Parallel Distributed Syst. 34(3): 766-780 (2023) - [j43]Lei Xu, Honghui Shang, Xin Chen, Yunquan Zhang, Lifang Wang, Xingyu Gao, Haifeng Song:
Redesigning OpenKMC for Multi-Component Trillion-Atom Simulations on the New Sunway Supercomputer. IEEE Trans. Parallel Distributed Syst. 34(7): 1997-2010 (2023) - [c84]Yan Zeng, Chengchuang Huang, Yijie Ni, Chunbao Zhou, Jilin Zhang, Jue Wang, Mingyao Zhou, Meiting Xue, Yunquan Zhang:
An Auto-Parallel Method for Deep Learning Models Based on Genetic Algorithm. ICPADS 2023: 230-235 - [c83]Rongyuan Guo, Haipeng Jia, Yunquan Zhang, Mingsen Deng, Cunyang Wei, Wenbin Chang, Xiang Zhao:
SA_TRSM: A Shape-Aware Auto-Tuning Framework for Small-Scale Irregular-Shaped TRSM. ICPADS 2023: 765-774 - [c82]Tun Chen, Haipeng Jia, Yunquan Zhang, Kun Li, Zhihao Li, Xiang Zhao, Jianyu Yao, Chendi Li:
OpenFFT: An Adaptive Tuning Framework for 3D FFT on ARM Multicore CPUs. ICS 2023: 398-409 - [c81]Daning Cheng, Shigang Li, Yunquan Zhang:
Asynch-SGBDT: Train Stochastic Gradient Boosting Decision Trees in an Asynchronous Parallel Manner. IPDPS 2023: 256-267 - [c80]Zhihao Li, Haipeng Jia, Yunquan Zhang, Yuyan Sun, Yiwei Zhang, Tun Chen:
Generating Fast FFT Kernels on CPUs via FFT-Specific Intrinsics. PPoPP 2023: 427-428 - [i18]Kun Li, Zhichun Li, Yuetao Chen, Zixuan Wang, Yiwei Zhang, Liang Yuan, Haipeng Jia, Yunquan Zhang, Ting Cao, Mao Yang:
Gamify Stencil Dwarf on Cloud for Democratizing Scientific Computing. CoRR abs/2303.08365 (2023) - [i17]Wenxuan Zhao, Liang Yuan, Baicheng Yan, Penghao Ma, Yunquan Zhang, Long Wang, Zhe Wang:
Stencil Computation with Vector Outer Product. CoRR abs/2310.16298 (2023) - 2022
- [j42]Yan Zeng, Jiyang Wu, Jilin Zhang, Yongjian Ren, Yunquan Zhang:
Trinity: Neural Network Adaptive Distributed Parallel Training Method Based on Reinforcement Learning. Algorithms 15(4): 108 (2022) - [j41]Yuetao Chen, Keni Qiu, Li Chen, Haipeng Jia, Yunquan Zhang, Limin Xiao, Lei Liu:
Smart scheduler: an adaptive NVM-aware thread scheduling approach on NUMA systems. CCF Trans. High Perform. Comput. 4(4): 394-406 (2022) - [j40]Yuetao Chen, Keni Qiu, Li Chen, Haipeng Jia, Yunquan Zhang, Limin Xiao, Lei Liu:
Publisher Correction: Smart scheduler: an adaptive NVM-aware thread scheduling approach on NUMA systems. CCF Trans. High Perform. Comput. 4(4): 492 (2022) - [j39]Mingchuan Wu, Yangjun Wu, Honghui Shang, Ying Liu, Huimin Cui, Fang Li, Xiaohui Duan, Yunquan Zhang, Xiaobing Feng:
Scaling Poisson Solvers on Many Cores via MMEwald. IEEE Trans. Parallel Distributed Syst. 33(8): 1888-1901 (2022) - [j38]Kun Li, Liang Yuan, Yunquan Zhang, Gongwei Chen:
An Accurate and Efficient Large-Scale Regression Method Through Best Friend Clustering. IEEE Trans. Parallel Distributed Syst. 33(11): 3129-3140 (2022) - [c79]Cunyang Wei, Haipeng Jia, Yunquan Zhang, Kun Li, Luhan Wang:
LBBGEMM: A Load-balanced Batch GEMM Framework on ARM CPU s. HPCC/DSS/SmartCity/DependSys 2022: 59-66 - [c78]Luhan Wang, Haipeng Jia, Yunquan Zhang, Kun Li, Cunyang Wei:
EgpuIP: An Embedded GPU Accelerated Library for Image Processing. HPCC/DSS/SmartCity/DependSys 2022: 914-921 - [c77]Yan Zeng, Guangzheng Yi, Yuyu Yin, Jiyang Wu, Meiting Xue, Jilin Zhang, Jian Wan, Yunquan Zhang:
Aware: Adaptive Distributed Training with Computation, Communication and Position Awareness for Deep Learning Model. HPCC/DSS/SmartCity/DependSys 2022: 1299-1306 - [c76]Yunquan Zhang, Jidong Zhai, Rajiv Ranjan:
Message from the High Performance Computing and Communications 2022 Program Chairs. HPCC/DSS/SmartCity/DependSys 2022: lv - [c75]Cunyang Wei, Haipeng Jia, Yunquan Zhang, Liusha Xu, Ji Qi:
IATF: An Input-Aware Tuning Framework for Compact BLAS Based on ARMv8 CPUs. ICPP 2022: 66:1-66:11 - [c74]Kun Li, Liang Yuan, Yunquan Zhang, Yue Yue, Hang Cao:
An Efficient Vectorization Scheme for Stencil Computation. IPDPS 2022: 650-660 - [c73]Honghui Shang, Li Shen, Yi Fan, Zhiqian Xu, Chu Guo, Jie Liu, Wenhao Zhou, Huan Ma, Rongfen Lin, Yuling Yang, Fang Li, Zhuoya Wang, Yunquan Zhang, Zhenyu Li:
Large-Scale Simulation of Quantum Computational Chemistry on a New Sunway Supercomputer. SC 2022: 14:1-14:14 - [i16]Chendi Li, Haipeng Jia, Hang Cao, Jianyu Yao, Boqian Shi, Chunyang Xiang, Jinbo Sun, Pengqi Lu, Yunquan Zhang:
AutoTSMM: An Auto-tuning Framework for Building High-Performance Tall-and-Skinny Matrix-Matrix Multiplication on CPUs. CoRR abs/2208.08088 (2022) - [i15]Jianyu Yao, Boqian Shi, Chunyang Xiang, Haipeng Jia, Chendi Li, Hang Cao, Yunquan Zhang:
IAAT: A Input-Aware Adaptive Tuning framework for Small GEMM. CoRR abs/2208.09822 (2022) - 2021
- [j37]Zhixiang Ren, Yongheng Liu, Tianhui Shi, Lei Xie, Yue Zhou, Jidong Zhai, Youhui Zhang, Yunquan Zhang, Wenguang Chen:
AIPerf: Automated machine learning as an AI-HPC benchmark. Big Data Min. Anal. 4(3): 208-220 (2021) - [j36]Honghui Shang, WanZhen Liang, Yunquan Zhang, Jinlong Yang:
Efficient parallel linear scaling method to get the response density matrix in all-electron real-space density-functional perturbation theory. Comput. Phys. Commun. 258: 107613 (2021) - [j35]Honghui Shang, Xiaohui Duan, Fang Li, Libo Zhang, Zhiqian Xu, Kan Liu, Haiwen Luo, Yingrui Ji, Wenxuan Zhao, Wei Xue, Li Chen, Yunquan Zhang:
Many-core acceleration of the first-principles all-electron quantum perturbation calculations. Comput. Phys. Commun. 267: 108045 (2021) - [j34]Daning Cheng, Shigang Li, Hanping Zhang, Fen Xia, Yunquan Zhang:
Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms. IEEE Trans. Parallel Distributed Syst. 32(7): 1702-1712 (2021) - [c72]Tun Chen, Haipeng Jia, Zhihao Li, Chendi Li, Yunquan Zhang:
A Transpose-free Three-dimensional FFT Algorithm on ARM CPUs. HPCC/DSS/SmartCity/DependSys 2021: 1-8 - [c71]Pengqi Lu, Yue Yue, Liang Yuan, Yunquan Zhang:
AutoFlow: Hotspot-Aware, Dynamic Load Balancing for Distributed Stream Processing. ICA3PP (3) 2021: 133-151 - [c70]Jianyu Yao, Boqian Shi, Chunyang Xiang, Haipeng Jia, Chendi Li, Hang Cao, Yunquan Zhang:
IAAT: A Input-Aware Adaptive Tuning framework for Small GEMM. ICPADS 2021: 899-906 - [c69]Chendi Li, Haipeng Jia, Hang Cao, Jianyu Yao, Boqian Shi, Chunyang Xiang, Jinbo Sun, Pengqi Lu, Yunquan Zhang:
AutoTSMM: An Auto-tuning Framework for Building High-Performance Tall-and-Skinny Matrix-Matrix Multiplication on CPUs. ISPA/BDCloud/SocialCom/SustainCom 2021: 159-166 - [c68]Honghui Shang, Fang Li, Yunquan Zhang, Libo Zhang, You Fu, Yingxiang Gao, Yangjun Wu, Xiaohui Duan, Rongfen Lin, Xin Liu, Ying Liu, Dexun Chen:
Extreme-scale ab initio quantum raman spectra simulations on the leadership HPC system in China. SC 2021: 6 - [c67]Honghui Shang, Fang Li, Yunquan Zhang, Ying Liu, Libo Zhang, Mingchuan Wu, Yangjun Wu, Di Wei, Huimin Cui, Xin Liu, Fei Wang, Yuxi Ye, Yingxiang Gao, Shuang Ni, Xin Chen, Dexun Chen:
Accelerating all-electron ab initio simulation of raman spectra for biological systems. SC 2021: 41 - [c66]Honghui Shang, Xin Chen, Xingyu Gao, Rongfen Lin, Lifang Wang, Fang Li, Qian Xiao, Lei Xu, Qiang Sun, Leilei Zhu, Fei Wang, Yunquan Zhang, Haifeng Song:
TensorKMC: kinetic Monte Carlo simulation of 50 trillion atoms driven by deep learning on a new generation of Sunway supercomputer. SC 2021: 73 - [c65]Liang Yuan, Hang Cao, Yunquan Zhang, Kun Li, Pengqi Lu, Yue Yue:
Temporal vectorization for stencils. SC 2021: 82 - [c64]Kun Li, Liang Yuan, Yunquan Zhang, Yue Yue:
Reducing redundancy in data organization and arithmetic calculation for stencil computations. SC 2021: 84 - [i14]Kun Li, Liang Yuan, Yunquan Zhang, Yue Yue, Hang Cao, Pengqi Lu:
An Efficient Vectorization Scheme for Stencil Computation. CoRR abs/2103.08825 (2021) - [i13]Pengqi Lu, Liang Yuan, Yunquan Zhang, Hang Cao, Kun Li:
AutoFlow: Hotspot-Aware, Dynamic Load Balancing for Distributed Stream Processing. CoRR abs/2103.08888 (2021) - [i12]Kun Li, Liang Yuan, Yunquan Zhang, Yue Yue, Hang Cao, Pengqi Lu:
Reducing Redundancy in Data Organization and Arithmetic Calculation for Stencil Computations. CoRR abs/2103.09235 (2021) - [i11]Hang Cao, Liang Yuan, He Zhang, Yunquan Zhang:
Enhanced AGCM3D: A Highly Scalable Dynamical Core of Atmospheric General Circulation Model Based on Leap-Format. CoRR abs/2103.10114 (2021) - [i10]Kun Li, Liang Yuan, Yunquan Zhang, Gongwei Chen:
An Accurate and Efficient Large-scale Regression Method through Best Friend Clustering. CoRR abs/2104.10819 (2021) - 2020
- [j33]Honghui Shang, Lei Xu, Baodong Wu, Xinming Qin, Yunquan Zhang, Jinlong Yang:
The dynamic parallel distribution algorithm for hybrid density-functional calculations in HONPAS package. Comput. Phys. Commun. 254: 107204 (2020) - [j32]Wei Li, Jun Liang, Yunquan Zhang, Haipeng Jia, Lin Xiao, Qing Li:
Accelerated LiDAR data processing algorithm for self-driving cars on the heterogeneous computing platform. IET Comput. Digit. Tech. 14(5): 201-209 (2020) - [j31]Daobi Chen, Liang Yuan, Yunquan Zhang, Jingfu Yan, David K. Kahaner:
HPC software capability landscape in China. Int. J. High Perform. Comput. Appl. 34(1) (2020) - [j30]Xinming Qin, Honghui Shang, Lei Xu, Wei Hu, Jinlong Yang, Shigang Li, Yunquan Zhang:
The static parallel distribution algorithms for hybrid density-functional calculations in HONPAS package. Int. J. High Perform. Comput. Appl. 34(2) (2020) - [j29]Daning Cheng, Shigang Li, Yunquan Zhang:
WP-SGD: Weighted parallel SGD for distributed unbalanced-workload training system. J. Parallel Distributed Comput. 145: 202-216 (2020) - [j28]Liang Yuan, Yunquan Zhang, Xuerui Bai, Guangting Zhang:
并行程序设计语言中局部性机制的研究 (Research on Locality-aware Design Mechanism of State-of-the-art Parallel Programming Languages). 计算机科学 47(1): 7-16 (2020) - [j27]Kun Li, Shigang Li, Shan Huang, Yifeng Chen, Yunquan Zhang:
FastNBL: fast neighbor lists establishment for molecular dynamics simulation based on bitwise operations. J. Supercomput. 76(7): 5501-5520 (2020) - [j26]Zhihao Li, Haipeng Jia, Yunquan Zhang, Tun Chen, Liang Yuan, Richard W. Vuduc:
Automatic Generation of High-Performance FFT Kernels on Arm and X86 CPUs. IEEE Trans. Parallel Distributed Syst. 31(8): 1925-1941 (2020) - [c63]Ke Zhan, Zhonghua Lu, Yunquan Zhang:
Performance Optimization for Feature Extraction Section of DeepChem. ICA3PP (1) 2020: 290-304 - [c62]Hang Cao, Liang Yuan, He Zhang, Baodong Wu, Shigang Li, Pengqi Lu, Yunquan Zhang, Yongjun Xu, Minghua Zhang:
A Highly Efficient Dynamical Core of Atmospheric General Circulation Model based on Leap-Format. IPDPS 2020: 95-104 - [i9]Zhixiang Ren, Yongheng Liu, Tianhui Shi, Lei Xie, Yue Zhou, Jidong Zhai, Youhui Zhang, Yunquan Zhang, Wenguang Chen:
AIPerf: Automated machine learning as an AI-HPC benchmark. CoRR abs/2008.07141 (2020) - [i8]Liang Yuan, Hang Cao, Yunquan Zhang, Kun Li, Pengqi Lu, Yue Yue:
Temporal Vectorization for Stencils. CoRR abs/2010.04868 (2020)
2010 – 2019
- 2019
- [j25]Di Zhang, Yunquan Zhang, Qiang Niu, Xingbao Qiu:
Mining concise patterns on graph-connected itemsets. Neurocomputing 336: 27-35 (2019) - [j24]Zhihao Li, Haipeng Jia, Yunquan Zhang, Shice Liu, Shigang Li, Xiao Wang, Hao Zhang:
Efficient parallel optimizations of a high-performance SIFT on GPUs. J. Parallel Distributed Comput. 124: 78-91 (2019) - [j23]Yunquan Zhang:
2018年中国高性能计算机发展现状分析与展望 (State-of-the-art Analysis and Perspectives of 2018 China HPC Development). 计算机科学 46(1): 1-5 (2019) - [j22]Liang Yuan, Chen Ding, Wesley Smith, Peter J. Denning, Yunquan Zhang:
A Relational Theory of Locality. ACM Trans. Archit. Code Optim. 16(3): 33:1-33:26 (2019) - [j21]Kun Li, Shigang Li, Shan Huang, Yifeng Chen, Yunquan Zhang:
Correction to: FastNBL: fast neighbor lists establishment for molecular dynamics simulation based on bitwise operations. J. Supercomput. 75(12): 8339-8340 (2019) - [c61]Liang Yuan, Shan Huang, Yunquan Zhang, Hang Cao:
Tessellating Star Stencils. ICPP 2019: 43:1-43:10 - [c60]Daning Cheng, Hanping Zhang, Fen Xia, Shigang Li, Yunquan Zhang:
Using Gradient Based Multikernel Gaussian Process and Meta-Acquisition Function to Accelerate SMBO. ICTAI 2019: 440-447 - [c59]Kun Li, Shigang Li, Bei Wang, Yifeng Chen, Yunquan Zhang:
swMD: Performance Optimizations for Molecular Dynamics Simulation on Sunway Taihulight. ISPA/BDCloud/SocialCom/SustainCom 2019: 511-518 - [c58]Zhihao Li, Haipeng Jia, Yunquan Zhang, Tun Chen, Liang Yuan, Luning Cao, Xiao Wang:
AutoFFT: a template-based FFT codes auto-generation framework for ARM and X86 CPUs. SC 2019: 25:1-25:15 - [c57]Kun Li, Honghui Shang, Yunquan Zhang, Shigang Li, Baodong Wu, Dong Wang, Libo Zhang, Fang Li, Dexun Chen, Zhiqiang Wei:
OpenKMC: a KMC design for hundred-billion-atom simulation using millions of cores on Sunway Taihulight. SC 2019: 68:1-68:16 - [i7]Zihan Jiang, Wanling Gao, Lei Wang, Xingwang Xiong, Yuchen Zhang, Xu Wen, Chunjie Luo, Hainan Ye, Yunquan Zhang, Shengzhong Feng, Kenli Li, Weijia Xu, Jianfeng Zhan:
HPC AI500: A Benchmark Suite for HPC AI Systems. CoRR abs/1908.02607 (2019) - [i6]Daning Cheng, Hanping Zhang, Fen Xia, Shigang Li, Yunquan Zhang:
The Scalability for Parallel Machine Learning Training Algorithm: Dataset Matters. CoRR abs/1910.11510 (2019) - 2018
- [j20]Shigang Li, Yunquan Zhang, Torsten Hoefler:
Cache-Oblivious MPI All-to-All Communications Based on Morton Order. IEEE Trans. Parallel Distributed Syst. 29(3): 542-555 (2018) - [c56]Zihan Jiang, Wanling Gao, Lei Wang, Xingwang Xiong, Yuchen Zhang, Xu Wen, Chunjie Luo, Hainan Ye, Xiaoyi Lu, Yunquan Zhang, Shengzhong Feng, Kenli Li, Weijia Xu, Jianfeng Zhan:
HPC AI500: A Benchmark Suite for HPC AI Systems. Bench 2018: 10-22 - [c55]Xiao Wang, Haipeng Jia, Zhihao Li, Yunquan Zhang:
Implementation and Optimization of Multi-dimensional Real FFT on ARMv8 Platform. ICA3PP (2) 2018: 338-353 - [c54]Baodong Wu, Shigang Li, Hang Cao, Yunquan Zhang, He Zhang, Junmin Xiao, Minghua Zhang:
AGCM3D: A Highly Scalable Finite-Difference Dynamical Core of Atmospheric General Circulation Model Based on 3D Decomposition. ICPADS 2018: 355-364 - [c53]Junmin Xiao, Shigang Li, Baodong Wu, He Zhang, Kun Li, Erlin Yao, Yunquan Zhang, Guangming Tan:
Communication-Avoiding for Dynamical Core of Atmospheric General Circulation Model. ICPP 2018: 12:1-12:10 - [c52]Shigang Li, Baodong Wu, Yunquan Zhang, Xianmeng Wang, Jianjiang Li, Changjun Hu, Jue Wang, Yangde Feng, Ningming Nie:
Massively Scaling the Metal Microscopic Damage Simulation on Sunway TaihuLight Supercomputer. ICPP 2018: 47:1-47:11 - [c51]Liang Yuan, Wesley Smith, Sicong Fan, Zixu Chen, Chen Ding, Yunquan Zhang:
Footmark: A New Formulation for Working Set Statistics. LCPC 2018: 61-69 - [c50]Di Zhang, Yunquan Zhang, Qiang Niu, Xingbao Qiu:
Rolling Forecasting Forward by Boosting Heterogeneous Kernels. PAKDD (1) 2018: 248-260 - [e2]Zongben Xu, Xinbo Gao, Qiguang Miao, Yunquan Zhang, Jiajun Bu:
Big Data - 6th CCF Conference, Big Data 2018, Xi'an, China, October 11-13, 2018, Proceedings. Communications in Computer and Information Science 945, Springer 2018, ISBN 978-981-13-2921-0 [contents] - [i5]Liang Yuan, Chen Ding, Peter J. Denning, Yunquan Zhang:
A Measurement Theory of Locality. CoRR abs/1802.01254 (2018) - [i4]Daning Cheng, Fen Xia, Shigang Li, Yunquan Zhang:
Asynchronous Parallel Sampling Gradient Boosting Decision Tree. CoRR abs/1804.04659 (2018) - [i3]Daning Cheng, Hanping Zhang, Fen Xia, Shigang Li, Yunquan Zhang:
Using Known Information to Accelerate HyperParameters Optimization Based on SMBO. CoRR abs/1811.03322 (2018) - 2017
- [j19]Baodong Wu, Shigang Li, Yunquan Zhang, Ningming Nie:
Hybrid-optimization strategy for the communication of large-scale Kinetic Monte Carlo simulation. Comput. Phys. Commun. 211: 113-123 (2017) - [j18]Vijayalakshmi Srinivasan, Yunquan Zhang:
Special Issue on Network and Parallel Computing. Int. J. Parallel Program. 45(1): 1-3 (2017) - [c49]Zhihao Li, Haipeng Jia, Yunquan Zhang:
HartSift: A High-Accuracy and Real-Time SIFT Based on GPU. ICPADS 2017: 135-142 - [c48]Shigang Li, Yunquan Zhang, Torsten Hoefler:
POSTER: Cache-Oblivious MPI All-to-All Communications on Many-Core Architectures. PPoPP 2017: 445-446 - [c47]Liang Yuan, Yunquan Zhang, Peng Guo, Shan Huang:
Tessellating stencils. SC 2017: 49 - [i2]Daning Cheng, Shigang Li, Yunquan Zhang:
Weighted parallel SGD for distributed unbalanced-workload training system. CoRR abs/1708.04801 (2017) - [i1]Daning Cheng, Shigang Li, Yunquan Zhang:
Asynchronous COMID: the theoretic basis for transmitted data sparsification tricks on Parameter Server. CoRR abs/1709.02091 (2017) - 2016
- [j17]Yunquan Zhang, Ji-Lin Zhang:
Workshop on high performance data intensive computing. Concurr. Comput. Pract. Exp. 28(6): 1695-1696 (2016) - [j16]Renbo Pang, Yunquan Zhang, Guangming Tan, Jianliang Xu, Haipeng Jia, Qingchun Xie:
边缘海静力数值预报模式并行算法研究 (Parallelization of Hydrostatic Numerical Forecasting Model of Marginal Sea). 计算机科学 43(1): 14-17 (2016) - [j15]Tao Luo, Yin Liao, Guoliang Chen, Yunquan Zhang:
P-DOT: a model of computation for big data. Int. J. Parallel Emergent Distributed Syst. 31(3): 233-253 (2016) - [j14]Yunquan Zhang, Ting Cao, Shigang Li, Xinhui Tian, Liang Yuan, Haipeng Jia, Athanasios V. Vasilakos:
Parallel Processing Systems for Big Data: A Survey. Proc. IEEE 104(11): 2114-2136 (2016) - [j13]Yunquan Zhang, Shigang Li, Shengen Yan, Huiyang Zhou:
A Cross-Platform SpMV Framework on Many-Core Architectures. ACM Trans. Archit. Code Optim. 13(4): 33:1-33:25 (2016) - [c46]Chenxi Wang, Ting Cao, John N. Zigman, Fang Lv, Yunquan Zhang, Xiaobing Feng:
Efficient Management for Hybrid Memory in Managed Language Runtime. NPC 2016: 29-42 - 2015
- [j12]Shigang Li, Changjun Hu, Junchao Zhang, Yunquan Zhang:
Automatic tuning of sparse matrix-vector multiplication on multicore clusters. Sci. China Inf. Sci. 58(9): 1-14 (2015) - [j11]Qingkui Gong, Changyou Zhang, Xianyi Zhang, Yunquan Zhang:
基于Julia语言的并行计算方法初探 (Primary Investigation into Parallel Computing in Julia Language). 计算机科学 42(1): 44-46 (2015) - [j10]Ke Zhan, Yunquan Zhang, Ting Wang, Jingjing Zheng, Peng Zhang:
基于Pthreads的并行DSRC压缩算法设计与实现 (Design and Implementation of Parallel DSRC Compression Algorithm Based on Pthreads). 计算机科学 42(1): 90-91 (2015) - [j9]Xiaojing An, Yunquan Zhang, Haipeng Jia:
基于OpenCL的直方图生成算法优化方法研究 (Research on Histogram Generation Algorithm Optimization Based on OpenCL). 计算机科学 42(11): 32-36 (2015) - [c45]Renbo Pang, Jianliang Xu, Yunquan Zhang:
Parallel Solving Method of SOR Based on the Numerical Marine Forecasting Model. CCGRID 2015: 733-736 - [c44]Xiaomin Zhu, Junchao Zhang, Kazutomo Yoshii, Shigang Li, Yunquan Zhang, Pavan Balaji:
Analyzing MPI-3.0 Process-Level Shared Memory: A Case Study with Stencil Computations. CCGRID 2015: 1099-1106 - [c43]Shigang Li, Yunquan Zhang, Chunyang Xiang, Lei Shi:
Fast Convolution Operations on Many-Core Architectures. HPCC/CSS/ICESS 2015: 316-323 - [c42]Xiaojing An, Haipeng Jia, Yunquan Zhang:
Optimized Password Recovery for Encrypted RAR on GPUs. HPCC/CSS/ICESS 2015: 591-598 - [c41]Mengran Fan, Haipeng Jia, Yunquan Zhang, Xiaojing An, Ting Cao:
Optimizing Image Sharpening Algorithm on GPU. ICPP 2015: 230-239 - [c40]James Dinan, Wenguang Chen, Xiaosong Ma, Pavan Balaji, Satoshi Matsuoka, Jiayuan Meng, Yunquan Zhang:
AsHES Introduction and Committees. IPDPS Workshops 2015: 591-592 - [e1]Xiaohua Jia, Tharam S. Dillon, Kuan-Ching Li, Yong Zhang, Nei Kato, Kui Wu, Yunquan Zhang:
Ninth International Conference on Frontier of Computer Science and Technology, FCST 2015, Dalian, China, August 26-28, 2015. IEEE Computer Society 2015, ISBN 978-1-4673-9295-2 [contents] - 2014
- [j8]Yiqung Liu, Yan Li, Yunquan Zhang,