


default search action
Guangwen Yang 0002
Person information
- affiliation: Tsinghua University, Beijing, China
Other persons with the same name
- Guangwen Yang — disambiguation page
- Guangwen Yang 0001
— Hebei University, Baoding, Hebei, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[c51]Zeyu Song
, Lin Gan
, Xiaohui Duan
, Zhengrui Li
, Jiayu Fu
, Yinuo Wang
, Guangzhao Li
, Guangwen Yang
:
HierCut: Enabling 16-bit Format Mixed Precision for Molecular Dynamics through Hierarchical Cutoff. PPoPP 2026: 315-328- 2025
[j39]Jingle Xu
, Jiayu Fu
, Lin Gan
, Yaojian Chen
, Zhaoqi Sun
, Zhenchun Huang
, Guangwen Yang
:
Leveraging the Hardware Resources to Accelerate cryo-EM Reconstruction of RELION on the New Sunway Supercomputer. ACM Trans. Archit. Code Optim. 22(1): 6:1-6:25 (2025)
[j38]Yinuo Wang
, Zeyu Song, Wubing Wan
, Xinpeng Zhao
, Lin Gan
, Ping Gao
, Wenqiang Wang
, Zhenguo Zhang
, Haohuan Fu
, Wei Xue
, Guangwen Yang
:
Accelerating Half-Precision Seismic Simulation on Neural Processing Unit. IEEE Trans. Parallel Distributed Syst. 36(9): 1998-2013 (2025)
[c50]Quan Deng, Lin Gan, Hongkun Yu, Wenlai Zhao, Guangwen Yang:
Auto-Stencil: Performance-Driven Stencil Optimization with Hardware Feedback for LLMs. ICPP 2025: 1-10
[c49]Xiaohui Duan
, Cheng Shen
, Gaowei Chen
, Shanshan Wu
, Yizhen Wang
, Yizhen Chen
, Qixin Chang
, Qiancheng Xia
, Zekun Yin
, Lin Gan
, Yibing Shan
, Guangwen Yang
, Weiguo Liu
, Niu Huang
:
Trillion Ligands per Day: Performance-Portable Virtual Screening via Compound Database Optimization and Multi-Target Docking. SC 2025: 2172-2185
[c48]Jiayu Fu
, Jingle Xu
, Lin Gan
, Tianqi Mao
, Zirong Shen
, Yinuo Wang
, Zeyu Song
, Xiaohui Duan
, Wei Xue
, Guangwen Yang
:
T2-RELION: Task Parallelism, Tensor Core Accelerated RELION for Cryo-EM 3D Reconstruction. SC 2025: 2186-2202
[i11]Yaojian Chen, Tianyu Ma, An Yang, Lin Gan, Wenlai Zhao, Guangwen Yang:
GenTT: Generate Vectorized Codes for General Tensor Permutation. CoRR abs/2506.03686 (2025)
[i10]Yinuo Wang, Tianqi Mao, Lin Gan, Wubing Wan, Zeyu Song, Jiayu Fu, Lanke He, Wenqiang Wang
, Zekun Yin, Wei Xue, Guangwen Yang:
MMStencil: Optimizing High-order Stencils on Multicore CPU using Matrix Unit. CoRR abs/2507.11067 (2025)- 2024
[j37]Haoran Lin, Lifeng Yan, Qixin Chang, Haitian Lu, Chenlin Li, Quanjie He, Zeyu Song, Xiaohui Duan
, Zekun Yin, Yuxuan Li, Zhao Liu, Wei Xue, Haohuan Fu, Lin Gan, Guangwen Yang, Weiguo Liu:
O2ath: an OpenMP offloading toolkit for the sunway heterogeneous manycore platform. CCF Trans. High Perform. Comput. 6(3): 274-286 (2024)
[j36]Mingzhen Li, Changxi Liu
, Jianjin Liao, Xuegui Zheng, Hailong Yang, Rujun Sun, Jun Xu, Lin Gan, Guangwen Yang, Zhongzhi Luan, Depei Qian:
Towards optimized tensor code generation for deep learning on sunway many-core processor. Frontiers Comput. Sci. 18(2): 182101 (2024)
[j35]Yushu Chen
, Shengzhuo Liu, Jinzhe Yang, Hao Jing, Wenlai Zhao
, Guangwen Yang:
A Joint Time-Frequency Domain Transformer for multivariate time series forecasting. Neural Networks 176: 106334 (2024)
[j34]Zhao Liu
, Xuesen Chu
, Xiaojing Lv
, Hongsong Meng
, Hanyue Liu
, Guanghui Zhu
, Haohuan Fu
, Guangwen Yang
:
SunwayLB: Enabling Extreme-Scale Lattice Boltzmann Method Based Computing Fluid Dynamics Simulations on Advanced Heterogeneous Supercomputers. IEEE Trans. Parallel Distributed Syst. 35(2): 324-337 (2024)
[j33]Quan Deng
, Qiang Liu
, Ming Yuan, Xiaohui Duan
, Lin Gan
, Jinzhe Yang, Wenlai Zhao
, Zhenxiang Zhang
, Guiming Wu
, Wayne Luk
, Haohuan Fu
, Guangwen Yang
:
Acceleration of Multi-Body Molecular Dynamics With Customized Parallel Dataflow. IEEE Trans. Parallel Distributed Syst. 35(12): 2297-2314 (2024)
[j32]Liang Wang
, Jinzhe Yang, Jidong Zhai
, Guangwen Yang
:
Optimizing I/O Performance Through Effective vCPU Scheduling Interference Management. IEEE Trans. Parallel Distributed Syst. 35(12): 2315-2330 (2024)
[c47]Zheng Zhang
, Yingsheng Ji
, Jiachen Shen
, Yushu Chen
, Xi Zhang
, Guangwen Yang
:
Collaborative Metapath Enhanced Corporate Default Risk Assessment on Heterogeneous Graph. WWW 2024: 446-456
[i9]Xiaohui Duan, Yuxuan Li, Zhao Liu, Bin Yang, Juepeng Zheng, Haohuan Fu, Shaoqing Zhang, Shiming Xu, Yang Gao, Wei Xue, Di Wei, Xiaojing Lv, Lifeng Yan, Haopeng Huang, Haitian Lu, Lingfeng Wan, Haoran Lin, Qixin Chang, Chenlin Li, Quanjie He, Zeyu Song, Xuantong Wang, Yangyang Yu, Xilong Fan, Zhaopeng Qu, Yankun Xu, Xiuwen Guo, Yunlong Fei, Zhaoying Wang, Mingkui Li, Yingjing Jiang, Lv Lu, Liang Su, Jiayu Fu, Peinan Yu, Weiguo Liu, Lixin Wu, Lanning Wang, Xin Liu, Dexun Chen, Guangwen Yang:
Kilometer-Level Coupled Modeling Using 40 Million Cores: An Eight-Year Journey of Model Development. CoRR abs/2404.10253 (2024)
[i8]Haoyu Ma, Yushu Chen, Wenlai Zhao, Jinzhe Yang, Yingsheng Ji, Xinghua Xu, Xiaozhu Liu
, Hao Jing, Shengzhuo Liu, Guangwen Yang:
A Mamba Foundation Model for Time Series Forecasting. CoRR abs/2411.02941 (2024)- 2023
[j31]Hailong Yang
, Yi Liu
, Zhongzhi Luan
, Lin Gan
, Guangwen Yang, Depei Qian
:
Input-Aware Sparse Tensor Storage Format Selection for Optimizing MTTKRP. Computer 56(8): 4-7 (2023)
[j30]Yushu Chen
, Guangwen Yang
, Lu Wang
, Haipeng Chen
, Qingzhong Gan
, Quanyong Xu
:
A Fast Algorithm for Onboard Atmospheric Powered Descent Guidance. IEEE Trans. Aerosp. Electron. Syst. 59(5): 6112-6123 (2023)
[j29]Xiaohui Duan
, Qi Shao, Junben Weng, Bertil Schmidt
, Lin Gan
, Guohui Li, Haohuan Fu
, Wei Xue
, Weiguo Liu
, Guangwen Yang:
Bio-ESMD: A Data Centric Implementation for Large-Scale Biological System Simulation on Sunway TaihuLight Supercomputer. IEEE Trans. Parallel Distributed Syst. 34(3): 881-893 (2023)
[j28]Ping Gao
, Xiaohui Duan
, Bertil Schmidt
, Wubing Wan
, Jiaxu Guo
, Wusheng Zhang
, Lin Gan
, Haohuan Fu
, Wei Xue
, Weiguo Liu
, Guangwen Yang
:
Redesign and Accelerate the AIREBO Bond-Order Potential on the New Sunway Supercomputer. IEEE Trans. Parallel Distributed Syst. 34(12): 3117-3132 (2023)
[c46]Wei Gao, Wenxiang Zhang, Wenzhao Wu, Yanjie Zhen, Wenlai Zhao, Guangwen Yang:
Automatic Deep Learning Operator Fusion on Sunway SW26010 Many-Core Processor. ICPADS 2023: 1943-1950
[c45]Zhao Liu
, Xuesen Chu
, Xiaojing Lv
, Hanyue Liu
, Haohuan Fu
, Guangwen Yang
:
Accelerating Large-Scale CFD Simulations with Lattice Boltzmann Method on a 40-Million-Core Sunway Supercomputer. ICPP 2023: 797-806
[c44]Yuhang Fu
, Weiqi Shen
, Jiahuan Cui
, Yao Zheng
, Guangwen Yang
, Zhao Liu
, Jifa Zhang
, Tingwei Ji
, Fangfang Xie
, Xiaojing Lv
, Hanyue Liu
, Xu Liu
, Xiyang Liu
, Xiaoyu Song
, Guocheng Tao
, Yan Yan
, Paul Tucker
, Steven A. E. Miller
, Shirui Luo
, Seid Koric
, Weimin Zheng
:
Toward Exascale Computation for Turbomachinery Flows. SC 2023: 4:1-4:12
[c43]Wubing Wan, Lin Gan, Wenqiang Wang
, Zekun Yin
, Haodong Tian, Zhenguo Zhang
, Yinuo Wang, Mengyuan Hua, Xiaohui Liu, Shengye Xiang, Zhongqiu He, Zijia Wang
, Ping Gao, Xiaohui Duan, Weiguo Liu, Wei Xue, Haohuan Fu, Guangwen Yang, Xiaofei Chen, Zeyu Song, Yaojian Chen, Xin Liu, Wei Zhang:
69.7-PFlops Extreme Scale Earthquake Simulation with Crossing Multi-faults and Topography on Sunway. SC 2023: 10:1-10:15
[c42]Xiaohui Duan
, Jin Wang
, Ping Gao
, Ming Ma
, Lin Gan
, Xin Liu
, Haohuan Fu
, Wei Xue
, Dexun Chen
, Guangwen Yang
, Weiguo Liu
:
Enabling Real World Scale Structural Superlubricity All-Atom Simulation on the Next-Generation Sunway Supercomputer. SC 2023: 99:1-99:14
[i7]Yushu Chen, Shengzhuo Liu, Jinzhe Yang, Hao Jing, Wenlai Zhao, Guangwen Yang:
A Joint Time-frequency Domain Transformer for Multivariate Time Series Forecasting. CoRR abs/2305.14649 (2023)
[i6]Yufan Jiang, Qiaozhi He, Xiaomin Zhuang, Zhihua Wu, Kunpeng Wang, Wenlai Zhao, Guangwen Yang:
RecycleGPT: An Autoregressive Language Model with Recyclable Module. CoRR abs/2308.03421 (2023)
[i5]Haoran Lin, Lifeng Yan, Qixin Chang, Haitian Lu, Chenlin Li, Quanjie He, Zeyu Song, Xiaohui Duan, Zekun Yin, Yuxuan Li, Zhao Liu, Wei Xue, Haohuan Fu, Lin Gan, Guangwen Yang, Weiguo Liu:
O2ATH: An OpenMP Offloading Toolkit for the Sunway Heterogeneous Manycore Platform. CoRR abs/2309.04945 (2023)- 2022
[j27]Bingwei Chen, Haohuan Fu, Wayne Luk, Guangwen Yang:
A fully-customized dataflow engine for 3D earthquake simulation with a complex topography. Sci. China Inf. Sci. 65(5): 1-16 (2022)
[j26]Yuxuan Li
, Xiaohui Duan, Lin Gan
, Wubing Wan, Yuhu Chen
, Kai Xu, Jinzhe Yang, Weiguo Liu, Wei Xue, Haohuan Fu
, Guangwen Yang:
Enabling Large-Scale Simulation of CAM on the Sunway TaihuLight Supercomputer. IEEE Trans. Computers 71(4): 824-837 (2022)
[j25]Qingxiao Sun
, Yi Liu
, Hailong Yang
, Ming Dun, Zhongzhi Luan
, Lin Gan
, Guangwen Yang, Depei Qian
:
Input-Aware Sparse Tensor Storage Format Selection for Optimizing MTTKRP. IEEE Trans. Computers 71(8): 1968-1981 (2022)
[j24]Ping Gao
, Xiaohui Duan, Bertil Schmidt
, Wusheng Zhang, Lin Gan, Haohuan Fu
, Wei Xue, Weiguo Liu, Guangwen Yang:
Optimization of Reactive Force Field Simulation: Refactor, Parallelization, and Vectorization for Interactions. IEEE Trans. Parallel Distributed Syst. 33(2): 359-373 (2022)
[j23]Yuxuan Li
, Lin Gan
, Mingcheng Chen, Yaojian Chen
, Haitian Lu, Chao-Yang Lu
, Jian-Wei Pan, Haohuan Fu
, Guangwen Yang:
Benchmarking 50-Photon Gaussian Boson Sampling on the Sunway TaihuLight. IEEE Trans. Parallel Distributed Syst. 33(6): 1357-1372 (2022)
[j22]Kai Xu
, Jinxiao Zhang, Xiaohui Duan
, Xiaobo Wan, Niu Huang, Bertil Schmidt
, Weiguo Liu
, Guangwen Yang:
Redesigning and Optimizing UCSF DOCK3.7 on Sunway TaihuLight. IEEE Trans. Parallel Distributed Syst. 33(10): 4458-4471 (2022)
[c41]Ming Yuan, Qiang Liu, Quan Deng, Shengye Xiang, Lin Gan, Jinzhe Yang, Xiaohui Duan, Haohuan Fu, Guangwen Yang:
FPGA-Accelerated Tersoff Multi-body Potential for Molecular Dynamics Simulations. ARC 2022: 17-31
[c40]Jingle Xu, Jiayu Fu, Lin Gan, Yaojian Chen
, Zhenchun Huang
, Guangwen Yang:
Accelerating cryo-EM Reconstruction of RELION on the New Sunway Supercomputer. ISPA/BDCloud/SocialCom/SustainCom 2022: 129-138
[i4]Zheng Zhang, Yingsheng Ji, Jiachen Shen, Xi Zhang, Guangwen Yang:
Heterogeneous Information Network based Default Analysis on Banking Micro and Small Enterprise Users. CoRR abs/2204.11849 (2022)- 2021
[j21]Ming Dun, Yunchun Li, Qingxiao Sun, Hailong Yang, Wei Li, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian:
Towards efficient canonical polyadic decomposition on sunway many-core processor. Inf. Sci. 549: 221-248 (2021)
[j20]Lin Gan
, Haohuan Fu, Guangwen Yang:
Translating novel HPC techniques into efficient geoscience solutions. J. Comput. Sci. 52: 101212 (2021)
[j19]Qingchang Han, Hailong Yang
, Ming Dun, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian:
Towards efficient tile low-rank GEMM computation on sunway many-core processors. J. Supercomput. 77(5): 4533-4564 (2021)
[j18]Mingzhen Li
, Yi Liu
, Xiaoyan Liu, Qingxiao Sun, Xin You, Hailong Yang
, Zhongzhi Luan
, Lin Gan, Guangwen Yang, Depei Qian
:
The Deep Learning Compiler: A Comprehensive Survey. IEEE Trans. Parallel Distributed Syst. 32(3): 708-727 (2021)
[c39]Ping Gao, Xiaohui Duan, Jiaxu Guo
, Jin Wang, Zhenya Song, Lizhen Cui, Xiangxu Meng, Xin Liu, Wusheng Zhang, Ming Ma, Guohui Li, Dexun Chen, Haohuan Fu, Wei Xue, Weiguo Liu, Guangwen Yang:
LMFF: efficient and scalable layered materials force field on heterogeneous many-core processors. SC 2021: 42- 2020
[j17]Xiaohui Duan, Meng Zhang, Weiguo Liu, Haohuan Fu, Lin Gan, Wei Xue, Guangwen Yang:
Tuning a general purpose software cache library for TaihuLight's SW26010 processor. CCF Trans. High Perform. Comput. 2(2): 164-182 (2020)
[j16]Lin Gan, Ming Yuan, Jinzhe Yang, Wenlai Zhao, Wayne Luk, Guangwen Yang:
High performance reconfigurable computing for numerical simulation and deep learning. CCF Trans. High Perform. Comput. 2(2): 196-208 (2020)
[j15]Teng Yu
, Wenlai Zhao
, Pan Liu
, Vladimir Janjic
, Xiaohan Yan, Shicai Wang
, Haohuan Fu
, Guangwen Yang, John Thomson
:
Large-Scale Automatic K-Means Clustering for Heterogeneous Many-Core Supercomputer. IEEE Trans. Parallel Distributed Syst. 31(5): 997-1008 (2020)
[j14]Yongmin Hu, Hailong Yang
, Zhongzhi Luan
, Lin Gan, Guangwen Yang, Depei Qian
:
Massively Scaling Seismic Processing on Sunway TaihuLight Supercomputer. IEEE Trans. Parallel Distributed Syst. 31(5): 1194-1208 (2020)
[j13]Mingzhen Li
, Yi Liu
, Hailong Yang
, Zhongzhi Luan
, Lin Gan, Guangwen Yang, Depei Qian
:
Accelerating Sparse Cholesky Factorization on Sunway Manycore Architecture. IEEE Trans. Parallel Distributed Syst. 31(7): 1636-1650 (2020)
[j12]Ping Gao
, Xiaohui Duan, Tingjian Zhang, Meng Zhang, Bertil Schmidt
, Xun Zhang, Hongliang Sun, Wusheng Zhang, Lin Gan, Wei Xue, Haohuan Fu
, Weiguo Liu, Guangwen Yang:
Millimeter-Scale and Billion-Atom Reactive Force Field Simulation on Sunway Taihulight. IEEE Trans. Parallel Distributed Syst. 31(12): 2954-2967 (2020)
[c38]Bangduo Chen, Mingzhen Li, Hailong Yang, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian:
swRodinia: A Benchmark Suite for Exploiting Architecture Properties of Sunway Processor. Bench 2020: 22-38
[c37]Xiaohui Duan, Ping Gao, Meng Zhang, Tingjian Zhang, Hongsong Meng, Yuxuan Li, Bertil Schmidt
, Haohuan Fu, Lin Gan, Wei Xue, Guangwen Yang, Weiguo Liu:
Neighbor-list-free molecular dynamics on sunway TaihuLight supercomputer. PPoPP 2020: 413-414
[c36]Qingxiao Sun, Yi Liu, Ming Dun, Hailong Yang, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian:
SpTFS: sparse tensor format selection for MTTKRP via deep learning. SC 2020: 18
[c35]Xiaohui Duan, Ping Gao, Meng Zhang, Tingjian Zhang, Hongsong Meng, Yuxuan Li, Bertil Schmidt
, Haohuan Fu, Lin Gan, Wei Xue, Weiguo Liu, Guangwen Yang:
Cell-list based molecular dynamics on many-core processors: a case study on sunway TaihuLight supercomputer. SC 2020: 22
2010 – 2019
- 2019
[j11]Xiaogang Zhong, Hailong Yang
, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian:
swTensor: accelerating tensor decomposition on Sunway architecture. CCF Trans. High Perform. Comput. 1(3-4): 161-176 (2019)
[j10]Jingheng Xu
, Haohuan Fu, Wen Shi, Lin Gan, Yuxuan Li, Wayne Luk, Guangwen Yang:
Performance Tuning and Analysis for Stencil-Based Applications on POWER8 Processor. ACM Trans. Archit. Code Optim. 15(4): 41:1-41:25 (2019)
[j9]Jingheng Xu
, Guangwen Yang, Haohuan Fu
, Wayne Luk, Lin Gan
, Wen Shi, Wei Xue, Chao Yang
, Yong Jiang, Conghui He
:
Optimizing Finite Volume Method Solvers on Nvidia GPUs. IEEE Trans. Parallel Distributed Syst. 30(12): 2790-2805 (2019)
[c34]Liang Qiao, Hongkun Yu, Kunpeng Wang, Ruixin Sun, Wenlai Zhao, Haohuan Fu, Guangwen Yang:
Large-scale Parallel Design for Cryo-EM Structure Determination on Heterogeneous Many-core Architectures. BIBM 2019: 711-716
[c33]Ouyi Li, Wenlai Zhao, Xuancheng Huang, Yushu Chen, Lin Gan, Hongkun Yu, Jiacheng Zhang, Yang Liu, Haohuan Fu, Guangwen Yang:
Scaling the Training of Recurrent Neural Networks on Sunway TaihuLight Supercomputer. ICCS (1) 2019: 427-440
[c32]Wei Gao, Jiarui Fang
, Wenlai Zhao, Jinzhe Yang, Long Wang, Lin Gan, Haohuan Fu, Guangwen Yang:
swATOP: Automatically Optimizing Deep Learning Operators on SW26010 Many-Core Processor. ICPP 2019: 89:1-89:10
[c31]Kunpeng Wang, Shizhen Xu, Haohuan Fu, Hongkun Yu, Wenlai Zhao, Guangwen Yang:
Parallelizing cryo-EM 3D reconstruction on GPU cluster with a partitioned and streamed model. ICS 2019: 13-23
[c30]Tingjian Zhang, Yuxuan Li, Ping Gao, Qi Shao, Mingshan Shao, Meng Zhang, Jinxiao Zhang, Xiaohui Duan, Zhao Liu, Lin Gan, Haohuan Fu, Wei Xue, Weiguo Liu, Guangwen Yang:
SW_GROMACS: accelerate GROMACS on Sunway TaihuLight. SC 2019: 66:1-66:14
[i3]Jiarui Fang, Liandeng Li, Haohuan Fu, Jinlei Jiang, Wenlai Zhao, Conghui He, Xin You, Guangwen Yang:
swCaffe: a Parallel Framework for Accelerating Deep Learning Applications on Sunway TaihuLight. CoRR abs/1903.06934 (2019)
[i2]Changxi Liu, Hailong Yang, Rujun Sun, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian:
swTVM: Exploring the Automated Compilation for Deep Learning on Sunway Architecture. CoRR abs/1904.07404 (2019)
[i1]Yushu Chen, Hao Jing, Wenlai Zhao, Zhiqiang Liu, Liang Qiao, Wei Xue, Haohuan Fu, Guangwen Yang:
NAMSG: An Efficient Method For Training Neural Networks. CoRR abs/1905.01422 (2019)- 2018
[j8]Guangwen Yang
, Haohuan Fu:
Application software beyond exascale: challenges and possible trends. Frontiers Inf. Technol. Electron. Eng. 19(10): 1267-1272 (2018)
[j7]Wenlai Zhao, Haohuan Fu, Jiarui Fang
, Weijie Zheng
, Lin Gan, Guangwen Yang:
Optimizing Convolutional Neural Networks on the Sunway TaihuLight Supercomputer. ACM Trans. Archit. Code Optim. 15(1): 13:1-13:26 (2018)
[c29]Liandeng Li, Jiarui Fang
, Haohuan Fu, Jinlei Jiang, Wenlai Zhao, Conghui He, Xin You, Guangwen Yang:
swCaffe: A Parallel Framework for Accelerating Deep Learning Applications on Sunway TaihuLight. CLUSTER 2018: 413-422
[c28]Xinliang Wang, Ping Xu, Wei Xue, Yulong Ao, Chao Yang, Haohuan Fu, Lin Gan, Guangwen Yang, Weimin Zheng:
A Fast Sparse Triangular Solver for Structured-grid Problems on Sunway Many-core Processor SW26010. ICPP 2018: 53:1-53:11
[c27]Shizhen Xu, Yuanchao Xu
, Wei Xue, Xipeng Shen
, Fang Zheng, Xiaomeng Huang, Guangwen Yang:
Taming the "Monster": Overcoming Program Optimization Challenges on SW26010 Through Precise Performance Modeling. IPDPS 2018: 763-773
[c26]Xiaohui Duan, Ping Gao, Tingjian Zhang, Meng Zhang, Weiguo Liu, Wusheng Zhang, Wei Xue, Haohuan Fu, Lin Gan, Dexun Chen, Xiangxu Meng, Guangwen Yang:
Redesigning LAMMPS for peta-scale and hundred-billion-atom simulation on Sunway TaihuLight. SC 2018: 12:1-12:12
[c25]Liandeng Li, Teng Yu, Wenlai Zhao, Haohuan Fu, Chenyu Wang, Li Tan, Guangwen Yang, John Thomson:
Large-scale hierarchical k-means for heterogeneous many-core supercomputers. SC 2018: 13:1-13:11- 2017
[j6]Lin Gan, Haohuan Fu, Oskar Mencer, Wayne Luk, Guangwen Yang:
Chapter Four - Data Flow Computing in Geoscience Applications. Adv. Comput. 104: 125-158 (2017)
[j5]Lin Gan, Haohuan Fu, Wayne Luk, Chao Yang, Wei Xue, Guangwen Yang:
Solving Mesoscale Atmospheric Dynamics Using a Reconfigurable Dataflow Architecture. IEEE Micro 37(4): 40-50 (2017)
[j4]Conghui He, Haohuan Fu, Ce Guo, Wayne Luk, Guangwen Yang:
A Fully-Pipelined Hardware Design for Gaussian Mixture Models. IEEE Trans. Computers 66(11): 1837-1850 (2017)
[c24]Haohuan Fu, Conghui He, Wayne Luk, Weijia Li, Guangwen Yang:
A Nanosecond-Level Hybrid Table Design for Financial Market Data Generators. FCCM 2017: 227-234
[c23]Haohuan Fu, Conghui He, Huabin Ruan, Itay Greenspon, Wayne Luk, Yongkang Zheng, Junfeng Liao, Qing Zhang, Guangwen Yang:
Accelerating Financial Market Server through Hybrid List Design (Abstract Only). FPGA 2017: 289-290
[c22]Jiarui Fang
, Haohuan Fu, Wenlai Zhao, Bingwei Chen, Weijie Zheng
, Guangwen Yang:
swDNN: A Library for Accelerating Deep Learning Applications on Sunway TaihuLight. IPDPS 2017: 615-624
[c21]Haohuan Fu, Conghui He, Bingwei Chen, Zekun Yin, Zhenguo Zhang
, Wenqiang Zhang
, Tingjian Zhang, Wei Xue, Weiguo Liu, Wanwang Yin, Guangwen Yang, Xiaofei Chen:
18.9-Pflops nonlinear earthquake simulation on Sunway TaihuLight: enabling depiction of 18-Hz and 8-meter scenarios. SC 2017: 2- 2016
[j3]Haohuan Fu, Junfeng Liao, Jinzhe Yang, Lanning Wang, Zhenya Song, Xiaomeng Huang, Chao Yang
, Wei Xue, Fangfang Liu, Fangli Qiao, Wei Zhao, Xunqiang Yin, Chaofeng Hou, Chenglong Zhang, Wei Ge, Jian Zhang, Yangang Wang, Chunbo Zhou, Guangwen Yang:
The Sunway TaihuLight supercomputer: system and applications. Sci. China Inf. Sci. 59(7): 072001:1-072001:16 (2016)
[c20]Haohuan Fu, Jingheng Xu, Lin Gan, Chao Yang, Wei Xue, Wenlai Zhao, Wen Shi, Xinliang Wang, Guangwen Yang:
Unleashing the performance potential of CPU-GPU platforms for the 3D atmospheric Euler solver. ASAP 2016: 41-49
[c19]Wenlai Zhao, Haohuan Fu, Wayne Luk, Teng Yu, Shaojun Wang, Bo Feng, Yuchun Ma, Guangwen Yang:
F-CNN: An FPGA-based framework for training Convolutional Neural Networks. ASAP 2016: 107-114
[c18]Jingheng Xu, Haohuan Fu, Lin Gan, Chao Yang, Wei Xue, Shizhen Xu, Wenlai Zhao, Xinliang Wang, Bingwei Chen, Guangwen Yang:
Generalized GPU Acceleration for Applications Employing Finite-Volume Methods. CCGrid 2016: 126-135
[c17]Jingheng Xu, Haohuan Fu, Lin Gan, Chao Yang
, Wei Xue, Guangwen Yang:
Accelerating the 3D euler atmospheric solver through heterogeneous CPU-GPU platforms. Conf. Computing Frontiers 2016: 353-356
[c16]Chao Yang, Wei Xue, Haohuan Fu, Hongtao You, Xinliang Wang, Yulong Ao, Fangfang Liu, Lin Gan, Ping Xu, Lanning Wang, Guangwen Yang, Weimin Zheng:
10M-core scalable fully-implicit solver for nonhydrostatic atmospheric dynamics. SC 2016: 57-68
[c15]Haohuan Fu, Junfeng Liao, Wei Xue, Lanning Wang, Dexun Chen, Long Gu, Jinxiu Xu, Nan Ding, Xinliang Wang, Conghui He, Shizhen Xu, Yishuang Liang, Jiarui Fang
, Yuanchao Xu
, Weijie Zheng
, Jingheng Xu, Zhen Zheng, Wanjing Wei, Xu Ji, He Zhang, Bingwei Chen, Kaiwei Li, Xiaomeng Huang, Wenguang Chen, Guangwen Yang:
Refactoring and optimizing the community atmosphere model (CAM) on the sunway taihulight supercomputer. SC 2016: 969-980- 2015
[j2]Lin Gan, Haohuan Fu, Wayne Luk, Chao Yang
, Wei Xue, Xiaomeng Huang, Youhui Zhang, Guangwen Yang:
Solving the Global Atmospheric Equations through Heterogeneous Reconfigurable Platforms. ACM Trans. Reconfigurable Technol. Syst. 8(2): 11:1-11:16 (2015)
[c14]Bangtian Liu, Haohuan Fu, Lin Gan, Wenlai Zhao, Guangwen Yang:
Optimizing Residue Number Reverse Converters through Bitwise Arithmetic on FPGAs. FCCM 2015: 236-243- 2014
[j1]Yang You, Haohuan Fu, Shuaiwen Leon Song, Maryam Mehri Dehnavi, Lin Gan, Xiaomeng Huang, Guangwen Yang:
Evaluating multi-core and many-core architectures through accelerating the three-dimensional Lax-Wendroff correction stencil. Int. J. High Perform. Comput. Appl. 28(3): 301-318 (2014)
[c13]Yanhua Li, Youhui Zhang, Jianfeng Yang, Wayne Luk, Guangwen Yang, Weimin Zheng:
An approach of processor core customization for stencil computation. ASAP 2014: 182-183
[c12]Wenlai Zhao, Haohuan Fu, Guangwen Yang:
A Fully-Pipelined FPGA Design for Tree-Reweighted Message Passing Algorithm. FCCM 2014: 177
[c11]Lin Gan, Haohuan Fu, Chao Yang
, Wayne Luk, Wei Xue, Oskar Mencer, Xiaomeng Huang, Guangwen Yang:
A highly-efficient and green data flow engine for solving euler atmospheric equations. FPL 2014: 1-6
[c10]Wenlai Zhao, Haohuan Fu, Guangwen Yang, Wayne Luk:
Patra: Parallel tree-reweighted message passing architecture. FPL 2014: 1-6
[c9]Lin Gan, Haohuan Fu, Wei Xue, Yangtong Xu, Chao Yang, Xinliang Wang, Zihong Lv, Yang You, Guangwen Yang, Kaijian Ou:
Scaling and analyzing the stencil performance on multi-core and many-core architectures. ICPADS 2014: 103-110- 2013
[c8]Yong Hu, Xiaomeng Huang, Xiaoge Wang, Haohuan Fu, Shizhen Xu, Huabin Ruan, Wei Xue, Guangwen Yang:
A Scalable Barotropic Mode Solver for the Parallel Ocean Program. Euro-Par 2013: 739-750
[c7]Huabin Ruan, Xiaomeng Huang, Haohuan Fu, Guangwen Yang, Wayne Luk, Sébastien Racanière, Oliver Pell, Wenjing Han:
An FPGA-Based Data Flow Engine for Gaussian Copula Model. FCCM 2013: 218-225
[c6]Lin Gan, Haohuan Fu, Wayne Luk, Chao Yang, Wei Xue, Guangwen Yang:
Global Atmospheric Simulation on a Reconfigurable Platform. FCCM 2013: 230
[c5]Lin Gan, Haohuan Fu, Wayne Luk, Chao Yang
, Wei Xue, Xiaomeng Huang, Youhui Zhang, Guangwen Yang:
Accelerating solvers for global atmospheric equations through mixed-precision data flow engine. FPL 2013: 1-6
[c4]Yang You, Haohuan Fu, Xiaomeng Huang, Guojie Song, Lin Gan, Wenjian Yu, Guangwen Yang:
Accelerating the 3D Elastic Wave Forward Modeling on GPU and MIC. IPDPS Workshops 2013: 1088-1096
[c3]Chao Yang
, Wei Xue, Haohuan Fu, Lin Gan, Linfeng Li, Yangtong Xu, Yutong Lu, Jiachang Sun, Guangwen Yang, Weimin Zheng:
A peta-scalable CPU-GPU algorithm for global atmospheric simulations. PPoPP 2013: 1-12- 2010
[c2]Shifeng Shang, Jinlei Jiang, Yongwei Wu, Zhenchun Huang
, Guangwen Yang, Weimin Zheng:
DABGPM: A Double Auction Bayesian Game-Based Pricing Model in Cloud Market. NPC 2010: 155-164
2000 – 2009
- 2008
[c1]Shifeng Shang, Jinlei Jiang, Zhenchun Huang
, Xiaomeng Huang, Guangwen Yang, Weimin Zheng, Lan Yu:
A Grid Workflow Framework with High Scalability and Usability. GCC 2008: 503-509
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-07 00:05 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







