


default search action
International Journal of High Performance Computing Applications, Volume 30
Volume 30, Number 1, February 2016
- Jack J. Dongarra, Michael A. Heroux, Piotr Luszczek:

High-performance conjugate-gradient benchmark: A new metric for ranking high-performance computing systems. 3-10 - Jongsoo Park, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Alexander Heinecke, Dhiraj D. Kalamkar, Md. Mostofa Ali Patwary, Vadim O. Pirogov, Pradeep Dubey, Xing Liu

, Carlos Rosales, Cyril Mazauric, Christopher S. Daley
:
Optimizations in a high-performance conjugate gradient benchmark for IA-based multi- and many-core processors. 11-27 - Everett H. Phillips, Massimiliano Fatica:

Performance analysis of the high-performance conjugate gradient benchmark on GPUs. 28-38
- Yiqung Liu, Chao Yang

, Fangfang Liu, Xianyi Zhang, Yutong Lu, Yunfei Du
, Canqun Yang, Min Xie, Xiangke Liao:
623 Tflop/s HPCG run on Tianhe-2: Leveraging millions of hybrid cores. 39-54 - Kiyoshi Kumahata, Kazuo Minami, Naoya Maruyama:

High-performance conjugate gradient performance improvement on the K computer. 55-70 - Toshitaka Baba

, Kazuto Ando, Daisuke Matsuoka, Mamoru Hyodo, Takane Hori, Narumi Takahashi, Ryoko Obayashi, Yoshiyuki Imato, Dai Kitamura, Hitoshi Uehara, Toshihiro Kato, Ryotaro Saka:
Large-scale, high-speed tsunami prediction for the Great Nankai Trough Earthquake on the K computer. 71-84 - Edmond Chow, Xing Liu

, Sanchit Misra, Marat Dukhan, Mikhail Smelyanskiy, Jeff R. Hammond, Yunfei Du
, Xiangke Liao, Pradeep Dubey:
Scaling up Hartree-Fock calculations on Tianhe-2. 85-102 - Dahai Guo, William Gropp, Luke N. Olson:

A hybrid format for better performance of sparse matrix-vector multiplication on a GPU. 103-120 - Patrick M. Widener

, Scott Levy, Kurt B. Ferreira, Torsten Hoefler:
On noise and the performance benefit of nonblocking collectives. 121-133
Volume 30, Number 2, May 2016
- Jirí Jaros

, Alistair P. Rendell
, Bradley E. Treeby
:
Full-wave nonlinear ultrasound simulation on distributed clusters with applications in high-intensity focused ultrasound. 137-155 - Xinqiang Miao, Xianlong Jin, Junhong Ding:

Improving the parallel efficiency of large-scale structural dynamic analysis using a hierarchical approach. 156-168 - Bozhong Liu, Weidong Qiu, Lin Jiang, Zheng Gong:

Software pipelining for graphic processing unit acceleration: Partition, scheduling and granularity. 169-185 - Rone Kwei Lim, J. William Pro, Matthew R. Begley, Marcel Utz, Linda R. Petzold:

High-performance simulation of fracture in idealized 'brick and mortar' composites using adaptive Monte Carlo minimization on the GPU. 186-199 - Hoang-Vu Dang, Bertil Schmidt

, Andreas Hildebrandt
, Tuan Tu Tran, Anna Katharina Hildebrandt:
CUDA-enabled hierarchical ward clustering of protein structures based on the nearest neighbour chain algorithm. 200-211 - Tanzima Z. Islam

, Kathryn M. Mohror
, Martin Schulz
:
Exploring the MPI tool information interface: features and capabilities. 212-222 - Bruce Palmer

, William A. Perkins, Yousu Chen
, Shuangshuang Jin, David Callahan, Kevin Glass, Ruisheng Diao, Mark Rice, Stephen T. Elbert, Mallikarjuna Vallem, Zhenyu Huang:
GridPACKTM: A framework for developing power grid simulations on high-performance computing platforms. 223-240 - Teng Wang, Kevin Vasko, Zhuo Liu, Hui Chen, Weikuan Yu

:
Enhance parallel input/output with cross-bundle aggregation. 241-256
Volume 30, Number 3, August 2016
- Yi Liu, Xiongzi Ge, David Hung-Chang Du, Xiaoxia Huang:

Par-BF: A parallel partitioned Bloom filter for dynamic data sets. 259-275 - Adnan Ozsoy:

An efficient parallelization of longest prefix match and application on data compression. 276-289 - Daniele Pianu, Roberto Nerino, Claudia Ferraris, Antonio Chimienti:

A novel approach to train random forests on GPU for computer vision applications using local features. 290-304 - Ignacio Laguna, David F. Richards, Todd Gamblin, Martin Schulz

, Bronis R. de Supinski, Kathryn M. Mohror
, Howard Pritchard:
Evaluating and extending user-level fault tolerance in MPI applications. 305-319 - Matthew Otten

, Jing Gong, Azamat Mametjanov
, Aaron Vose, John M. Levesque, Paul F. Fischer, Misun Min:
An MPI/OpenACC implementation of a high-order electromagnetics solver with GPUDirect communication. 320-334 - Md. Mohsin Ali

, Peter E. Strazdins, Brendan Harding
, Markus Hegland
:
Complex scientific applications made fault-tolerant with the sparse grid combination technique. 335-359 - William Boyd, Andrew R. Siegel, Shuo He, Benoit Forget, Kord Smith:

Parallel performance results for the OpenMOC neutron transport code on multicore platforms. 360-375
Volume 30, Number 4, November 2016
- Zsolt Horváth, Rui A. P. Perdigão

, Jürgen Waser, Daniel Cornel
, Artem Konev, Günter Blöschl
:
Kepler shuffle for real-world flood simulations on GPUs. 379-395 - Shuibing He, Yan Liu, Yang Wang, Xian-He Sun, Chuanhe Huang:

Enhancing hybrid parallel file system through performance and space-aware data layout. 396-410 - Seiji Tsuboi

, Kazuto Ando, Takayuki Miyoshi, Daniel Peter
, Dimitri Komatitsch
, Jeroen Tromp
:
A 1.8 trillion degrees-of-freedom, 1.24 petaflops global seismic wave simulation on the K computer. 411-422 - Huda Ibeid

, Rio Yokota
, David E. Keyes
:
A performance model for the communication in fast multipole methods on high-performance computing platforms. 423-437 - Pavol Bauer

, Stefan Engblom
, Stefan Widgren
:
Fast event-based epidemiological simulations on national scales. 438-453 - Kazuto Ando, Mamoru Hyodo, Toshitaka Baba

, Takane Hori, Toshihiro Kato, Masaru Watanabe, Shin'ichi Ichikawa, Hisakuni Kitahara, Hitoshi Uehara, Hikaru Inoue:
Parallel-algorithm extension for tsunami and earthquake-cycle simulators for massively parallel execution on the K computer. 454-468 - Alejandro Calderón

, Alberto García Fernández, Félix García Carballeira, Jesús Carretero
, Javier Fernández
:
Improving performance using computational compression through memoization: A case study using a railway power consumption simulator. 469-485 - Jonathan Y. Kemal, Roger L. Davis, John D. Owens:

Multidisciplinary simulation acceleration using multiple shared memory graphical processing units. 486-508

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














