


default search action
SC 2015: Austin, TX, USA
- Jackie Kern, Jeffrey S. Vetter:

Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2015, Austin, TX, USA, November 15-20, 2015. ACM 2015, ISBN 978-1-4503-3723-6
ACM Gordon Bell finalists
- Amanda Randles

, Erik W. Draeger, Tomas Oppelstrup, Liam Krauss, John A. Gunnels:
Massively parallel models of the human circulatory system. 1:1-1:11 - Diego Rossinelli, Yu-Hang Tang

, Kirill Lykov, Dmitry Alexeev, Massimo Bernaschi, Panagiotis E. Hadjidoukas, Mauro Bisson, Wayne Joubert, Christian Conti, George E. Karniadakis, Massimiliano Fatica, Igor Pivkin
, Petros Koumoutsakos
:
The in-silico lab-on-a-chip: petascale and high-throughput simulations of microfluidics at cell resolution. 2:1-2:12 - Mauro Calderara, Sascha Brück, Andreas Pedersen

, Mohammad H. Bani-Hashemian
, Joost VandeVondele, Mathieu Luisier:
Pushing back the limit of ab-initio quantum transport simulations on hybrid supercomputers. 3:1-3:12 - Tsuyoshi Ichimura, Kohei Fujita, Pher Errol Balde Quinay, Lalith Maddegedara, Muneo Hori, Seizo Tanaka, Yoshihisa Shizawa, Hiroshi Kobayashi, Kazuo Minami:

Implicit nonlinear wave simulation with 1.08T DOF and 0.270T unstructured finite elements to enhance comprehensive earthquake simulation. 4:1-4:12 - Johann Rudi

, A. Cristiano I. Malossi
, Tobin Isaac, Georg Stadler, Michael Gurnis, Peter W. J. Staar, Yves Ineichen, Costas Bekas, Alessandro Curioni, Omar Ghattas
:
An extreme-scale implicit solver for complex PDEs: highly heterogeneous flow in earth's mantle. 5:1-5:12
Technical papers: data clustering
- Md. Mostofa Ali Patwary, Surendra Byna

, Nadathur Rajagopalan Satish, Narayanan Sundaram, Zarija Lukic, Vadim Roytershteyn
, Michael J. Anderson, Yushu Yao, Prabhat, Pradeep Dubey:
BD-CATS: big data clustering at trillion particle scale. 6:1-6:12 - Chenhan D. Yu, Jianyu Huang

, Woody Austin, Bo Xiao, George Biros:
Performance optimization for the k-nearest neighbors kernel on x86 architectures. 7:1-7:12
Technical papers: applications: material science
- Martin Bauer, Johannes Hötzer

, Marcus Jainta, Philipp Steinmetz, Marco Berghoff
, Florian Schornbaum, Christian Godenschwager, Harald Köstler
, Britta Nestler, Ulrich Rüde
:
Massively parallel phase-field simulations for ternary eutectic directional solidification. 8:1-8:12 - Hongzhang Shan, Samuel Williams

, Calvin W. Johnson, Kenneth S. McElvain
, W. Erich Ormand:
Parallel implementation and performance optimization of the configuration-interaction method. 9:1-9:12 - Raffaele Solcà

, Anton Kozhevnikov
, Azzam Haidar, Stanimire Tomov
, Jack J. Dongarra, Thomas C. Schulthess:
Efficient implementation of quantum materials simulations on distributed CPU-GPU systems. 10:1-10:12
Technical papers: cache and memory subsystems
- Abhisek Pan, Vijay S. Pai:

Runtime-driven shared last-level cache management for task-parallel programs. 11:1-11:12 - Jungrae Kim, Michael B. Sullivan, Seong-Lyong Gong, Mattan Erez

:
Frugal ECC: efficient and versatile memory error protection through fine-grained compression. 12:1-12:12 - Malek Musleh, Vijay S. Pai:

Automatic sharing classification and timely push for cache-coherent systems. 13:1-13:12
Technical papers: applications: biophysics and genomics
- Evangelos Georganas, Aydin Buluç

, Jarrod Chapman, Steven A. Hofmeyr, Chaitanya Aluru, Rob Egan
, Leonid Oliker, Daniel Rokhsar
, Katherine A. Yelick
:
HipMer: an extreme-scale de novo genome assembler. 14:1-14:11 - Patrick Flick, Chirag Jain, Tony Pan

, Srinivas Aluru:
A parallel connectivity algorithm for de Bruijn graphs in metagenomic applications. 15:1-15:11 - Patrick Flick, Srinivas Aluru:

Parallel distributed memory construction of suffix and longest common prefix arrays. 16:1-16:10
Technical papers: GPU memory management
- Ang Li, Gert-Jan van den Braak, Akash Kumar, Henk Corporaal:

Adaptive and transparent cache bypassing for GPUs. 17:1-17:12 - Jason Jong Kyu Park, Yongjun Park, Scott A. Mahlke:

ELF: maximizing memory-level parallelism for GPUs with coordinated warp and fetch scheduling. 18:1-18:12 - Tal Ben-Nun, Ely Levy, Amnon Barak, Eri Rubin:

Memory access patterns: the missing piece of the multi-GPU puzzle. 19:1-19:12
Technical papers: scalable storage systems
- Hyogi Sim, Youngjae Kim, Sudharshan S. Vazhkudai, Devesh Tiwari, Ali Anwar

, Ali Raza Butt
, Lavanya Ramakrishnan:
AnalyzeThis: an analysis workflow-aware storage system. 20:1-20:12 - Michael A. Sevilla, Noah Watkins, Carlos Maltzahn, Ike Nassi

, Scott A. Brandt, Sage A. Weil, Greg Farnum, Sam Fineberg:
Mantle: a programmable metadata load balancer for the ceph file system. 21:1-21:12 - Yandong Wang, Li Zhang, Jian Tan, Min Li, Yuqing Gao, Xavier Guerin, Xiaoqiao Meng, Shicong Meng:

HydraDB: a resilient RDMA-driven key-value middleware for in-memory cluster computing. 22:1-22:11
Technical papers: applications: folding, imaging. and proteins
- Yida Wang, Michael J. Anderson, Jonathan D. Cohen, Alexander Heinecke, Kai Li, Nadathur Satish, Narayanan Sundaram, Nicholas B. Turk-Browne, Theodore L. Willke:

Full correlation matrix analysis of fMRI data on Intel® Xeon Phi™ coprocessors. 23:1-23:12 - William B. March, Bo Xiao, Sameer Tharakan, Chenhan D. Yu, George Biros:

A kernel-independent FMM in general dimensions. 24:1-24:12 - Andrew Schoenrock, Daniel J. Burnside, Houman Moteshareie, Alex Wong, Ashkan Golshani, Frank Dehne:

Engineering inhibitory proteins with InSiPS: the in-silico protein synthesizer. 25:1-25:11
Technical papers: graph analytics on HPC systems
- Xinyu Que, Fabio Checconi, Fabrizio Petrini, Xing Liu

, Daniele Buono:
Exploring network optimizations for large-scale graph analytics. 26:1-26:10 - Seung-Hee Bae, Bill Howe

:
GossipMap: a distributed community detection algorithm for billion-edge directed graphs. 27:1-27:12 - Dipanjan Sengupta, Shuaiwen Leon Song, Kapil Agarwal, Karsten Schwan:

GraphReduce: processing large-scale graphs on accelerator-based systems. 28:1-28:12
Technical papers: MPI/communication
- Akshay Venkatesh, Abhinav Vishnu, Khaled Hamidouche, Nathan R. Tallent

, Dhabaleswar K. Panda, Darren J. Kerbyson, Adolfy Hoisie
:
A case for application-oblivious energy-efficient MPI runtime. 29:1-29:12 - Karthikeyan Vaidyanathan, Dhiraj D. Kalamkar, Kiran Pamnany, Jeff R. Hammond, Pavan Balaji, Dipankar Das, Jongsoo Park, Bálint Joó:

Improving concurrency and asynchrony in multithreaded MPI applications using software offloading. 30:1-30:12 - Thomas Hérault

, Aurélien Bouteiller
, George Bosilca, Marc Gamell, Keita Teranishi, Manish Parashar, Jack J. Dongarra:
Practical scalable consensus for pseudo-synchronous distributed systems. 31:1-31:12
Technical papers: cloud resource management
- Yifan Gong, Bingsheng He

, Amelie Chi Zhou:
Monetary cost optimizations for MPI-based HPC applications on Amazon clouds: checkpoints and replicated execution. 32:1-32:12 - Feng Liu, Jon B. Weissman:

Elastic job bundling: an adaptive resource request strategy for large-scale parallel applications. 33:1-33:12 - Yanfei Guo, Wesley Bland, Pavan Balaji, Xiaobo Zhou:

Fault tolerant MapReduce-MPI for HPC clusters. 34:1-34:12
Technical papers: interconnection networks
- Nan Jiang, Larry R. Dennison, William J. Dally:

Network endpoint congestion control for fine-grained communication. 35:1-35:12 - Georgios Kathareios, Cyriel Minkenberg, Bogdan Prisacari, Germán Rodríguez, Torsten Hoefler:

Cost-effective diameter-two topologies: analysis and evaluation. 36:1-36:11 - Shinobu Miwa, Hiroshi Nakamura

:
Profile-based power shifting in interconnection networks with on/off links. 37:1-37:11
Technical papers: state of the practice: infrastructure management
- Devesh Tiwari, Saurabh Gupta, George Gallarno, Jim Rogers, Don Maxwell:

Reliability lessons learned from GPU experience with the Titan supercomputer at Oak Ridge leadership computing facility. 38:1-38:12 - Patricia H. Kovatch, Anthony Costa

, Zachary Giles, Eugene Fluder, Hyung Min Cho, Svetlana Mazurkova:
Big omics data experience. 39:1-39:12 - Todd Gamblin, Matthew P. LeGendre, Michael R. Collette, Gregory L. Lee, Adam Moody, Bronis R. de Supinski, Scott Futral:

The Spack package manager: bringing order to HPC software chaos. 40:1-40:12
Technical papers: applications: climate and weather
- Tobias Gysi, Carlos Osuna

, Oliver Fuhrer
, Mauro Bianco, Thomas C. Schulthess
:
STELLA: a domain-specific tool for structured grid methods in weather and climate models. 41:1-41:12 - Yong Hu, Xiaomeng Huang, Allison H. Baker

, Yu-heng Tseng
, Frank O. Bryan
, John M. Dennis, Guangwen Yang:
Improving the scalability of the ocean barotropic solver in the community earth system model. 42:1-42:12 - Kalin Kanov, Randal C. Burns

:
Particle tracking in open simulation laboratories. 43:1-43:11
Technical papers: data transfers and data-intensive applications
- Ismail Alan, Engin Arslan, Tevfik Kosar

:
Energy-aware data transfer algorithms. 44:1-44:12 - Ron Chi-Lung Chiang, H. Howie Huang

, Timothy Wood
, Changbin Liu, Oliver Spatscheck:
IOrchestra: supporting high-performance data-intensive applications in the cloud via collaborative virtualization. 45:1-45:12 - Rajkumar Kettimuthu, Gayane Vardoyan

, Gagan Agrawal, P. Sadayappan, Ian T. Foster:
An elegant sufficiency: load-aware differentiated scheduling of data transfers. 46:1-46:12
Technical papers: performance tools and models
- Xu Liu, Bo Wu:

ScaAnalyzer: a tool to identify memory scalability bottlenecks in parallel programs. 47:1-47:12 - Yuhang Liu, Xian-He Sun:

C2-bound: a capacity and concurrency driven analytical model for many-core design. 48:1-48:11 - Katherine E. Isaacs, Abhinav Bhatele, Jonathan Lifflander, David Böhme

, Todd Gamblin, Martin Schulz
, Bernd Hamann, Peer-Timo Bremer:
Recovering logical structure from Charm++ event traces. 49:1-49:12
Technical papers: in-situ (simulation time) analysis
- Christopher M. Sewell, Katrin Heitmann, Hal Finkel, George Zagaris, Suzanne Parete-Koon

, Patricia K. Fasel, Adrian Pope, Nicholas Frontiere, Li-Ta Lo
, O. E. Bronson Messer
, Salman Habib, James P. Ahrens
:
Large-scale compute-intensive analysis via a combined in-situ and co-scheduling workflow approach. 50:1-50:11 - Yi Wang, Gagan Agrawal, Tekin Bicer

, Wei Jiang:
Smart: a MapReduce-like framework for in-situ scientific analytics. 51:1-51:12 - Preeti Malakar, Venkatram Vishwanath, Todd S. Munson, Christopher Knight, Mark Hereld, Sven Leyffer

, Michael E. Papka
:
Optimal scheduling of in-situ analysis for large-scale scientific simulations. 52:1-52:11
Technical papers: linear algebra
- Luc Jaulmes

, Marc Casas
, Miquel Moretó
, Eduard Ayguadé, Jesús Labarta, Mateo Valero
:
Exploiting asynchrony from exact forward recovery for DUE in iterative solvers. 53:1-53:12 - Jongsoo Park, Mikhail Smelyanskiy, Ulrike Meier Yang

, Dheevatsa Mudigere, Pradeep Dubey:
High-performance algebraic multigrid solver optimized for multi-core based distributed parallel systems. 54:1-54:12 - Humayun Kabir

, Joshua Dennis Booth, Guillaume Aupy, Anne Benoit
, Yves Robert
, Padma Raghavan:
STS-k: a multilevel sparse triangular solution scheme for NUMA multicores. 55:1-55:11
Technical papers: management of graph workloads
- Michael LeBeane, Shuang Song, Reena Panda, Jee Ho Ryoo, Lizy K. John:

Data partitioning strategies for graph workloads on heterogeneous clusters. 56:1-56:12 - Kisung Lee, Ling Liu, Karsten Schwan, Calton Pu, Qi Zhang, Yang Zhou, Emre Yigitoglu, Pingpeng Yuan:

Scaling iterative graph computations with GraphMap. 57:1-57:12 - Sungpack Hong, Siegfried Depner, Thomas Manhardt, Jan Van Der Lugt, Merijn Verstraaten, Hassan Chafi:

PGX.D: a fast distributed graph processing engine. 58:1-58:12
Technical papers: sampling in matrix computations
- Ichitaro Yamazaki, Jakub Kurzak, Piotr Luszczek, Jack J. Dongarra:

Randomized algorithms to update partial singular value decomposition on a hybrid CPU/GPU cluster. 59:1-59:12 - Théo Mary

, Ichitaro Yamazaki, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov
, Jack J. Dongarra:
Performance of random sampling for computing low-rank approximations of a dense matrix on GPUs. 60:1-60:11
Technical papers: programming tools
- Stephen F. Siegel

, Manchun Zheng, Ziqing Luo, Timothy K. Zirkel, Andre V. Marianiello, John G. Edenhofner, Matthew B. Dwyer
, Michael S. Rogers:
CIVL: the concurrency intermediate verification language. 61:1-61:12 - Kento Sato, Dong H. Ahn, Ignacio Laguna, Gregory L. Lee, Martin Schulz

:
Clock delta compression for scalable order-replay of non-deterministic parallel applications. 62:1-62:12 - Luiz De Rose, Andrew Gontarek, Aaron Vose, Robert Moench, David Abramson

, Minh Ngoc Dinh
, Chao Jin:
Relative debugging for a highly parallel hybrid computer system. 63:1-63:12
Technical papers: resource management
- Éric Gaussier, David Glesser, Valentin Reis, Denis Trystram:

Improving backfilling by using machine learning to predict running times. 64:1-64:10 - Qian Sun, Tong Jin, Melissa Romanus

, Hoang Bui, Fan Zhang, Hongfeng Yu, Hemanth Kolla, Scott Klasky, Jacqueline Chen, Manish Parashar:
Adaptive data placement for staging-based coupled scientific workflows. 65:1-65:12 - Sergey Blagodurov, Alexandra Fedorova, Evgeny Vinnik, Tyler Dwyer, Fabien Hermenier:

Multi-objective job placement in clusters. 66:1-66:12
Technical papers: graph algorithms and benchmarks
- Umut A. Acar, Arthur Charguéraud, Mike Rainey:

A work-efficient algorithm for parallel unordered depth-first search. 67:1-67:12 - Hang Liu, H. Howie Huang

:
Enterprise: breadth-first graph traversal on GPUs. 68:1-68:12 - Lifeng Nai, Yinglong Xia, Ilie Gabriel Tanase, Hyesoon Kim, Ching-Yung Lin:

GraphBIG: understanding graph computing in the context of industrial solutions. 69:1-69:12
Technical papers: resilience
- Marc Gamell, Keita Teranishi, Michael A. Heroux

, Jackson R. Mayo, Hemanth Kolla, Jacqueline Chen, Manish Parashar:
Local recovery and failure masking for stencil-based applications at extreme scales. 70:1-70:12 - Antonio J. Peña

, Wesley Bland, Pavan Balaji:
VOCL-FT: introducing techniques for efficient soft error coprocessor recovery. 71:1-71:12 - Rizwan A. Ashraf

, Roberto Gioiosa, Gokcen Kestor
, Ronald F. DeMara
, Chen-Yong Cher, Pradip Bose:
Understanding the propagation of transient errors in HPC applications. 72:1-72:12
Technical papers: state of the practice: measuring systems
- Torsten Hoefler, Roberto Belli:

Scientific benchmarking of parallel computing systems: twelve ways to tell the masses when reporting performance results. 73:1-73:12 - Thomas Scogland, Jonathan Azose, David Rohr, Suzanne Rivoire, Natalie J. Bates, Daniel Hackenberg

:
Node variability in large-scale power measurements: perspectives from the Green500, Top500 and EEHPCWG. 74:1-74:11 - Lipeng Wan, Feiyi Wang, Sarp Oral

, Devesh Tiwari, Sudharshan S. Vazhkudai, Qing Cao:
A practical approach to reconciling availability, performance, and capacity in provisioning extreme-scale storage systems. 75:1-75:12
Technical papers: tensor computation
- Jiajia Li

, Casey Battaglino, Ioakeim Perros, Jimeng Sun
, Richard W. Vuduc:
An input-adaptive and in-place approach to dense tensor-times-matrix multiply. 76:1-76:12 - Oguz Kaya, Bora Uçar

:
Scalable sparse tensor decompositions in distributed memory systems. 77:1-77:11
Technical papers: power-constrained computing
- Yuichi Inadomi, Tapasya Patki, Koji Inoue, Mutsumi Aoyagi, Barry Rountree, Martin Schulz

, David K. Lowenthal
, Yasutaka Wada
, Keiichiro Fukazawa, Masatsugu Ueda, Masaaki Kondo, Ikuo Miyoshi:
Analyzing and mitigating the impact of manufacturing variability in power-constrained supercomputing. 78:1-78:12 - Peter E. Bailey, Aniruddha Marathe

, David K. Lowenthal
, Barry Rountree, Martin Schulz
:
Finding the limits of power-constrained application performance. 79:1-79:12 - Daniel A. Ellsworth, Allen D. Malony, Barry Rountree, Martin Schulz

:
Dynamic power sharing for higher job throughput. 80:1-80:11
Technical papers: programming systems
- Elliott Slaughter, Wonchan Lee, Sean Treichler, Michael Bauer, Alex Aiken

:
Regent: a high-productivity programming language for HPC with logical regions. 81:1-81:12 - Junghyun Kim, Thanh Tuan Dao, Jaehoon Jung, Jinyoung Joo, Jaejin Lee:

Bridging OpenCL and CUDA: a comparative analysis and translation. 82:1-82:12 - Shaizeen Aga, Sriram Krishnamoorthy

, Satish Narayanasamy
:
CilkSpec: optimistic concurrency for Cilk. 83:1-83:12

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














