default search action
17th CLUSTER 2015: Chicago, IL, USA
- 2015 IEEE International Conference on Cluster Computing, CLUSTER 2015, Chicago, IL, USA, September 8-11, 2015. IEEE Computer Society 2015, ISBN 978-1-4673-6598-7
Session 1: Best Paper Candidates
- Jianping Zeng, Hongfeng Yu:
Parallel Modularity-Based Community Detection on Large-Scale Graphs. 1-10 - Edgar A. León, Ian Karlin, Ryan E. Grant:
Optimizing Explicit Hydrodynamics for Power, Energy, and Performance. 11-21 - Lorenz Fischer, Shen Gao, Abraham Bernstein:
Machines Tuning Machines: Configuring Distributed Stream Processors with Bayesian Optimization. 22-31 - Jiaan Zeng, Beth Plale:
Workload-Aware Resource Reservation for Multi-tenant NoSQL. 32-41
Session 2: Task Parallel Computing
- Ashwin Mandayam Aji, Antonio J. Peña, Pavan Balaji, Wu-chun Feng:
Automatic Command Queue Scheduling for Task-Parallel Workloads in OpenCL. 42-51
Session 3: Big Data Processing
- Ze Yu, Min Li, Xin Yang, Han Zhao, Xiaolin Li:
Taming Non-local Stragglers Using Efficient Prefetching in MapReduce. 52-61 - Bo Feng, Xi Yang, Kun Feng, Yanlong Yin, Xian-He Sun:
IOSIG+: On the Role of I/O Tracing and Analysis for Hadoop Systems. 62-65 - Wenzhao Zhang, Houjun Tang, Xiaocheng Zou, Steve Harenberg, Qing Liu, Scott Klasky, Nagiza F. Samatova:
Exploring Memory Hierarchy to Improve Scientific Data Read Performance. 66-69 - Dixin Tang, Taoying Liu, Rubao Lee, Hong Liu, Wei Li:
A Case Study of Optimizing Big Data Analytical Stacks Using Structured Data Shuffling. 70-73 - Cailiang Xu, Wei Wang, Deng Zhou, Tao Xie:
An SSD-HDD Integrated Storage Architecture for Write-Once-Read-Once Applications on Clusters. 74-77
Session 4: GPU Computing
- Khaled Hamidouche, Akshay Venkatesh, Ammar Ahmad Awan, Hari Subramoni, Ching-Hsiang Chu, Dhabaleswar K. Panda:
Exploiting GPUDirect RDMA in Designing High Performance OpenSHMEM for NVIDIA GPU Clusters. 78-87 - Toshihiro Hanawa, Hisafumi Fujii, Norihisa Fujita, Tetsuya Odajima, Kazuya Matsumoto, Yuetsu Kodama, Taisuke Boku:
Improving Strong-Scaling on GPU Cluster Based on Tightly Coupled Accelerators Architecture. 88-91 - Adrián Castelló, Antonio J. Peña, Rafael Mayo, Pavan Balaji, Enrique S. Quintana-Ortí:
Exploring the Suitability of Remote GPGPU Virtualization for the OpenACC Programming Model Using rCUDA. 92-95 - Luis Sant'Ana, Daniel Cordeiro, Raphael Y. de Camargo:
PLB-HeC: A Profile-Based Load-Balancing Algorithm for Heterogeneous CPU-GPU Clusters. 96-105 - Langshi Chen, Serge G. Petiton:
A TSQR Based Krylov Basis Computation Method on Hybrid GPU Cluster. 106-109
Session 5: Machine Learning and Data Mining
- Abhinav Vishnu, Jeyanthi Narasimhan, Lawrence Holder, Darren J. Kerbyson, Adolfy Hoisie:
Fast and Accurate Support Vector Machines on Large Scale Systems. 110-119 - Nikela Papadopoulou, Georgios I. Goumas, Nectarios Koziris:
A Machine-Learning Approach for Communication Prediction of Large-Scale Applications. 120-123 - Florin Isaila, Prasanna Balaprakash, Stefan M. Wild, Dries Kimpe, Robert Latham, Robert B. Ross, Paul D. Hovland:
Collective I/O Tuning Using Analytical and Machine Learning Models. 128-137 - Abhinav Vishnu, Khushbu Agarwal:
Large Scale Frequent Pattern Mining Using MPI One-Sided Model. 138-147
Session 6: Resilience and Reliability
- Kun Feng, Manjunath Gorentla Venkata, Dong Li, Xian-He Sun:
Fast Fault Injection and Sensitivity Analysis for Collective Communications. 148-157 - Jiaqi Liu, Mehmet Can Kurt, Gagan Agrawal:
A Practical Approach for Handling Soft Errors in Iterative Applications. 158-161 - Veronica Estrada Galiñanes, Pascal Felber:
Ensuring Data Durability with Increasingly Interdependent Content. 162-165 - Dylan Chapp, Travis Johnston, Michela Taufer:
On the Need for Reproducible Numerical Accuracy through Intelligent Runtime Selection of Reduction Algorithms at the Extreme Scale. 166-175 - Qiang Guan, Nathan DeBardeleben, Brian Atkinson, Robert W. Robey, William M. Jones:
Towards Building Resilient Scientific Applications: Resilience Analysis on the Impact of Soft Error and Transient Error Tolerance with the CLAMR Hydrodynamics Mini-App. 176-179 - Arash Rezaei, Frank Mueller:
DINO: Divergent Node Cloning for Sustained Redundancy in HPC. 180-183
Session 7: High Performance I/O
- Babak Behzad, Surendra Byna, Stefan M. Wild, Prabhat, Marc Snir:
Dynamic Model-Driven Parallel I/O Performance Tuning. 184-193 - Teng Wang, Sarp Oral, Michael Pritchard, Bin Wang, Weikuan Yu:
TRIO: Burst Buffer Based I/O Orchestration. 194-203 - Congjin Du, Chentao Wu, Jie Li, Minyi Guo, Xubin He:
BPS: A Balanced Partial Stripe Write Scheme to Improve the Write Performance of RAID-6. 204-213 - Shin Sasaki, Kazushi Takahashi, Yoshihiro Oyama, Osamu Tatebe:
RDMA-Based Direct Transfer of File Data to Remote Page Cache. 214-225
Session 8: MPI
- Mingzhe Li, Hari Subramoni, Khaled Hamidouche, Xiaoyi Lu, Dhabaleswar K. Panda:
High Performance MPI Datatype Support with User-Mode Memory Registration: Challenges, Designs, and Benefits. 226-235
Session 9: Distributed Data Processing
- Ke Wang, Ning Liu, Iman Sadooghi, Xi Yang, Xiaobing Zhou, Tonglin Li, Michael Lang, Xian-He Sun, Ioan Raicu:
Overcoming Hadoop Scaling Limitations through Distributed Task Execution. 236-245 - Yin Huai, Yuan Yuan, Rubao Lee, Xiaodong Zhang:
SideWalk: A Facility of Lightweight Out-of-Band Communications for Augmenting Distributed Data Processing Flows. 246-249 - Alessandro Morari, Jesse Weaver, Oreste Villa, David J. Haglin, Antonino Tumeo, Vito Giovanni Castellana, John Feo:
High-Performance, Distributed Dictionary Encoding of RDF Datasets. 250-253 - Zhou Zhou, Xu Yang, Dongfang Zhao, Paul Rich, Wei Tang, Jia Wang, Zhiling Lan:
I/O-Aware Batch Scheduling for Petascale Computing Systems. 254-263
Session 10: Energy Efficiency
- Xiaojun Ruan, Haiquan Chen:
Performance-to-Power Ratio Aware Virtual Machine (VM) Allocation in Energy-Efficient Clouds. 264-273 - Vincenzo De Maio, Gabor Kecskemeti, Radu Prodan:
A Workload-Aware Energy Model for Virtual Machine Migration. 274-283
Session 11: Graph Processing
- Dong Dai, Philip H. Carns, Robert B. Ross, John Jenkins, Kyle Blauer, Yong Chen:
GraphTrek: Asynchronous Graph Traversal for Property Graph-Based Metadata Management. 284-293 - Luis Pineda-Morales, Alexandru Costan, Gabriel Antoniu:
Towards Multi-site Metadata Management for Geographically Distributed Cloud Workflows. 294-303
Session 12: Application Acceleration
- Anthony Danalis, Heike Jagode, George Bosilca, Jack J. Dongarra:
PaRSEC in Practice: Optimizing a Legacy Chemistry Application through Distributed Task-Based Execution. 304-313 - Sebastian Rettenberger, Michael Bader:
Optimizing I/O for Petascale Seismic Simulations on Unstructured Meshes. 314-317 - Pedro Valero-Lara, Johan Jansson:
LBM-HPC - An Open-Source Tool for Fluid Simulations. Case Study: Unified Parallel C (UPC-PGAS). 318-321 - Anna Woodard, Matthias Wolf, Charles Mueller, Nil Valls, Ben Tovar, Patrick Donnelly, Peter Ivie, Kenyi Hurtado Anampa, Paul R. Brenner, Douglas Thain, Kevin Lannon, Michael D. Hildreth:
Scaling Data Intensive Physics Applications to 10k Cores on Non-dedicated Clusters with Lobster. 322-331 - Mücahid Kutlu, Gagan Agrawal:
RE-PAGE: Domain-Specific REplication and PArallel Processing of GEnomic Data. 332-341
Session 13: Network and High Performance Communication
- Matthew G. F. Dosanjh, Ryan E. Grant, Patrick G. Bridges, Ron Brightwell:
Re-evaluating Network Onload vs. Offload for the Many-Core Era. 342-350 - Md Atiqul Mollah, Xin Yuan, Scott Pakin, Michael Lang:
Fast Calculation of Max-Min Fair Rates for Multi-commodity Flows in Fat-Tree Networks. 351-360 - Emily M. Hastings, David Rincon-Cruz, Marc Spehlmann, Sofia Meyers, Anda Xu, David P. Bunde, Vitus J. Leung:
Comparing Global Link Arrangements for Dragonfly Networks. 361-370 - Evangelos Tasoulas, Ernst Gunnar Gran, Bjørn Dag Johnsen, Kyrre M. Begnum, Tor Skeie:
Towards the InfiniBand SR-IOV vSwitch Architecture. 371-380
Session 14: Parallel Algorithms
- Annie Yang, Hari Mukka, Farbod Hesaaraki, Martin Burtscher:
MPC: A Massively Parallel Compression Algorithm for Scientific Data. 381-389 - Olivia Choudhury, Dinesh Rajan, Nicholas L. Hazekamp, Sandra Gesing, Douglas Thain, Scott J. Emrich:
Balancing Thread-Level and Task-Level Parallelism for Data-Intensive Workloads on Clusters and Clouds. 390-393 - Tan Nguyen, Scott B. Baden:
LU Factorization: Towards Hiding Communication Overheads with a Lookahead-Free Algorithm. 394-397 - Ariful Azad, Aydin Buluç:
Distributed-Memory Algorithms for Maximal Cardinality Matching Using Matrix Algebra. 398-407
Session 15: Task and Process Scheduling
- Ivy Bo Peng, Stefano Markidis, Erwin Laure:
The Cost of Synchronizing Imbalanced Processes in Message Passing Systems. 408-417 - Hormozd Gahvari, Martin Schulz, Ulrike Meier Yang:
An Approach to Selecting Thread + Process Mixes for Hybrid MPI + OpenMP Applications. 418-427 - Dana Akhmetova, Gokcen Kestor, Roberto Gioiosa, Stefano Markidis, Erwin Laure:
On the Application Task Granularity and the Interplay with the Scheduling Overhead in Many-Core Shared Memory Systems. 428-437
Session 16: PGAS and Shared Memory Programming
- Naveen Namashivayam, Deepak Eachempati, Dounia Khaldi, Barbara M. Chapman:
OpenSHMEM as a Portable Communication Layer for PGAS Models: A Case Study with Coarray Fortran. 438-447 - Dounia Khaldi, Deepak Eachempati, Shiyao Ge, Pierre Jouvelot, Barbara M. Chapman:
A Team-Based Methodology of Memory Hierarchy-Aware Runtime Support in Coarray Fortran. 448-451 - Sai Charan Koduru, Keval Vora, Rajiv Gupta:
Optimizing Caching DSM for Distributed Software Speculation. 452-455 - Hajime Fujita, Kamil Iskra, Pavan Balaji, Andrew A. Chien:
Empirical Comparison of Three Versioning Architectures. 456-459 - Hongyi Ma, Liqiang Wang, Krishanthan Krishnamoorthy:
Detecting Thread-Safety Violations in Hybrid OpenMP/MPI Programs. 460-463
Session 17: Cluster Tools
- Anthony M. Agelastos, Benjamin A. Allan, Jim M. Brandt, Ann C. Gentile, Sophia Lefantzi, Steve Monk, Jeff Ogden, Mahesh Rajan, Joel Stevenson:
Toward Rapid Understanding of Production HPC Applications and Systems. 464-473 - Alan Nussbaum, Shwetha Mathangi Chandra Choodamani, Karsten Schwan:
ObsCon: Integrated Monitoring and Control for Parallel, Real-Time Applications. 474-477 - Joshua Peraza, Ananta Tiwari, William A. Ward Jr., Roy L. Campbell, Laura Carrington:
VecMeter: Measuring Vectorization on the Xeon Phi. 478-481 - Justin M. Wozniak, Timothy G. Armstrong, Ketan C. Maheshwari, Daniel S. Katz, Michael Wilde, Ian T. Foster:
Toward Interlanguage Parallel Scripting for Distributed-Memory Scientific Computing. 482-485
Poster Papers
- Wei Xie, Yong Chen:
A Cache Management Scheme for Hiding Garbage Collection Latency in Flash-Based Solid State Drives. 486-487 - Carlos Reaño, Federico Silla:
A Performance Comparison of CUDA Remote GPU Virtualization Frameworks. 488-489 - Sean McDaniel, Stephen Herbein, Michela Taufer:
A Two-Tiered Approach to I/O Quality of Service in Docker Containers. 490-491 - Ke Yue, Nicholas Schwarz, Jonathan Z. Tischler:
Accelerating Laue Depth Reconstruction Algorithm with CUDA. 492-493 - Xiang Ma, Chao Wang, Qi Yu, Xi Li, Xuehai Zhou:
An FPGA-Based Accelerator for Neighborhood-Based Collaborative Filtering Recommendation Algorithms. 494-495 - Xinkui Zhao, Jianwei Yin, Chen Zhi, Pengxiang Lin, Zuoning Chen:
Can Cloud Service Get His Family? A Step Towards Service Family Detecting. 496-497 - Shih-Wen Hsu, Tseng-Yi Chen, Yung-Chun Chang, Shuo-Han Chen, Han-Chieh Chao, Tsen-Yeh Lin, Wei-Kuan Shih:
Design a Hash-Based Control Mechanism in vSwitch for Software-Defined Networking Environment. 498-499 - Mejdl S. Safran, Saad Al-qahtani, Michelle Zhu, Dunren Che:
Development of MapReduce and MPI Programs for Motif Search. 500-501 - Konstantin S. Stefanov, Vladimir V. Voevodin:
Distributed Modular Monitoring (DiMMon) Approach to Supercomputer Monitoring. 502-503 - Jia Li, Dongsheng Li, Yiming Zhang:
Efficient Distributed Data Clustering on Spark. 504-505 - Mustafa Ibrahim Khaleel, Michelle M. Zhu:
Energy-Aware Job Management Approaches for Workflow in Cloud. 506-507 - Mei Liang, Cesar Trejo, Lavanya Muthu, Linh Bao Ngo, André Luckow, Amy W. Apon:
Evaluating R-Based Big Data Analytic Frameworks. 508-509 - Poornima Nookala, Serapheim Dimitropoulos, Karl Stough, Ioan Raicu:
Evaluating the Support of MTC Applications on Intel Xeon Phi Many-Core Accelerators. 510-511 - Nan Dun, Hajime Fujita, Aiman Fang, Yan Liu, Andrew A. Chien, Pavan Balaji, Kamil Iskra, Wesley Bland, Andrew R. Siegel:
Flexible Error Recovery Using Versions in Global View Resilience. 512-513 - Olivier Sallou, Cyril Monjeaud:
GO-Docker: A Batch Scheduling System with Docker Containers. 514-515 - Tonglin Li, Chaoqi Ma, Jiabao Li, Xiaobing Zhou, Ke Wang, Dongfang Zhao, Iman Sadooghi, Ioan Raicu:
GRAPH/Z: A Key-Value Store Based Scalable Graph Processing System. 516-517 - Faisal N. Abu-Khzam, Amer E. Mouawad, Karim A. Jahed:
Highly Scalable Parallel Search-Tree Algorithms: The Virtual Topology Approach. 518 - Jason Arnold, Boris Glavic, Ioan Raicu:
HRDBMS: A NewSQL Database for Analytics. 519-520 - Jie Wei, Shangguang Wang, Lingyan Zhang, Ao Zhou, Qibo Sun, Ruisheng Shi, Fangchun Yang:
Minimizing Data Transmission Latency by Bipartite Graph in MapReduce. 521-522 - Xinkui Zhao, Jianwei Yin, Chen Zhi, Pengxiang Lin, Shichun Feng, Hao Wu, Zuoning Chen:
monBench: A Database Performance Benchmark for Cloud Monitoring System. 523-524 - Sanjaya Gajurel, Roger Bielefeld:
Mutated Near Optimal Vertex Cover Algorithm (NOVCA) Visualization on a Tile Display. 525-526 - Ayush Dusia, Yang Yang, Michela Taufer:
Network Quality of Service in Docker Containers. 527-528 - Yuming Ye, Ziyang Li, Dongsheng Li, Yiming Zhang, Feng Liu, Yuxing Peng:
Pallas: An Application-Driven Task and Network Simulation Framework. 529-530 - Gregor von Laszewski, Fugang Wang, Geoffrey Charles Fox, David L. Hart, Thomas R. Furlani, Robert L. DeLeon, Steven M. Gallo:
Peer Comparison of XSEDE and NCAR Publication Data. 531-532 - Yash Ukidave, David R. Kaeli, Umesh Gupta, Kurt Keville:
Performance of the NVIDIA Jetson TK1 in HPC. 533-534 - José Monsalve Diaz, Aaron Myles Landwehr, Michela Taufer:
Dynamic CPU Resource Allocation in Containerized Cloud Environments. 535-536 - Langshi Chen, Serge Petition:
Toward Auto-tuned Krylov Basis Computation for Different Sparse Matrix Formats and Interconnects on GPU Clusters. 537-538 - Jiaan Zeng, Beth Plale:
Towards Building a Lightweight Key-Value Store on Parallel File System. 539-540 - Jon Calhoun, Marc Snir, Luke N. Olson, María Jesús Garzarán:
Understanding the Propagation of Error Due to a Silent Data Corruption in a Sparse Matrix Vector Multiply. 541-542
FTS 2015
- Rob Hunt, Simon McIntosh-Smith:
Exploiting Spatial Information in Datasets to Enable Fault Tolerant Sparse Matrix Solvers. 543-551 - Francesco Rizzi, Karla Morris, Khachik Sargsyan, Paul Mycek, Cosmin Safta, Olivier P. Le Maître, Omar M. Knio, Bert J. Debusschere:
Partial Differential Equations Preconditioner Resilient to Soft and Hard Faults. 552-562 - Tatiana V. Martsinkevich, Omer Subasi, Osman S. Unsal, Franck Cappello, Jesús Labarta:
Fault-Tolerant Protocol for Hybrid Task-Parallel Message-Passing Applications. 563-570 - David E. Bernholdt, Wael R. Elwasif, Christos Kartsaklis, Seyong Lee, Tiffany M. Mintz:
Programmer-Guided Reliability for Extreme-Scale Applications. 571-579 - Faisal Shahzad, Moritz Kreutzer, Thomas Zeiser, Rui Machado, Andreas Pieper, Georg Hager, Gerhard Wellein:
Building a Fault Tolerant Application Using the GASPI Communication Layer. 580-587 - Dong-Wan Kim, Mattan Erez:
Stay Alive, Don't Give Up: DUE and SDC Reduction with Memory Repair. 588-594 - Leonardo Arturo Bautista-Gomez, Franck Cappello:
Detecting and Correcting Data Corruption in Stencil Applications through Multivariate Interpolation. 595-602
HUCAA 2015
- Thomas C. Carroll, Jude-Thaddeus Ojiaku, Prudence W. H. Wong:
Pairwise Sequence Alignment with Gaps with GPU. 603-610 - Forrest Wolfgang Glines, Matthew Anderson, David Neilsen:
Scalable Relativistic High-Resolution Shock-Capturing for Heterogeneous Computing. 611-618 - Santiago Mislata Valero, Federico Silla:
On the Execution of Computationally Intensive CPU-Based Libraries on Remote Accelerators for Increasing Performance: Early Experience with the OpenBLAS and FFTW Libraries. 619-626 - Tetsuya Odajima, Taisuke Boku, Toshihiro Hanawa, Hitoshi Murai, Masahiro Nakao, Akihiro Tabuchi, Mitsuhisa Sato:
Hybrid Communication with TCA and InfiniBand on a Parallel Programming Language XcalableACC for GPU Clusters. 627-634 - Toshihiro Hanawa, Hisafumi Fujii, Norihisa Fujita, Tetsuya Odajima, Kazuya Matsumoto, Taisuke Boku:
Evaluation of FFT for GPU Cluster Using Tightly Coupled Accelerators Architecture. 635-641
HPCMASPA 2015
- Steven M. Gallo, Joseph P. White, Robert L. DeLeon, Thomas R. Furlani, Helen Ngo, Abani K. Patra, Matthew D. Jones, Jeffrey T. Palmer, Nikolay Simakov, Jeanette M. Sperhac, Martins Innus, Thomas Yearke, Ryan Rathsam:
Analysis of XDMoD/SUPReMM Data Using Machine Learning Techniques. 642-649 - Gideon Juve, Benjamín Tovar, Rafael Ferreira da Silva, Dariusz Król, Douglas Thain, Ewa Deelman, William E. Allcock, Miron Livny:
Practical Resource Monitoring for Robust High Throughput Computing. 650-657 - Jim M. Brandt, Ann C. Gentile, Cindy Martin, Jason Repik, Narate Taerat:
New Systems, New Behaviors, New Patterns: Monitoring Insights from System Standup. 658-665 - Taylor L. Groves, Samuel K. Gutierrez, Dorian C. Arnold:
A LogP Extension for Modeling Tree Aggregation Networks. 666-673 - Omar Aaziz, Jonathan Cook, Hadi Sharifi:
Push Me Pull You: Integrating Opposing Data Transport Modes for Efficient HPC Application Monitoring. 674-681 - Patricia Grubel, Hartmut Kaiser, Jeanine E. Cook, Adrian Serio:
The Performance Implication of Task Size for Applications on the HPX Runtime System. 682-689 - Sean Wallace, Venkatram Vishwanath, Susan Coghlan, Zhiling Lan, Michael E. Papka:
Comparison of Vendor Supplied Environmental Data Collection Mechanisms. 690-697 - Mohammad J. Rashti, Gerald Sabin, David Vansickle, Boyana Norris:
WattProf: A Flexible Platform for Fine-Grained HPC Power Profiling. 698-705 - Michael T. Showerman:
Real Time Visualization of Monitoring Data for Large Scale HPC Systems. 706-709 - Adam DeConinck, Kathleen Kelly:
Evolution of Monitoring over the Lifetime of a High Performance Computing Cluster. 710-713 - Christopher Lee Moore, Prabhu Singh Khalsa, Todd Alan Yilk, Michael Mason:
Monitoring High Performance Computing Systems for the End User. 714-716 - Steven D. Feldman, Deli Zhang, Damian Dechev, James Brandt:
Extending LDMS to Enable Performance Monitoring in Multi-core Applications. 717-720