


default search action
IPDPS 2011: Anchorage, Alaska, USA
- 25th IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2011, Anchorage, Alaska, USA, 16-20 May, 2011 - Conference Proceedings. IEEE 2011, ISBN 978-1-61284-372-8

Keynote
- Peter Sanders:

Algorithm Engineering for Scalable Parallel External Sorting. 1
Resource Management
- Anne Benoit

, Paul Renaud-Goud, Yves Robert
:
Power-Aware Replica Placement and Update Strategies in Tree Networks. 2-13 - Venkatesan T. Chakaravarthy, Gyana R. Parija, Sambuddha Roy, Yogish Sabharwal, Amit Kumar

:
Minimum Cost Resource Allocation for Meeting Job Requirements. 14-23 - Kaiqi Xiong:

Power and Performance Management in Priority-Type Cluster Computing Systems. 24-35 - Krishna Kant, Muthukumar Murugan, David H. C. Du:

Willow: A Control System for Energy and Thermal Adaptive Computing. 36-47
Communication and I/O Optimization
- Michael J. Anderson, Grey Ballard

, James Demmel, Kurt Keutzer:
Communication-Avoiding QR Decomposition for GPUs. 48-58 - James Buford White III

, Jack J. Dongarra:
Overlapping Computation and Communication for Advection on Hybrid Parallel Computers. 59-67 - Christopher Mitchell, James P. Ahrens

, Jun Wang
:
VisIO: Enabling Interactive Visualization of Ultra-Scale, Time Series Data via High-Bandwidth Distributed I/O Systems. 68-79 - Abhinav Bhatele, Pritish Jetley, Hormozd Gahvari, Lukasz Wesolowski, William D. Gropp

, Laxmikant V. Kalé:
Architectural Constraints to Attain 1 Exaflop/s for Three Scientific Application Classes. 80-91
Hardware-Software Interaction
- Pengju Shang, Jun Wang

:
A Novel Power Management for CMP Systems in Data-Intensive Environment. 92-103 - Seetharami Seelam, Liana Fong, John Lewars, John Divirgilio, Brian F. Veale, Kevin J. Gildea:

Characterization of System Services and Their Performance Impact in Multi-core Nodes. 104-117 - Jiahua He, Allan Snavely, Rob F. Van der Wijngaart, Michael A. Frumkin:

Automatic Recognition of Performance Idioms in Scientific Applications. 118-127 - Shuaiwen Song, Chun-Yi Su, Rong Ge, Abhinav Vishnu, Kirk W. Cameron

:
Iso-Energy-Efficiency: An Approach to Power-Constrained Parallel Computation. 128-139
Runtime Systems
- Pieter Bellens, Josep M. Pérez

, Rosa M. Badia
, Jesús Labarta
:
A Study of Speculative Distributed Scheduling on the Cell/B.E. 140-151 - Susmit Biswas, Bronis R. de Supinski, Martin Schulz

, Diana Franklin, Timothy Sherwood
, Frederic T. Chong
:
Exploiting Data Similarity to Reduce Memory Footprints. 152-163 - Andriy Kot, Andrey N. Chernikov, Nikos Chrisochoides:

The Evaluation of an Effective Out-of-Core Run-Time System in the Context of Parallel Mesh Generation. 164-175 - Romain Cledat, Tushar Kumar, Jaswanth Sreeram, Santosh Pande

:
Enriching 3-D Video Games on Multicores. 176-187
Routing and Communication
- Xin Yuan:

On Nonblocking Folded-Clos Networks in Computer Communication Environments. 188-196 - Wei Lin Guay, Bartosz Bogdanski, Sven-Arne Reinemo

, Olav Lysne, Tor Skeie:
vFtree - A Fat-Tree Routing Algorithm Using Virtual Lanes to Alleviate Congestion. 197-208 - Arnaud Casteigts, Paola Flocchini, Bernard Mans

, Nicola Santoro:
Measuring Temporal Lags in Delay-Tolerant Networks. 209-218
Self-Stabilization and Security
- Ali Ebnenasir, Aly Farahat:

A Lightweight Method for Automated Design of Convergence. 219-230 - Borzoo Bonakdarpour, Stéphane Devismes

, Franck Petit
:
Snap-Stabilizing Committee Coordination. 231-242 - Zhongjian Le, Naixue Xiong, Bo Yang, Yuezhi Zhou:

SC-OA: A Secure and Efficient Scheme for Origin Authentication of Interdomain Routing in Cloud Computing Networks. 243-254
Numerical Algorithms
- Huimin Cui, Lei Wang, Jingling Xue

, Yang Yang, Xiaobing Feng:
Automatic Library Generation for BLAS3 on GPUs. 255-265 - Che-Rung Lee, Zhaojun Bai:

Redesign of Higher-Level Matrix Algorithms for Multicore and Distributed Architectures and Applications in Quantum Monte Carlo Simulation. 266-274 - Allison H. Baker, Todd Gamblin, Martin Schulz

, Ulrike Meier Yang
:
Challenges of Scaling Algebraic Multigrid Across Modern Multicore Architectures. 275-286
Reliability and Security
- Keun Soo Yim, Cuong Manh Pham, Mushfiq Saleheen, Zbigniew Kalbarczyk, Ravishankar K. Iyer:

Hauberk: Lightweight Silent Data Corruption Error Detector for GPGPU. 287-300 - Govind Sreekar Shenoy, Jordi Tubella, Antonio González

:
A Performance and Area Efficient Architecture for Intrusion Detection Systems. 301-310 - Martin Dimitrov, Huiyang Zhou

:
Time-Ordered Event Traces: A New Debugging Primitive for Concurrency Bugs. 311-321
Wireless and Sensor Networks
- Murat Demirbas, Serafettin Tasci, Hanifi Gunes, Atri Rudra:

Singlehop Collaborative Feedback Primitives for Threshold Querying in Wireless Sensor Networks. 322-333 - Bo Jiang, Binoy Ravindran

:
Completely Distributed Particle Filters for Target Tracking in Sensor Networks. 334-344 - Evangelos Kranakis

, Danny Krizanc, Ashish Modi, Oscar Morales-Ponce:
Connectivity Trade-offs in 3D Wireless Sensor Networks Using Directional Antennae. 345-351 - Sushmita Ruj, Amiya Nayak

, Ivan Stojmenovic:
Distributed Fine-Grained Access Control in Wireless Sensor Networks. 352-362
GPU Acceleration
- Guochun Shi, Steven A. Gottlieb, Aaron Torok, Volodymyr V. Kindratenko

:
Design of MILC Lattice QCD Application for GPU Clusters. 363-371 - Thomas George, Vaibhav Saxena, Anshul Gupta, Amik Singh, Anamitra R. Choudhury:

Multifrontal Factorization of Sparse SPD Matrices on GPUs. 372-383 - Mamadou Diao, Chrysostomos Nicopoulos

, Jongman Kim:
Large-Scale Semantic Concept Detection on Manycore Platforms for Multimedia Mining. 384-394 - Rejith George Joseph, Girish Ravunnikutty, Sanjay Ranka

, Eduardo F. D'Azevedo, Scott Klasky:
Efficient GPU Implementation for Particle in Cell Algorithm. 395-406
Multiprocessing and Concurrency
- Junghee Lee

, Chrysostomos Nicopoulos
, Yongjae Lee, Hyung Gyu Lee, Jongman Kim:
Hardware-Based Job Queue Management for Manycore Architectures and OpenMP Environments. 407-418 - Javier Lira, Carlos Molina, Antonio González

:
HK-NUCA: Boosting Data Searches in Dynamic Non-Uniform Cache Architectures for Chip Multiprocessors. 419-430 - Juan M. Cebrian

, Juan L. Aragón
, Stefanos Kaxiras:
Power Token Balancing: Adapting CMPs to Power Constraints for Parallel Multithreaded Workloads. 431-442 - Olivier Certner, Zheng Li, Arun Raman, Olivier Temam:

A Very Fast Simulator for Exploring the Many-Core Future. 443-454
Compilers
- Sandya S. Mannarswamy, Ramaswamy Govindarajan:

Variable Granularity Access Tracking Scheme for Improving the Performance of Software Transactional Memory. 455-466 - Andrei Hagiescu, Huynh Phung Huynh, Weng-Fai Wong

, Rick Siow Mong Goh:
Automated Architecture-Aware Mapping of Streaming Applications Onto GPUs. 467-478 - Haibo Lin, Tao Liu, Lakshminarayanan Renganarayanan, Huoding Li, Tong Chen, Kevin O'Brien, Ling Shao:

Automatic Loop Tiling for Direct Memory Access. 479-489 - Nathaniel Azuelos, Idit Keidar, Ayal Zaks:

Tolerant Value Speculation in Coarse-Grain Streaming Computations. 490-501
Special 25th IPDPS Panel: Looking Back
- Yves Robert, William J. Dally, Jack J. Dongarra, Satoshi Matsuoka, Robert Schreiber, Horst D. Simon, Uzi Vishkin:

Panel Statement. 505
Tutorial: Parallel Programming Using the Global Arrays Toolkit: Now and into the Future
- Bruce J. Palmer

, Manojkumar Krishnan, Abhinav Vishnu:
Tutorial Statement. 506
Keynote
- Jack J. Dongarra:

Architecture-aware Algorithms and Software for Peta and Exascale Computing. 507
Distributed Algorithms and Models
- Florent Becker, Martín Matamala

, Nicolas Nisse, Ivan Rapaport, Karol Suchan
, Ioan Todinca:
Adding a Referee to an Interconnection Network: What Can(not) Be Computed in One Round. 508-514 - Venkatesan T. Chakaravarthy, Anamitra R. Choudhury, Yogish Sabharwal:

Improved Algorithms for the Distributed Trigger Counting Problem. 515-523 - Vijay K. Garg, John Bridgman:

The Weighted Byzantine Agreement Problem. 524-531 - Ze Li, Haiying Shen, Karan Sapra:

Leveraging Social Networks to Combat Collusion in Reputation Systems for Peer-to-Peer Networks. 532-543
Parallel Graph and Particle Algorithms
- Jiri Barnat, Petr Bauch, Lubos Brim

, Milan Ceska
:
Computing Strongly Connected Components in Parallel on CUDA. 544-555 - Mathias Jacquelin

, Loris Marchal
, Yves Robert, Bora Uçar
:
On Optimal Tree Traversals for Sparse Matrix Factorization. 556-567 - Jyothish Soman, Ankur Narang:

Fast Community Detection Algorithm with GPUs and Multicore Architectures. 568-579 - Tom Peterka, Robert B. Ross, Boonthanome Nouanesengsy, Teng-Yok Lee

, Han-Wei Shen, Wesley Kendall, Jian Huang:
A Study of Parallel Particle Tracing for Steady-State and Time-Varying Flow Fields. 580-591
Distributed Systems and Networks
- Lizhong Chen, Ruisheng Wang, Timothy Mark Pinkston:

Critical Bubble Scheme: An Efficient Implementation of Globally Aware Network Flow Control. 592-603 - Junyao Zhang, Pengju Shang, Jun Wang

:
A Scalable Reverse Lookup Scheme Using Group-Based Shifted Declustering Layout. 604-615 - Jens Domke, Torsten Hoefler, Wolfgang E. Nagel:

Deadlock-Free Oblivious Routing for Arbitrary Topologies. 616-627 - Ryan E. Grant, Mohammad J. Rashti, Ahmad Afsahi, Pavan Balaji:

RDMA Capable iWARP over Datagrams. 628-639
Programming Environments and Tools
- Zoltán Szebenyi, Todd Gamblin, Martin Schulz

, Bronis R. de Supinski, Felix Wolf, Brian J. N. Wylie:
Reconciling Sampling and Direct Instrumentation for Unintrusive Call-Path Profiling of MPI Programs. 640-651 - Bogdan Marius Tudor, Yong Meng Teo

:
A Practical Approach for Performance Analysis of Shared-Memory Programs. 652-663 - Pierre-Nicolas Clauss, Mark Stillwell

, Stéphane Genaud
, Frédéric Suter
, Henri Casanova
, Martin Quinson
:
Single Node On-Line Simulation of MPI Applications with SMPI. 664-675 - Matthias Christen, Olaf Schenk

, Helmar Burkhart:
PATUS: A Code Generation and Autotuning Framework for Parallel Iterative Stencil Computations on Modern Microarchitectures. 676-687
Parallel Algorithms
- Guojing Cong, Konstantin Makarychev:

Optimizing Large-Scale Graph Analysis on a Multi-threaded, Multi-core Platform. 688-697 - Rasmus Resen Amossen, Rasmus Pagh:

A New Data Layout for Set Intersection on GPUs. 698-708 - Erik Saule, Erdeniz Ö. Bas, Ümit V. Çatalyürek

:
Partitioning Spatially Located Computations Using Rectangles. 709-720 - Aydin Buluç

, Samuel Williams
, Leonid Oliker, James Demmel:
Reduced-Bandwidth Multithreaded Algorithms for Sparse Matrix-Vector Multiplication. 721-733
Distributed Systems
- Nikos Tziritas, Thanasis Loukopoulos, Spyros Lalis

, Petros Lampsas:
GRAL: A Grouping Algorithm to Optimize Application Placement in Wireless Embedded Systems. 734-745 - Fatemeh Rahimian, Sarunas Girdzijauskas

, Amir Hossein Payberah
, Seif Haridi:
Vitis: A Gossip-based Hybrid Overlay for Internet-scale Publish/Subscribe Enabling Rendezvous Routing in Unstructured Overlay Networks. 746-757 - Ciprian Docan, Manish Parashar, Julian Cummings, Scott Klasky:

Moving the Code to the Data - Dynamic Code Deployment Using ActiveSpaces. 758-769 - Karthik Channakeshava, Keith R. Bisset, V. S. Anil Kumar, Madhav V. Marathe, Shrirang M. Yardi:

High Performance Scalable and Expressive Modeling Environment to Study Mobile Malware in Large Dynamic Networks. 770-781
Storage Systems and Memory
- Chentao Wu, Shenggang Wan, Xubin He, Qiang Cao, Changsheng Xie:

H-Code: A Hybrid MDS Array Code to Optimize Partial Stripe Writes in RAID-6. 782-793 - Yong Chen

, Xian-He Sun, Rajeev Thakur
, Philip C. Roth, William D. Gropp
:
LACIO: A New Collective I/O Strategy for Parallel I/O Systems. 794-804 - Feng Ji, Xiaosong Ma:

Using Shared Memory to Accelerate MapReduce on Graphics Processing Units. 805-816 - Woojin Choi, Jeff Draper:

Unified Signatures for Improving Performance in Transactional Memory. 817-827
Operating Systems and Resource Management
- Wei Tang, Zhiling Lan, Narayan Desai, Daniel Buettner, Yongen Yu:

Reducing Fragmentation on Torus-Connected Supercomputers. 828-839 - Ziming Zheng, Li Yu, Wei Tang, Zhiling Lan, Rinku Gupta, Narayan Desai, Susan Coghlan, Daniel Buettner:

Co-analysis of RAS Log and Job Log on Blue Gene/P. 840-851 - Alessandro Morari, Roberto Gioiosa, Robert W. Wisniewski, Francisco J. Cazorla

, Mateo Valero
:
A Quantitative Analysis of OS Noise. 852-863 - Hiroyuki Takizawa

, Kentaro Koyama, Katsuto Sato, Kazuhiko Komatsu
, Hiroaki Kobayashi:
CheCL: Transparent Checkpointing and Process Migration of OpenCL Applications. 864-876
Special 25th IPDPS Panel: What's Ahead
- Per Stenström, Doug Burger, Wen-mei W. Hwu, Vipin Kumar, Kunle Olukotun, David A. Padua, Burton Smith:

Panel Statement. 877
Keynote
- Bill Dally:

Power, Programmability, and Granularity: The Challenges of ExaScale Computing. 878
Plenary Session: Best Papers
- Ananta Tiwari, Jeffrey K. Hollingsworth:

Online Adaptive Code Generation and Tuning. 879-892 - José L. Abellán, Juan Fernández, Manuel E. Acacio

:
GLocks: Efficient Support for Highly-Contended Locks in Many-Core CMPs. 893-905 - Andrew Nere, Atif Hashmi, Mikko H. Lipasti:

Profiling Heterogeneous Multi-GPU Systems to Accelerate Cortically Inspired Learning Algorithms. 906-920 - Daniel Delling, Andrew V. Goldberg, Andreas Nowatzyk, Renato Fonseca F. Werneck:

PHAST: Hardware-Accelerated Shortest Path Trees. 921-931
Numerical Algorithms
- Emmanuel Agullo, Cédric Augonnet

, Jack J. Dongarra, Mathieu Faverge, Hatem Ltaief
, Samuel Thibault, Stanimire Tomov
:
QR Factorization on a Multicore Node Enhanced with Multiple GPU Accelerators. 932-943 - Piotr Luszczek, Hatem Ltaief

, Jack J. Dongarra:
Two-Stage Tridiagonal Reduction for Dense Symmetric Matrices Using Tile Algorithms on Multicore Architectures. 944-955 - Andrew A. Davidson, Yao Zhang, John D. Owens:

An Auto-tuned Method for Solving Large Tridiagonal Systems on the GPU. 956-965 - Mark Hoemmen:

A Communication-Avoiding, Hybrid-Parallel, Rank-Revealing Orthogonalization Method. 966-977
Fault Tolerance
- Björn Kolbeck, Mikael Högqvist, Jan Stender, Felix Hupfeld:

Flease - Lease Coordination Without a Lock Server. 978-988 - Amina Guermouche, Thomas Ropars, Elisabeth Brunet, Marc Snir, Franck Cappello:

Uncoordinated Checkpointing Without Domino Effect for Send-Deterministic MPI Applications. 989-1000 - Tristan Fevat, Emmanuel Godard:

Minimal Obstructions for the Coordinated Attack Problem and Beyond. 1001-1011 - Henri Casanova, Fanny Dufossé, Yves Robert, Frédéric Vivien

:
Scheduling Parallel Iterative Applications on Volatile Resources. 1012-1023
Resource Utilization
- Jaideep Moses, Ravi R. Iyer, Ramesh Illikkal, Sadagopan Srinivasan, Konstantinos Aisopos:

Shared Resource Monitoring and Throughput Optimization in Cloud-Computing Datacenters. 1024-1033 - Qingyang Wang, Simon Malkowski, Deepal Jayasinghe, PengCheng Xiong

, Calton Pu, Yasuhiko Kanemasa, Motoyuki Kawaba, Lilian Harada:
The Impact of Soft Resource Allocation on n-Tier Application Scalability. 1034-1045 - Rui Yang, Joseph Antony, Alistair P. Rendell, Danny Robson, Peter E. Strazdins:

Profiling Directed NUMA Optimization on Linux Systems: A Case Study of the Gaussian Computational Chemistry Code. 1046-1057 - Kevin Stock, Thomas Henretty, Iyyappa Murugandi, P. Sadayappan, Robert J. Harrison

:
Model-Driven SIMD Code Generation for a Multi-resolution Tensor Kernel. 1058-1067
Parallel Programming Models and Languages
- Jeff A. Stuart, John D. Owens:

Multi-GPU MapReduce on GPU Clusters. 1068-1079 - Josh Milthorpe

, V. Ganesh, Alistair P. Rendell, David Grove:
X10 as a Parallel Language for Scientific Computation: Practice and Experience. 1080-1088 - Guohua Jin, John M. Mellor-Crummey

, Laksono Adhianto
, William N. Scherer III, Chaoran Yang:
Implementation and Performance Evaluation of the HPC Challenge Benchmarks in Coarray Fortran 2.0. 1089-1100 - Rajkishore Barik, Jisheng Zhao, David Grove

, Igor Peshansky, Zoran Budimlic, Vivek Sarkar:
Communication Optimizations for Distributed-Memory X10 Programs. 1101-1113
Algorithms for Distributed Computing
- Deepak Ajwani

, Nodari Sitchinava, Norbert Zeh:
I/O-Optimal Distribution Sweeping on Private-Cache Chip Multiprocessors. 1114-1123 - Zheng Wei, Joseph F. JáJá:

A Fast Algorithm for Constructing Inverted Files on Heterogeneous Platforms. 1124-1134 - Daniel Delling, Andrew V. Goldberg, Ilya P. Razenshteyn, Renato Fonseca F. Werneck:

Graph Partitioning with Natural Cuts. 1135-1146 - Shaojie Tang, Cheng Wang, Xiang-Yang Li, Changjun Jiang:

Reader Activation Scheduling in Multi-reader RFID Systems: A Study of General Case. 1147-1155
Scheduling
- Peter Sanders, Jochen Speck:

Efficient Parallel Scheduling of Malleable Tasks. 1156-1166 - Veronika Rehn-Sonigo, Denis Trystram, Frédéric Wagner, Haifeng Xu, Guochuan Zhang

:
Offline Scheduling of Multi-threaded Request Streams on a Caching Server. 1167-1176 - Daniel Cordeiro

, Pierre-François Dutot, Grégory Mounié, Denis Trystram:
Tight Analysis of Relaxed Multi-organization Scheduling Algorithms. 1177-1186 - Yuxiong He, Jie Liu, Hongyang Sun:

Scheduling Functionally Heterogeneous Systems with Utilization Balancing. 1187-1198
Computational Biology and Simulations
- Edans Flavius de Oliveira Sandes, Alba Cristina Magalhaes Alves de Melo

:
Smith-Waterman Alignment of Huge Sequences with GPU in Linear Space. 1199-1211 - Shucai Xiao, Heshan Lin, Wu-chun Feng:

Accelerating Protein Sequence Search in a Heterogeneous Computing System. 1212-1222 - Xiao Yang, Jaroslaw Zola, Srinivas Aluru:

Parallel Metagenomic Sequence Clustering Via Sketching and Maximal Quasi-clique Enumeration on Map-Reduce Clouds. 1223-1233 - Tobias C. Kerscher, Stefan Müller, Quinn O. Snell, Gus L. W. Hart

:
Large-Scale Lattice Gas Monte Carlo Simulations for the Generalized Ising Model. 1234-1241
Cloud Computing
- Henry M. Monti, Ali Raza Butt

, Sudharshan S. Vazhkudai:
CATCH: A Cloud-Based Adaptive Data Transfer Service for HPC. 1242-1253 - Ming Li, Fan Ye, Minkyong Kim, Han Chen, Hui Lei:

A Scalable and Elastic Publish/Subscribe Service. 1254-1265 - Yujuan Tan, Hong Jiang, Dan Feng, Lei Tian, Zhichao Yan:

CABdedupe: A Causality-Based Deduplication Performance Booster for Cloud Backup Services. 1266-1277 - Mihai Budiu, Daniel Delling, Renato Fonseca F. Werneck:

DryadOpt: Branch-and-Bound on Distributed Data-Parallel Execution Engines. 1278-1289

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














