default search action
Rajeev Thakur
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c136]Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Jinyang Liu, Yafan Huang, Ken Raffenetti, Hui Zhou, Kai Zhao, Xiaoyi Lu, Zizhong Chen, Franck Cappello, Yanfei Guo, Rajeev Thakur:
gZCCL: Compression-Accelerated Collective Communication Framework for GPU Clusters. ICS 2024: 437-448 - [c135]Murali Emani, Sam Foreman, Varuni Sastry, Zhen Xie, Siddhisanket Raskar, William Arnold, Rajeev Thakur, Venkatram Vishwanath, Michael E. Papka, Sanjif Shanmugavelu, Darshan Gandhi, Hengyu Zhao, Dun Ma, Kiran Ranganath, Rick Weisner, Jiunn-yeu Chen, Yuting Yang, Natalia Vassilieva, Bin C. Zhang, Sylvia Howland, Alexander Tsyplikhin:
Toward a Holistic Performance Evaluation of Large Language Models Across Diverse AI Accelerators. IPDPS (Workshops) 2024: 1-10 - [c134]Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Zhaorui Zhang, Jinyang Liu, Xiaoyi Lu, Ken Raffenetti, Hui Zhou, Kai Zhao, Zizhong Chen, Franck Cappello, Yanfei Guo, Rajeev Thakur:
An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compression. IPDPS 2024: 752-764 - [c133]Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Jinyang Liu, Yafan Huang, Ken Raffenetti, Hui Zhou, Kai Zhao, Zizhong Chen, Franck Cappello, Yanfei Guo, Rajeev Thakur:
POSTER: Optimizing Collective Communications with Error-bounded Lossy Compression for GPU Clusters. PPoPP 2024: 454-456 - [i16]Hui Zhou, Ken Raffenetti, Junchao Zhang, Yanfei Guo, Rajeev Thakur:
Frustrated with MPI+Threads? Try MPIxThreads! CoRR abs/2401.16551 (2024) - [i15]Hui Zhou, Ken Raffenetti, Yanfei Guo, Thomas Gillis, Robert Latham, Rajeev Thakur:
Designing and Prototyping Extensions to MPI in MPICH. CoRR abs/2402.12274 (2024) - [i14]Hui Zhou, Robert Latham, Ken Raffenetti, Yanfei Guo, Rajeev Thakur:
MPI Progress For All. CoRR abs/2405.13807 (2024) - [i13]Logan T. Ward, J. Gregory Pauloski, Valérie Hayot-Sasson, Yadu N. Babuji, Alexander Brace, Ryan Chard, Kyle Chard, Rajeev Thakur, Ian T. Foster:
Employing Artificial Intelligence to Steer Exascale Workflows with Colmena. CoRR abs/2408.14434 (2024) - 2023
- [c132]Michael Wilkins, Hanming Wang, Peizhi Liu, Bangyen Pham, Yanfei Guo, Rajeev Thakur, Peter A. Dinda, Nikos Hardavellas:
Generalized Collective Algorithms for the Exascale Era. CLUSTER 2023: 60-71 - [c131]Jiajun Huang, Kaiming Ouyang, Yujia Zhai, Jinyang Liu, Min Si, Ken Raffenetti, Hui Zhou, Atsushi Hori, Zizhong Chen, Yanfei Guo, Rajeev Thakur:
PiP-MColl: Process-in-Process-based Multi-object MPI Collectives. CLUSTER 2023: 354-364 - [c130]Jiajun Huang, Kaiming Ouyang, Yujia Zhai, Jinyang Liu, Min Si, Ken Raffenetti, Hui Zhou, Atsushi Hori, Zizhong Chen, Yanfei Guo, Rajeev Thakur:
Accelerating MPI Collectives with Process-in-Process-based Multi-object Techniques. HPDC 2023: 333-334 - [c129]Thomas Gillis, Ken Raffenetti, Hui Zhou, Yanfei Guo, Rajeev Thakur:
Quantifying the Performance Benefits of Partitioned Communication in MPI. ICPP 2023: 285-294 - [c128]Logan T. Ward, J. Gregory Pauloski, Valérie Hayot-Sasson, Ryan Chard, Yadu N. Babuji, Ganesh Sivaraman, Sutanay Choudhury, Kyle Chard, Rajeev Thakur, Ian T. Foster:
Cloud Services Enable Efficient AI-Guided Simulation Workflows across Heterogeneous Resources. IPDPS Workshops 2023: 32-41 - [c127]Hui Zhou, Ken Raffenetti, Junchao Zhang, Yanfei Guo, Rajeev Thakur:
Frustrated With MPI+Threads? Try MPIxThreads! EuroMPI 2023: 2:1-2:10 - [i12]Logan T. Ward, J. Gregory Pauloski, Valérie Hayot-Sasson, Ryan Chard, Yadu N. Babuji, Ganesh Sivaraman, Sutanay Choudhury, Kyle Chard, Rajeev Thakur, Ian T. Foster:
Cloud Services Enable Efficient AI-Guided Simulation Workflows across Heterogeneous Resources. CoRR abs/2303.08803 (2023) - [i11]Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Jinyang Liu, Ken Raffenetti, Hui Zhou, Kai Zhao, Zizhong Chen, Franck Cappello, Yanfei Guo, Rajeev Thakur:
C-Coll: Introducing Error-bounded Lossy Compression into MPI Collectives. CoRR abs/2304.03890 (2023) - [i10]Jiajun Huang, Kaiming Ouyang, Yujia Zhai, Jinyang Liu, Min Si, Ken Raffenetti, Hui Zhou, Atsushi Hori, Zizhong Chen, Yanfei Guo, Rajeev Thakur:
Accelerating MPI Collectives with Process-in-Process-based Multi-object Techniques. CoRR abs/2305.10612 (2023) - [i9]Thomas Gillis, Ken Raffenetti, Hui Zhou, Yanfei Guo, Rajeev Thakur:
Quantifying the Performance Benefits of Partitioned Communication in MPI. CoRR abs/2308.03930 (2023) - [i8]Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Jinyang Liu, Yafan Huang, Ken Raffenetti, Hui Zhou, Kai Zhao, Zizhong Chen, Franck Cappello, Yanfei Guo, Rajeev Thakur:
gZCCL: Compression-Accelerated Collective Communication Framework for GPU Clusters. CoRR abs/2308.05199 (2023) - [i7]Murali Emani, Sam Foreman, Varuni Sastry, Zhen Xie, Siddhisanket Raskar, William Arnold, Rajeev Thakur, Venkatram Vishwanath, Michael E. Papka:
A Comprehensive Performance Study of Large Language Models on Novel AI Accelerators. CoRR abs/2310.04607 (2023) - 2022
- [c126]Michael Wilkins, Yanfei Guo, Rajeev Thakur, Peter A. Dinda, Nikos Hardavellas:
ACCLAiM: Advancing the Practicality of MPI Collective Communication Autotuning Using Machine Learning. CLUSTER 2022: 161-171 - [c125]Murali Emani, Zhen Xie, Siddhisanket Raskar, Varuni Sastry, William Arnold, Bruce Wilson, Rajeev Thakur, Venkatram Vishwanath, Zhengchun Liu, Michael E. Papka, Cindy Orozco Bohorquez, Rick Weisner, Karen Li, Yongning Sheng, Yun Du, Jian Zhang, Alexander Tsyplikhin, Gurdaman Khaira, Jeremy Fowers, Ramakrishnan Sivakumar, Victoria Godsoe, Adrián Macías, Chetan Tekur, Matthew Boyd:
A Comprehensive Evaluation of Novel AI Accelerators for Deep Learning Workloads. PMBS@SC 2022: 13-25 - [c124]Hui Zhou, Ken Raffenetti, Yanfei Guo, Rajeev Thakur:
MPIX Stream: An Explicit Solution to Hybrid MPI+X Programming. EuroMPI 2022: 1-10 - [i6]Hui Zhou, Ken Raffenetti, Yanfei Guo, Rajeev Thakur:
MPIX Stream: An Explicit Solution to Hybrid MPI+X Programming. CoRR abs/2208.13707 (2022) - 2021
- [j47]Anshu Dubey, Lois Curfman McInnes, Rajeev Thakur, Erik W. Draeger, Thomas M. Evans, Timothy C. Germann, William E. Hart:
Performance Portability in the Exascale Computing Project: Exploration Through a Panel Series. Comput. Sci. Eng. 23(5): 46-54 (2021) - [j46]Francis J. Alexander, James A. Ang, Jenna A. Bilbrey, Jan Balewski, Tiernan Casey, Ryan Chard, Jong Choi, Sutanay Choudhury, Bert J. Debusschere, Anthony M. DeGennaro, Nikoli Dryden, J. Austin Ellis, Ian T. Foster, Cristina Garcia-Cardona, Sayan Ghosh, Peter Harrington, Yunzhi Huang, Shantenu Jha, Travis Johnston, Ai Kagawa, Ramakrishnan Kannan, Neeraj Kumar, Zhengchun Liu, Naoya Maruyama, Satoshi Matsuoka, Erin McCarthy, Jamaludin Mohd-Yusof, Peter Nugent, Yosuke Oyama, Thomas Proffen, David Pugmire, Sivasankaran Rajamanickam, Vinay Ramakrishnaiah, Malachi Schram, Sudip K. Seal, Ganesh Sivaraman, Christine Sweeney, Li Tan, Rajeev Thakur, Brian Van Essen, Logan T. Ward, Paul M. Welch, Michael Wolf, Sotiris S. Xantheas, Kevin G. Yager, Shinjae Yoo, Byung-Jun Yoon:
Co-design Center for Exascale Machine Learning Technologies (ExaLearn). Int. J. High Perform. Comput. Appl. 35(6): 598-616 (2021) - [j45]William Gropp, Rajeev Thakur, Pavan Balaji:
Translational research in the MPICH project. J. Comput. Sci. 52: 101203 (2021) - [c123]Michael Wilkins, Yanfei Guo, Rajeev Thakur, Nikos Hardavellas, Peter A. Dinda, Min Si:
A FACT-based Approach: Making Machine Learning Collective Autotuning Feasible on Exascale Systems. ExaMPI@SC 2021: 36-45 - [c122]Logan T. Ward, Ganesh Sivaraman, J. Gregory Pauloski, Yadu N. Babuji, Ryan Chard, Naveen Dandu, Paul C. Redfern, Rajeev S. Assary, Kyle Chard, Larry A. Curtiss, Rajeev Thakur, Ian T. Foster:
Colmena: Scalable Machine-Learning-Based Steering of Ensemble Simulations for High Performance Computing. MLHPC@SC 2021: 9-20 - [i5]Logan T. Ward, Ganesh Sivaraman, J. Gregory Pauloski, Yadu N. Babuji, Ryan Chard, Naveen Dandu, Paul C. Redfern, Rajeev S. Assary, Kyle Chard, Larry A. Curtiss, Rajeev Thakur, Ian T. Foster:
Colmena: Scalable Machine-Learning-Based Steering of Ensemble Simulations for High Performance Computing. CoRR abs/2110.02827 (2021)
2010 – 2019
- 2019
- [j44]William Gropp, Rajeev Thakur:
Guest editor's introduction: Special issue on best papers from EuroMPI/USA 2017. Parallel Comput. 84: 62 (2019) - 2017
- [j43]Anthony Kougkas, Hassan Eslami, Xian-He Sun, Rajeev Thakur, William Gropp:
Rethinking key-value store for parallel I/O optimization. Int. J. High Perform. Comput. Appl. 31(4): 335-356 (2017) - [e1]Antonio J. Peña, Pavan Balaji, William Gropp, Rajeev Thakur:
Proceedings of the 24th European MPI Users' Group Meeting, EuroMPI/USA 2017, Chicago, IL, USA, September 25-28, 2017. ACM 2017, ISBN 978-1-4503-4849-2 [contents] - 2016
- [j42]Rajeev Thakur:
Scanning LIDAR in Advanced Driver Assistance Systems and Beyond. IEEE Consumer Electron. Mag. 5(3): 48-54 (2016) - [j41]James Dinan, Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Rajeev Thakur:
An implementation and evaluation of the MPI 3.0 one-sided communication interface. Concurr. Comput. Pract. Exp. 28(17): 4385-4404 (2016) - [j40]Ashwin M. Aji, Lokendra S. Panwar, Feng Ji, Karthik Murthy, Milind Chabbi, Pavan Balaji, Keith R. Bisset, James Dinan, Wu-chun Feng, John M. Mellor-Crummey, Xiaosong Ma, Rajeev Thakur:
MPI-ACC: Accelerator-Aware MPI for Scientific Applications. IEEE Trans. Parallel Distributed Syst. 27(5): 1401-1414 (2016) - [c121]Yong Chen, Chao Chen, Yanlong Yin, Xian-He Sun, Rajeev Thakur, William Gropp:
Rethinking High Performance Computing System Architecture for Scientific Big Data Applications. Trustcom/BigDataSE/ISPA 2016: 1605-1612 - 2015
- [j39]Yin Lu, Yong Chen, Yu Zhuang, Jialin Liu, Rajeev Thakur:
Collective input/output under memory constraints. Int. J. High Perform. Comput. Appl. 29(1): 21-36 (2015) - [j38]Seong Jo Kim, Yuanrui Zhang, Seung Woo Son, Mahmut T. Kandemir, Wei-keng Liao, Rajeev Thakur, Alok N. Choudhary:
IOPro: a parallel I/O profiling and visualization framework for high-performance storage systems. J. Supercomput. 71(3): 840-870 (2015) - [j37]Yong Chen, Yin Lu, Prathamesh Amritkar, Rajeev Thakur, Yu Zhuang:
Performance model-directed data sieving for high-performance I/O. J. Supercomput. 71(6): 2066-2090 (2015) - [j36]Torsten Hoefler, James Dinan, Rajeev Thakur, Brian Barrett, Pavan Balaji, William Gropp, Keith D. Underwood:
Remote Memory Access Programming in MPI-3. ACM Trans. Parallel Comput. 2(2): 9:1-9:26 (2015) - [c120]Swann Perarnau, Rajeev Thakur, Kamil Iskra, Ken Raffenetti, Franck Cappello, Rinku Gupta, Peter H. Beckman, Marc Snir, Henry Hoffmann, Martin Schulz, Barry Rountree:
Distributed Monitoring and Management of Exascale Systems in the Argo Project. DAIS 2015: 173-178 - [c119]Hassan Eslami, Anthony Kougkas, Maria Kotsifakou, Theodoros Kasampalis, Kun Feng, Yin Lu, William Gropp, Xian-He Sun, Yong Chen, Rajeev Thakur:
Efficient disk-to-disk sorting: a case study in the decoupled execution paradigm. DISCS@SC 2015: 2:1-2:8 - 2014
- [j35]James Dinan, Ryan E. Grant, Pavan Balaji, David Goodell, Douglas Miller, Marc Snir, Rajeev Thakur:
Enabling communication concurrency through flexible MPI endpoints. Int. J. High Perform. Comput. Appl. 28(4): 390-405 (2014) - [j34]John Jenkins, James Dinan, Pavan Balaji, Tom Peterka, Nagiza F. Samatova, Rajeev Thakur:
Processing MPI Derived Datatypes on Noncontiguous GPU-Resident Data. IEEE Trans. Parallel Distributed Syst. 25(10): 2627-2637 (2014) - [c118]Chao Chen, Yong Chen, Kun Feng, Yanlong Yin, Hassan Eslami, Rajeev Thakur, Xian-He Sun, William D. Gropp:
Decoupled I/O for Data-Intensive High Performance Computing. ICPP Workshops 2014: 312-320 - [c117]Yanlong Yin, Antonios Kougkas, Kun Feng, Hassan Eslami, Yin Lu, Xian-He Sun, Rajeev Thakur, William Gropp:
Rethinking key-value store for parallel I/O optimization. DISCS@SC 2014: 33-40 - 2013
- [j33]Torsten Hoefler, James Dinan, Darius Buntinas, Pavan Balaji, Brian Barrett, Ron Brightwell, William Gropp, Vivek Kale, Rajeev Thakur:
MPI + MPI: a new hybrid approach to parallel programming with MPI plus shared memory. Computing 95(12): 1121-1136 (2013) - [c116]Xin Zhao, Darius Buntinas, Judicael A. Zounmevo, James Dinan, David Goodell, Pavan Balaji, Rajeev Thakur, Ahmad Afsahi, William Gropp:
Toward Asynchronous and MPI-Interoperable Active Messages. CCGRID 2013: 87-94 - [c115]Kun Feng, Yanlong Yin, Chao Chen, Hassan Eslami, Xian-He Sun, Yong Chen, Rajeev Thakur, William Gropp:
Runtime system design of decoupled execution paradigm for data-intensive high-end computing. CLUSTER 2013: 1 - [c114]Xin Zhao, Pavan Balaji, William Gropp, Rajeev Thakur:
Optimization Strategies for MPI-Interoperable Active Messages. DASC 2013: 508-515 - [c113]Ashwin M. Aji, Lokendra S. Panwar, Feng Ji, Milind Chabbi, Karthik Murthy, Pavan Balaji, Keith R. Bisset, James Dinan, Wu-chun Feng, John M. Mellor-Crummey, Xiaosong Ma, Rajeev Thakur:
On the efficacy of GPU-integrated MPI for scientific applications. HPDC 2013: 191-202 - [c112]Palden Lama, Yan Li, Ashwin M. Aji, Pavan Balaji, James Dinan, Shucai Xiao, Yunquan Zhang, Wu-chun Feng, Rajeev Thakur, Xiaobo Zhou:
pVOCL: Power-Aware Dynamic Placement and Migration in Virtualized GPU Environments. ICDCS 2013: 145-154 - [c111]Xin Zhao, Pavan Balaji, William Gropp, Rajeev Thakur:
MPI-Interoperable Generalized Active Messages. ICPADS 2013: 200-207 - [c110]Yin Lu, Yong Chen, Yu Zhuang, Rajeev Thakur:
Memory-conscious collective I/O for extreme scale HPC systems. ROSS@ICS 2013: 5:1-5:8 - [c109]Yanlong Yin, Jibing Li, Jun He, Xian-He Sun, Rajeev Thakur:
Pattern-Direct and Layout-Aware Replication Scheme for Parallel I/O Systems. IPDPS 2013: 345-356 - [c108]Ashwin M. Aji, Pavan Balaji, James Dinan, Wu-chun Feng, Rajeev Thakur:
Synchronization and Ordering Semantics in Hybrid MPI+GPU Programming. IPDPS Workshops 2013: 1020-1029 - [c107]James Dinan, Pavan Balaji, David Goodell, Douglas Miller, Marc Snir, Rajeev Thakur:
Enabling MPI interoperability through flexible communication endpoints. EuroMPI 2013: 13-18 - [c106]Antonio J. Peña, Ralf G. Correa Carvalho, James Dinan, Pavan Balaji, Rajeev Thakur, William Gropp:
Analysis of topology-dependent MPI performance on Gemini networks. EuroMPI 2013: 61-66 - [i4]Anshu Dubey, Steven R. Brandt, Richard C. Brower, M. Giles, Paul D. Hovland, Don Q. Lamb, Frank Löffler, Boyana Norris, Brian W. O'Shea, Claudio Rebbi, Marc Snir, Rajeev Thakur:
Software Abstractions and Methodologies for HPC Simulation Codes on Future Architectures. CoRR abs/1309.1780 (2013) - 2012
- [c105]Shucai Xiao, Pavan Balaji, James Dinan, Qian Zhu, Rajeev Thakur, Susan Coghlan, Heshan Lin, Gaojin Wen, Jue Hong, Wu-chun Feng:
Transparent Accelerator Migration in a Virtualized GPU Environment. CCGRID 2012: 124-131 - [c104]Yanlong Yin, Surendra Byna, Huaiming Song, Xian-He Sun, Rajeev Thakur:
Boosting Application-Specific Parallel I/O Optimization Using IOSIG. CCGRID 2012: 196-203 - [c103]Yong Chen, Chao Chen, Xian-He Sun, William D. Gropp, Rajeev Thakur:
A Decoupled Execution Paradigm for Data-Intensive High-End Computing. CLUSTER 2012: 200-208 - [c102]Jun He, Xian-He Sun, Rajeev Thakur:
KNOWAC: I/O Prefetch via Accumulated Knowledge. CLUSTER 2012: 429-437 - [c101]John Jenkins, James Dinan, Pavan Balaji, Nagiza F. Samatova, Rajeev Thakur:
Enabling Fast, Noncontiguous GPU Data Movement in Hybrid MPI+GPU Environments. CLUSTER 2012: 468-476 - [c100]Feng Ji, Ashwin M. Aji, James Dinan, Darius Buntinas, Pavan Balaji, Rajeev Thakur, Wu-chun Feng, Xiaosong Ma:
DMA-Assisted, Intranode Communication in GPU Accelerated Systems. HPCC-ICESS 2012: 461-468 - [c99]Ashwin M. Aji, James Dinan, Darius Buntinas, Pavan Balaji, Wu-chun Feng, Keith R. Bisset, Rajeev Thakur:
MPI-ACC: An Integrated and Extensible Approach to Data Movement in Accelerator-based Systems. HPCC-ICESS 2012: 647-654 - [c98]Hui Jin, Jiayu Ji, Xian-He Sun, Yong Chen, Rajeev Thakur:
CHAIO: Enabling HPC Applications on Data-Intensive File Systems. ICPP 2012: 369-378 - [c97]Huaiming Song, Hui Jin, Jun He, Xian-He Sun, Rajeev Thakur:
A Server-Level Adaptive Data Layout Strategy for Parallel File Systems. IPDPS Workshops 2012: 2095-2103 - [c96]William Gropp, Ewing L. Lusk, Rajeev Thakur:
Advanced MPI Including New MPI-3 Features. EuroMPI 2012: 14 - [c95]James Dinan, David Goodell, William Gropp, Rajeev Thakur, Pavan Balaji:
Efficient Multithreaded Context ID Allocation in MPI. EuroMPI 2012: 57-66 - [c94]Torsten Hoefler, James Dinan, Darius Buntinas, Pavan Balaji, Brian W. Barrett, Ron Brightwell, William Gropp, Vivek Kale, Rajeev Thakur:
Leveraging MPI's One-Sided Communication Interface for Shared-Memory Programming. EuroMPI 2012: 132-141 - [c93]Seong Jo Kim, Seung Woo Son, Wei-keng Liao, Mahmut T. Kandemir, Rajeev Thakur, Alok N. Choudhary:
IOPin: Runtime Profiling of Parallel I/O in HPC Systems. SC Companion 2012: 18-23 - [c92]Yin Lu, Yong Chen, Rajeev Thakur, Yu Zhuang:
Abstract: Memory-Conscious Collective I/O for Extreme-Scale HPC Systems. SC Companion 2012: 1360-1361 - [c91]Yin Lu, Yong Chen, Rajeev Thakur, Yu Zhuang:
Poster: Memory-Conscious Collective I/O for Extreme-Scale HPC Systems. SC Companion 2012: 1362 - 2011
- [j32]Ganesh Gopalakrishnan, Robert M. Kirby, Stephen F. Siegel, Rajeev Thakur, William Gropp, Ewing L. Lusk, Bronis R. de Supinski, Martin Schulz, Greg Bronevetsky:
Formal analysis of MPI-based parallel programs. Commun. ACM 54(12): 82-91 (2011) - [j31]Torsten Hoefler, Rolf Rabenseifner, Hubert Ritzdorf, Bronis R. de Supinski, Rajeev Thakur, Jesper Larsson Träff:
The scalable process topology interface of MPI 2.2. Concurr. Comput. Pract. Exp. 23(4): 293-310 (2011) - [j30]Jack J. Dongarra, Peter H. Beckman, Terry Moore, Patrick Aerts, Giovanni Aloisio, Jean-Claude Andre, David Barkai, Jean-Yves Berthou, Taisuke Boku, Bertrand Braunschweig, Franck Cappello, Barbara M. Chapman, Xuebin Chi, Alok N. Choudhary, Sudip S. Dosanjh, Thom H. Dunning, Sandro Fiore, Al Geist, Bill Gropp, Robert J. Harrison, Mark Hereld, Michael A. Heroux, Adolfy Hoisie, Koh Hotta, Zhong Jin, Yutaka Ishikawa, Fred Johnson, Sanjay Kale, Richard Kenway, David E. Keyes, Bill Kramer, Jesús Labarta, Alain Lichnewsky, Thomas Lippert, Bob Lucas, Barney Maccabe, Satoshi Matsuoka, Paul Messina, Peter Michielse, Bernd Mohr, Matthias S. Müller, Wolfgang E. Nagel, Hiroshi Nakashima, Michael E. Papka, Daniel A. Reed, Mitsuhisa Sato, Edward Seidel, John Shalf, David Skinner, Marc Snir, Thomas L. Sterling, Rick Stevens, Frederick H. Streitz, Bob Sugar, Shinji Sumimoto, William M. Tang, John A. Taylor, Rajeev Thakur, Anne E. Trefethen, Mateo Valero, Aad J. van der Steen, Jeffrey S. Vetter, Peg Williams, Robert W. Wisniewski, Katherine A. Yelick:
The International Exascale Software Project roadmap. Int. J. High Perform. Comput. Appl. 25(1): 3-60 (2011) - [j29]Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Torsten Hoefler, Sameer Kumar, Ewing L. Lusk, Rajeev Thakur, Jesper Larsson Träff:
Mpi on millions of Cores. Parallel Process. Lett. 21(1): 45-60 (2011) - [c90]Huaiming Song, Yanlong Yin, Xian-He Sun, Rajeev Thakur, Samuel Lang:
A Segment-Level Adaptive Data Layout Scheme for Improved Load Balance in Parallel File Systems. CCGRID 2011: 414-423 - [c89]Yong Chen, Xian-He Sun, Rajeev Thakur, Philip C. Roth, William D. Gropp:
LACIO: A New Collective I/O Strategy for Parallel I/O Systems. IPDPS 2011: 794-804 - [c88]David Goodell, William Gropp, Xin Zhao, Rajeev Thakur:
Scalable Memory Use in MPI: A Case Study with MPICH2. EuroMPI 2011: 140-149 - [c87]William Gropp, Torsten Hoefler, Rajeev Thakur, Jesper Larsson Träff:
Performance Expectations and Guidelines for MPI Derived Datatypes. EuroMPI 2011: 150-159 - [c86]Huaiming Song, Yanlong Yin, Xian-He Sun, Rajeev Thakur, Samuel Lang:
Server-side I/O coordination for parallel file systems. SC 2011: 17:1-17:11 - 2010
- [j28]Pavan Balaji, Wu-chun Feng, Heshan Lin, Jeremy S. Archuleta, Satoshi Matsuoka, Andrew S. Warren, João Carlos Setubal, Ewing L. Lusk, Rajeev Thakur, Ian T. Foster, Daniel S. Katz, Shantenu Jha, K. Shinpaugh, Susan Coghlan, Daniel A. Reed:
Global-scale distributed I/O with ParaMEDIC. Concurr. Comput. Pract. Exp. 22(16): 2266-2281 (2010) - [j27]Pavan Balaji, Anthony Chan, William Gropp, Rajeev Thakur, Ewing L. Lusk:
The Importance of Non-Data-Communication Overheads in MPI. Int. J. High Perform. Comput. Appl. 24(1): 5-15 (2010) - [j26]Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Rajeev Thakur:
Fine-Grained Multithreading Support for Hybrid Threaded MPI Programming. Int. J. High Perform. Comput. Appl. 24(1): 49-57 (2010) - [j25]Jesper Larsson Träff, Andreas Ripke, Christian Siebert, Pavan Balaji, Rajeev Thakur, William Gropp:
A Pipelined Algorithm for Large, Irregular All-Gather Problems. Int. J. High Perform. Comput. Appl. 24(1): 58-68 (2010) - [j24]Zhiling Lan, Jiexing Gu, Ziming Zheng, Rajeev Thakur, Susan Coghlan:
A study of dynamic meta-learning for failure prediction in large-scale systems. J. Parallel Distributed Comput. 70(6): 630-643 (2010) - [j23]Salman Pervez, Ganesh Gopalakrishnan, Robert M. Kirby, Rajeev Thakur, William Gropp:
Formal methods applied to high-performance computing software design: a case study of MPI one-sided communication-based locking. Softw. Pract. Exp. 40(1): 23-43 (2010) - [j22]Jesper Larsson Träff, William D. Gropp, Rajeev Thakur:
Self-Consistent MPI Performance Guidelines. IEEE Trans. Parallel Distributed Syst. 21(5): 698-709 (2010) - [c85]James Dinan, Pavan Balaji, Ewing L. Lusk, P. Sadayappan, Rajeev Thakur:
Hybrid parallel programming with MPI and unified parallel C. Conf. Computing Frontiers 2010: 177-186 - [c84]David Goodell, Pavan Balaji, Darius Buntinas, Gábor Dózsa, William Gropp, Sameer Kumar, Bronis R. de Supinski, Rajeev Thakur:
Minimizing MPI Resource Contention in Multithreaded Multicore Environments. CLUSTER 2010: 1-8 - [c83]Yong Chen, Xian-He Sun, Rajeev Thakur, Huaiming Song, Hui Jin:
Improving Parallel I/O Performance with Data Layout Awareness. CLUSTER 2010: 302-311 - [c82]Yong Chen, Huaiming Song, Rajeev Thakur, Xian-He Sun:
A layout-aware optimization strategy for collective I/O. HPDC 2010: 360-363 - [c81]Seung Woo Son, Samuel Lang, Philip H. Carns, Robert B. Ross, Rajeev Thakur, Berkin Özisikyilmaz, Prabhat Kumar, Wei-keng Liao, Alok N. Choudhary:
Enabling active storage on parallel I/O software stacks. MSST 2010: 1-12 - [c80]Gábor Dózsa, Sameer Kumar, Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Joe Ratterman, Rajeev Thakur:
Enabling Concurrent Multithreaded MPI Communication on Multicore Petascale Systems. EuroMPI 2010: 11-20 - [c79]Torsten Hoefler, William Gropp, Rajeev Thakur, Jesper Larsson Träff:
Toward Performance Models of MPI Implementations for Understanding Application Scaling Issues. EuroMPI 2010: 21-30 - [c78]Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Jayesh Krishna, Ewing L. Lusk, Rajeev Thakur:
PMI: A Scalable Parallel Process-Management Interface for Extreme-Scale Systems. EuroMPI 2010: 31-41 - [c77]Jayesh Krishna, Pavan Balaji, Ewing L. Lusk, Rajeev Thakur, Fabian Tiller:
Implementing MPI on Windows: Comparison with Common Approaches on Unix. EuroMPI 2010: 160-169 - [c76]Wei-Fan Chiang, Grzegorz Szubzda, Ganesh Gopalakrishnan, Rajeev Thakur:
Dynamic Verification of Hybrid Programs. EuroMPI 2010: 298-301
2000 – 2009
- 2009
- [j21]Ping Lai, Pavan Balaji, Rajeev Thakur, Dhabaleswar K. Panda:
ProOnE: a general-purpose protocol onload engine for multi- and many-core architectures. Comput. Sci. Res. Dev. 23(3-4): 133-142 (2009) - [j20]Pavan Balaji, Anthony Chan, Rajeev Thakur, William Gropp, Ewing L. Lusk:
Toward message passing for a million processes: characterizing MPI on a massive scale blue gene/P. Comput. Sci. Res. Dev. 24(1-2): 11-19 (2009) - [j19]Rajeev Thakur, William Gropp:
Test suite for evaluating performance of multithreaded MPI communication. Parallel Comput. 35(12): 608-617 (2009) - [c75]Gopalakrishnan Santhanaraman, Pavan Balaji, K. Gopalakrishnan, Rajeev Thakur, William Gropp, Dhabaleswar K. Panda:
Natively Supporting True One-Sided Communication in. CCGRID 2009: 380-387 - [c74]Vinod Tipparaju, William Gropp, Hubert Ritzdorf, Rajeev Thakur, Jesper Larsson Träff:
Investigating High Performance RMA Interfaces for the MPI-3 Standard. ICPP 2009: 293-300 - [c73]