default search action
Guang R. Gao
Guang Rong Gao
Person information
- affiliation: University of Delaware, Newark, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [c228]Siddhisanket Raskar, Thomas Applencourt, Kalyan Kumaran, Guang R. Gao:
Codelet Pipe: Realization of Dataflow Software Pipelining for Extended Codelet Model. ICPP Workshops 2023: 127-135 - 2022
- [j70]Tongsheng Geng, Marcos Amaris, Stéphane Zuckerman, Alfredo Goldman, Guang R. Gao, Jean-Luc Gaudiot:
A Profile-Based AI-Assisted Dynamic Scheduling Approach for Heterogeneous Architectures. Int. J. Parallel Program. 50(1): 115-151 (2022) - [j69]Joshua Suetterlein, Joseph B. Manzano, Andres Marquez, Guang R. Gao:
Extending an asynchronous runtime system for high throughput applications: A case study. J. Parallel Distributed Comput. 163: 214-231 (2022) - [c227]Jose Manuel Monsalve Diaz, Kevin Harms, Rafael A. Herrera Guaitero, Diego A. Roa Perdomo, Kalyan Kumaran, Guang R. Gao:
The SuperCodelet architecture. ExHET@PPOPP 2022: 2:1-2:6 - 2021
- [j68]Guangming Tan, Guang R. Gao:
Guest Editorial: Special issue on Network and Parallel Computing for Emerging Architectures and Applications. Int. J. Parallel Program. 49(5): 625-627 (2021) - [j67]Mingfan Li, Han Lin, Junshi Chen, Jose Monsalve Diaz, Qian Xiao, Rongfen Lin, Fei Wang, Guang R. Gao, Hong An:
swFLOW: A large-scale distributed framework for deep learning on Sunway TaihuLight supercomputer. Inf. Sci. 570: 831-847 (2021) - [c226]Shiyang Chen, Shaoyi Huang, Santosh Pandey, Bingbing Li, Guang R. Gao, Long Zheng, Caiwen Ding, Hang Liu:
E.T.: re-thinking self-attention for transformer models on GPUs. SC 2021: 25 - [i1]Shaoshan Liu, Yuhao Zhu, Bo Yu, Jean-Luc Gaudiot, Guang R. Gao:
The Promise of Dataflow Architectures in the Design of Processing Systems for Autonomous Machines. CoRR abs/2109.07047 (2021) - 2020
- [c225]Joshua Suetterlein, Joseph B. Manzano, Andres Marquez, Guang R. Gao:
On the Marriage of Asynchronous Many Task Runtimes and Big Data: A Glance. HiPC 2020: 233-242 - [c224]Tongsheng Geng, Marcos Amaris, Stéphane Zuckerman, Alfredo Goldman, Guang R. Gao, Jean-Luc Gaudiot:
PDAWL: Profile-Based Iterative Dynamic Adaptive WorkLoad Balance on Heterogeneous Architectures. JSSPP 2020: 145-162 - [c223]Diego A. Roa Perdomo, Ryan Kabrick, Jose Manuel Monsalve Diaz, Siddhisanket Raskar, Dawson Fox, Guang R. Gao:
DEMAC: A Modular Platform for HW-SW Co-Design. IPDRM@SC 2020: 25-32 - [c222]Ryan Kabrick, Diego A. Roa Perdomo, Siddhisanket Raskar, Jose Manuel Monsalve Diaz, Dawson Fox, Guang R. Gao:
CODIR: Towards an MLIR Codelet Model Dialect. IPDRM@SC 2020: 33-40
2010 – 2019
- 2019
- [j66]Guangming Tan, Guang R. Gao:
Editorial for the special issue on innovations in supercomputing techniques. CCF Trans. High Perform. Comput. 1(2): 61-62 (2019) - [c221]Han Lin, Zeng Lin, Jose Monsalve Diaz, Mingfan Li, Hong An, Guang R. Gao:
swFLOW: A Dataflow Deep Learning Framework on Sunway TaihuLight Supercomputer. HPCC/SmartCity/DSS 2019: 2467-2475 - 2018
- [j65]Xiangyu Ju, Quan Chen, Zhenning Wang, Minyi Guo, Guang R. Gao:
DCF: A Dataflow-Based Collaborative Filtering Training Algorithm. Int. J. Parallel Program. 46(4): 686-698 (2018) - 2017
- [j64]Yao Wu, Long Zheng, Brian Heilig, Guang R. Gao:
HAMR: A dataflow-based real-time in-memory cluster computing engine. Int. J. High Perform. Comput. Appl. 31(5): 361-374 (2017) - [j63]Peng Qu, Jin Yan, Youhui Zhang, Guang R. Gao:
Parallel Turing Machine, a Proposal. J. Comput. Sci. Technol. 32(2): 269-285 (2017) - [j62]Jaime Arteaga, Stéphane Zuckerman, Guang R. Gao:
Generating Fine-Grain Multithreaded Applications Using a Multigrain Approach. ACM Trans. Archit. Code Optim. 14(4): 47:1-47:26 (2017) - [c220]Joshua Landwehr, Joshua Suetterlein, Joseph B. Manzano, Andrès Márquez, Kevin J. Barker, Guang R. Gao:
Designing Scalable Distributed Memory Models: A Case Study. Conf. Computing Frontiers 2017: 174-182 - [c219]Fateme S. Hosseini, Pouya Fotouhi, Chengmo Yang, Guang R. Gao:
Leveraging Compiler Optimizations to Reduce Runtime Fault Recovery Overhead. DAC 2017: 20:1-20:6 - [c218]Hoda Aghaei Khouzani, Pouya Fotouhi, Chengmo Yang, Guang R. Gao:
Leveraging access port positions to accelerate page table walk in DWM-based main memory. DATE 2017: 1450-1455 - [c217]Jaime Arteaga Molina, Stéphane Zuckerman, Guang R. Gao:
Multigrain Parallelism: Bridging Coarse-Grain Parallel Programs and Fine-Grain Event-Driven Multithreading. IPDPS 2017: 799-808 - [c216]Joshua Suetterlein, Joshua Landwehr, Andres Marquez, Joseph B. Manzano, Kevin J. Barker, Guang R. Gao:
Verification of the Extended Roofline Model for Asynchronous Many Task Runtimes. ESPM2@SC 2017: 6:1-6:8 - 2016
- [j61]Daniel A. Orozco, Elkin Garcia, Robert S. Pavel, Jaime Arteaga, Guang R. Gao:
The Design and Implementation of TIDeFlow: A Dataflow-Inspired Execution Model for Parallel Loops and Task Pipelining. Int. J. Parallel Program. 44(2): 278-307 (2016) - [c215]Joshua Landwehr, Joshua Suetterlein, Andres Marquez, Joseph B. Manzano, Guang R. Gao:
Application characterization at scale: lessons learned from developing a distributed open community runtime system for high performance computing. Conf. Computing Frontiers 2016: 164-171 - [c214]Joshua D. Suetterlein, Joshua Landwehr, Andrès Márquez, Joseph B. Manzano, Guang R. Gao:
Extending the Roofline Model for Asynchronous Many-Task Runtimes. CLUSTER 2016: 493-496 - [c213]Joshua Suetterlein, Joshua Landwehr, Andrès Márquez, Joseph B. Manzano, Guang R. Gao:
Asynchronous Runtimes in Action: An Introspective Framework for a Next Gen Runtime. IPDPS Workshops 2016: 1744-1751 - [c212]Kelly Livingston, Aaron Myles Landwehr, José Monsalve Diaz, Stéphane Zuckerman, Benoît Meister, Guang R. Gao:
Energy Avoiding Matrix Multiply. LCPC 2016: 55-70 - [c211]Tongsheng Geng, Stéphane Zuckerman, José Monsalve Diaz, Alfredo Goldman, Sami Habib, Jean-Luc Gaudiot, Guang R. Gao:
The Importance of Efficient Fine-Grain Synchronization for Many-Core Systems. LCPC 2016: 203-217 - [c210]Peng Qu, Jin Yan, Guang R. Gao:
Toward a Parallel Turing Machine Model. NPC 2016: 191-204 - [e10]Guang R. Gao, Depei Qian, Xinbo Gao, Barbara M. Chapman, Wenguang Chen:
Network and Parallel Computing - 13th IFIP WG 10.3 International Conference, NPC 2016, Xi'an, China, October 28-29, 2016, Proceedings. Lecture Notes in Computer Science 9966, 2016, ISBN 978-3-319-47098-6 [contents] - 2015
- [j60]R. Govindarajan, Guang R. Gao:
Author Rebuttal to Rocha et al. "Comments on Minimizing Buffer Requirements under Rate-Optimal Schedule in Regular Dataflow Networks". J. Signal Process. Syst. 81(1): 135-136 (2015) - [c209]Sunil Shrestha, Guang R. Gao, Joseph B. Manzano, Andrès Márquez, John Feo:
Locality aware concurrent start for stencil applications. CGO 2015: 157-166 - [c208]Haitao Wei, Guang R. Gao, Elkin Garcia:
Energy efficient multi-level tiling for dense matrix multiplication on many-core architecture. IGSC 2015: 1-6 - [c207]Sunil Shrestha, Joseph B. Manzano, Andrès Márquez, Stéphane Zuckerman, Shuaiwen Song, Guang R. Gao:
Gregarious Data Re-structuring in a Many Core Architecture. HPCC/CSS/ICESS 2015: 712-720 - [c206]Xiaoming Li, Jack B. Dennis, Guang R. Gao, Willie Y.-P. Lim, Haitao Wei, Chao Yang, Robert S. Pavel:
FreshBreeze: A Data Flow Approach for Meeting DDDAS Challenges. ICCS 2015: 2573-2582 - [c205]Sam Kaplan, Sergio Pino, Aaron Myles Landwehr, Guang R. Gao:
Landing Containment Domains on SWARM: Toward a Robust Resiliency Solution on a Dynamic Adaptive Runtime Machine. PARCO 2015: 753-761 - [c204]Yao Wu, Long Zheng, Brian Heilig, Guang R. Gao:
Design and evaluation of a novel dataflow based bigdata solution. PMAM@PPoPP 2015: 40-48 - 2014
- [j59]Roberto Giorgi, Rosa M. Badia, François Bodin, Albert Cohen, Paraskevas Evripidou, Paolo Faraboschi, Bernhard Fechner, Guang R. Gao, Arne Garbade, Rahulkumar Gayatri, Sylvain Girbal, Daniel Goodman, Behram Khan, Souad Koliai, Joshua Landwehr, Nhat Minh Lê, Feng Li, Mikel Luján, Avi Mendelson, Laurent Morin, Nacho Navarro, Tomasz Patejko, Antoniu Pop, Pedro Trancoso, Theo Ungerer, Ian Watson, Sebastian Weis, Stéphane Zuckerman, Mateo Valero:
TERAFLUX: Harnessing dataflow in next generation teradevices. Microprocess. Microsystems 38(8): 976-990 (2014) - [c203]Haitao Wei, Stéphane Zuckerman, Xiaoming Li, Guang R. Gao:
A Dataflow Programming Language and its Compiler for Streaming Systems. ICCS 2014: 1289-1298 - [c202]Andres Marquez, Joseph B. Manzano, Shuaiwen Leon Song, Benoît Meister, Sunil Shrestha, Thomas St. John, Guang R. Gao:
ACDT: Architected Composite Data Types trading-in unfettered data access for improved execution. ICPADS 2014: 289-297 - [c201]Jaime Arteaga, Stéphane Zuckerman, Elkin Garcia, Guang R. Gao:
Position Paper: Locality-Driven Scheduling of Tasks for Data-Dependent Multithreading. IPDPS Workshops 2014: 1363-1367 - [c200]Sunil Shrestha, Joseph B. Manzano, Andrès Márquez, John Feo, Guang R. Gao:
Jagged Tiling for Intra-tile Parallelism and Fine-Grain Multithreading. LCPC 2014: 161-175 - 2013
- [j58]Haitao Wei, Mingkang Qin, Weiwei Zhang, Junqing Yu, Dongrui Fan, Guang R. Gao:
StreamTMC: Stream compilation for tiled multi-core architectures. J. Parallel Distributed Comput. 73(4): 484-494 (2013) - [c199]Elkin Garcia, Guang R. Gao:
Strategies for improving performance and energy efficiency on a many-core. Conf. Computing Frontiers 2013: 9:1-9:4 - [c198]Marco Solinas, Rosa M. Badia, François Bodin, Albert Cohen, Paraskevas Evripidou, Paolo Faraboschi, Bernhard Fechner, Guang R. Gao, Arne Garbade, Sylvain Girbal, Daniel Goodman, Behram Khan, Souad Koliai, Feng Li, Mikel Luján, Laurent Morin, Avi Mendelson, Nacho Navarro, Antoniu Pop, Pedro Trancoso, Theo Ungerer, Mateo Valero, Sebastian Weis, Ian Watson, Stéphane Zuckerman, Roberto Giorgi:
The TERAFLUX Project: Exploiting the DataFlow Paradigm in Next Generation Teradevices. DSD 2013: 272-279 - [c197]Joshua Suetterlein, Stéphane Zuckerman, Guang R. Gao:
An Implementation of the Codelet Model. Euro-Par 2013: 633-644 - [c196]Aaron Myles Landwehr, Stéphane Zuckerman, Guang R. Gao:
Toward a Self-aware System for Exascale Architectures. Euro-Par Workshops 2013: 812-822 - [c195]Elkin Garcia, Daniel A. Orozco, Rishi Khan, Ioannis E. Venetis, Kelly Livingston, Guang R. Gao:
A dynamic schema to increase performance in many-core architectures through percolation operations. HiPC 2013: 276-285 - [c194]Chen Chen, Yao Wu, Stéphane Zuckerman, Guang R. Gao:
Towards Memory-Load Balanced Fast Fourier Transformations in Fine-Grain Execution Models. IPDPS Workshops 2013: 1607-1617 - [c193]Elkin Garcia, Jaime Arteaga, Robert S. Pavel, Guang R. Gao:
Optimizing the LU Factorization for Energy Efficiency on a Many-Core Architecture. LCPC 2013: 237-251 - [c192]Chen Chen, Yao Wu, Joshua Suetterlein, Long Zheng, Minyi Guo, Guang R. Gao:
Automatic Locality Exploitation in the Codelet Model. TrustCom/ISPA/IUCC 2013: 853-862 - 2012
- [j57]Daniel A. Orozco, Elkin Garcia, Rishi Khan, Kelly Livingston, Guang R. Gao:
Toward high-throughput algorithms on many-core architectures. ACM Trans. Archit. Code Optim. 8(4): 49:1-49:21 (2012) - [j56]Haitao Wei, Junqing Yu, Huafei Yu, Mingkang Qin, Guang R. Gao:
Software Pipelining for Stream Programs on Resource Constrained Multicore Architectures. IEEE Trans. Parallel Distributed Syst. 23(12): 2338-2350 (2012) - [c191]Elkin Garcia, Daniel A. Orozco, Rishi Khan, Ioannis E. Venetis, Kelly Livingston, Guang R. Gao:
Dynamic percolation: a case of study on the shortcomings of traditional optimization in many-core architectures. Conf. Computing Frontiers 2012: 245-248 - [c190]Juergen Ributzka, Joseph B. Manzano, Guang R. Gao:
The Role of Non-strict Fine-grain Synchronization. High Performance Computing Workshop (2) 2012: 121-140 - [c189]Elkin Garcia, Daniel A. Orozco, Robert S. Pavel, Guang R. Gao:
A Discussion in Favor of Dynamic Scheduling for Regular Applications in Many-core Architectures. IPDPS Workshops 2012: 1591-1600 - [c188]Daniel A. Orozco, Elkin Garcia, Robert S. Pavel, Orlando Ayala, Lian-Ping Wang, Guang R. Gao:
Demystifying Performance Predictions of Distributed FFT3D Implementations. NPC 2012: 196-207 - [c187]Tom St. John, Jack B. Dennis, Guang R. Gao:
Massively parallel breadth first search using a tree-structured memory model. PMAM 2012: 115-123 - 2011
- [j55]Jack B. Dennis, Guang R. Gao, Xiao X. Meng:
Experiments with the Fresh Breeze tree-based memory model. Comput. Sci. Res. Dev. 26(3-4): 325-337 (2011) - [j54]Guangming Tan, Vugranam C. Sreedhar, Guang R. Gao:
Analysis and performance results of computing betweenness centrality on IBM Cyclops64. J. Supercomput. 56(1): 1-24 (2011) - [c186]Long Chen, Oreste Villa, Guang R. Gao:
Exploring Fine-Grained Task-Based Execution on Multi-GPU Systems. CLUSTER 2011: 386-394 - [c185]Yonghong Yan, Sanjay Chatterjee, Daniel A. Orozco, Elkin Garcia, Zoran Budimlic, Jun Shirako, Robert S. Pavel, Guang R. Gao, Vivek Sarkar:
Hardware and Software Tradeoffs for Task Synchronization on Manycore Architectures. Euro-Par (2) 2011: 112-123 - [c184]Juergen Ributzka, Yuhei Hayashi, Fei Chen, Guang R. Gao:
DEEP: an iterative fpga-based many-core emulation system for chip verification and architecture research. FPGA 2011: 115-118 - [c183]Murat Bolat, Kirk Kelsey, Xiaoming Li, Guang R. Gao:
Source Code Partitioning in Program Optimization. ICPADS 2011: 56-63 - [c182]Juergen Ributzka, Yuhei Hayashi, Joseph B. Manzano, Guang R. Gao:
The elephant and the mice: the role of non-strict fine-grain synchronization for modern many-core architectures. ICS 2011: 338-347 - [c181]Joseph B. Manzano, Ge Gan, Juergen Ributzka, Sunil Shrestha, Guang R. Gao:
OPELL and PM: A Case Study on Porting Shared Memory Programming Models to Accelerators Architectures. LCPC 2011: 106-123 - [c180]Daniel A. Orozco, Elkin Garcia, Robert S. Pavel, Rishi Khan, Guang R. Gao:
Polytasks: A Compressed Task Representation for HPC Runtimes. LCPC 2011: 268-282 - [c179]Jack B. Dennis, Guang R. Gao, Xiao X. Meng, Brian Lucas, Joshua Slocum:
The Fresh Breeze Program Execution Model. PARCO 2011: 335-342 - [e9]Guang R. Gao, Yu-Chee Tseng:
International Conference on Parallel Processing, ICPP 2011, Taipei, Taiwan, September 13-16, 2011. IEEE Computer Society 2011, ISBN 978-1-4577-1336-1 [contents] - 2010
- [c178]Haitao Wei, Junqing Yu, Huafei Yu, Guang R. Gao:
Minimizing communication in rate-optimal software pipelining for stream programs. CGO 2010: 210-217 - [c177]Elkin Garcia, Ioannis E. Venetis, Rishi Khan, Guang R. Gao:
Optimized Dense Matrix Multiplication on a Many-Core Architecture. Euro-Par (2) 2010: 316-327 - [c176]Chen Chen, Joseph B. Manzano, Ge Gan, Guang R. Gao, Vivek Sarkar:
A Study of a Software Cache Implementation of the OpenMP Memory Model for Multicore and Manycore Architectures. Euro-Par (2) 2010: 341-352 - [c175]Long Chen, Oreste Villa, Sriram Krishnamoorthy, Guang R. Gao:
Dynamic load balancing on single- and multi-GPU systems. IPDPS 2010: 1-12 - [c174]Handong Ye, Robert S. Pavel, Aaron Myles Landwehr, Guang R. Gao:
TiNy threads on BlueGene/P: Exploring many-core parallelisms beyond The traditional OS. IPDPS Workshops 2010: 1-8 - [c173]Daniel A. Orozco, Elkin Garcia, Guang R. Gao:
Locality Optimization of Stencil Applications Using Data Dependency Graphs. LCPC 2010: 77-91 - [c172]Long Chen, Guang R. Gao:
Performance analysis of Cooley-Tukey FFT algorithms for a many-core architecture. SpringSim 2010: 81 - [e8]Guang R. Gao, Lori L. Pollock, John Cavazos, Xiaoming Li:
Languages and Compilers for Parallel Computing, 22nd International Workshop, LCPC 2009, Newark, DE, USA, October 8-10, 2009, Revised Selected Papers. Lecture Notes in Computer Science 5898, Springer 2010, ISBN 978-3-642-13373-2 [contents]
2000 – 2009
- 2009
- [j53]Guangming Tan, Ninghui Sun, Guang R. Gao:
Improving Performance of Dynamic Programming via Parallelism and Locality on Multicore Architectures. IEEE Trans. Parallel Distributed Syst. 20(2): 261-274 (2009) - [c171]Ioannis E. Venetis, Guang R. Gao:
Mapping the LU decomposition on a many-core architecture: challenges and solutions. Conf. Computing Frontiers 2009: 71-80 - [c170]Ge Gan, Xu Wang, Joseph B. Manzano, Guang R. Gao:
Tile Percolation: An OpenMP Tile Aware Parallelization Technique for the Cyclops-64 Multicore Processor. Euro-Par 2009: 839-850 - [c169]Daniel A. Orozco, Guang R. Gao:
Mapping the FDTD Application to Many-Core Chip Architectures. ICPP 2009: 309-316 - [c168]Alejandro Segovia, Xiaoming Li, Guang R. Gao:
Iterative layer-based raytracing on CUDA. IPCCC 2009: 248-255 - [c167]Ge Gan, Xu Wang, Joseph B. Manzano, Guang R. Gao:
Tile Reduction: The First Step towards Tile Aware Parallelization in OpenMP. IWOMP 2009: 140-153 - 2008
- [j52]Guang R. Gao, Mitsuhisa Sato, Eduard Ayguadé:
Guest Editors Introduction: Special Issue on OpenMP. Int. J. Parallel Program. 36(3): 287-288 (2008) - [j51]Mihailo Kaplarevic, Alison E. Murray, Stephen C. Cary, Guang R. Gao:
Engenius - Environmental genome Informational Utility System. J. Bioinform. Comput. Biol. 6(6): 1193-1211 (2008) - [j50]Hongbo Rong, Alban Douillet, Guang R. Gao:
Register allocation for software pipelined multidimensional loops. ACM Trans. Program. Lang. Syst. 30(4): 23:1-23:68 (2008) - [c166]Sun C. Chan, Guang R. Gao, Barbara M. Chapman, T. Linthicum, A. Dasgupta:
Open64 compiler infrastructure for emerging multicore/manycore architecture All Symposium Tutorial. IPDPS 2008: 1 - [c165]Yuan Zhang, Vugranam C. Sreedhar, Weirong Zhu, Vivek Sarkar, Guang R. Gao:
Minimum Lock Assignment: A Method for Exploiting Concurrency among Critical Sections. LCPC 2008: 141-155 - [c164]Guangming Tan, Vugranam C. Sreedhar, Guang R. Gao:
Just-In-Time Locality and Percolation for Optimizing Irregular Applications on a Manycore Architecture. LCPC 2008: 331-342 - [c163]Guangming Tan, Dongrui Fan, Junchao Zhang, Andrew Russo, Guang R. Gao:
Experience on optimizing irregular computation for memory hierarchy in manycore architecture. PPoPP 2008: 279-280 - [e7]Barbara M. Chapman, Weimin Zheng, Guang R. Gao, Mitsuhisa Sato, Eduard Ayguadé, Dongsheng Wang:
A Practical Programming Model for the Multi-Core Era, 3rd International Workshop on OpenMP, IWOMP 2007, Beijing, China, June 3-7, 2007, Proceedings. Lecture Notes in Computer Science 4935, Springer 2008, ISBN 978-3-540-69302-4 [contents] - 2007
- [j49]Weirong Zhu, Yanwei Niu, Guang R. Gao:
Performance portability on EARTH: a case study across several parallel architectures. Clust. Comput. 10(2): 115-126 (2007) - [j48]Hongbo Rong, Zhizhong Tang, Ramaswamy Govindarajan, Alban Douillet, Guang R. Gao:
Single-dimension software pipelining for multidimensional loops. ACM Trans. Archit. Code Optim. 4(1): 7 (2007) - [c162]Alban Douillet, Guang R. Gao:
Software-Pipelining on Multi-Core Architectures. PACT 2007: 39-48 - [c161]Long Chen, Ziang Hu, Junmin Lin, Guang R. Gao:
Optimizing the Fast Fourier Transform on a Multi-core Architecture. IPDPS 2007: 1-8 - [c160]Ge Gan, Ziang Hu, Juan del Cuvillo, Guang R. Gao:
Exploring a Multithreaded Methodology to Implement a Network Communication Protocol on the Cyclops-64 Multithreaded Architecture. IPDPS 2007: 1-8 - [c159]Guang R. Gao, Thomas L. Sterling, Rick Stevens, Mark Hereld, Weirong Zhu:
ParalleX: A Study of A New Parallel Computation Model. IPDPS 2007: 1-6 - [c158]Daniel A. Orozco, Liping Xue, Murat Bolat, Xiaoming Li, Guang R. Gao:
Experience of Optimizing FFT on Intel Architectures. IPDPS 2007: 1-8 - [c157]Haiping Wu, Eunjung Park, Mihailo Kaplarevic, Yingping Zhang, Murat Bolat, Xiaoming Li, Guang R. Gao:
Automatic Program Segment Similarity Detection in Targeted Program Performance Improvement. IPDPS 2007: 1-8 - [c156]Weirong Zhu, Ziang Hu, Guang R. Gao:
On the Role of Deterministic Fine-Grain Data Synchronization for Scientific Applications: A Revisit in the Emerging Many-Core Era. IPDPS 2007: 1-8 - [c155]Weirong Zhu, Ziang Hu, Guang R. Gao:
On the Role of Deterministic Fine-Grain Data Synchronization for Scientific Applications: A Revisit in the Emerging Many-Core Era. IPDPS 2007: 1-8 - [c154]Weirong Zhu, Vugranam C. Sreedhar, Ziang Hu, Guang R. Gao:
Synchronization state buffer: supporting efficient fine-grain synchronization on many-core architectures. ISCA 2007: 35-45 - [c153]Yuan Zhang, Evelyn Duesterwald, Guang R. Gao:
Concurrency Analysis for Shared Memory Programs with Textually Unaligned Barriers. LCPC 2007: 95-109 - [c152]Guang R. Gao:
On Parallel Models of Computation. NPC 2007: 541 - [c151]Yuan Zhang, Vugranam C. Sreedhar, Weirong Zhu, Vivek Sarkar, Guang R. Gao:
Optimized lock assignment and allocation: a method for exploiting concurrency among critical sections. PPoPP 2007: 146-147 - [c150]Peiheng Zhang, Guangming Tan, Guang R. Gao:
Implementation of the Smith-Waterman algorithm on a reconfigurable supercomputing platform. HPRCTA 2007: 39-48 - [c149]Guangming Tan, Ninghui Sun, Guang R. Gao:
A parallel dynamic programming algorithm on a multi-core architecture. SPAA 2007: 135-144 - 2006
- [c148]Guang R. Gao:
The Era of Multi-core Chips -A Fresh Look on Software Challenges. Asia-Pacific Computer Systems Architecture Conference 2006: 1 - [c147]Juan del Cuvillo, Weirong Zhu, Guang R. Gao:
Landing openMP on cyclops-64: an efficient mapping of openMP to a many-core system-on-a-chip. Conf. Computing Frontiers 2006: 41-50 - [c146]Ziang Hu, Juan del Cuvillo, Weirong Zhu, Guang R. Gao:
Optimization of Dense Matrix Multiplication on IBM Cyclops-64: Challenges and Experiences. Euro-Par 2006: 134-144 - [c145]Alban Douillet, Hongbo Rong, Guang R. Gao:
Multi-dimensional Kernel Generation for Loop Nest Software Pipelining. Euro-Par 2006: 311-322 - [c144]Juan del Cuvillo, Weirong Zhu, Ziang Hu, Guang R. Gao:
Toward a Software Infrastructure for the Cyclops-64 Cellular Architecture. HPCS 2006: 9 - [c143]Guang R. Gao, Thomas L. Sterling, Rick L. Stevens, Mark Hereld, Weirong Zhu:
Hierarchical multithreading: programming model and system software. IPDPS 2006 - [c142]Yingping Zhang, Taikyeong Jeong, Fei Chen, Haiping Wu, Ronny Nitzsche, Guang R. Gao:
A study of the on-chip interconnection network for the IBM Cyclops64 multi-core architecture. IPDPS 2006 - [c141]