


Остановите войну!
for scientists:


default search action
John D. Owens
Person information

- affiliation: University of California, Davis, US
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [c99]Toluwanimi O. Odemuyiwa, Hadi Asghari Moghaddam, Michael Pellauer, Kartik Hegde, Po-An Tsai, Neal Clayton Crago, Aamer Jaleel, John D. Owens, Edgar Solomonik, Joel S. Emer, Christopher W. Fletcher:
Accelerating Sparse Data Orchestration via Dynamic Reflexive Tiling. ASPLOS (3) 2023: 18-32 - [c98]Toluwanimi O. Odemuyiwa
, Hadi Asghari Moghaddam
, Michael Pellauer
, Kartik Hegde
, Po-An Tsai
, Neal Clayton Crago
, Aamer Jaleel
, John D. Owens
, Edgar Solomonik
, Joel S. Emer
, Christopher W. Fletcher
:
Accelerating Sparse Data Orchestration via Dynamic Reflexive Tiling (Extended Abstract). HOPC@SPAA 2023: 15-16 - [c97]Afton Geil, Serban D. Porumbescu, John D. Owens:
Maximum Clique Enumeration on the GPU. IPDPS Workshops 2023: 234-244 - [c96]Jonathan D. Wapman
, Sean Treichler
, Serban D. Porumbescu
, John D. Owens
:
Harmonic CUDA: Asynchronous Programming on GPUs. PMAM@PPoPP 2023: 39-49 - [c95]Muhammad Osama
, Serban D. Porumbescu
, John D. Owens
:
A Programming Model for GPU Load Balancing. PPoPP 2023: 79-91 - [c94]Muhammad Osama, Duane Merrill, Cris Cecka, Michael Garland, John D. Owens:
Stream-K: Work-Centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU. PPoPP 2023: 429-431 - [i33]Muhammad Osama, Duane Merrill, Cris Cecka, Michael Garland, John D. Owens:
Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU. CoRR abs/2301.03598 (2023) - [i32]Muhammad Osama, Serban D. Porumbescu, John D. Owens:
A Programming Model for GPU Load Balancing. CoRR abs/2301.04792 (2023) - [i31]Matthew Drescher, Muhammad A. Awad, Serban D. Porumbescu, John D. Owens:
BOBA: A Parallel Lightweight Graph Reordering Algorithm with Heavyweight Implications. CoRR abs/2306.10410 (2023) - 2022
- [j34]Kerry A. Seitz, Theresa Foley, Serban D. Porumbescu, John D. Owens:
Supporting Unified Shader Specialization by Co-opting C++ Features. Proc. ACM Comput. Graph. Interact. Tech. 5(3): 25:1-25:17 (2022) - [j33]Carl Yang
, Aydin Buluç
, John D. Owens
:
GraphBLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU. ACM Trans. Math. Softw. 48(1): 1:1-1:51 (2022) - [c93]Muhammad A. Awad, Serban D. Porumbescu, John D. Owens:
A GPU Multiversion B-Tree. PACT 2022: 481-493 - [c92]Zhongyi Lin, Louis Feng, Ehsan K. Ardestani, Jaewon Lee, John Lundell, Changkyu Kim, Arun Kejariwal, John D. Owens:
Building a Performance Model for Deep Learning Recommendation Model Training on GPUs. HIPC 2022: 48-58 - [c91]Yuxin Chen, Benjamin Brock, Serban D. Porumbescu, Aydin Buluç, Katherine A. Yelick, John D. Owens:
Atos: A Task-Parallel GPU Scheduler for Graph Analytics. ICPP 2022: 50:1-50:11 - [c90]Muhammad Osama, Serban D. Porumbescu, John D. Owens:
Essentials of Parallel Graph Analytics. IPDPS Workshops 2022: 314-317 - [c89]Zhongyi Lin, Louis Feng, Ehsan K. Ardestani, Jaewon Lee, John Lundell, Changkyu Kim, Arun Kejariwal, John D. Owens:
Building a Performance Model for Deep Learning Recommendation Model Training on GPUs. ISPASS 2022: 227-229 - [c88]Yuxin Chen, Benjamin Brock, Serban D. Porumbescu, Aydin Buluç, Katherine A. Yelick, John D. Owens:
Scalable Irregular Parallelism with GPUs: Getting CPUs Out of the Way. SC 2022: 50:1-50:16 - [i30]Zhongyi Lin, Louis Feng, Ehsan K. Ardestani, Jaewon Lee, John Lundell, Changkyu Kim, Arun Kejariwal, John D. Owens:
Building a Performance Model for Deep Learning Recommendation Model Training on GPUs. CoRR abs/2201.07821 (2022) - [i29]Muhammad Osama, Serban D. Porumbescu, John D. Owens:
Essentials of Parallel Graph Analytics. CoRR abs/2212.08200 (2022) - 2021
- [j32]Ahmed H. Mahmoud, Serban D. Porumbescu
, John D. Owens:
RXMesh: a GPU mesh data structure. ACM Trans. Graph. 40(4): 104:1-104:16 (2021) - [c87]Zhongyi Lin
, Evangelos Georganas, John D. Owens
:
Towards Flexible and Compiler-Friendly Layer Fusion for CNNs on Multicore CPUs. Euro-Par 2021: 232-248 - [i28]Muhammad A. Awad, Saman Ashkiani, Serban D. Porumbescu, Martin Farach-Colton, John D. Owens:
Better GPU Hash Tables. CoRR abs/2108.07232 (2021) - [i27]Kerry A. Seitz Jr., Theresa Foley, Serban D. Porumbescu, John D. Owens:
Unified Shader Programming in C++. CoRR abs/2109.14682 (2021) - [i26]Yuxin Chen, Benjamin Brock, Serban D. Porumbescu, Aydin Buluç, Katherine A. Yelick, John D. Owens:
Atos: A Task-Parallel GPU Dynamic Scheduling Framework for Dynamic Irregular Computations. CoRR abs/2112.00132 (2021) - 2020
- [j31]Ahmed Abdelkader
, Chandrajit L. Bajaj, Mohamed S. Ebeida, Ahmed H. Mahmoud, Scott A. Mitchell, John D. Owens, Ahmad A. Rushdi:
VoroCrust: Voronoi Meshing Without Clipping. ACM Trans. Graph. 39(3): 23:1-23:16 (2020) - [c86]Muhammad A. Awad, Saman Ashkiani, Serban D. Porumbescu, John D. Owens:
Dynamic Graphs on the GPU. IPDPS 2020: 739-748 - [c85]Weitang Liu, Xiaoyun Wang, John D. Owens, Yixuan Li:
Energy-based Out-of-distribution Detection. NeurIPS 2020 - [i25]Leyuan Wang, John D. Owens:
Fast Gunrock Subgraph Matching (GSM) on GPUs. CoRR abs/2003.01527 (2020) - [i24]Weitang Liu, Xiaoyun Wang, John D. Owens, Yixuan Li:
Energy-based Out-of-distribution Detection. CoRR abs/2010.03759 (2020)
2010 – 2019
- 2019
- [j30]Zhongyi Lin
, Matthew Yih, Jeffrey M. Ota, John D. Owens
, Pinar Muyan-Özçelik
:
Benchmarking Deep Learning Frameworks and Investigating FPGA Deployment for Traffic Sign Classification and Detection. IEEE Trans. Intell. Veh. 4(3): 385-395 (2019) - [j29]Kerry A. Seitz Jr.
, Theresa Foley, Serban D. Porumbescu, John D. Owens:
Staged metaprogramming for shader system development. ACM Trans. Graph. 38(6): 202:1-202:15 (2019) - [c84]Xiaoyun Wang, Zhongyi Lin, Carl Yang, John D. Owens:
Accelerating DNN Inference with GraphBLAS and the GPU. HPEC 2019: 1-6 - [c83]Leyuan Wang, John D. Owens:
Fast BFS-Based Triangle Counting on GPUs. HPEC 2019: 1-6 - [c82]Muhammad Osama, Minh Truong, Carl Yang, Aydin Buluç, John D. Owens:
Graph Coloring on the GPU. IPDPS Workshops 2019: 231-240 - [c81]Muhammad A. Awad
, Saman Ashkiani, Rob Johnson, Martin Farach-Colton
, John D. Owens:
Engineering a high-performance GPU B-Tree. PPoPP 2019: 145-157 - [c80]Benjamin A. Brock, Yuxin Chen, Jiakun Yan, John D. Owens, Aydin Buluç, Katherine A. Yelick:
RDMA vs. RPC for Implementing Distributed Data Structures. IA3@SC 2019: 17-22 - [i23]Ahmed Abdelkader, Chandrajit L. Bajaj
, Mohamed S. Ebeida, Ahmed H. Mahmoud, Scott A. Mitchell, John D. Owens, Ahmad A. Rushdi:
VoroCrust: Voronoi Meshing Without Clipping. CoRR abs/1902.08767 (2019) - [i22]Carl Yang, Aydin Buluç, John D. Owens:
GraphBLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU. CoRR abs/1908.01407 (2019) - [i21]Leyuan Wang, John D. Owens:
Fast BFS-Based Triangle Counting on GPUs. CoRR abs/1909.02127 (2019) - [i20]Benjamin Brock, Yuxin Chen, Jiakun Yan, John D. Owens, Aydin Buluç, Katherine A. Yelick:
RDMA vs. RPC for Implementing Distributed Data Structures. CoRR abs/1910.02158 (2019) - [i19]Weitang Liu, Lifeng Wei, James Sharpnack, John D. Owens:
Unsupervised Object Segmentation with Explicit Localization Module. CoRR abs/1911.09228 (2019) - 2018
- [j28]John D. Owens:
Technical perspective: Graphs, betweenness centrality, and the GPU. Commun. ACM 61(8): 84 (2018) - [c79]Ahmed Abdelkader, Chandrajit L. Bajaj
, Mohamed S. Ebeida, Ahmed H. Mahmoud, Scott A. Mitchell, John D. Owens, Ahmad A. Rushdi:
Sampling Conditions for Conforming Voronoi Meshing by the VoroCrust Algorithm. SoCG 2018: 1:1-1:16 - [c78]Ahmed Abdelkader, Chandrajit L. Bajaj, Mohamed S. Ebeida, Ahmed H. Mahmoud, Scott A. Mitchell, John D. Owens, Ahmad A. Rushdi:
VoroCrust Illustrated: Theory and Challenges (Multimedia Exposition). SoCG 2018: 77:1-77:4 - [c77]Carl Yang
, Aydin Buluç
, John D. Owens
:
Design Principles for Sparse Matrix Multiplication on the GPU. Euro-Par 2018: 672-687 - [c76]Carl Yang, Aydin Buluç, John D. Owens:
Implementing Push-Pull Efficiently in GraphBLAS. ICPP 2018: 89:1-89:11 - [c75]Saman Ashkiani, Martin Farach-Colton
, John D. Owens:
A Dynamic Hash Table for the GPU. IPDPS 2018: 419-429 - [c74]Saman Ashkiani, Shengren Li, Martin Farach-Colton
, Nina Amenta, John D. Owens:
GPU LSM: A Dynamic Dictionary Data Structure for the GPU. IPDPS 2018: 430-440 - [c73]Afton Geil, Martin Farach-Colton
, John D. Owens:
Quotient Filters: Approximate Membership Queries on the GPU. IPDPS 2018: 451-462 - [c72]Yuechao Pan, Roger Pearce, John D. Owens:
Scalable Breadth-First Search on a GPU Cluster. IPDPS 2018: 1090-1101 - [c71]Matthew Yih, Jeffrey M. Ota, John D. Owens, Pinar Muyan-Özçelik:
FPGA versus GPU for Speed-Limit-Sign Recognition. ITSC 2018: 843-850 - [c70]Zhongyi Lin, Jeffrey M. Ota, John D. Owens, Pinar Muyan-Özçelik:
Benchmarking Deep Learning Frameworks with FPGA-suitable Models on a Traffic Sign Dataset. Intelligent Vehicles Symposium 2018: 1197-1203 - [i18]Yuechao Pan, Roger Pearce, John D. Owens:
Scalable Breadth-First Search on a GPU Cluster. CoRR abs/1803.03922 (2018) - [i17]Ahmed Abdelkader, Chandrajit L. Bajaj, Mohamed S. Ebeida, Ahmed H. Mahmoud, Scott A. Mitchell, John D. Owens, Ahmad A. Rushdi:
Sampling Conditions for Conforming Voronoi Meshing by the VoroCrust Algorithm. CoRR abs/1803.06078 (2018) - [i16]Carl Yang, Aydin Buluç, John D. Owens:
Design Principles for Sparse Matrix Multiplication on the GPU. CoRR abs/1803.08601 (2018) - [i15]Carl Yang, Aydin Buluç, John D. Owens:
Implementing Push-Pull Efficiently in GraphBLAS. CoRR abs/1804.03327 (2018) - [i14]Leyuan Wang, Yangzihao Wang, Carl Yang, John D. Owens:
A Comparative Study on Exact Triangle Counting Algorithms on the GPU. CoRR abs/1804.06926 (2018) - [i13]Weitang Liu, Emad Barsoum, John D. Owens:
Object Localization and Motion Transfer learning with Capsules. CoRR abs/1805.07706 (2018) - 2017
- [j27]Ahmed Abdelkader, Ahmed H. Mahmoud, Ahmad A. Rushdi, Scott A. Mitchell, John D. Owens, Mohamed S. Ebeida:
A Constrained Resampling Strategy for Mesh Improvement. Comput. Graph. Forum 36(5): 189-201 (2017) - [j26]Pinar Muyan-Özçelik, John D. Owens:
Methods for multitasking among real-time embedded compute tasks running on the GPU. Concurr. Comput. Pract. Exp. 29(15) (2017) - [j25]Saman Ashkiani, Andrew A. Davidson, Ulrich Meyer, John D. Owens:
GPU Multisplit: An Extended Study of a Parallel Algorithm. ACM Trans. Parallel Comput. 4(1): 2:1-2:44 (2017) - [j24]Yangzihao Wang, Yuechao Pan, Andrew A. Davidson, Yuduo Wu, Carl Yang, Leyuan Wang, Muhammad Osama, Chenshan Yuan, Weitang Liu, Andy T. Riffel, John D. Owens:
Gunrock: GPU Graph Analytics. ACM Trans. Parallel Comput. 4(1): 3:1-3:49 (2017) - [c69]Yuechao Pan, Yangzihao Wang, Yuduo Wu, Carl Yang, John D. Owens:
Multi-GPU Graph Analytics. IPDPS 2017: 479-490 - [c68]Yangzihao Wang, Sean Baxter, John D. Owens:
Mini-Gunrock: A Lightweight Graph Analytics Framework on the GPU. IPDPS Workshops 2017: 616-626 - [i12]Yangzihao Wang, Yuechao Pan, Andrew A. Davidson, Yuduo Wu, Carl Yang, Leyuan Wang, Muhammad Osama, Chenshan Yuan, Weitang Liu, Andy T. Riffel, John D. Owens:
Gunrock: GPU Graph Analytics. CoRR abs/1701.01170 (2017) - [i11]Saman Ashkiani, Andrew A. Davidson, Ulrich Meyer, John D. Owens:
GPU Multisplit. CoRR abs/1701.01189 (2017) - [i10]Saman Ashkiani, Shengren Li, Martin Farach-Colton, Nina Amenta, John D. Owens:
GPU LSM: A Dynamic Dictionary Data Structure for the GPU. CoRR abs/1707.05354 (2017) - [i9]Saman Ashkiani, Martin Farach-Colton, John D. Owens:
A Dynamic Hash Table for the GPU. CoRR abs/1710.11246 (2017) - 2016
- [j23]Mohamed S. Ebeida, Ahmad A. Rushdi, Muhammad A. Awad
, Ahmed H. Mahmoud, Dong-Ming Yan, Shawn A. English, John D. Owens, Chandrajit L. Bajaj
, Scott A. Mitchell:
Disk Density Tuning of a Maximal Random Packing. Comput. Graph. Forum 35(5): 259-269 (2016) - [j22]Leyuan Wang, Sean Baxter, John D. Owens:
Fast parallel skew and prefix-doubling suffix array construction on the GPU. Concurr. Comput. Pract. Exp. 28(12): 3466-3484 (2016) - [j21]Jonathan Y. Kemal, Roger L. Davis, John D. Owens:
Multidisciplinary simulation acceleration using multiple shared memory graphical processing units. Int. J. High Perform. Comput. Appl. 30(4): 486-508 (2016) - [c67]Ross K. Gegan, Vishal Ahuja, John D. Owens, Dipak Ghosal:
Real-time GPU-based timing channel detection using entropy. CNS 2016: 296-305 - [c66]Leyuan Wang, Yangzihao Wang, Carl Yang, John D. Owens:
A Comparative Study on Exact Triangle Counting Algorithms on the GPU. HPGP@HPDC 2016: 1-8 - [c65]Jeremy Kepner, Peter Aaltonen, David A. Bader
, Aydin Buluç, Franz Franchetti, John R. Gilbert, Dylan Hutchison, Manoj Kumar, Andrew Lumsdaine
, Henning Meyerhenke
, Scott McMillan, Carl Yang, John D. Owens, Marcin Zalewski, Timothy G. Mattson, José E. Moreira:
Mathematical foundations of the GraphBLAS. HPEC 2016: 1-9 - [c64]Yangzihao Wang, Andrew A. Davidson, Yuechao Pan, Yuduo Wu, Andy Riffel, John D. Owens:
Gunrock: a high-performance graph processing library on the GPU. PPoPP 2016: 11:1-11:12 - [c63]Saman Ashkiani, Andrew A. Davidson, Ulrich Meyer, John D. Owens:
GPU multisplit. PPoPP 2016: 12:1-12:13 - [c62]Pinar Muyan-Özçelik, John D. Owens:
Multitasking Real-time Embedded GPU Computing Tasks. PMAM@PPoPP 2016: 78-87 - [c61]Saman Ashkiani, Nina Amenta, John D. Owens:
Parallel Approaches to the String Matching Problem on the GPU. SPAA 2016: 275-285 - [i8]Jeremy Kepner, Peter Aaltonen, David A. Bader, Aydin Buluç, Franz Franchetti, John R. Gilbert, Dylan Hutchison, Manoj Kumar, Andrew Lumsdaine, Henning Meyerhenke, Scott McMillan, José E. Moreira, John D. Owens, Carl Yang, Marcin Zalewski, Timothy G. Mattson:
Mathematical Foundations of the GraphBLAS. CoRR abs/1606.05790 (2016) - 2015
- [j20]Anjul Patney, Stanley Tzeng, Kerry A. Seitz Jr.
, John D. Owens:
Piko: a framework for authoring programmable graphics pipelines. ACM Trans. Graph. 34(4): 147:1-147:13 (2015) - [c60]Mikhail M. Shashkov, Jason Mak, Shawn Recker, Connie S. Nguyen, John D. Owens, Kenneth I. Joy:
Efficient dense reconstruction using geometry and image consistency constraints. AIPR 2015: 1-7 - [c59]Leyuan Wang, Sean Baxter, John D. Owens:
Fast Parallel Suffix Array on the GPU. Euro-Par 2015: 573-587 - [c58]Yuduo Wu, Yangzihao Wang, Yuechao Pan, Carl Yang, John D. Owens:
Performance Characterization of High-Level Programming Models for GPU Graph Analytics. IISWC 2015: 66-75 - [c57]Carl Yang, Yangzihao Wang, John D. Owens:
Fast Sparse Matrix and Sparse Vector Multiplication Algorithm on the GPU. IPDPS Workshops 2015: 841-847 - [c56]Yangzihao Wang, Andrew A. Davidson, Yuechao Pan, Yuduo Wu, Andy Riffel, John D. Owens:
Gunrock: a high-performance graph processing library on the GPU. PPoPP 2015: 265-266 - [c55]Thomas Weber, Michael Wimmer, John D. Owens:
Parallel Reyes-style adaptive subdivision with bounded memory usage. I3D 2015: 39-45 - [p2]Mohamed S. Ebeida, Scott A. Mitchell, Anjul Patney, Andrew A. Davidson, Stanley Tzeng, Muhammad A. Awad
, Ahmed H. Mahmoud, John D. Owens:
Exercises in High-Dimensional Sampling: Maximal Poisson-Disk Sampling and k-d Darts. Topological and Statistical Methods for Complex Data, Tackling Large-Scale, High-Dimensional, and Multivariate Data Spaces 2015: 221-238 - [i7]Yangzihao Wang, Andrew A. Davidson, Yuechao Pan, Yuduo Wu, Andy Riffel, John D. Owens:
Gunrock: A High-Performance Graph Processing Library on the GPU. CoRR abs/1501.05387 (2015) - [i6]Yuechao Pan, Yangzihao Wang, Yuduo Wu, Carl Yang, John D. Owens:
Multi-GPU Graph Analytics. CoRR abs/1504.04804 (2015) - 2014
- [j19]Mohamed S. Ebeida, Anjul Patney, Scott A. Mitchell, Keith R. Dalbey, Andrew A. Davidson, John D. Owens:
k-d Darts: Sampling by k-dimensional flat searches. ACM Trans. Graph. 33(1): 3:1-3:16 (2014) - [c54]Jason Mak, Mauricio Hess-Flores, Shawn Recker, John D. Owens, Kenneth I. Joy:
A Comparative Study of GPU-Accelerated Multi-view Sequential Reconstruction Triangulation Methods for Large-Scale Scenes. ACCV Workshops (1) 2014: 254-269 - [c53]Afton Geil, Yangzihao Wang, John D. Owens:
WTF, GPU! computing twitter's who-to-follow on the GPU. COSN 2014: 63-68 - [c52]Andrew A. Davidson, Sean Baxter, Michael Garland, John D. Owens:
Work-Efficient Parallel GPU Methods for Single-Source Shortest Paths. IPDPS 2014: 349-359 - [c51]Jason Mak, Mauricio Hess-Flores, Shawn Recker, John D. Owens, Kenneth I. Joy:
GPU-accelerated and efficient multi-view triangulation for scene reconstruction. WACV 2014: 61-68 - [i5]Anjul Patney, Stanley Tzeng, Kerry A. Seitz Jr., John D. Owens:
Piko: A Design Framework for Programmable Graphics Pipelines. CoRR abs/1404.6293 (2014) - 2013
- [j18]Mohamed S. Ebeida, Ahmed H. Mahmoud, Muhammad A. Awad
, Mohammed A. Mohammed
, Scott A. Mitchell, Alexander Rand, John D. Owens:
Sifted Disks. Comput. Graph. Forum 32(2): 509-518 (2013) - [i4]Mohamed S. Ebeida, Anjul Patney, Scott A. Mitchell, Keith R. Dalbey, Andrew A. Davidson, John D. Owens:
k-d Darts: Sampling by k-Dimensional Flat Searches. CoRR abs/1302.3917 (2013) - [i3]Kerry A. Seitz Jr., Alex Kennedy, Owen Ransom, Bassam A. Younis, John D. Owens:
A GPU Implementation for Two-Dimensional Shallow Water Modeling. CoRR abs/1309.1230 (2013) - 2012
- [j17]Mohamed S. Ebeida, Scott A. Mitchell, Anjul Patney, Andrew A. Davidson, John D. Owens:
A Simple Algorithm for Maximal Poisson-Disk Sampling in High Dimensions. Comput. Graph. Forum 31(2pt4): 785-794 (2012) - [j16]Stanley Tzeng, Brandon Lloyd, John D. Owens:
A GPU Task-Parallel Model with Dependency Resolution. Computer 45(8): 34-41 (2012) - [c50]Stanley Tzeng, Anjul Patney, Andrew A. Davidson, Mohamed S. Ebeida, Scott A. Mitchell, John D. Owens:
High-Quality Parallel Depth-of-Field Using Line Samples. High Performance Graphics 2012: 23-31 - [c49]Shengren Li, Lance Simons, Jagadeesh Bhaskar Pakaravoor, Fatemeh Abbasinejad, John D. Owens, Nina Amenta:
kANN on the GPU with Shifted Sorting. High Performance Graphics 2012: 39-47 - [c48]Yao Zhang, John Recker, Robert Ulichney, Ingeborg Tastl, John D. Owens:
Plane-dependent error diffusion on a GPU. Image Processing: Algorithms and Systems / Parallel Processing for Imaging Applications 2012: 829515 - [i2]Stanley Tzeng, John D. Owens:
Finding Convex Hulls Using Quickhull on the GPU. CoRR abs/1201.2936 (2012) - 2011
- [j15]Mohamed S. Ebeida, Scott A. Mitchell, Andrew A. Davidson, Anjul Patney, Patrick M. Knupp, John D. Owens:
Efficient and good Delaunay meshes from random points. Comput. Aided Des. 43(11): 1506-1515 (2011) - [j14]Everett H. Phillips, Yao Zhang, Roger L. Davis, John D. Owens:
Acceleration of 2-D Compressible Flow Solvers with Graphics Processing Unit Clusters. J. Aerosp. Comput. Inf. Commun. 8(8): 237-249 (2011) - [j13]Mohamed S. Ebeida, Andrew A. Davidson, Anjul Patney, Patrick M. Knupp, Scott A. Mitchell, John D. Owens:
Efficient maximal poisson-disk sampling. ACM Trans. Graph. 30(4): 49 (2011) - [c47]Andrew A. Davidson, John D. Owens:
Register packing for cyclic reduction: a case study. GPGPU 2011: 4 - [c46]John Jenkins, Isha Arkatkar, John D. Owens, Alok N. Choudhary, Nagiza F. Samatova:
Lessons Learned from Exploring the Backtracking Paradigm on the GPU. Euro-Par (2) 2011: 425-437 - [c45]Kshitij Gupta, John D. Owens:
Compute & memory optimizations for high-quality speech recognition on low-end GPU processors. HiPC 2011: 1-10 - [c44]Yao Zhang, John D. Owens:
A quantitative performance analysis model for GPU architectures. HPCA 2011: 382-393 - [c43]Andrew A. Davidson, Yao Zhang, John D. Owens:
An Auto-tuned Method for Solving Large Tridiagonal Systems on the GPU. IPDPS 2011: 956-965 - [c42]Jeff A. Stuart, John D. Owens:
Multi-GPU MapReduce on GPU Clusters. IPDPS 2011: 1068-1079 - [c41]Vladimir Glavtchev, Pinar Muyan-Özçelik, Jeffrey M. Ota, John D. Owens:
Feature-based speed limit sign detection using a graphics processing unit. Intelligent Vehicles Symposium 2011: 195-200 - [c40]Yao Zhang, John Recker, Robert Ulichney, Giordano B. Beretta, Ingeborg Tastl, I-Jong Lin, John D. Owens:
A parallel error diffusion implementation on a GPU. Parallel Processing for Imaging Applications 2011: 78720K - [c39]Stanley Tzeng, Anjul Patney, John D. Owens:
Efficient adaptive tiling for programmable rendering. SI3D 2011: 201 - [e2]John D. Owens, I-Jong Lin, Yu-Jin Zhang, Giordano B. Beretta:
Proceedings of the Conference on Parallel Processing for Imaging Applications 2011, San Francisco Airport, CA, USA, January 24-25, 2011. SPIE Proceedings 7872, SPIE 2011, ISBN 978-0-8194-8409-3 [contents] - [i1]Jeff A. Stuart, John D. Owens:
Efficient Synchronization Primitives for GPUs. CoRR abs/1110.4623 (2011) - 2010
- [j12]Anjul Patney, Stanley Tzeng, John D. Owens:
Fragment-Parallel Composite and Filter. Comput. Graph. Forum 29(4): 1251-1258 (2010) - [c38]Pinar Muyan-Özçelik, Vladimir Glavtchev, Jeffrey M. Ota, John D. Owens:
A Template-Based Approach for Real-Time Speed-Limit-Sign Recognition on an Embedded System Using GPU Computing. DAGM-Symposium 2010: 162-171 - [c37]Stanley Tzeng, Anjul Patney, John D. Owens:
Task management for irregular-parallel workloads on the GPU. High Performance Graphics 2010: 29-37 - [c36]Jeff A. Stuart, Michael Cox, John D. Owens:
GPU-to-CPU Callbacks. Euro-Par Workshops 2010: 365-372 - [c35]