default search action
Tarek S. Abdelrahman
Person information
- affiliation: University of Toronto, Canada
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c47]Kyle Zhao Bin Chen, Tarek S. Abdelrahman, Reza Azimi, Tomasz S. Czajkowski, Maziar Goudarzi:
RoDMap: A Reserve-on-Demand Mapper for Spatially-Configured Coarse-Grained Reconfigurable Arrays. ICPP 2024: 876-886 - 2023
- [c46]Mahdi Abbaszadeh, Tarek S. Abdelrahman, Reza Azimi, Tomasz S. Czajkowski, Maziar Goudarzi:
Efficient Data Streaming for a Tightly-Coupled Coarse-Grained Reconfigurable Array. IPDPS Workshops 2023: 435-443 - 2022
- [c45]Seung-Hun Chung, Tarek S. Abdelrahman:
Optimization of Compiler-Generated OpenCL CNN Kernels and Runtime for FPGAs. IPDPS Workshops 2022: 100-103 - [c44]Nikhil Sambhus, Tarek S. Abdelrahman:
Reuse-Aware Partitioning of Dataflow Graphs on a Tightly-Coupled CGRA. ISPA/BDCloud/SocialCom/SustainCom 2022: 458-467 - [i4]Seung-Hun Chung, Tarek S. Abdelrahman:
A Compilation Flow for the Generation of CNN Inference Accelerators on FPGAs. CoRR abs/2203.04015 (2022) - 2021
- [j20]Lifu Zhang, Tarek S. Abdelrahman:
Pipelined Training with Stale Weights in Deep Convolutional Neural Networks. Appl. Comput. Intell. Soft Comput. 2021: 3839543:1-3839543:16 (2021) - [c43]Francis O'Brien, Matthew Agostini, Tarek S. Abdelrahman:
A Streaming Accelerator for Heterogeneous CPU-FPGA Processing of Graph Applications. IPDPS Workshops 2021: 26-35 - 2020
- [j19]Tarek S. Abdelrahman:
Cooperative Software-hardware Acceleration of K-means on a Tightly Coupled CPU-FPGA System. ACM Trans. Archit. Code Optim. 17(3): 20:1-20:24 (2020) - [c42]Matthew Agostini, Francis O'Brien, Tarek S. Abdelrahman:
Balancing Graph Processing Workloads Using Work Stealing on Heterogeneous CPU-FPGA Systems. ICPP 2020: 50:1-50:12 - [c41]Seung-Hun Chung, Tarek S. Abdelrahman:
Optimizing OpenCL Kernels and Runtime for DNN Inference on FPGAs. IPDPS Workshops 2020: 151-154 - [e2]Federico Silla, Tarek S. Abdelrahman:
ICPP Workshops '20: Workshops, Edmonton, AB, Canada, August 17-20, 2020. ACM 2020, ISBN 978-1-4503-8868-9 [contents]
2010 – 2019
- 2019
- [j18]Amir H. Ashouri, Tarek S. Abdelrahman, Alwyn Dos Remedios:
Retraining-free methods for fast on-the-fly pruning of convolutional neural networks. Neurocomputing 370: 56-69 (2019) - [i3]Lifu Zhang, Tarek S. Abdelrahman:
Pipelined Training with Stale Weights of Deep Convolutional Neural Networks. CoRR abs/1912.12675 (2019) - 2018
- [j17]Joseph D. Garvey, Tarek S. Abdelrahman:
A Strategy for Automatic Performance Tuning of Stencil Computations on GPUs. Sci. Program. 2018: 6093054:1-6093054:24 (2018) - [c40]Leslie Barron, Tarek S. Abdelrahman:
User-Transparent Translation of Machine Instructions to Programmable Hardware. IPDPS Workshops 2018: 7-14 - [i2]Amir H. Ashouri, Tarek S. Abdelrahman, Alwyn Dos Remedios:
Fast On-the-fly Retraining-free Sparsification of Convolutional Neural Networks. CoRR abs/1811.04199 (2018) - 2017
- [j16]Alton Chiu, Joseph Garvey, Tarek S. Abdelrahman:
A Language and Preprocessor for User-Controlled Generation of Synthetic Programs. Sci. Program. 2017: 1046161:1-1046161:24 (2017) - [c39]Wilson Feng, Tarek S. Abdelrahman:
A Sampling Based Strategy to Automatic Performance Tuning of GPU Programs. IPDPS Workshops 2017: 1342-1349 - [c38]Tianyi David Han, Tarek S. Abdelrahman:
Use of Synthetic Benchmarks for Machine-Learning-Based Performance Auto-Tuning. IPDPS Workshops 2017: 1350-1361 - [c37]Andrew S. D. Lee, Tarek S. Abdelrahman:
Launch-Time Optimization of OpenCL GPU Kernels. GPGPU@PPoPP 2017: 32-41 - 2016
- [c36]Tarek S. Abdelrahman:
Accelerating K-means clustering on a tightly-coupled processor-FPGA heterogeneous system. ASAP 2016: 176-181 - 2015
- [c35]Alton Chiu, Joseph Garvey, Tarek S. Abdelrahman:
Genesis: a language for generating synthetic training programs for machine learning. Conf. Computing Frontiers 2015: 8:1-8:8 - [c34]Joseph D. Garvey, Tarek S. Abdelrahman:
Automatic Performance Tuning of Stencil Computations on GPUs. ICPP 2015: 300-309 - [c33]Cedomir Segulja, Tarek S. Abdelrahman:
Clean: a race detector with cleaner semantics. ISCA 2015: 401-413 - 2014
- [c32]Cedomir Segulja, Tarek S. Abdelrahman:
What is the cost of weak determinism? PACT 2014: 99-112 - [c31]Davor Capalija, Tarek S. Abdelrahman:
Tile-based bottom-up compilation of custom mesh-of-functional-units FPGA overlays. FPL 2014: 1-8 - [i1]Tianyi David Han, Tarek S. Abdelrahman:
Automatic Tuning of Local Memory Use on GPGPUs. CoRR abs/1412.6986 (2014) - 2013
- [j15]Davor Capalija, Tarek S. Abdelrahman:
Microarchitecture of a Coarse-Grain Out-of-Order Superscalar Processor. IEEE Trans. Parallel Distributed Syst. 24(2): 392-405 (2013) - [c30]Tianyi David Han, Tarek S. Abdelrahman:
Reducing divergence in GPGPU programs with loop merging. GPGPU@ASPLOS 2013: 12-23 - [c29]Davor Capalija, Tarek S. Abdelrahman:
A high-performance overlay architecture for pipelined execution of data flow graphs. FPL 2013: 1-8 - [c28]Michael C. Delorme, Tarek S. Abdelrahman, Chengyan Zhao:
Parallel Radix Sort on the AMD Fusion Accelerated Processing Unit. ICPP 2013: 339-348 - 2012
- [j14]Borys J. Bradel, Tarek S. Abdelrahman:
Inlining with traces in Java programs. Comput. Syst. Sci. Eng. 27(4) (2012) - [j13]Utku Aydonat, Tarek S. Abdelrahman:
Relaxed Concurrency Control in Software Transactional Memory. IEEE Trans. Parallel Distributed Syst. 23(7): 1312-1325 (2012) - [c27]Ivan Matosevic, Tarek S. Abdelrahman:
Efficient bottom-up heap analysis for symbolic path-based data access summaries. CGO 2012: 252-263 - [c26]Cedomir Segulja, Tarek S. Abdelrahman:
Architectural support for synchronization-free deterministic parallel programming. HPCA 2012: 337-348 - 2011
- [j12]Utku Aydonat, Tarek S. Abdelrahman:
Parallelization of multimedia applications on the multi-level computing architecture. J. Embed. Comput. 4(3-4): 87-106 (2011) - [j11]Tianyi David Han, Tarek S. Abdelrahman:
hiCUDA: High-Level GPGPU Programming. IEEE Trans. Parallel Distributed Syst. 22(1): 78-90 (2011) - [c25]Tianyi David Han, Tarek S. Abdelrahman:
Reducing branch divergence in GPU programs. GPGPU 2011: 3 - [c24]Davor Capalija, Tarek S. Abdelrahman:
Towards Synthesis-Free JIT Compilation to Commodity FPGAs. FCCM 2011: 202-205 - 2010
- [c23]Utku Aydonat, Tarek S. Abdelrahman:
Hardware Support for Relaxed Concurrency Control in Transactional Memory. MICRO 2010: 15-26
2000 – 2009
- 2009
- [j10]Borys J. Bradel, Tarek S. Abdelrahman:
A study of potential parallelism among traces in Java programs. Sci. Comput. Program. 74(5-6): 296-313 (2009) - [c22]Tianyi David Han, Tarek S. Abdelrahman:
hiCUDA: a high-level directive-based language for GPU programming. GPGPU 2009: 52-61 - [c21]Borys J. Bradel, Tarek S. Abdelrahman:
The use of hardware transactional memory for the trace-based parallelization of recursive Java programs. PPPJ 2009: 101-110 - 2007
- [c20]Borys J. Bradel, Tarek S. Abdelrahman:
Automatic Trace-Based Parallelization of Java Programs. ICPP 2007: 26 - [c19]Borys J. Bradel, Tarek S. Abdelrahman:
The potential of trace-level parallelism in Java programs. PPPJ 2007: 167-174 - 2006
- [c18]Ahmed Abdelkhalek, Tarek S. Abdelrahman:
Locality management using multiple SPMs on the Multi-Level Computing Architecture. ESTIMedia 2006: 67-72 - 2005
- [c17]Borys J. Bradel, Tarek S. Abdelrahman:
A Characterization of Traces in Java Programs. PLC 2005: 87-93 - [c16]Ivan Matosevic, Tarek S. Abdelrahman, Faraydon Karim, Alain Mellan:
Power Optimization for the MLCA Using Dynamic Voltage Scaling. SCOPES 2005: 109-123 - 2004
- [j9]Tarek S. Abdelrahman, Robert Sawaya:
Improving the structure of loop nests in scientific programs. Comput. Syst. Sci. Eng. 19(1): 11-25 (2004) - [j8]Faraydon Karim, Alain Mellan, Anh Nguyen, Utku Aydonat, Tarek S. Abdelrahman:
A Multilevel Computing Architecture for Embedded Multimedia Applications. IEEE Micro 24(3): 56-66 (2004) - [j7]Patrick Doyle, Carlos Cavanna, Tarek S. Abdelrahman:
The design and implementation of a modular and extensible Java Virtual Machine. Softw. Pract. Exp. 34(3): 287-313 (2004) - [j6]Bryan Chan, Tarek S. Abdelrahman:
Run-Time Support for the Automatic Parallelization of Java Programs. J. Supercomput. 28(1): 91-117 (2004) - [c15]Benjamin Vitale, Tarek S. Abdelrahman:
Catenation and specialization for Tcl virtual machine performance. IVME 2004: 42-50 - [c14]Borys J. Bradel, Tarek S. Abdelrahman:
The Use of Traces for Inlining in Java Programs. LCPC 2004: 179-193 - [e1]Tarek S. Abdelrahman:
Proceedings of the 3rd Virtual Machine Research and Technology Symposium, May 6-7, 2004, San Jose, CA, USA. USENIX 2004 [contents] - 2002
- [c13]Patrick Doyle, Tarek S. Abdelrahman:
A Modular and Extensible JVM Infrastructure. Java Virtual Machine Research and Technology Symposium 2002: 65-78 - 2001
- [j5]Naraig Manjikian, Tarek S. Abdelrahman:
Exploiting Wavefront Parallelism on Large-Scale Shared-Memory Multiprocessors. IEEE Trans. Parallel Distributed Syst. 12(3): 259-271 (2001) - [c12]Neil V. Brewster, Tarek S. Abdelrahman:
A Compiler Infrastructure for High-Performance Java. HPCN Europe 2001: 675-684 - 2000
- [c11]R. Grindley, Tarek S. Abdelrahman, Stephen Dean Brown, S. Caranci, D. DeVries, Benjamin Gamsa, A. Grbic, M. Gusat, R. Ho, Orran Krieger, Guy G. Lemieux, K. Loveless, Naraig Manjikian, P. McHardy, Sinisa Srbljic, Michael Stumm, Zvonko G. Vranesic, Zeljko Zilic:
The NUMAchine Multiprocessor. ICPP 2000: 487-496
1990 – 1999
- 1999
- [j4]Tarek S. Abdelrahman, Gary Liu:
Overlap of Computation and Communication on Shared-Memory. Parallel Distributed Comput. Pract. 2(2) (1999) - 1998
- [j3]Tarek S. Abdelrahman, Thomas N. Wong:
Compiler Support for Array Distribution on NUMA Shared Memory Multiprocessors. J. Supercomput. 12(4): 349-371 (1998) - [c10]Tarek S. Abdelrahman, Naraig Manjikian, Gary Liu, Sudarsan Tandri:
Locality Enhancement for Large-Scale Shared-Memory Multiprocessors. LCR 1998: 335-342 - 1997
- [j2]Naraig Manjikian, Tarek S. Abdelrahman:
Fusion of Loops for Parallelism and Locality. IEEE Trans. Parallel Distributed Syst. 8(2): 193-209 (1997) - [c9]Sudarsan Tandri, Tarek S. Abdelrahman:
Automatic Partitioning of Data and Computations on Scalable Shared Memory Multiprocessors. ICPP 1997: 64-73 - [c8]Edward D. Moreno, Sergio Takeo Kofuji, Michael Stumm, Tarek S. Abdelrahman:
Tuning Shared Network Cache Size vs. Second-Level Cache Size in Clusters-Based Multiprocessors. PaCT 1997: 398-399 - 1996
- [j1]Tarek S. Abdelrahman:
Latency hiding on COMA multiprocessors. J. Supercomput. 10(3): 225-242 (1996) - [c7]Naraig Manjikian, Tarek S. Abdelrahman:
Scheduling of Wavefront Parallelism on Scalable Shared-memory Multiprocessors. ICPP, Vol. 3 1996: 122-131 - [c6]Sudarsan Tandri, Tarek S. Abdelrahman:
Automatic Data and Computation Partitioning on Scalable Shared Memory Multiprocessors. LCPC 1996: 600-602 - [c5]Tarek S. Abdelrahman, Sum Huynh:
Exploiting Task-Level Parallelism Using pTask. PDPTA 1996: 252-263 - [c4]Tarek S. Abdelrahman, Kenneth L. Ma:
Evaluation of Dynamic Data Distributions on NUMA Shared Memory Multiprocessors. PDPTA 1996: 1319-1322 - 1995
- [c3]Naraig Manjikian, Tarek S. Abdelrahman:
Fusion of Loops for Parallelism and Locality. ICPP (2) 1995: 19-28 - [c2]Sudarsan Tandri, Tarek S. Abdelrahman:
Computation and Data Partitioning on Scalable Shared Memory Multiprocessors. PDPTA 1995: 41-50 - [c1]Tarek S. Abdelrahman:
Latency Hiding on COMA Multiprocessors. PDPTA 1995: 363-372
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-23 18:31 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint