


default search action
50th MICRO 2017: Cambridge, MA, USA
- Hillery C. Hunter, Jaime Moreno, Joel S. Emer, Daniel Sánchez:
Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2017, Cambridge, MA, USA, October 14-18, 2017. ACM 2017, ISBN 978-1-4503-4952-9
DRAM
- Xiangyao Yu, Christopher J. Hughes
, Nadathur Satish, Onur Mutlu, Srinivas Devadas:
Banshee: bandwidth-efficient DRAM caching via software/hardware cooperation. 1-14 - Bharat Sukhwani, Thomas Roewer, Charles L. Haymes, Kyu-Hyoun Kim
, Adam J. McPadden, Daniel M. Dreps, Dean Sanner, Jan van Lunteren, Sameh W. Asaad:
Contutto: a novel FPGA-based prototyping platform enabling innovation in the memory subsystem of a server class processor. 15-26 - Samira Manabi Khan, Chris Wilkerson, Zhe Wang, Alaa R. Alameldeen, Donghyuk Lee, Onur Mutlu:
Detecting and mitigating data-dependent DRAM failures by exploiting current memory content. 27-40 - Mike O'Connor
, Niladrish Chatterjee, Donghyuk Lee, John M. Wilson, Aditya Agrawal, Stephen W. Keckler, William J. Dally:
Fine-grained DRAM: energy-efficient DRAM for extreme bandwidth systems. 41-54
Accelerators
- Yuanwei Fang, Chen Zou, Aaron J. Elmore
, Andrew A. Chien:
UDP: a programmable accelerator for extract-transform-load workloads and more. 55-68 - Reza Yazdani, José-María Arnau, Antonio González:
UNFOLD: a memory-efficient speech recognizer using on-the-fly WFST composition. 69-81 - Mostafa Mahmoud, Bojian Zheng, Alberto Delmas Lascorz, Felix Heide, Jonathan Assouline, Paul Boucher, Emmanuel Onzon, Andreas Moshovos:
IDEAL: image denoising accelerator. 82-95 - Thomas J. Repetti, João Pedro Cerqueira, Martha A. Kim
, Mingoo Seok:
Pipelining a triggered processing element. 96-108
GPU-I
- Ivan Tanasic, Isaac Gelado, Marc Jordà, Eduard Ayguadé, Nacho Navarro:
Efficient exception handling support for GPUs. 109-122 - Ugljesa Milic, Oreste Villa, Evgeny Bolotin, Akhil Arunkumar, Eiman Ebrahimi, Aamer Jaleel, Alex Ramírez, David W. Nellans:
Beyond the socket: NUMA-aware GPUs. 123-135 - Rachata Ausavarungnirun, Joshua Landgraf, Vance Miller, Saugata Ghose, Jayneel Gandhi
, Christopher J. Rossbach
, Onur Mutlu:
Mosaic: a GPU memory manager with application-transparent support for multiple page sizes. 136-150 - John Kloosterman
, Jonathan Beaumont
, Davoud Anoushe Jamshidi, Jonathan Bailey, Trevor N. Mudge, Scott A. Mahlke:
Regless: just-in-time operand staging for GPUs. 151-164 - Pedro Duarte, Pedro Tomás
, Gabriel Falcão
:
SCRATCH: an end-to-end application-aware soft-GPGPU architecture and trimming tool. 165-177
Non-volatile memory/storage
- Seunghee Shin, Satish Kumar Tirukkovalluri, James Tuck, Yan Solihin:
Proteus: a flexible and fast software supported hardware logging approach for NVM. 178-190 - Guoyang Chen, Lei Zhang, Richa Budhiraja, Xipeng Shen
, Youfeng Wu:
Efficient support of position independence on non-volatile memory. 191-203 - Kaisheng Ma
, Xueqing Li, Jinyang Li, Yongpan Liu, Yuan Xie, Jack Sampson, Mahmut Taylan Kandemir, Vijaykrishnan Narayanan:
Incidental computing on IoT nonvolatile processors. 204-218 - Gunjae Koo
, Kiran Kumar Matam, Te I, H. V. Krishna Giri Narra, Jing Li, Hung-Wei Tseng
, Steven Swanson
, Murali Annavaram
:
Summarizer: trading communication with computing near storage. 219-231 - Zhaoxia Deng, Lunkai Zhang, Nikita Mishra, Henry Hoffmann, Frederic T. Chong
:
Memory cocktail therapy: a general learning-based framework to optimize dynamic tradeoffs in NVMs. 232-244
In/near memory computing
- Sandeep R. Agrawal, Sam Idicula, Arun Raghavan, Evangelos Vlachos, Venkatraman Govindaraju, Venkatanathan Varadarajan, Cagri Balkesen, Georgios Giannikis, Charlie Roth, Nipun Agarwal, Eric Sedlar:
A many-core architecture for in-memory data processing. 245-258 - Arun Subramaniyan, Jingcheng Wang, Ezhil R. M. Balasubramanian, David T. Blaauw, Dennis Sylvester, Reetuparna Das
:
Cache automaton. 259-272 - Vivek Seshadri, Donghyuk Lee, Thomas Mullins, Hasan Hassan, Amirali Boroumand, Jeremie S. Kim, Michael A. Kozuch, Onur Mutlu, Phillip B. Gibbons, Todd C. Mowry:
Ambit: in-memory accelerator for bulk bitwise operations using commodity DRAM technology. 273-287 - Shuangchen Li, Dimin Niu, Krishna T. Malladi, Hongzhong Zheng, Bob Brennan, Yuan Xie:
DRISA: a DRAM-based reconfigurable in-situ accelerator. 288-301 - Dimitrios Skarlatos, Nam Sung Kim, Josep Torrellas:
Pageforge: a near-memory content-aware page-merging architecture. 302-314
Security
- Khaled N. Khasawneh, Nael B. Abu-Ghazaleh
, Dmitry Ponomarev, Lei Yu:
RHMD: evasion-resilient hardware malware detectors. 315-327 - Hari Cherupalli, Henry Duwe, Weidong Ye, Rakesh Kumar, John Sartori:
Software-based gate-level information flow security for IoT systems. 328-340 - Zecheng He, Ruby B. Lee:
How secure is your cache against side-channel attacks? 341-353 - Hoda Naghibijouybari
, Khaled N. Khasawneh, Nael B. Abu-Ghazaleh
:
Constructing and characterizing covert channels on GPGPUs. 354-366
Deep learning
- Jongse Park, Hardik Sharma
, Divya Mahajan
, Joon Kyung Kim, Preston Olds, Hadi Esmaeilzadeh:
Scale-out acceleration for machine learning. 367-381 - Jorge Albericio, Alberto Delmas, Patrick Judd, Sayeh Sharify, Gerard O'Leary, Roman Genov, Andreas Moshovos:
Bit-pragmatic deep neural network computing. 382-394 - Caiwen Ding
, Siyu Liao, Yanzhi Wang, Zhe Li, Ning Liu
, Youwei Zhuo, Chao Wang, Xuehai Qian, Yu Bai, Geng Yuan, Xiaolong Ma, Yipeng Zhang, Jian Tang, Qinru Qiu, Xue Lin, Bo Yuan:
CirCNN: accelerating and compressing deep neural networks using block-circulant weight matrices. 395-408
Prediction
- Abhishek Bhattacharjee
:
Using branch predictors to predict brain activity in brain-machine implants. 409-422 - Rami Sheikh, Harold W. Cain, Raguram Damodaran:
Load value prediction via path-based address prediction: avoiding mispredictions due to conflicting stores. 423-435 - Daniel A. Jiménez
, Elvira Teran:
Multiperspective reuse prediction. 436-448
Coherence/consistency, translation and virtual memory
- Yashwant Marathe, Nagendra Gulur, Jee Ho Ryoo, Shuang Song, Lizy K. John:
CSALT: context switch aware large TLB. 449-462 - Yatin A. Manerkar, Daniel Lustig, Margaret Martonosi, Michael Pellauer:
RTLcheck: verifying the memory consistency of RTL designs. 463-476 - Opeoluwa Matthews, Daniel J. Sorin:
Architecting hierarchical coherence protocols for push-button parametric verification. 477-489 - Yuanfeng Peng, Benjamin P. Wood, Joseph Devietti
:
PARSNIP: performant architecture for race safety with no impact on precision. 490-502
Energy
- George Papadimitriou
, Manolis Kaliorakis, Athanasios Chatzidimitriou, Dimitris Gizopoulos, Peter Lawthers, Shidhartha Das:
Harnessing voltage margins for energy efficiency in multicore CPUs. 503-516 - Haibo Zhang, Prasanna Venkatesh Rengasamy, Shulin Zhao, Nachiappan Chidambaram Nachiappan, Anand Sivasubramaniam, Mahmut T. Kandemir, Ravi R. Iyer, Chita R. Das:
Race-to-sleep + content caching + display caching: a recipe for energy-efficient video streaming on handhelds. 517-531 - Ang Li, Wenfeng Zhao, Shuaiwen Leon Song:
BVF: enabling significant on-chip power savings via bit-value-favor for throughput processors. 532-545 - Aditya Agrawal, Josep Torrellas, Sachin Idgunji:
Xylem: enhancing vertical thermal conduction in 3D processor-memory stacks. 546-559
GPU-II
- Ya-Shuai Lü, Libo Huang, Li Shen, Zhiying Wang:
Unleashing the power of GPU for physically-based rendering via dynamic ray shuffling. 560-573 - Youngsok Kim
, Jae-Eon Jo, Hanhwi Jang, Minsoo Rhu, Hanjun Kim
, Jangwoo Kim:
GPUpd: a fast and scalable multi-GPU architecture using cooperative projection and distribution. 574-586 - Zhen Zheng, Chanyoung Oh, Jidong Zhai, Xipeng Shen
, Youngmin Yi, Wenguang Chen:
Versapipe: a versatile programming framework for pipelined computing on GPU. 587-599 - AmirAli Abdolrashidi, Devashree Tripathy
, Mehmet Esat Belviranli, Laxmi Narayan Bhuyan, Daniel Wong
:
Wireframe: supporting data-dependent parallelism through dependency graph execution in GPUs. 600-611
OS and system design
- Prathmesh Kallurkar, Smruti R. Sarangi:
Schedtask: a hardware-assisted task scheduler. 612-624 - Md. Enamul Haque, Yuxiong He, Sameh Elnikety, Thu D. Nguyen, Ricardo Bianchini, Kathryn S. McKinley:
Exploiting heterogeneity for tail latency and energy efficiency. 625-638 - Christian DeLozier
, Ariel Eizenberg, Shiliang Hu, Gilles Pokam, Joseph Devietti
:
TMI: thread memory isolation for false sharing repair. 639-650 - Weilong Cui, Timothy Sherwood
:
Estimating and understanding architectural risk. 651-664
Unconventional architectures
- Yipeng Huang
, Ning Guo, Mingoo Seok, Yannis P. Tsividis, Kyle T. Mandli
, Simha Sethumadhavan:
Hybrid analog-digital solution of nonlinear partial differential equations. 665-678 - Swamit S. Tannu
, Zachary A. Myers, Prashant J. Nair, Douglas M. Carmean, Moinuddin K. Qureshi:
Taming the instruction bandwidth of quantum computers via hardware-managed error correction. 679-691 - Ali JavadiAbhari, Pranav Gokhale, Adam Holmes, Diana Franklin
, Kenneth R. Brown
, Margaret Martonosi, Frederic T. Chong
:
Optimized surface code communication in superconducting quantum computers. 692-705 - Ting-Jung Chang, Zhuozhi Yao, Paul J. Jackson, Barry P. Rand
, David Wentzlaff:
Architectural tradeoffs for biodegradable computing. 706-717
Compilers and microarchitecture
- Joonmoo Huh, James Tuck:
Improving the effectiveness of searching for isomorphic chains in superword level parallelism. 718-729 - Xulong Tang
, Orhan Kislal, Mahmut T. Kandemir, Mustafa Karaköy:
Data movement aware computation partitioning. 730-744 - Shruti Padmanabha, Andrew Lukefahr, Reetuparna Das, Scott A. Mahlke:
Mirage cores: the illusion of many out-of-order cores using in-order hardware. 745-758 - Ji Kim, Shunning Jiang, Christopher Torng, Moyang Wang, Shreesha Srinath, Berkin Ilbeyi, Khalid Al-Hawaj, Christopher Batten:
Using intra-core loop-task accelerators to improve the productivity and performance of task-based parallel programs. 759-773
Best paper nominees
- Daphne I. Gorman, Matthew R. Guthaus, Jose Renau:
Architectural opportunities for novel dynamic EMI shifting (DEMIS). 774-785 - Parker Hill, Animesh Jain, Mason Hill, Babak Zamirai, Chang-Hong Hsu, Michael A. Laurenzano, Scott A. Mahlke, Lingjia Tang, Jason Mars:
DeftNN: addressing bottlenecks for DNN execution on GPUs via synapse vector elimination and near-compute data fission. 786-799 - Tiancong Wang, Sakthikumaran Sambasivam, Yan Solihin, James Tuck:
Hardware supported persistent object address translation. 800-812 - Xiang Fu, Michiel Adriaan Rol
, Cornelis Christiaan Bultink, J. van Someren, Nader Khammassi, Imran Ashraf
, R. F. L. Vermeulen, J. C. de Sterke, W. J. Vlothuizen, R. N. Schouten, Carmen G. Almudéver
, Leonardo DiCarlo, Koen Bertels:
An experimental microarchitecture for a superconducting quantum processor. 813-825

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.