In-datacenter performance analysis of a tensor processing unit
76 auth.
N. Jouppi,
C. Young,
Nishant Patil,
David Patterson,
Gaurav Agrawal,
Raminder Bajwa,
Sarah Bates,
Suresh Bhatia,
Nan Boden,
Al Borchers,
Rick Boyle,
Pierre-luc Cantin,
Clifford Chao,
Chris Clark,
Jeremy Coriell,
...
Mike Daley,
Matt Dau,
Jeffrey Dean,
Ben Gelb,
Taraneh Ghaemmaghami,
Rajendra Gottipati,
William Gulland,
Robert Hagmann,
C. R. Ho,
Doug Hogberg,
John Hu,
R. Hundt,
Dan Hurt,
Julian Ibarz,
A. Jaffey,
Alek Jaworski,
Alexander Kaplan,
Harshit Khaitan,
Daniel Killebrew,
Andy Koch,
Naveen Kumar,
Steve Lacy,
James Laudon,
James Law,
Diemthu Le,
Chris Leary,
Zhuyuan Liu,
Kyle Lucke,
Alan Lundin,
Gordon MacKean,
Adriana Maggiore,
Maire Mahony,
Kieran Miller,
R. Nagarajan,
Ravi Narayanaswami,
Ray Ni,
Kathy Nix,
Thomas Norrie,
Mark Omernick,
Narayana Penukonda,
Andy Phelps,
Jonathan Ross,
Matt Ross,
Amir Salek,
Emad Samadiani,
Chris Severn,
Gregory Sizikov,
Matthew Snelham,
Jed Souter,
Dan Steinberg,
Andy Swing,
Mercedes Tan,
Gregory Thorson,
Bo Tian,
Horia Toma,
Erick Tuttle,
Vijay Vasudevan,
Richard Walter,
Walter Wang,
Eric Wilcox,
Doe Hyun Yoon
|
12 |
2017 |
12
2017
|
EIE: Efficient Inference Engine on Compressed Deep Neural Network
7 auth.
Song Han,
Xingyu Liu,
Huizi Mao,
Jing Pu,
A. Pedram,
M. Horowitz,
...
W. Dally
|
11 |
2016 |
11
2016
|
Power provisioning for a warehouse-sized computer
Xiaobo Fan,
W. Weber,
L. Barroso
|
11 |
2007 |
11
2007
|
Dark silicon and the end of multicore scaling
H. Esmaeilzadeh,
Emily R. Blem,
Renée St. Amant,
Karthikeyan Sankaralingam,
D. Burger
|
11 |
2011 |
11
2011
|
ISAAC: A Convolutional Neural Network Accelerator with In-Situ Analog Arithmetic in Crossbars
8 auth.
Ali Shafiee,
Anirban Nag,
N. Muralimanohar,
R. Balasubramonian,
J. Strachan,
Miao Hu,
...
R. S. Williams,
Vivek Srikumar
|
10 |
2016 |
10
2016
|
Architecting phase change memory as a scalable dram alternative
Benjamin C. Lee,
Engin Ipek,
O. Mutlu,
D. Burger
|
10 |
2009 |
10
2009
|
Eyeriss: A Spatial Architecture for Energy-Efficient Dataflow for Convolutional Neural Networks
Yu-hsin Chen,
J. Emer,
V. Sze
|
10 |
2016 |
10
2016
|
Scalable high performance main memory system using phase-change memory technology
Moinuddin K. Qureshi,
Vijayalakshmi Srinivasan,
J. Rivers
|
10 |
2009 |
10
2009
|
A reconfigurable fabric for accelerating large-scale datacenter services
23 auth.
Andrew Putnam,
Adrian M. Caulfield,
Eric S. Chung,
Derek Chiou,
Kypros Constantinides,
J. Demme,
H. Esmaeilzadeh,
J. Fowers,
Gopi Prashanth Gopal,
J. Gray,
...
M. Haselman,
S. Hauck,
Stephen Heil,
Amir Hormati,
Joo-Young Kim,
S. Lanka,
J. Larus,
Eric Peterson,
Simon Pope,
Aaron Smith,
J. Thong,
Phillip Yi Xiao,
D. Burger
|
10 |
2014 |
10
2014
|
SCNN: An accelerator for compressed-sparse convolutional neural networks
9 auth.
A. Parashar,
Minsoo Rhu,
Anurag Mukkara,
A. Puglielli,
Rangharajan Venkatesan,
Brucek Khailany,
...
J. Emer,
S. Keckler,
W. Dally
|
10 |
2017 |
10
2017
|
A durable and energy efficient main memory using phase change memory technology
Ping Zhou,
Bo Zhao,
Jun Yang,
Youtao Zhang
|
9 |
2009 |
9
2009
|
Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU
12 auth.
V. Lee,
Changkyu Kim,
J. Chhugani,
M. Deisher,
Daehyun Kim,
A. Nguyen,
...
N. Satish,
M. Smelyanskiy,
Srinivas Chennupaty,
Per Hammarlund,
Ronak Singhal,
P. Dubey
|
9 |
2010 |
9
2010
|
A scalable processing-in-memory accelerator for parallel graph processing
Junwhan Ahn,
Sungpack Hong,
S. Yoo,
O. Mutlu,
Kiyoung Choi
|
9 |
2015 |
9
2015
|
Retrospective: memory consistency and event ordering in scalable shared-memory multiprocessors
K. Gharachorloo,
D. Lenoski,
J. Laudon,
Phillip B. Gibbons,
Anoop Gupta,
J. Hennessy
|
9 |
1998 |
9
1998
|
Technology-Driven, Highly-Scalable Dragonfly Topology
John Kim,
W. Dally,
Steve Scott,
D. Abts
|
9 |
2008 |
9
2008
|
Adaptive insertion policies for high performance caching
Moinuddin K. Qureshi,
A. Jaleel,
Y. Patt,
S. Steely,
J. Emer
|
9 |
2007 |
9
2007
|
Retrospective: a study of branch prediction strategies
James E. Smith
|
9 |
1998 |
9
1998
|