On-chip Networks for Manycore Architecture Myong ... - People - MIT
On-chip Networks for Manycore Architecture Myong ... - People - MIT
On-chip Networks for Manycore Architecture Myong ... - People - MIT
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
[63] Gordon E. Moore. Cramming more components onto integrated circuits. Electronics,<br />
38(8):114–117, April 1965.<br />
[64] Robert D. Mullins, Andrew F. West, and Simon W. Moore. Low-latency virtualchannel<br />
routers <strong>for</strong> on-<strong>chip</strong> networks. In Proc. of the 31st Annual Intl. Symp. on<br />
Computer <strong>Architecture</strong> (ISCA), pages 188–197, 2004.<br />
[65] Ted Nesson and S. Lennart Johnsson. ROMM Routing: A Class of E cient Minimal<br />
Routing Algorithms. In in Proc. Parallel Computer Routing and Communication<br />
Workshop, pages 185–199, 1994.<br />
[66] Ted Nesson and S. Lennart Johnsson. ROMM routing on mesh and torus networks. In<br />
Proc. 7th Annual ACM Symposium on Parallel Algorithms and <strong>Architecture</strong>s SPAA’95,<br />
pages 275–287, 1995.<br />
[67] George Nychis, Chris Fallin, Thomas Moscibroda, and <strong>On</strong>ur Mutlu. Next generation<br />
on-<strong>chip</strong> networks: what kind of congestion control do we need? In Proceedings of<br />
the 9th ACM SIGCOMM Workshop on Hot Topics in <strong>Networks</strong>, Hotnets-IX, pages<br />
12:1–12:6, 2010.<br />
[68] Cheolmin Park, Roy Badeau, Larry Biro, Jonathan Chang, Tejpal Singh, Jim Vash,<br />
Bo Wang, and Tom Wang. A 1.2 TB/s on-<strong>chip</strong> ring interconnect <strong>for</strong> 45nm 8-core<br />
enterprise Xeon R processor. In Solid-State Circuits Conference Digest of Technical<br />
Papers (ISSCC), 2010 IEEE International, pages 180–181, 2010.<br />
[69] Sunghyun Park, Masood Qazi, Li-Shiuan Peh, and Anantha P Chandrakasan. 40.4<br />
fJ/bit/mm low-swing on-<strong>chip</strong> signaling with self-resetting logic repeaters embedded<br />
within a mesh NoC in 45nm SOI CMOS. In Proceedings of the Conference on Design,<br />
Automation and Test in Europe, pages 1637–1642. EDA Consortium, 2013.<br />
[70] Li-Shiuan Peh and William J. Dally. A Delay Model and Speculative <strong>Architecture</strong> <strong>for</strong><br />
Pipelined Routers. In Proc. International Symposium on High-Per<strong>for</strong>mance Computer<br />
<strong>Architecture</strong> (HPCA), pages 255–266, January 2001.<br />
[71] Michael D. Powell, Arijit Biswas, Shantanu Gupta, and Shubhendu S. Mukherjee.<br />
Architectural core salvaging in a multi-core processor <strong>for</strong> hard-error tolerance. In<br />
Proceedings of ISCA 2009, pages 93–104, 2009.<br />
[72] Krishna K. Rangan, Gu-Yeon Wei, and David Brooks. Thread motion: Fine-grained<br />
power management <strong>for</strong> multi-core systems. In Proceedings of ISCA 2009, pages 302–<br />
313, 2009.<br />
[73] Pengju Ren, Mieszko Lis, <strong>Myong</strong> Hyon Cho, Keun Sup Shim, Christopher W Fletcher,<br />
Omer Khan, Nanning Zheng, and Srinivas Devadas. Hornet: A cycle-level multicore<br />
simulator. Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions<br />
on, 31(6):890–903, 2012.<br />
[74] R.J. Riedlinger, R. Bhatia, L. Biro, B. Bowhill, E. Fetzer, P. Gronowski, and<br />
T. Grutkowski. A 32nm 3.1 billion transistor 12-wide-issue Itanium R processor <strong>for</strong><br />
mission-critical servers. In Solid-State Circuits Conference Digest of Technical Papers<br />
(ISSCC), 2011 IEEE International, pages 84–86, 2011.<br />
114