30.12.2012 Views

Superconducting Technology Assessment - nitrd

Superconducting Technology Assessment - nitrd

Superconducting Technology Assessment - nitrd

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

The key challenges at the processor design level are:<br />

26<br />

■ Microarchitecture<br />

– a partitioned organization.<br />

– long pipelines.<br />

– small area reachable in a single cycle.<br />

– mechanisms for memory latency avoidance and tolerance.<br />

– clocking, communication, and synchronization for 50-100 GHz processors and chipsets.<br />

■ Memory<br />

– High-speed, low-latency, high-bandwidth, hybrid-technology memory hierarchy.<br />

■ Interconnect<br />

– Low-latency on-chip point-to-point interconnect (no shared buses are allowed).<br />

– Low-latency and high-bandwidth for processor-memory switches and system interconnect.<br />

Most of the architectural and design challenges are not peculiar to superconductor circuitry but, rather, stem from<br />

the processor circuit speed itself. At the same time, some of the unique characteristics of the RSFQ logic will<br />

certainly influence the microarchitecture for superconductor processors.<br />

Pipelines<br />

With their extremely high processing rates, fine-grained superconductor processor pipelines are longer than those in<br />

current complementary metal oxide semiconductors (CMOS) processors. The on-chip gate-to-gate communication<br />

delays in 50 GHz microprocessors will limit the space reachable in one cycle to ~1-2 mm. A time to read data from<br />

local, off-chip memory can be up to 50 cycles, while long-distance memory access and interprocessor synchronization<br />

latencies can be easily an order of thousands of processor cycles.<br />

Latency Problem<br />

The sheer size of the latency problem at each design level requires very aggressive latency avoidance and tolerance<br />

mechanisms. Superconductor processors need a microarchitecture in which most processing occurs in close proximity<br />

to data. Latency tolerance must be used to mitigate costs of unavoidable, multi-cycle memory access or other<br />

on-/off-chip communication latencies. Some latency tolerance techniques (e.g., multithreading and vector processing)<br />

that are successfully used in current processors can also work for superconductor processors. Other potential<br />

aggressive architectural options may focus on computation (threads) migrating towards data in order to decrease<br />

memory access latency.<br />

Bandwidth Issues<br />

Petaflops-level computing requires very-high-bandwidth memory systems and processor-memory interconnect<br />

switches. In order to reach the required capacity, latency, and bandwidth characteristics, the memory subsystems<br />

for superconductor processors will likely be built with both superconductor and other technologies (e.g., hybrid<br />

SFQ-CMOS memory). Multi-technology superconductor petaflops systems will need high-speed, high-bandwidth<br />

(electrical and optical) interfaces between sections operating at different temperatures.<br />

Successful resolution of these design issues and the continuing development of the superconductor technology will<br />

allow us to produce a full-fledged 100 GHz 64/128-bit, million-gate microprocessor for HEC on a single chip.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!