15.01.2013 Views

U. Glaeser

U. Glaeser

U. Glaeser

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

FIGURE 31.20 Scaling of link performance with process technology.<br />

at the input of the phase detector can compensate for the setup delay. The most accurate compensation<br />

is to use a replica of the receiver as the phase detector since the setup time is inherent to the receiver;<br />

however, this poses challenge because a receiver does not give phase information proportional to the<br />

phase difference. The output only indicates that the loop clock is either earlier or later than an reference<br />

clock transition. An oscillator-based loop is not stable with this type of bang-bang control so only DLLs<br />

can be built. In order to also lock to the input frequency, a clever design uses a dual-loop architecture<br />

that locks to the input frequency using a core loop [21,27,44]. Coarsely-spaced clock phases from the<br />

core loop are interpolated 25 to generate a clock phase that can be finely controlled. This loop clock is<br />

locked to the input phase using a receiver replica phase detector. Using these techniques, phase offsets<br />

can be smaller than 2% of the bit time.<br />

31.5 Conclusion<br />

This chapter has described the design goals and challenges for high-performance I/O. Performance using<br />

2:1 multiplexing of greater than 5 Gb/s has been demonstrated using a 0.18-µm CMOS technology<br />

[16,18,40]. Higher bit-rates have been shown using higher degree of multiplexing and demultiplexing.<br />

Because transistor speeds will scale with technology, link speeds are expected to scale as well. Unfortunately,<br />

noise coupling due to parasitic capacitances and inductances increases with frequency requiring designs<br />

to be even more robust to noise. Designs employ many of the noise reductions techniques described in<br />

this chapter and have continued to scale. Figure 31.20 illustrates the scaling so far. Future designs will need<br />

to improve these noise reducing and filtering techniques. Furthermore, wire bandwidth does not scale<br />

with technology scaling so the compensating for the low-pass filtering will be even more important.<br />

Methods are being researched that can squeeze more bits into existing bandwidth. Given an SNR,<br />

Shannon’s limit shows the maximum channel capacity to be Capacity/f bw = log(1 + SNR). Researchers<br />

are beginning to show that multilevel (4 + PAM) can be encoded in each bit period at the gigabits per<br />

second broadband data rate [10,13]. This and many techniques [37] that have been demonstrated in<br />

phone modems [4,23] can dramatically increase capacity to 10 bits/Hz in extremely noisy conditions,<br />

but all require accurate A/D and D/A converters. Research has shown that they are feasible but require<br />

extremely accurate timing. Low-jitter PLLs that lock accurately to the data phase are critical in maintaining<br />

the resolution at the gigahertz sampling rates.<br />

25 Interpolation takes two clock phases and performs a weighted average to generate an intermediate clock phase<br />

[32,52].<br />

© 2002 by CRC Press LLC<br />

bit time (psec)<br />

2000<br />

1500<br />

1000<br />

[14] [29]<br />

[49]<br />

[11]∗<br />

[9]∗<br />

[21]<br />

500<br />

1 FO-4<br />

0<br />

0.0<br />

[16,18,30]]<br />

[40]<br />

[52]<br />

[52]<br />

0.2 0.4 0.6 0.8<br />

[52]<br />

1.0 1.2<br />

feature size (µm)<br />

[2]<br />

3 FO-4

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!