15.01.2013 Views

U. Glaeser

U. Glaeser

U. Glaeser

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

FIGURE 13.4<br />

TABLE 13.1<br />

combinational logic circuits can be evaluated. In the end, final device sizings, placements, wire level<br />

assignments, buffer sizing and placement, and indeed some circuit designs will be adjusted, as the design<br />

becomes fixed. The final process is aided by device tuning, extraction, simulation, and static timing tools.<br />

In high-frequency designs, the accurate analysis of the wiring becomes critical. Because the designer<br />

is only able to reduce to a limited extent the amount of the cycle time, which must be allocated to clock<br />

uncertainty, latching overhead, and the minimum required function (e.g., a 32-bit addition/subtraction<br />

and muxing of a logical result) pressure to minimize time in signal distribution, is intense. To date, the<br />

shortest technology-independent cycle time for a 64-bit processor, 14.5 FO4 inverter delays, has been an<br />

5<br />

IBM research prototype. The cycle time for this design was allocated according to the budget presented<br />

in Table 13.1.<br />

The time available for distribution of results limits the placement of communicating macros. For results,<br />

which must be driven out of a macro through a wire into a remote receiving latch, an inverter was placed<br />

at the macro output to provide gain to drive the wire and another inverter was placed at the input of the<br />

9<br />

latch to isolate the dynamic multiplexor from any noise on the wire. Thus, the distribution wires could<br />

only be at most about 3.5 mm and thus the core of such a short cycle design with full forwarding must<br />

be quite small. For full-cycle latch-to-latch transfers, the wires were limited to about 10 mm. Figure 13.4<br />

presents the floorplan of the processor with a representative full-cycle latch-to-latch transfer path of<br />

6.5 mm from the fixed point instruction register to the floating-point decoder latches. Figure 13.5 presents<br />

a portion of the fixed point data-flow, FXU, with a representative maximum length forwarding wire of<br />

© 2002 by CRC Press LLC<br />

Gigahertz PowerPC Delay Allocation<br />

Function Delay (ps) FO4 Function Delay (ps) FO4<br />

Mux-latch Clk-Q 200 2.9 Mux-latch Clk-Q 200 2.9<br />

Control logic 470 6.8 Datapath logic 610 8.8<br />

Control distribution 140 2.0 Datapath distribution 140 2.0<br />

Control latch setup 140 2.0 Datapath latch setup 0 0<br />

Clock jitter/skew 50 0.7 Clock jitter/skew 50 0.7<br />

A 1 GHz PowerPC processor floorplan with representative global signal.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!