05.12.2012 Views

NASA Scientific and Technical Aerospace Reports

NASA Scientific and Technical Aerospace Reports

NASA Scientific and Technical Aerospace Reports

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

operation of LSI is pursuited to achieve real time processing. Continuous speech recognition with an approximate vocabulary<br />

of 1000 words can be achieved using only 20 to 30 of these PE-LSIs.<br />

Author<br />

Linear Arrays; Very Large Scale Integration; Speech Recognition<br />

20060001689 Florida Univ., FL, USA<br />

A Reconfigurable Binary/RNS/LNS Architecture for DSP<br />

Taylor, Fred J.; IEEE International Conference on Acoustics, Speech, <strong>and</strong> Signal Processing (ICASSP ‘87); Volume 1; 1987,<br />

pp. 503-506; In English; See also 20060001583; Copyright; Avail.: Other Sources<br />

An integrated digital signal processing machine will accept data from a plethera of sensors <strong>and</strong>/or subsystems which may<br />

be running at different speeds <strong>and</strong> with different precision/dynamic range metrics. Data would be manipulated using a set of<br />

algorithms which perform a variety of filtering or transform tasks. The design of such a machine considered in this paper,<br />

integrates recent advancements in mesh array synthesis with processor technology. The result is a high-throughput GIPS class<br />

DSP machine capable of responding to a wide mix of system <strong>and</strong> user defined DSP problems.<br />

Author<br />

Binary Codes; Signal Processing; Architecture (Computers); Digital Systems<br />

20060001692 Motorola, Inc., Schaumburg, IL, USA<br />

A Cascadable Adaptive FIR Filter VLSI IC<br />

Borth, David E.; Gerson, Ira A.; Haug, John R.; IEEE International Conference on Acoustics, Speech, <strong>and</strong> Signal Processing<br />

(ICASSP ‘87); Volume 1; 1987, pp. 515-518; In English; See also 20060001583; Copyright; Avail.: Other Sources<br />

This paper describes the architecture <strong>and</strong> features of the Motorola DSP56200, an algorithm-specific cascadable digital<br />

signal processing peripheral designed to perform the computationally intensive tasks associated with FIR <strong>and</strong> adaptive FIR<br />

digital filtering applications. The DSP56200 is implemented in high performance, low power 1.51 micron HCMOS technology<br />

<strong>and</strong> is available in a 28 pin DIP package. The on-chip computation unit includes a 97.5 ns 24x16-bit multiplier with a 40-bit<br />

accumulator, a 256x24-bit coefficient RAM,<strong>and</strong> a 256x16-bit data RAM. Three modes of operation allow the part to be used<br />

as a single FIR filter, a dual FIR filter, or a single adaptive FIR filter, with up to 256 taps/chip. In the adaptive FIR filter mode,<br />

the part performs the FIR filtering <strong>and</strong> LMS coefficient update operations for a single tap in 195 ns, permitting use of the part<br />

as a 19kHz sampling rate. 256 tap adaptive FIR filter, Programmable DC tan coefficient leakage <strong>and</strong> adaption coefficient<br />

parameters in the adaptive FIR mode allow the DSP56200 to be used in a wide variety of adaptive FIR filtering applications.<br />

The performance of the part in an echo canceller configuration will be presented. Typical applications of the part will also be<br />

described.<br />

Author<br />

Cascade Control; Architecture (Computers); Signal Processing; Chips; Adaptive Filters<br />

20060001819 Duke Univ., Durham, NC USA<br />

Load Latency Tolerance in Dynamically Scheduled Processors<br />

Srinivasan, Srikanth T.; Lebeck, Alvin R.; Jan. 1, 2005; 13 pp.; In English<br />

Contract(s)/Grant(s): DABT63-98-1-0001; CDA-97-2637<br />

Report No.(s): AD-A440304; No Copyright; Avail.: Defense <strong>Technical</strong> Information Center (DTIC)<br />

This paper provides quantitative measurements of load latency tolerance in a dynamically scheduled processor. To<br />

determine the latency tolerance of each memory load operation, our simulations use flexible load completion policies instead<br />

of a fixed memory hierarchy that dictates the latency. Although our policies delay load completion as long as possible, they<br />

produce performance (instructions committed per cycle (IPC)) comparable to an ideal memory system where all loads<br />

complete in one cycle. Our measurements reveal that to produce IPC values within 8% of the ideal memory system, between<br />

1% <strong>and</strong> 62% of loads need to be satisfied within a single cycle <strong>and</strong> that up to 84% can be satisfied in as many as 32 cycles,<br />

depending on the benchmark <strong>and</strong> processor configuration. Load latency tolerance is largely determined by whether an<br />

unpredictable branch is in the load s data dependence graph <strong>and</strong> the depth of the dependence graph. Our results also show that<br />

up to 36% of all loads miss in the level one cache yet have latency dem<strong>and</strong>s lower than second level cache access times. We<br />

also show that up to 37% of loads hit in the level one cache even though they possess enough latency tolerance to be satisfied<br />

by lower levels of the memory hierarchy.<br />

DTIC<br />

Computer Storage Devices; Loads (Forces)<br />

126

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!