23.08.2013 Views

MNEMEE - Electronic Systems - Technische Universiteit Eindhoven

MNEMEE - Electronic Systems - Technische Universiteit Eindhoven

MNEMEE - Electronic Systems - Technische Universiteit Eindhoven

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

• Read energy: the (average) energy needed for a read access. As for cycles, each access mode<br />

can have a different figure.<br />

• Write energy: the (average) energy needed for a write access. This is similar to the energy<br />

needed for a read access.<br />

• Static power consumption of a single memory module.<br />

• Maximum size of a memory module. If a memory partition of this type has a larger word<br />

depth, it is must consist of multiple modules.<br />

Processor partitions<br />

A processor partition represents a set of processors (possibly only one) with similar characteristics and<br />

that share their connections to the memory partitions.<br />

DMA controllers<br />

The transfers of data between different memory layers (i.e., the loading and storing of copies), can<br />

often be handled very efficiently by a direct memory access (DMA) controller. A DMA controller can<br />

typically perform transfers independently, while the processor is processing other data. Data re-use can<br />

be better exploited such DMA controllers is present in a platform. Only a high-level model of the<br />

controllers is needed. Its specification should include the following properties:<br />

• Possible source and destination memory partitions for transfers<br />

• Maximum number of transfers that can be handled in parallel by the unit.<br />

• Cycle cost of a DMA transfer. The cost can be derived from 4 parameters:<br />

o Processor overhead: the number of cycles it takes for the processor to set up a DMA<br />

transfer, and during which it cannot perform other processing.<br />

o DMA overhead: the number of cycles it takes for the DMA unit to prepare a whole<br />

transfer, after it has received the necessary parameters from the processor.<br />

o Line setup: the number of cycles it takes for the DMA unit to prepare the transfer of a<br />

line of data. A line of data consists of a set of elements that can be transferred at once.<br />

Usually, they are stored in adjacent addresses, but they could be separated by a certain<br />

stride.<br />

o Line gap: the number of cycles it takes for the DMA unit to switch to another line.<br />

Connections<br />

To optimize the allocation and assignment of copy candidates to memory partitions, it should be<br />

known, which transfers the architecture allows. For instance, it needs to be known whether there is a<br />

connection between memory partitions that the DMA controller can use to transfer data. Also, the<br />

memory partitions which are directly accessible by the processor partition need to be known. It is<br />

possible that all accesses to a background memory must pass via an on-chip local memory, for<br />

instance, because the processor has no direct access to the background memory.<br />

Copy selection and assignment<br />

In the assignment phase, a set of copy-candidates is selected and assigned to the memory layers such<br />

that an optimal solution is obtained.<br />

Public Page 52 of 87

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!