MNEMEE - Electronic Systems - Technische Universiteit Eindhoven
MNEMEE - Electronic Systems - Technische Universiteit Eindhoven
MNEMEE - Electronic Systems - Technische Universiteit Eindhoven
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
• Read energy: the (average) energy needed for a read access. As for cycles, each access mode<br />
can have a different figure.<br />
• Write energy: the (average) energy needed for a write access. This is similar to the energy<br />
needed for a read access.<br />
• Static power consumption of a single memory module.<br />
• Maximum size of a memory module. If a memory partition of this type has a larger word<br />
depth, it is must consist of multiple modules.<br />
Processor partitions<br />
A processor partition represents a set of processors (possibly only one) with similar characteristics and<br />
that share their connections to the memory partitions.<br />
DMA controllers<br />
The transfers of data between different memory layers (i.e., the loading and storing of copies), can<br />
often be handled very efficiently by a direct memory access (DMA) controller. A DMA controller can<br />
typically perform transfers independently, while the processor is processing other data. Data re-use can<br />
be better exploited such DMA controllers is present in a platform. Only a high-level model of the<br />
controllers is needed. Its specification should include the following properties:<br />
• Possible source and destination memory partitions for transfers<br />
• Maximum number of transfers that can be handled in parallel by the unit.<br />
• Cycle cost of a DMA transfer. The cost can be derived from 4 parameters:<br />
o Processor overhead: the number of cycles it takes for the processor to set up a DMA<br />
transfer, and during which it cannot perform other processing.<br />
o DMA overhead: the number of cycles it takes for the DMA unit to prepare a whole<br />
transfer, after it has received the necessary parameters from the processor.<br />
o Line setup: the number of cycles it takes for the DMA unit to prepare the transfer of a<br />
line of data. A line of data consists of a set of elements that can be transferred at once.<br />
Usually, they are stored in adjacent addresses, but they could be separated by a certain<br />
stride.<br />
o Line gap: the number of cycles it takes for the DMA unit to switch to another line.<br />
Connections<br />
To optimize the allocation and assignment of copy candidates to memory partitions, it should be<br />
known, which transfers the architecture allows. For instance, it needs to be known whether there is a<br />
connection between memory partitions that the DMA controller can use to transfer data. Also, the<br />
memory partitions which are directly accessible by the processor partition need to be known. It is<br />
possible that all accesses to a background memory must pass via an on-chip local memory, for<br />
instance, because the processor has no direct access to the background memory.<br />
Copy selection and assignment<br />
In the assignment phase, a set of copy-candidates is selected and assigned to the memory layers such<br />
that an optimal solution is obtained.<br />
Public Page 52 of 87