MNEMEE - Electronic Systems - Technische Universiteit Eindhoven

With modern applications, the streams and their encodings can be very dynamic. Smart compression, 

encoding and scalability features make these streams less regular then they used to be. Furthermore, 

these streams are typically part of a larger application. Other parts of such an application may be 

event-driven and interact with the stream components. Consider for example an imaginary game 

application shown in Figure 1, which is taken from [25]. This game includes modes of 3-dimensional 

game play with streaming video based modes. The rendering pipeline, used in the 3D mode, is a 

dynamic streaming application. Characters or objects may enter or leave the scene because of player 

interaction, rendering parameters may be adapted to achieve the required frame rates. Overlay graphics 

(for instance text or scores) may change. This happens under control of the event-driven game control 

logic. At the core of the application, the streaming kernels, a lot of intensive pixel based operations are 

required to perform the various texture mapping or video filtering operations. These operations are 

performed on a stream of data items. At each point in time, only a small number of data items from the 

stream are being processed. The order in which these data items are accessed has typically little 

variation. This makes it possible to optimize the memory access behaviour of the streaming kernels to 

achieve the required performance while minimising the energy consumption. 

Future embedded systems are not only characterized by their increasingly dynamic behaviour due to 

different operations modes that may occur at run-time, but also by their dynamism in memory 

requirements. Next generation embedded multimedia systems will have intensive data transfer and 

storage requirements. Partially these requirements will be static, but partially they will also be 

changing at run-time. Therefore, efficient memory management and optimization techniques are 

needed to increase system performance and decreased cost in energy consumption and memory 

footprint due to the larger memory needs of future applications. It is essential that these memory 

management and optimization techniques are supported by design automation tools, because due to the 

very complex nature of the source code, it would be impossible to apply them manually in a 

reasonable time frame. The automation tools should be able to optimize both statically and 

dynamically allocated data, in order to cope with design-time needs and adapt to run-time memory 

needs (scalable multimedia input changing at run-time, unpredictable network traffic, etc.) in the most 

efficient way. A design flow that realizes these objectives will be developed within MNEMEE. An 

overview of the preliminary flow is shown in Figure 2 (The final flow will be presented in D5.3). The 

flow takes as input the source code of an application and optimizes the memory behaviour of the 

application. The source code of the optimized application is the output of this flow. The first step of 

the flow models the source code as a task graph. The output of this step, and all other steps, is a 

combination of source code and models that are transferred to the next step. In this step, the task graph 

is mapped onto the processing elements that are available in the hardware platform. There will be two 

different mapping options available within the MNEMEE flow. The first option, called scenariobased, 

takes the dynamic behaviour of the application into account when mapping it onto the platform. 

The second option, called memory-aware, considers the memory hierarchy in the platform. After the 

mapping, the static and dynamic access behaviour of the application is optimized in the steps labelled 

‘parallelization implementation’ and ‘dynamic memory management’. Finally, additional memory 

optimizations are applied on a per processor element basis. These optimizations are based on existing 

single processor scratchpad optimization techniques [49]. 

The remainder of this document presents a number of analysis approaches that can be used, within the 

context of the design flow, to identify memory access behaviour and dynamic behaviour in 

applications. Each of these analysis approaches focuses on different sources of dynamism in 

multimedia applications. Section 7 presents techniques to analyze and exploit the dynamically 

changing resource requirements of applications. These analysis techniques are part of the scenariobased 

approach that can be used within the ‘task graph to processing element mapping’ step of the 

design flow. The memory-aware approach that can also be used in this step will be developed within 

WP4 and described in D4.2. Section 8 presents techniques to analyze the access behaviour to statically 

allocated data objects. This section discusses the step labelled ‘parallelization implementation’ in the 

design flow of Figure 2. Section 9 focuses on the access behaviour to dynamically allocated data 

objects. This section discusses the details of the step labelled ‘dynamic memory management 

optimizations’. Finally, Section 10 concludes this deliverable. 

Public Page 10 of 87

Previous page

Next page

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

61

62

63

64

65

66

67

68

69

70

71

72

73

74

75

76

77

78

79

80

81

82

83

84

85

86

87

MNEMEE - Electronic Systems - Technische Universiteit Eindhoven

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?