Dual-pipeline heterogeneous ASIP design - School of Computer ...

Figure 9: 2-pipes .vs. 1-pipe 

ness of the created two-pipe processors, and our design a p 

proach, we also implemented those functions with single 

pipe ASIPs produced by ASIPMeister. The verification sys- 

tem for sin le pipe and two-pi e processors are shown in 

Figure 10. %,e that the sirnurated results are for the in- 

struction set after Phase I. All applications used the same 

sample data set for both single and parallel implementations. 

Figure 10: Synthesis/Simulation System 

the moeram in'terms of clock cvcles ICC,). We also o kin 

microns from AMI. 

The simulation results are shown in Table 2. The first col- 

umn gives the name of the application, the second and third 

the area cost of single and arallel implementations respec- 

tively, the fourth and the &th columns give the number of 

clock cycles for execution of the application, and the sixth 

and the seventh gives the switching activity when the appli- 

cation was executed. Eight and ninth columns give the clock 

period for the processors designed (results obtained from 

Synplify ASIC). Tenth, eleventh and the twelfth columns 

give the percentage increase in area, percentage speed im- 

provements, and percentage switching activity reductions. 

These three columns (10,ll and 12) are plotted in Figure 9. 

As can be seen from the table, there is up to 36% im rove- 

ment in speed. The speed improvement is at the cost orsome 

extra chip area. The two-instruction sets in each design 

were created with very little overlap, i.e., few instructions 

were in both pipes. The clock eriod reductions came from 

the decreased controller compyexity. The extra area cost 

mainly comes from the second controller, additional func- 

tional blocks and data buses for parallel processin which 

is common to all dual-pi e designs. Therefore, tfe extra 

area cost is almost same t% all the design, which is around 

16% 

Switching activity reductions are obtained by reduced switch- 

ing in program counter and simplified functional circuitry. 

For exam le, two parallel add instructions can have less 

number ofswitches than the two instructions executed in 

a single pipe, since addition is implemented with an adder 

instead of an ALU in second pipe. 

17 

5. CONCLUSIONS 

In this paper we have described a system which expands 

the design space of ASIPs, by increasing the number of 

pipelines. We allocate load/store operations to one of the 

pipes, and in general ALU operation to the other pipe. If 

there are additional ALU operations which can be paral- 

lelized, then they are spread over the next pipe. We see 

speed improvements of up to 36% and switching activity 

reductions of up to 11%. The additional area costs approx- 

imately 16%. 

6. REFERENCES 

Arctangent processor. ARC International. 

(http://www.arc.com). 

Asip-meister. (http://www.~d~-m~i~t~~.~~g/asi~m~i~t~~/) 

Xtensa processor. Tensilica Inc. (http://www.tensilica.com) 

N. Binh. M. Imai. and Y. Takeuchi. A Derformance 

maximization dgbrithm to design =ips- under the constm.int of 

chip area including ram and rom sine. In ASP-DAC, 1998. 

Nguyen Ngoc Binh, Masaharu Imai, Akichika Shiomi, and 

Nobuyuki Hikichi. A hardware/software partitioning algorithm 

for desi "in ipelined asips with least gate counts. In Pmc. of 

the 33r2 DdE pages 527-532. ACM Press, 1996. 

161 P. Brisk, A. Kaplan, R. Kastner, and M. Suriafiadeh. 

Instruction eneration and re ularit extraction for 

reconfigurade processorb. in &ASEX 2002. 

171 Hoon Chai, Ju Hwan Yi. Jong-Yeol Lee, In-Cheol Park, and 

Chong-Min Kyun Ex biting intellectual pro erties in asi 

desi ns for embehed fz software. In Pmcee&~gs of the hth 

DA& pages 939-944. A M press, 1999. 

181 J.G. Cousin, M. Denoual, D. Saille, and 0. Sentieys. Fast asip 

synthesis and power estimation for ds ap lication. IEEE Sips 

2000 : Deszon and Imolementotmn. gct& 2000. 

191 Kayhan K &ai. An asip design methodology for embedded 

systems. In Pmceedzngs of the seventh internotionol workshop 

on Hardwove/so,fLware codesign, pages 17-21. ACM Press, 

1999. 

Tilman Glokler and Heinrich Meyi. Power reduction for sips: 

A case study. 

M. Jacome, G. de Veciana, and C. Akturan. Resource 

constrained dataflow retimin heuristics for vliw asips. In 

Proceedings of the seventh BODES, pages 12-16. ACM Press, 

1uou 

Margarida F. Jacome, Gustavo de Veciana, and Viktor 

Lapinskii. Exploring performance tradeoffs for clustered vliw 

asi s. In Proceedings of the 2000 ICCAD. pages 504-510. 

IEEE press. 2000. 

hl. K. Jain, L. Wehmeyer, S. Steinke, P. Marwedel, and 

M. Bal-akrishnan. Evaluating register file size in asip design. In 

CODES, 2001. 

R. Kastner, S. Ogrenci-Memik, E. Bozorgzadeh, and 

M. Sar-rafiadeh. Instruction eneration for hybrid 

reconfigurable systems. In IC!CAO, 2001. 

V. Kathail, shail Aditya, R. Schreiber, B. R. Rau, D. C. 

Cron-quist. and M. Sivaraman. Pico: Automatically designing 

CUStom computers. in computer, 2002. 

Akira Kitajima, Makiko Itoh, Jun Sato, Akichika Shiomi, 

Yoshinori Takeuchi, and Masaharu Imai. Effectiveness of the 

asip design system eas iii in design of i dined rocessors. In 

Pmc. of the 2001 hP-DAC, pages 64&4. A& Press, 2001. 

S. Kobayashi, H. Mita, Y. Takeuchi, and M. Irnai. Design space 

exploration for dsp a lications using the asip development 

system peas-iii. I" ABPP, 2002. 

J. Lee, K. Choi, and N. Dutt. Efficient instruction encoding for 

automatic instmction set desifn of configurable asips. In 

ICCAD. znnz. 

~~ ~~~~ 

A. Pyttel, A. Sedlmeier, and C. Veith. Pscp: a scalable parallel 

asip architecture for reactive systems. In Proceedings of DATE, 

.I oases 370-376. IEEE Comouter Societv. ", 1998. 

~~~ 

James E. Smith. Decoupled access/execute computer 

architectures. ACM Trans. Comput. Syst., 2(4):289-308, 1984 

F. Sun, S. Ravi, A. Raghunathan, and N. Jha. Synthesis of 

custom processors based on extenbible platforms. In ICCAD, 

nnnn 

J.-H. Yang, B.-W. Kim, et al. Metacore: an application specific 

dsp development system. In DAC, 1998. 

4. Zhao. B. Mesman, and T. Basten. Practical instruction set 

desi n and corn iler retargetability using statio resource 

mo& ~n DA~E, 2002.

Previous page

Next page

1

2

3

4

5

6

Dual-pipeline heterogeneous ASIP design - School of Computer ...

Create successful ePaper yourself

Delete template?

Save as template?