80960KA EMBEDDED 32-BIT MICROPROCESSOR - Datasheet ...

More documents

Recommendations

Info

80960KApurpose registers provided in other popular microprocessors.The term global refers to the fact that theseregisters retain their contents across procedure calls.The local registers, on the other hand, are procedurespecific. For each procedure call, the 80960KAallocates 16 local registers (R0 through R15). Eachlocal register is 32 bits wide.1.1.4. Multiple Register SetsTo further increase the efficiency of the register set,multiple sets of local registers are stored on-chip (SeeFigure 4). This cache holds up to four local registerframes, which means that up to three procedure callscan be made without having to access the procedurestack resident in memory.Although programs may have procedure calls nestedmany calls deep, a program typically oscillates backand forth between only two to three levels. As aresult, with four stack frames in the cache, the probabilityof having a free frame available on the cachewhen a call is made is very high. In fact, runs of representativeC-language programs show that 80% of thecalls are handled without needing to access memory.If four or more procedures are active and a newprocedure is called, the 80960KA moves the oldestlocal register set in the stack-frame cache to aprocedure stack in memory to make room for a newset of registers. Global register G15 is the framepointer (FP) to the procedure stack.Global registers are not exchanged on a procedurecall, but retain their contents, making them availableto all procedures for fast parameter passing.1.1.5. Instruction CacheTo further reduce memory accesses, the 80960KAincludes a 512-byte on-chip instruction cache. Theinstruction cache is based on the concept of localityof reference; most programs are not usually executedin a steady stream but consist of many branches,loops and procedure calls that lead to jumping backand forth in the same small section of code. Thus, bymaintaining a block of instructions in cache, thenumber of memory references required to readinstructions into the processor is greatly reduced.To load the instruction cache, instructions are fetchedin 16-byte blocks; up to four instructions can befetched at one time. An efficient prefetch algorithmincreases the probability that an instruction willalready be in the cache when it is needed.Code for small loops often fits entirely within thecache, leading to a great increase in processingspeed since further memory references might not benecessary until the program exits the loop. Similarly,when calling short procedures, the code for thecalling procedure is likely to remain in the cache so itwill be there on the procedure’s return.1.1.6. Register ScoreboardingThe instruction decoder is optimized in several ways.One optimization method is the ability to overlapinstructions by using register scoreboarding.Register scoreboarding occurs when a LOAD movesa variable from memory into a register. When theinstruction initiates, a scoreboard bit on the targetregister is set. Once the register is loaded, the bit isreset. In between, any reference to the registercontents is accompanied by a test of the scoreboardbit to ensure that the load has completed beforeprocessing continues. Since the processor does notneed to wait for the LOAD to complete, it can executeadditional instructions placed between the LOAD andthe instruction that uses the register contents, asshown in the following example:ld data_2, r4ld data_2, r5Unrelated instructionUnrelated instructionadd R4, R5, R6In essence, the two unrelated instructions betweenLOAD and ADD are executed “for free” (i.e., take noapparent time to execute) because they are executedwhile the register is being loaded. Up to three loadinstructions can be pending at one time with threecorresponding scoreboard bits set. By exploiting thisfeature, system programmers and compiler writershave a useful tool for optimizing execution speed.5
80960KAONE OF FOURLOCALREGISTER SETSREGISTERCACHELOCAL REGISTER SETR 031 0R 15Figure 4. Multiple Register Sets Are Stored On-Chip1.1.7. High Bandwidth Local BusThe 80960KA CPU resides on a high-bandwidthaddress/data bus known as the local bus (L-Bus). TheL-Bus provides a direct communication path betweenthe processor and the memory and I/O subsysteminterfaces. The processor uses the L-Bus to fetchinstructions, manipulate memory and respond tointerrupts. L-Bus features include:• 32-bit multiplexed address/data path• Four-word burst capability which allows transfersfrom 1 to 16 bytes at a time• High bandwidth reads and writes with66.7 MBytes/s burst (at 25 MHz)Table 3 defines L-bus signal names and functions;Table 4 defines other component-support signalssuch as interrupt lines.1.1.8. Interrupt HandlingThe 80960KA can be interrupted in two ways: by theactivation of one of four interrupt pins or by sending amessage on the processor’s data bus.The 80960KA is unusual in that it automaticallyhandles interrupts on a priority basis and can keeptrack of pending interrupts through its on-chipinterrupt controller. Two of the interrupt pins can beconfigured to provide 8259A-style handshaking forexpansion beyond four interrupt lines.1.1.9. Debug FeaturesThe 80960KA has built-in debug capabilities. Thereare two types of breakpoints and six trace modes.Debug features are controlled by two internal 32-bitregisters: the Process-Controls Word and the Trace-Controls Word. By setting bits in these control words,a software debug monitor can closely control how theprocessor responds during program execution.The 80960KA provides two hardware breakpointregisters on-chip which, by using a special command,can be set to any value. When the instruction pointermatches either breakpoint register value, thebreakpoint handling routine is automatically called.The 80960KA also provides software breakpointsthrough the use of two instructions: MARK andFMARK. These can be placed at any point in aprogram and cause the processor to halt execution atthat point and call the breakpoint handling routine.The breakpoint mechanism is easy to use andprovides a powerful debugging tool.Tracing is available for instructions (single stepexecution), calls and returns and branching. Eachtrace type may be enabled separately by a special6
Page 3 and 4: CONTENTSFigure 24.Figure 25.Figure
Page 6 and 7: 80960KATable 1. 80960KA Instruction
Page 10 and 11: 80960KAdebug instruction. In each c
Page 12 and 13: 80960KABE3:0OO.D.BYTE ENABLE LINES
Page 14 and 15: 80960KA2.3. Connection Recommendati
Page 16 and 17: 80960KATEMP = +22°C@5.5V@5.0V@4.5V
Page 18 and 19: 80960KA2.6. Absolute Maximum Rating
Page 20 and 21: 80960KA2.8.1. AC Specification Tabl
Page 22 and 23: 80960KA1. Clock rise and fall times
Page 24 and 25: 80960KA3.0 MECHANICAL DATA3.1. Pack
Page 26 and 27: 80960KA14 13 12 11 10 9 8 7 6 5 4 3
Page 28 and 29: 80960KA3.2. PinoutTable 9. 80960KA
Page 30 and 31: 80960KATable 11. 80960KA PQFP Pinou
Page 32 and 33: 80960KA3.3. Package Thermal Specifi
Page 34 and 35: 80960KATEMPERATURE ( o C)9085807570
Page 36 and 37: 80960KA4.0 WAVEFORMSFigures 27, 28,
Page 38 and 39: 80960KAT a T w T w T d T w T d T w
Page 40 and 41: 80960KAPREVIOUSCYCLEINTERRUPTACKNOW
Page 42 and 43: 80960KA39

80960KA EMBEDDED 32-BIT MICROPROCESSOR - Datasheet ...

Create successful ePaper yourself

Delete template?

Save as template?