13.07.2015 Views

Intel® 64 and IA-32 Architectures Optimization Reference Manual

Intel® 64 and IA-32 Architectures Optimization Reference Manual

Intel® 64 and IA-32 Architectures Optimization Reference Manual

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

INTEL® <strong>64</strong> AND <strong>IA</strong>-<strong>32</strong> PROCESSOR ARCHITECTURES2.4.2 Data PrefetchingIntel Core Solo <strong>and</strong> Intel Core Duo processors provide hardware mechanisms toprefetch data from memory to the second-level cache. There are two techniques:1. One mechanism activates after the data access pattern experiences two cachereferencemisses within a trigger-distance threshold (see Table 2-6). Thismechanism is similar to that of the Pentium M processor, but can track 16 forwarddata streams <strong>and</strong> 4 backward streams.2. The second mechanism fetches an adjacent cache line of data after experiencinga cache miss. This effectively simulates the prefetching capabilities of 128-bytesectors (similar to the sectoring of two adjacent <strong>64</strong>-byte cache lines available inPentium 4 processors).Hardware prefetch requests are queued up in the bus system at lower priority thannormal cache-miss requests. If bus queue is in high dem<strong>and</strong>, hardware prefetchrequests may be ignored or cancelled to service bus traffic required by dem<strong>and</strong>cache-misses <strong>and</strong> other bus transactions. Hardware prefetch mechanisms areenhanced over that of Pentium M processor by:• Data stores that are not in the second-level cache generate read for ownershiprequests. These requests are treated as loads <strong>and</strong> can trigger a prefetch stream.• Software prefetch instructions are treated as loads, they can also trigger aprefetch stream.2.5 INTEL ® HYPER-THREADING TECHNOLOGYIntel ® Hyper-Threading Technology (HT Technology) is supported by specificmembers of the Intel Pentium 4 <strong>and</strong> Xeon processor families. The technology enablessoftware to take advantage of task-level, or thread-level parallelism by providingmultiple logical processors within a physical processor package. In its first implementationin Intel Xeon processor, Hyper-Threading Technology makes a single physicalprocessor appear as two logical processors.The two logical processors each have a complete set of architectural registers whilesharing one single physical processor's resources. By maintaining the architecturestate of two processors, an HT Technology capable processor looks like two processorsto software, including operating system <strong>and</strong> application code.By sharing resources needed for peak dem<strong>and</strong>s between two logical processors, HTTechnology is well suited for multiprocessor systems to provide an additional performanceboost in throughput when compared to traditional MP systems.Figure 2-7 shows a typical bus-based symmetric multiprocessor (SMP) based onprocessors supporting HT Technology. Each logical processor can execute a softwarethread, allowing a maximum of two software threads to execute simultaneously onone physical processor. The two software threads execute simultaneously, meaningthat in the same clock cycle an “add” operation from logical processor 0 <strong>and</strong> another2-37

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!