13.07.2015 Views

Intel® 64 and IA-32 Architectures Optimization Reference Manual

Intel® 64 and IA-32 Architectures Optimization Reference Manual

Intel® 64 and IA-32 Architectures Optimization Reference Manual

SHOW MORE
SHOW LESS
  • No tags were found...

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

APPENDIX BUSING PERFORMANCE MONITORING EVENTSPerformance monitoring events provide facilities to characterize the interactionbetween programmed sequences of instructions <strong>and</strong> microarchitectural subsystems.Performance monitoring events are described in Chapter 18 <strong>and</strong> AppendixA of the Intel® <strong>64</strong> <strong>and</strong> <strong>IA</strong>-<strong>32</strong> <strong>Architectures</strong> Software Developer’s <strong>Manual</strong>, Volume 3B.The first part of this chapter provides information on how to use performance eventsspecific to processors based on the Intel NetBurst microarchitecture. Section B.5discusses similar topics for performance events available on Intel Core Solo <strong>and</strong> IntelCore Duo processors.B.1 PENTIUM ® 4 PROCESSOR PERFORMANCE METRICSThe descriptions of Intel Pentium 4 processor performance metrics use terminologythat is specific to the Intel NetBurst microarchitecture <strong>and</strong> to implementations in thePentium 4 <strong>and</strong> Intel Xeon processors. The performance metrics in Table B-1 throughTable B-13 apply to processors with a CPUID signature that matches family encoding15, mode encoding 0, 1, 2, 3, 4, or 6. Several new performance metrics are availableto <strong>IA</strong>-<strong>32</strong> processors with a CPUID signature that matches family encoding 15, modeencoding 3; the new metrics are listed in Table B-11.The performance metrics listed in Tables B-1 through B-7 may be applicable toprocessors that support HT Technology. See Appendix B.4, “Using PerformanceMetrics with Hyper-Threading Technology.”B.1.1Pentium ® 4 Processor-Specific TerminologyB.1.1.1Bogus, Non-bogus, RetireBranch mispredictions incur a large penalty on microprocessors with deep pipelines.In general, the direction of branches can be predicted with a high degree of accuracyby the front end of the Intel Pentium 4 processor, such that most computations canbe performed along the predicted path while waiting for the resolution of the branch.In the event of a misprediction, instructions <strong>and</strong> μops that were scheduled to executealong the mispredicted path must be cancelled. These instructions <strong>and</strong> μops arereferred to as bogus instructions <strong>and</strong> bogus μops. A number of Pentium 4 processorperformance monitoring events, for example, instruction_ retired <strong>and</strong> μops_retired,can count instructions or mops that are retired based on the characterization ofbogus versus non-bogus.Vol. 1 B-1

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!