<<

general compiler directives
    affecting data prefetches
    for auto-parallelization
    for inlining functions
    for profile-guided optimization
    for vectorization
       Vectorization Overview
       Key Programming Guidelines for Vectorization
    instrumented code
    processor-specific code
    profile-optimized executable
    profiling information
    reports



high performance
high performance programming
    applications for
    dispatch options for
    options for
    processors for
       Parallelism Overview
       Automatic Processor-specific Optimization (IA-32 Architecture)
    report generation
high-level optimization
high-level optimizer
   HLO Overview
   Optimizer Report Generation
HLO
   HLO Overview
   High-Level Optimization (HLO) Report
    reports
hotspots
Hyper-Threading Technology
    parallel loops
    thread pools



IA-32 architecture based applications
    HLO
    methods of parallelization
    options
       Targeting IA-32 and Intel(R) 64 Architecture Processors Automatically
       Targeting Multiple IA-32 and Intel 64 Architecture Processors for Run-time Performance
    targeting
       Targeting IA-32 and Intel(R) 64 Architecture Processors Automatically
       Targeting Multiple IA-32 and Intel 64 Architecture Processors for Run-time Performance
IA-64 architecture based applications
    auto-vectorization in
    HLO
    methods of parallelization
    options
    report generation
    targeting
ILO
inlining
   Controlling Inline Expansion of User Functions
   Profile-guided Optimizations Overview
   User Directed Inline Expansion of User Functions
   Inline Function Expansion
    compiler directed
    developer directed
    preemption
instruction-level parallelism
instrumentation
    compilation
    execution
    feedback compilation
    generating
    program
Intel(R) 64 architecture based applications
    HLO
    methods of parallelization
    options
       Targeting IA-32 and Intel(R) 64 Architecture Processors Automatically
       Targeting Multiple IA-32 and Intel 64 Architecture Processors for Run-time Performance
    targeting
       Targeting IA-32 and Intel(R) 64 Architecture Processors Automatically
       Targeting Multiple IA-32 and Intel 64 Architecture Processors for Run-time Performance
Intel(R) compatibility libraries for OpenMP*
Intel(R) extension environment variables
Intel(R) extension routines
Intel(R) linking tools
INTEL_PROF_DUMP_CUMULATIVE environment variable
INTEL_PROF_DUMP_INTERVAL environment variable
intermediate language scalar optimizer
intermediate representation (IR)
   Using IPO
   Interprocedural Optimization (IPO) Overview
interprocedural optimizations
   Controlling Inline Expansion of User Functions
   Profile-guided Optimizations Overview
   Optimizer Report Generation
    capturing intermediate output
    code layout
    compilation
    compiling
    considerations
    creating libraries
    initiating
    issues
    large programs
    linking
       Interprocedural Optimization (IPO) Overview
       Using IPO
    options
    overview
    performance
    reports
    using
    whole program analysis
    xiar
    xild
    xilibtool
intrinsics
introduction to Optimizing Applications
IR
   Using IPO
   Interprocedural Optimization (IPO) Overview
IVDEP
    effect when tuning applications



KMP_AFFINITY
   Thread Affinity Interface (Linux* and Windows*)
   OpenMP* Environment Variables
    modifier
    offset
    permute
    type
KMP_ALL_THREADS
KMP_BLOCKTIME
KMP_LIBRARY
   OpenMP* Environment Variables
   OpenMP* Support Libraries
KMP_MONITOR_STACKSIZE
KMP_STACKSIZE
KMP_VERSION



language support
    __declspec
libraries
    OpenMP* run-time routines
       OpenMP* Run-time Library Routines
       Intel Extension Routines/Functions
library functions
    Intel extension
    OpenMP* run-time routines
linking tools
    xild
       Interprocedural Optimization (IPO) Overview
       Creating a Library from IPO Objects
       IPO-Related Performance Issues
    xilibtool
    xilink
       Interprocedural Optimization (IPO) Overview
       IPO-Related Performance Issues
linking with IPO
load balancing
lock routines
loop interchange
loop scheduling
loop unrolling
    limitations of
    using the HLO optimizer
       Optimizer Report Generation
       HLO Overview
loops
    anti dependency
    constructs
    dependencies
    distribution
    flow dependency
    independence
    interchange
       HLO Overview
       Applying Optimization Strategies
       Understanding Runtime Performance
    manual transformation
    output dependency
    parallelization
       Loop Parallelization and Vectorization
       Programming with Auto-parallelization
       Parallelism Overview
    reductions
    transformations
       HLO Overview
       Understanding Runtime Performance
    unrolling
    vectorization



maintainability
    allocation
manual dispatch
    cpu_dispatch
    cpu_specific
manual transformations
memory aliasing
mixing vectorizable types in a loop
MMX(TM)
mock object files
multithreaded programs
   Auto-parallelization Overview
   Parallelism Overview
multithreading
   OpenMP* Support Libraries
   Programming with Auto-parallelization



non-unit memory access
NOPREFETCH
    using



OMP directives
   Parallelism Overview
   Examples of OpenMP* Usage
    advanced issues
    cluster
       Cluster OpenMP* Support (Linux*)
       Thread Affinity Interface (Linux* and Windows*)
    compatibility with other compilers
    compatibility libraries
       OpenMP* Support Libraries
       OpenMP* Source Compatibility and Interoperability with Other Compilers
    debugging
    directives
    environment variables
       OpenMP* Environment Variables
       Thread Affinity Interface (Linux* and Windows*)
    guidelines for using libraries
    KMP_AFFINITY
       OpenMP* Environment Variables
       Thread Affinity Interface (Linux* and Windows*)
    KMP_ALL_THREADS
    KMP_BLOCKTIME
    KMP_LIBRARY
    KMP_MONITOR_STACKSIZE
    KMP_STACKSIZE
    KMP_VERSION
    legacy libraries
       OpenMP* Support Libraries
       OpenMP* Source Compatibility and Interoperability with Other Compilers
    library file names
    load balancing
    loop scheduling
    object-level interoperability
    omp.h
    OMP_DYNAMIC
    OMP_NESTED
    OMP_NUM_THREADS
    OMP_SCHEDULE
    parallel processing thread model
    performance
    reports
    run-time library routines
    source compatibility
    support libraries
    task
    taskq
    threading
    worksharing
OMP_DYNAMIC
OMP_NESTED
OMP_NUM_THREADS
OMP_SCHEDULE
OpenMP* Fortran directives
    clauses for
    examples of
    Intel extensions for
optimal records to improve performance
    analyzing applications
    application-specific
    hardware-related
    library-related
    methodology
    options
       Optimizing Different Application Types Overview
       Enabling Automatic Optimizations
        restricting
        setting
    OS-related
    reports
    resources
    restricting
    strategies
    system-related
optimization support
optimizations
   Restricting Optimizations
   Optimizing Applications Overview
   Enabling Automatic Optimizations
    compilation process
    for specific processors
    high-level language
    options for IA-32 architecture
    options for IA-64 architecture
    options for Intel(R) 64 architecture
    overview of
       Automatic Optimizations Overview
       Profile-guided Optimizations Overview
    parallelization
    PGO methodology
    profile-guided
    profile-guided optimization
optimizer report generation
optimizing
    applications
    helping the compiler
    overview
    technical applications
optimizing performance
options for IA-32 architectures
options used for IPO
OptReport support
ORDERED
    overview of OpenMP* directives and clauses
overflow
    call to a runtime library routine
overview
    of optimizing for specific processors
    of parallelism


>>