general compiler directives
affecting data prefetches
for auto-parallelization
for inlining functions
for profile-guided optimization
for vectorization
Vectorization Overview
Key Programming Guidelines for Vectorization
instrumented code
processor-specific code
profile-optimized executable
profiling information
reports
high performance
high performance programming
applications for
dispatch options for
options for
processors for
Parallelism Overview
Automatic Processor-specific Optimization (IA-32 Architecture)
report generation
high-level optimization
high-level optimizer
HLO Overview
Optimizer Report Generation
HLO
HLO Overview
High-Level Optimization (HLO) Report
reports
hotspots
Hyper-Threading Technology
parallel loops
thread pools
IA-32 architecture based applications
HLO
methods of parallelization
options
Targeting IA-32 and Intel(R) 64 Architecture Processors Automatically
Targeting Multiple IA-32 and Intel 64 Architecture Processors for Run-time Performance
targeting
Targeting IA-32 and Intel(R) 64 Architecture Processors Automatically
Targeting Multiple IA-32 and Intel 64 Architecture Processors for Run-time Performance
IA-64 architecture based applications
auto-vectorization in
HLO
methods of parallelization
options
report generation
targeting
ILO
inlining
Controlling Inline Expansion of User Functions
Profile-guided Optimizations Overview
User Directed Inline Expansion of User Functions
Inline Function Expansion
compiler directed
developer directed
preemption
instruction-level parallelism
instrumentation
compilation
execution
feedback compilation
generating
program
Intel(R) 64 architecture based applications
HLO
methods of parallelization
options
Targeting IA-32 and Intel(R) 64 Architecture Processors Automatically
Targeting Multiple IA-32 and Intel 64 Architecture Processors for Run-time Performance
targeting
Targeting IA-32 and Intel(R) 64 Architecture Processors Automatically
Targeting Multiple IA-32 and Intel 64 Architecture Processors for Run-time Performance
Intel(R) compatibility libraries for OpenMP*
Intel(R) extension environment variables
Intel(R) extension routines
Intel(R) linking tools
INTEL_PROF_DUMP_CUMULATIVE environment variable
INTEL_PROF_DUMP_INTERVAL environment variable
intermediate language scalar optimizer
intermediate representation (IR)
Using IPO
Interprocedural Optimization (IPO) Overview
interprocedural optimizations
Controlling Inline Expansion of User Functions
Profile-guided Optimizations Overview
Optimizer Report Generation
capturing intermediate output
code layout
compilation
compiling
considerations
creating libraries
initiating
issues
large programs
linking
Interprocedural Optimization (IPO) Overview
Using IPO
options
overview
performance
reports
using
whole program analysis
xiar
xild
xilibtool
intrinsics
introduction to Optimizing Applications
IR
Using IPO
Interprocedural Optimization (IPO) Overview
IVDEP
effect when tuning applications
KMP_AFFINITY
Thread Affinity Interface (Linux* and Windows*)
OpenMP* Environment Variables
modifier
offset
permute
type
KMP_ALL_THREADS
KMP_BLOCKTIME
KMP_LIBRARY
OpenMP* Environment Variables
OpenMP* Support Libraries
KMP_MONITOR_STACKSIZE
KMP_STACKSIZE
KMP_VERSION
language support
__declspec
libraries
OpenMP* run-time routines
OpenMP* Run-time Library Routines
Intel Extension Routines/Functions
library functions
Intel extension
OpenMP* run-time routines
linking tools
xild
Interprocedural Optimization (IPO) Overview
Creating a Library from IPO Objects
IPO-Related Performance Issues
xilibtool
xilink
Interprocedural Optimization (IPO) Overview
IPO-Related Performance Issues
linking with IPO
load balancing
lock routines
loop interchange
loop scheduling
loop unrolling
limitations of
using the HLO optimizer
Optimizer Report Generation
HLO Overview
loops
anti dependency
constructs
dependencies
distribution
flow dependency
independence
interchange
HLO Overview
Applying Optimization Strategies
Understanding Runtime Performance
manual transformation
output dependency
parallelization
Loop Parallelization and Vectorization
Programming with Auto-parallelization
Parallelism Overview
reductions
transformations
HLO Overview
Understanding Runtime Performance
unrolling
vectorization
maintainability
allocation
manual dispatch
cpu_dispatch
cpu_specific
manual transformations
memory aliasing
mixing vectorizable types in a loop
MMX(TM)
mock object files
multithreaded programs
Auto-parallelization Overview
Parallelism Overview
multithreading
OpenMP* Support Libraries
Programming with Auto-parallelization
non-unit memory access
NOPREFETCH
using
OMP directives
Parallelism Overview
Examples of OpenMP* Usage
advanced issues
cluster
Cluster OpenMP* Support (Linux*)
Thread Affinity Interface (Linux* and Windows*)
compatibility with other compilers
compatibility libraries
OpenMP* Support Libraries
OpenMP* Source Compatibility and Interoperability with Other Compilers
debugging
directives
environment variables
OpenMP* Environment Variables
Thread Affinity Interface (Linux* and Windows*)
guidelines for using libraries
KMP_AFFINITY
OpenMP* Environment Variables
Thread Affinity Interface (Linux* and Windows*)
KMP_ALL_THREADS
KMP_BLOCKTIME
KMP_LIBRARY
KMP_MONITOR_STACKSIZE
KMP_STACKSIZE
KMP_VERSION
legacy libraries
OpenMP* Support Libraries
OpenMP* Source Compatibility and Interoperability with Other Compilers
library file names
load balancing
loop scheduling
object-level interoperability
omp.h
OMP_DYNAMIC
OMP_NESTED
OMP_NUM_THREADS
OMP_SCHEDULE
parallel processing thread model
performance
reports
run-time library routines
source compatibility
support libraries
task
taskq
threading
worksharing
OMP_DYNAMIC
OMP_NESTED
OMP_NUM_THREADS
OMP_SCHEDULE
OpenMP* Fortran directives
clauses for
examples of
Intel extensions for
optimal records to improve performance
analyzing applications
application-specific
hardware-related
library-related
methodology
options
Optimizing Different Application Types Overview
Enabling Automatic Optimizations
restricting
setting
OS-related
reports
resources
restricting
strategies
system-related
optimization support
optimizations
Restricting Optimizations
Optimizing Applications Overview
Enabling Automatic Optimizations
compilation process
for specific processors
high-level language
options for IA-32 architecture
options for IA-64 architecture
options for Intel(R) 64 architecture
overview of
Automatic Optimizations Overview
Profile-guided Optimizations Overview
parallelization
PGO methodology
profile-guided
profile-guided optimization
optimizer report generation
optimizing
applications
helping the compiler
overview
technical applications
optimizing performance
options for IA-32 architectures
options used for IPO
OptReport support
ORDERED
overview of OpenMP* directives and clauses
overflow
call to a runtime library routine
overview
of optimizing for specific processors
of parallelism