<<

packed structures
parallel control structures
PARALLEL DO
    summary of OpenMP* directives and clauses
parallel invocations with makefile
PARALLEL OpenMP* directive
parallel processing
    thread model
parallel programming
   Parallelism Overview
   Optimizing Applications Overview
parallel regions
PARALLEL SECTIONS
    summary of OpenMP* directives
parallelism
   Parallelism: an Overview
   OpenMP* Run-time Library Routines
   Auto-parallelization Overview
parallelization
   Parallelization Overview
   Loop Parallelization and Vectorization
   Auto-parallelization Overview
   Programming with Auto-parallelization
    diagnostic
performance analyzer
performance issues with IPO
PGO
PGO API
    _PGOPTI_Prof_Dump_And_Reset
    _PGOPTI_Prof_Reset
    _PGOPTI_Set_Interval_Prof_Dump
    enable
PGO tools
    code coverage tool
    profmerge
    proforder
    test prioritization tool
pgopti.dpi file
   PGO Environment Variables
   Basic PGO Options
pgopti.spi file
pgouser.h header file
pipelining
pragmas
    capturedprivate
    ivdep
    novector
    restrict
    task
       Intel Workqueuing Model Overview
       Workqueuing Constructs
    taskq
       Intel Workqueuing Model Overview
       Workqueuing Constructs
       Workqueuing Example Function
    using with vectorization
    vector
    vector always
    vector nontemporal
pragmas for language support
preempting functions
PREFETCH
    options used for
    using
prefetches of data
    optimizations for
prioritizing application tests
    manual dispatch
PRIVATE
    summary of data scope attribute clauses
private data
processors
    targeting IA-32 architecture processors using options
    targeting IA-64 architecture processors using options
    targeting Intel(R) 64 architecture processors using options
PROF_DIR environment variable
PROF_DUMP_INTERVAL environment variable (deprecated)
PROF_NO_CLOBBER environment variable
profile-guided optimization
    API support
    data ordering optimization
    dumping profile information
    environment variables
    example of
    function grouping optimization
    function order lists optimization
    function ordering optimization
    interval profile dumping
    options
    overview
    phases
    resetting dynamic profile counters
    resetting profile information
    support
    usage model
profile-optimized code
   Basic PGO Options
   PGO API Support Overview
    dumping
       Dumping Profile Information
       Interval Profile Dumping
    generating information
    resetting dynamic counters for
profmerge
program loops
    parallel processing model



quick reference
   Interprocedural Optimization (IPO) Quick Reference
   Compiler Reports Quick Reference
   Enabling Automatic Optimizations
    automatic optimizations
    compiler reports
    IPO options
    PGO options



REDUCTION
    summary of data scope attribute clauses
reductions
reductions in loops
register allocation
report generation
    dynamic profile counters
    Intel extension
    OpenMP* run-time
    profile information
    timing
    using compiler commands
    using xi* tools
report software pipelining (SWP)
restrict
restricting optimization
run-time checking



sample of timing
scheduling loops
shared data
shared scalars
SMP systems
software pipelining
   Parallelism Overview
   SWP Reports
    reports
specialized code
   Automatic Processor-specific Optimization (IA-32 Architecture Only)
   Parallelism Overview
specifying symbol visibility
SSE
SSE2
strategies for optimization
Streaming SIMD Extensions
subroutines in the OpenMP* run-time library
    for OpenMP*
    for prefetching
    parallel run-time
SWP
    SWP reports
symbol visibility
    specifying
symbol visibility on Linux*
symbol visibility on Mac OS* X
synchronization
   Parallelism Overview
   Auto-parallelization Overview
    parallel processing model for
    thread sleep time
    thread-level parallelism



targeting
   Targeting a Processor
   Targeting IA-32 Architecture Processors Manually
   Targeting Itanium(R) Processors Automatically
    IA-32 architecture processors
    Intel(R) 64 architecture processors
    Itanium(R) 2 processors
    manual dispatch
    manual targeting
    run-time checking
technical applications
test prioritization tool
    examples
    options
    requirements
thread affinity
thread pooling
threshold control for auto-parallelization
    OpenMP* routines for
    reordering
tool options
    code coverage tool
    profmerge
    proforder
    test prioritization
tools
topology maps



unvectorizable copy
user functions
   Controlling Inline Expansion of User Functions
   Developer Directed Inline Expansion of User Functions
    auto-parallelization
       Auto-parallelization Overview
       Parallelism: an Overview
    dynamic libraries
    OpenMP*
    PGO environment
    profile-guided optimization
    timing for an application
utilities
    profmerge
    proforder



vector copy
    options
    options for
    overview
    programming guidelines
       Key Programming Guidelines for Vectorization
       Vectorization Overview (IA-32 Architecture Only)
    reports
vectorizing
    loops
       Loop Constructs
       Profile-guided Optimizations Overview
VTune™ Performance Analyzer



whole program analysis
worker thread
workqueuing
   Intel Workqueuing Model Overview
   Workqueuing Example Function
   Workqueuing Constructs
    example function
worksharing
   Parallelism Overview
   Auto-parallelization Overview
   OpenMP* Directives and Clauses Summary
   Work-sharing Using OpenMP* (Windows*)
    using OpenMP*



xiar
   Creating a Library from IPO Objects
   IPO-Related Performance Issues
xild
   Creating a Library from IPO Objects
   Interprocedural Optimization (IPO) Overview
   IPO-Related Performance Issues
xilib
xilibtool
xilink
   Interprocedural Optimization (IPO) Overview
   IPO-Related Performance Issues
   Creating a Library from IPO Objects