Efficient Parallelization

Efficient Parallelization Document

Compiler Methodology for Intel® MIC Architecture

Overview

This chapter covers topics in parallelization. There are links to various parallelization methods and resources along with tips and techniques for getting optimal parallel performance.

Goals

In this chapter, you will learn techniques for the Intel OpenMP* runtime library provided with the Intel compilers, Intel® MPI, Intel® Cilk™ Plus, and Intel® Threading Building Blocks (Intel® TBB).

Topics

The following subchapters provide more information on parallelization topics. Click the links below to access these topics.

Take Aways

In this chapter, various parallelization methods were presented. For OpenMP, two major performance techniques were presented: controlling thread affinity and controlling OpenMP scheduling.

NEXT STEPS

It is essential that you read this guide from start to finish using the built-in hyperlinks to guide you along a path to a successful port and tuning of your application(s) on Intel® Xeon Phi™ architecture. The paths provided in this guide reflect the steps necessary to get best possible application performance.

The next chapter, Vectorization Essentials, covers techniques to help vectorize your code along with best methods for efficient vectorization.

Intel® C++ Composer XE

Intel® Composer XE

Intel® Fortran Compiler

Intel® Fortran Composer XE

Módulos Intel® de subprocesamiento