Compiler Methodology for Intel® MIC Architecture
Efficient ParallelizationOverview
This chapter covers topics in parallelization. There are links to various parallelization methods and resources along with tips and techniques for getting optimal parallel performance.
Goals
In this chapter, you will learn techniques for the Intel OpenMP* runtime library provided with the Intel compilers, Intel® MPI, Intel® Cilk™ Plus, and Intel® Threading Building Blocks (Intel® TBB).
Topics
The following subchapters provide more information on parallelization topics. Click the links below to access these topics.
OpenMP* - Getting started using OpenMP with the Intel compilers
Thread Affinity control with the Intel OpenMP runtime
Parallelization using Intel® Threading Building Blocks (Intel® TBB)
Take Aways
In this chapter, various parallelization methods were presented. For OpenMP, two major performance techniques were presented: controlling thread affinity and controlling OpenMP scheduling.
NEXT STEPS
It is essential that you read this guide from start to finish using the built-in hyperlinks to guide you along a path to a successful port and tuning of your application(s) on Intel® Xeon Phi™ architecture. The paths provided in this guide reflect the steps necessary to get best possible application performance.
The next chapter, Vectorization Essentials, covers techniques to help vectorize your code along with best methods for efficient vectorization.