Intel® C++ Compiler

↧

Image may be NSFW.
Clik here to view.

Fun with Intel® Transactional Synchronization Extensions

July 25, 2013, 1:32 pm

By now, many of you have heard of Intel® Transactional Synchronization Extensions (Intel® TSX). If you have not, I encourage you to check out this page (http://www.intel.com/software/tsx) before you...

View Article

Improving Discrete Cosine Transform performance using Intel(R) Cilk(TM) Plus

July 26, 2013, 1:03 pm

DCT and Quantization are the first two steps in JPEG compression standard. This article demonstrates how DCT and Quantizing stages can be implemented to run faster using Intel® Cilk™ Plus. In order to...

View Article

Hands-on Lab: Optimizing Monte Carlo on Intel(R) Xeon Phi(tm) Coprocessor

August 2, 2013, 3:41 pm

IntroductionThis lab was developed for the Intel(R) Xeon Phi(tm) Technology Conference held in May 8-9 2013 in the United Kingdom, which was attended by multiple Financial Services Institutions.In this...

View Article

Best Known Method: Avoid heterogeneous precision in control flow calculations

August 13, 2013, 5:45 pm

Best Known MethodRunning an MPI program in symmetric mode on an Intel® Xeon® host and an Intel Xeon Phi™ coprocessor may deadlock in specific cases due to the heterogeneous precision in replicated...

View Article

_mm256_hadd_pd

September 4, 2013, 1:30 am

Adds horizontal pairs of float64 elements of two vectors. The corresponding Intel® AVX instruction is VHADDPD.Syntaxextern __m256d _mm256_hadd_pd(__m256d m1, __m256d m2);Argumentsm1float64 vector used...

View Article

_mm256_addsub_ps

September 4, 2013, 1:30 am

Adds odd float32 elements and subtracts even float32 elements of vectors. The corresponding Intel® AVX instruction is VADDSUBPS.Syntaxextern __m256 _mm256_addsub_ps(__m256 m1, __m256...

View Article

_mm256_addsub_pd

September 4, 2013, 1:30 am

Adds odd float64 elements and subtracts even float64 elements of vectors. The corresponding Intel® AVX instruction is VADDSUBPD.Syntaxextern __m256d _mm256_addsub_pd(__m256d m1, __m256d...

View Article

_mm256_add_ps

September 4, 2013, 1:30 am

Adds float32 vectors. The corresponding Intel® AVX instruction is VADDPS.Syntaxextern __m256 _mm256_add_ps(__m256 m1, __m256 m2);Argumentsm1float32 vector used for the operationm2float32 vector also...

View Article

_mm256_add_pd

September 4, 2013, 1:30 am

Adds float64 vectors. The corresponding Intel® AVX instruction is VADDPD.Syntaxextern __m256d _mm256_add_pd(__m256d m1, __m256d m2);Argumentsm1float64 vector used for the operationm2float64 vector also...

View Article

Intrinsics for Arithmetic Operations

September 4, 2013, 1:30 am

Parent topic: Intrinsics for Intel® Advanced Vector Extensions_mm256_add_pd Adds float64 vectors. The corresponding Intel® AVX instruction is VADDPD._mm256_add_ps Adds float32 vectors. The...

View Article

Image may be NSFW.
Clik here to view.

Details of Intel® Advanced Vector Extensions Intrinsics

September 4, 2013, 1:30 am

Intel® Advanced Vector Extensions (Intel® AVX) intrinsics map directly to Intel® AVX instructions and other enhanced 128-bit single-instruction multiple data processing (SIMD) instructions. Intel® AVX...

View Article

Overview: Intrinsics for Intel® Advanced Vector Extensions Instructions

September 4, 2013, 1:30 am

Intel® Advanced Vector Extensions (Intel® AVX) intrinsics are assembly-coded functions that call on Intel® AVX instructions, which are new vector SIMD instruction extensions for IA-32 and Intel® 64...

View Article

Intrinsics for Intel® Advanced Vector Extensions

September 4, 2013, 1:30 am

Parent topic: IntrinsicsOverview: Intrinsics for Intel® Advanced Vector Extensions InstructionsDetails of Intel® Advanced Vector Extensions IntrinsicsIntrinsics for Arithmetic OperationsIntrinsics for...

View Article

Function Prototype and Macro Definitions

September 4, 2013, 1:30 am

Function Prototype and Macro Definitions for RTMThe following function prototypes are included in the immintrin.h header file:unsigned int _xbegin(void); void _xend(void); void _xabort(const unsigned...

View Article

HLE Release _Store Functions

September 4, 2013, 1:30 am

Stores the specified value at the specified address and releases pending active HLE transaction. This intrinsic function applies to C/C++ applications for Windows* OS only.Syntaxvoid...

View Article

HLE Release _InterlockedExchangeAdd Functions

September 4, 2013, 1:30 am

Performs an atomic addition of two values and releases pending active HLE transaction. This intrinsic function applies to C/C++ applications for Windows* OS only.Syntaxlong...

View Article

HLE Release _InterlockedCompareExchange Functions

September 4, 2013, 1:30 am

Performs an atomic compare-and-exchange operation on the specified values and releases pending active HLE transaction. This intrinsic function applies to C/C++ applications for Windows* OS...

View Article

Intrinsics for Hardware Lock Elision Operations

September 4, 2013, 1:30 am

Parent topic: Intrinsics for Intel® Transactional Synchronization Extensions (Intel® TSX)Hardware Lock Elision OverviewHLE Acquire _InterlockedCompareExchange Functions Performs an atomic...

View Article

Image may be NSFW.
Clik here to view.

Analyse the single-threaded Stream benchmark's behaviour on Intel® Xeon®...

September 6, 2013, 1:42 am

The STREAM benchmark (http://www.cs.virginia.edu/stream/) a synthetic benchmark program, written in standard Fortran 77 (with a corresponding version in C). It measures the the performance of four long...

View Article

Large Page Considerations

September 6, 2013, 3:24 pm

Compiler Methodology for Intel® MIC ArchitectureLarge Page ConsiderationsUse THP enabled by default in the MPSS Operating System:MPSS versions later than 2.1.4982-15 support “Transparent Huge Pages...

View Article