From 9f10cc2d124c275f2e7a6764e168f6b75527e78f Mon Sep 17 00:00:00 2001 From: Kelvin Li Date: Wed, 6 Nov 2019 13:25:16 -0500 Subject: [PATCH] [OPENMP] [DOCS] fix section formatting issues [NFC] Differential Revision: https://reviews.llvm.org/D69909 --- clang/docs/OpenMPSupport.rst | 23 ++++++++++------------- 1 file changed, 10 insertions(+), 13 deletions(-) diff --git a/clang/docs/OpenMPSupport.rst b/clang/docs/OpenMPSupport.rst index b47a2b6..0ef4acf 100644 --- a/clang/docs/OpenMPSupport.rst +++ b/clang/docs/OpenMPSupport.rst @@ -13,9 +13,8 @@ .. contents:: :local: -================== OpenMP Support -================== +============== Clang fully supports OpenMP 4.5. Clang supports offloading to X86_64, AArch64, PPC64[LE] and has `basic support for Cuda devices`_. @@ -30,7 +29,7 @@ Interface (OMPT) on x86, x86_64, AArch64, and PPC64 on Linux, Windows, and macOS For the list of supported features from OpenMP 5.0 see `OpenMP implementation details`_. General improvements --------------------- +==================== - New collapse clause scheme to avoid expensive remainder operations. Compute loop index variables after collapsing a loop nest via the collapse clause by replacing the expensive remainder operation with @@ -47,6 +46,13 @@ General improvements - Simplified SPMD code generation for `distribute parallel for` when the new default schedules are applicable. +- When using the collapse clause on a loop nest the default behavior + is to automatically extend the representation of the loop counter to + 64 bits for the cases where the sizes of the collapsed loops are not + known at compile time. To prevent this conservative choice and use + at most 32 bits, compile your program with the + `-fopenmp-optimistic-collapse`. + .. _basic support for Cuda devices: Cuda devices support @@ -77,15 +83,6 @@ are stored in the global memory. In `Cuda` mode local variables are not shared between the threads and it is user responsibility to share the required data between the threads in the parallel regions. -Collapsed loop nest counter ---------------------------- - -When using the collapse clause on a loop nest the default behavior is to -automatically extend the representation of the loop counter to 64 bits for -the cases where the sizes of the collapsed loops are not known at compile -time. To prevent this conservative choice and use at most 32 bits, -compile your program with the `-fopenmp-optimistic-collapse`. - Features not supported or with limited support for Cuda devices --------------------------------------------------------------- @@ -112,7 +109,7 @@ Features not supported or with limited support for Cuda devices .. _OpenMP implementation details: OpenMP 5.0 Implementation Details ---------------------------------- +================================= The following table provides a quick overview over various OpenMP 5.0 features and their implementation status. Please contact *openmp-dev* at -- 2.7.4