Retry: [llvm-profdata] Speed up merging by using a thread pool
authorVedant Kumar <vsk@apple.com>
Tue, 19 Jul 2016 01:17:20 +0000 (01:17 +0000)
committerVedant Kumar <vsk@apple.com>
Tue, 19 Jul 2016 01:17:20 +0000 (01:17 +0000)
commite3a0bf504859c95513d75df06aca1a6d38c44d60
treeb98db977f7a0aa95ee9bde587472456dc8855404
parent21ab20e0050d18185f6020a32aadd73c351a7e1d
Retry: [llvm-profdata] Speed up merging by using a thread pool

Add a "-j" option to llvm-profdata to control the number of threads used.
Auto-detect NumThreads when it isn't specified, and avoid spawning threads when
they wouldn't be beneficial.

I tested this patch using a raw profile produced by clang (147MB). Here is the
time taken to merge 4 copies together on my laptop:

  No thread pool: 112.87s user 5.92s system 97% cpu 2:01.08 total
  With 2 threads: 134.99s user 26.54s system 164% cpu 1:33.31 total

Changes since the initial commit:

  - When handling odd-length inputs, call ThreadPool::wait() before merging the
    last profile. Should fix a race/off-by-one (see r275937).

Differential Revision: https://reviews.llvm.org/D22438

llvm-svn: 275938
llvm/docs/CommandGuide/llvm-profdata.rst
llvm/include/llvm/ProfileData/InstrProfWriter.h
llvm/lib/ProfileData/InstrProfWriter.cpp
llvm/test/tools/llvm-profdata/multiple-inputs.test
llvm/tools/llvm-profdata/llvm-profdata.cpp
llvm/unittests/ProfileData/InstrProfTest.cpp