clang/docs/analyzer/developer-docs/IPA.rst

   1 Inlining
   2 ========
   3
   4 There are several options that control which calls the analyzer will consider for
   5 inlining. The major one is ``-analyzer-config ipa``:
   6
   7 * ``analyzer-config ipa=none`` - All inlining is disabled. This is the only mode
   8   available in LLVM 3.1 and earlier and in Xcode 4.3 and earlier.
   9
  10 * ``analyzer-config ipa=basic-inlining`` - Turns on inlining for C functions, C++
  11    static member functions, and blocks -- essentially, the calls that behave
  12    like simple C function calls. This is essentially the mode used in
  13    Xcode 4.4.
  14
  15 * ``analyzer-config ipa=inlining`` - Turns on inlining when we can confidently find
  16     the function/method body corresponding to the call. (C functions, static
  17     functions, devirtualized C++ methods, Objective-C class methods, Objective-C
  18     instance methods when ExprEngine is confident about the dynamic type of the
  19     instance).
  20
  21 * ``analyzer-config ipa=dynamic`` - Inline instance methods for which the type is
  22    determined at runtime and we are not 100% sure that our type info is
  23    correct. For virtual calls, inline the most plausible definition.
  24
  25 * ``analyzer-config ipa=dynamic-bifurcate`` - Same as -analyzer-config ipa=dynamic,
  26    but the path is split. We inline on one branch and do not inline on the
  27    other. This mode does not drop the coverage in cases when the parent class
  28    has code that is only exercised when some of its methods are overridden.
  29
  30 Currently, ``-analyzer-config ipa=dynamic-bifurcate`` is the default mode.
  31
  32 While ``-analyzer-config ipa`` determines in general how aggressively the analyzer
  33 will try to inline functions, several additional options control which types of
  34 functions can inlined, in an all-or-nothing way. These options use the
  35 analyzer's configuration table, so they are all specified as follows:
  36
  37     ``-analyzer-config OPTION=VALUE``
  38
  39 c++-inlining
  40 ------------
  41
  42 This option controls which C++ member functions may be inlined.
  43
  44     ``-analyzer-config c++-inlining=[none | methods | constructors | destructors]``
  45
  46 Each of these modes implies that all the previous member function kinds will be
  47 inlined as well; it doesn't make sense to inline destructors without inlining
  48 constructors, for example.
  49
  50 The default c++-inlining mode is 'destructors', meaning that all member
  51 functions with visible definitions will be considered for inlining. In some
  52 cases the analyzer may still choose not to inline the function.
  53
  54 Note that under 'constructors', constructors for types with non-trivial
  55 destructors will not be inlined. Additionally, no C++ member functions will be
  56 inlined under -analyzer-config ipa=none or -analyzer-config ipa=basic-inlining,
  57 regardless of the setting of the c++-inlining mode.
  58
  59 c++-template-inlining
  60 ^^^^^^^^^^^^^^^^^^^^^
  61
  62 This option controls whether C++ templated functions may be inlined.
  63
  64     ``-analyzer-config c++-template-inlining=[true | false]``
  65
  66 Currently, template functions are considered for inlining by default.
  67
  68 The motivation behind this option is that very generic code can be a source
  69 of false positives, either by considering paths that the caller considers
  70 impossible (by some unstated precondition), or by inlining some but not all
  71 of a deep implementation of a function.
  72
  73 c++-stdlib-inlining
  74 ^^^^^^^^^^^^^^^^^^^
  75
  76 This option controls whether functions from the C++ standard library, including
  77 methods of the container classes in the Standard Template Library, should be
  78 considered for inlining.
  79
  80     ``-analyzer-config c++-stdlib-inlining=[true | false]``
  81
  82 Currently, C++ standard library functions are considered for inlining by
  83 default.
  84
  85 The standard library functions and the STL in particular are used ubiquitously
  86 enough that our tolerance for false positives is even lower here. A false
  87 positive due to poor modeling of the STL leads to a poor user experience, since
  88 most users would not be comfortable adding assertions to system headers in order
  89 to silence analyzer warnings.
  90
  91 c++-container-inlining
  92 ^^^^^^^^^^^^^^^^^^^^^^
  93
  94 This option controls whether constructors and destructors of "container" types
  95 should be considered for inlining.
  96
  97     ``-analyzer-config c++-container-inlining=[true | false]``
  98
  99 Currently, these constructors and destructors are NOT considered for inlining
 100 by default.
 101
 102 The current implementation of this setting checks whether a type has a member
 103 named 'iterator' or a member named 'begin'; these names are idiomatic in C++,
 104 with the latter specified in the C++11 standard. The analyzer currently does a
 105 fairly poor job of modeling certain data structure invariants of container-like
 106 objects. For example, these three expressions should be equivalent:
 107
 108
 109 .. code-block:: cpp
 110
 111  std::distance(c.begin(), c.end()) == 0
 112  c.begin() == c.end()
 113  c.empty()
 114
 115 Many of these issues are avoided if containers always have unknown, symbolic
 116 state, which is what happens when their constructors are treated as opaque.
 117 In the future, we may decide specific containers are "safe" to model through
 118 inlining, or choose to model them directly using checkers instead.
 119
 120
 121 Basics of Implementation
 122 ------------------------
 123
 124 The low-level mechanism of inlining a function is handled in
 125 ExprEngine::inlineCall and ExprEngine::processCallExit.
 126
 127 If the conditions are right for inlining, a CallEnter node is created and added
 128 to the analysis work list. The CallEnter node marks the change to a new
 129 LocationContext representing the called function, and its state includes the
 130 contents of the new stack frame. When the CallEnter node is actually processed,
 131 its single successor will be an edge to the first CFG block in the function.
 132
 133 Exiting an inlined function is a bit more work, fortunately broken up into
 134 reasonable steps:
 135
 136 1. The CoreEngine realizes we're at the end of an inlined call and generates a
 137    CallExitBegin node.
 138
 139 2. ExprEngine takes over (in processCallExit) and finds the return value of the
 140    function, if it has one. This is bound to the expression that triggered the
 141    call. (In the case of calls without origin expressions, such as destructors,
 142    this step is skipped.)
 143
 144 3. Dead symbols and bindings are cleaned out from the state, including any local
 145    bindings.
 146
 147 4. A CallExitEnd node is generated, which marks the transition back to the
 148    caller's LocationContext.
 149
 150 5. Custom post-call checks are processed and the final nodes are pushed back
 151    onto the work list, so that evaluation of the caller can continue.
 152
 153 Retry Without Inlining
 154 ^^^^^^^^^^^^^^^^^^^^^^
 155
 156 In some cases, we would like to retry analysis without inlining a particular
 157 call.
 158
 159 Currently, we use this technique to recover coverage in case we stop
 160 analyzing a path due to exceeding the maximum block count inside an inlined
 161 function.
 162
 163 When this situation is detected, we walk up the path to find the first node
 164 before inlining was started and enqueue it on the WorkList with a special
 165 ReplayWithoutInlining bit added to it (ExprEngine::replayWithoutInlining).  The
 166 path is then re-analyzed from that point without inlining that particular call.
 167
 168 Deciding When to Inline
 169 ^^^^^^^^^^^^^^^^^^^^^^^
 170
 171 In general, the analyzer attempts to inline as much as possible, since it
 172 provides a better summary of what actually happens in the program.  There are
 173 some cases, however, where the analyzer chooses not to inline:
 174
 175 - If there is no definition available for the called function or method.  In
 176   this case, there is no opportunity to inline.
 177
 178 - If the CFG cannot be constructed for a called function, or the liveness
 179   cannot be computed.  These are prerequisites for analyzing a function body,
 180   with or without inlining.
 181
 182 - If the LocationContext chain for a given ExplodedNode reaches a maximum cutoff
 183   depth.  This prevents unbounded analysis due to infinite recursion, but also
 184   serves as a useful cutoff for performance reasons.
 185
 186 - If the function is variadic.  This is not a hard limitation, but an engineering
 187   limitation.
 188
 189   Tracked by: <rdar://problem/12147064> Support inlining of variadic functions
 190
 191 - In C++, constructors are not inlined unless the destructor call will be
 192   processed by the ExprEngine. Thus, if the CFG was built without nodes for
 193   implicit destructors, or if the destructors for the given object are not
 194   represented in the CFG, the constructor will not be inlined. (As an exception,
 195   constructors for objects with trivial constructors can still be inlined.)
 196   See "C++ Caveats" below.
 197
 198 - In C++, ExprEngine does not inline custom implementations of operator 'new'
 199   or operator 'delete', nor does it inline the constructors and destructors
 200   associated with these. See "C++ Caveats" below.
 201
 202 - Calls resulting in "dynamic dispatch" are specially handled.  See more below.
 203
 204 - The FunctionSummaries map stores additional information about declarations,
 205   some of which is collected at runtime based on previous analyses.
 206   We do not inline functions which were not profitable to inline in a different
 207   context (for example, if the maximum block count was exceeded; see
 208   "Retry Without Inlining").
 209
 210
 211 Dynamic Calls and Devirtualization
 212 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
 213
 214 "Dynamic" calls are those that are resolved at runtime, such as C++ virtual
 215 method calls and Objective-C message sends. Due to the path-sensitive nature of
 216 the analysis, the analyzer may be able to reason about the dynamic type of the
 217 object whose method is being called and thus "devirtualize" the call.
 218
 219 This path-sensitive devirtualization occurs when the analyzer can determine what
 220 method would actually be called at runtime.  This is possible when the type
 221 information is constrained enough for a simulated C++/Objective-C object that
 222 the analyzer can make such a decision.
 223
 224 DynamicTypeInfo
 225 ^^^^^^^^^^^^^^^
 226
 227 As the analyzer analyzes a path, it may accrue information to refine the
 228 knowledge about the type of an object.  This can then be used to make better
 229 decisions about the target method of a call.
 230
 231 Such type information is tracked as DynamicTypeInfo.  This is path-sensitive
 232 data that is stored in ProgramState, which defines a mapping from MemRegions to
 233 an (optional) DynamicTypeInfo.
 234
 235 If no DynamicTypeInfo has been explicitly set for a MemRegion, it will be lazily
 236 inferred from the region's type or associated symbol. Information from symbolic
 237 regions is weaker than from true typed regions.
 238
 239   EXAMPLE: A C++ object declared "A obj" is known to have the class 'A', but a
 240            reference "A &ref" may dynamically be a subclass of 'A'.
 241
 242 The DynamicTypePropagation checker gathers and propagates DynamicTypeInfo,
 243 updating it as information is observed along a path that can refine that type
 244 information for a region.
 245
 246   WARNING: Not all of the existing analyzer code has been retrofitted to use
 247            DynamicTypeInfo, nor is it universally appropriate. In particular,
 248            DynamicTypeInfo always applies to a region with all casts stripped
 249            off, but sometimes the information provided by casts can be useful.
 250
 251
 252 RuntimeDefinition
 253 ^^^^^^^^^^^^^^^^^
 254
 255 The basis of devirtualization is CallEvent's getRuntimeDefinition() method,
 256 which returns a RuntimeDefinition object.  When asked to provide a definition,
 257 the CallEvents for dynamic calls will use the DynamicTypeInfo in their
 258 ProgramState to attempt to devirtualize the call.  In the case of no dynamic
 259 dispatch, or perfectly constrained devirtualization, the resulting
 260 RuntimeDefinition contains a Decl corresponding to the definition of the called
 261 function, and RuntimeDefinition::mayHaveOtherDefinitions will return FALSE.
 262
 263 In the case of dynamic dispatch where our information is not perfect, CallEvent
 264 can make a guess, but RuntimeDefinition::mayHaveOtherDefinitions will return
 265 TRUE. The RuntimeDefinition object will then also include a MemRegion
 266 corresponding to the object being called (i.e., the "receiver" in Objective-C
 267 parlance), which ExprEngine uses to decide whether or not the call should be
 268 inlined.
 269
 270 Inlining Dynamic Calls
 271 ^^^^^^^^^^^^^^^^^^^^^^
 272
 273 The -analyzer-config ipa option has five different modes: none, basic-inlining,
 274 inlining, dynamic, and dynamic-bifurcate. Under -analyzer-config ipa=dynamic,
 275 all dynamic calls are inlined, whether we are certain or not that this will
 276 actually be the definition used at runtime. Under -analyzer-config ipa=inlining,
 277 only "near-perfect" devirtualized calls are inlined*, and other dynamic calls
 278 are evaluated conservatively (as if no definition were available).
 279
 280 * Currently, no Objective-C messages are not inlined under
 281   -analyzer-config ipa=inlining, even if we are reasonably confident of the type
 282   of the receiver. We plan to enable this once we have tested our heuristics
 283   more thoroughly.
 284
 285 The last option, -analyzer-config ipa=dynamic-bifurcate, behaves similarly to
 286 "dynamic", but performs a conservative invalidation in the general virtual case
 287 in *addition* to inlining. The details of this are discussed below.
 288
 289 As stated above, -analyzer-config ipa=basic-inlining does not inline any C++
 290 member functions or Objective-C method calls, even if they are non-virtual or
 291 can be safely devirtualized.
 292
 293
 294 Bifurcation
 295 ^^^^^^^^^^^
 296
 297 ExprEngine::BifurcateCall implements the ``-analyzer-config ipa=dynamic-bifurcate``
 298 mode.
 299
 300 When a call is made on an object with imprecise dynamic type information
 301 (RuntimeDefinition::mayHaveOtherDefinitions() evaluates to TRUE), ExprEngine
 302 bifurcates the path and marks the object's region (retrieved from the
 303 RuntimeDefinition object) with a path-sensitive "mode" in the ProgramState.
 304
 305 Currently, there are 2 modes:
 306
 307 * ``DynamicDispatchModeInlined`` - Models the case where the dynamic type information
 308    of the receiver (MemoryRegion) is assumed to be perfectly constrained so
 309    that a given definition of a method is expected to be the code actually
 310    called. When this mode is set, ExprEngine uses the Decl from
 311    RuntimeDefinition to inline any dynamically dispatched call sent to this
 312    receiver because the function definition is considered to be fully resolved.
 313
 314 * ``DynamicDispatchModeConservative`` - Models the case where the dynamic type
 315    information is assumed to be incorrect, for example, implies that the method
 316    definition is overridden in a subclass. In such cases, ExprEngine does not
 317    inline the methods sent to the receiver (MemoryRegion), even if a candidate
 318    definition is available. This mode is conservative about simulating the
 319    effects of a call.
 320
 321 Going forward along the symbolic execution path, ExprEngine consults the mode
 322 of the receiver's MemRegion to make decisions on whether the calls should be
 323 inlined or not, which ensures that there is at most one split per region.
 324
 325 At a high level, "bifurcation mode" allows for increased semantic coverage in
 326 cases where the parent method contains code which is only executed when the
 327 class is subclassed. The disadvantages of this mode are a (considerable?)
 328 performance hit and the possibility of false positives on the path where the
 329 conservative mode is used.
 330
 331 Objective-C Message Heuristics
 332 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
 333
 334 ExprEngine relies on a set of heuristics to partition the set of Objective-C
 335 method calls into those that require bifurcation and those that do not. Below
 336 are the cases when the DynamicTypeInfo of the object is considered precise
 337 (cannot be a subclass):
 338
 339  - If the object was created with +alloc or +new and initialized with an -init
 340    method.
 341
 342  - If the calls are property accesses using dot syntax. This is based on the
 343    assumption that children rarely override properties, or do so in an
 344    essentially compatible way.
 345
 346  - If the class interface is declared inside the main source file. In this case
 347    it is unlikely that it will be subclassed.
 348
 349  - If the method is not declared outside of main source file, either by the
 350    receiver's class or by any superclasses.
 351
 352 C++ Caveats
 353 ^^^^^^^^^^^
 354
 355 C++11 [class.cdtor]p4 describes how the vtable of an object is modified as it is
 356 being constructed or destructed; that is, the type of the object depends on
 357 which base constructors have been completed. This is tracked using
 358 DynamicTypeInfo in the DynamicTypePropagation checker.
 359
 360 There are several limitations in the current implementation:
 361
 362 * Temporaries are poorly modeled right now because we're not confident in the
 363   placement of their destructors in the CFG. We currently won't inline their
 364   constructors unless the destructor is trivial, and don't process their
 365   destructors at all, not even to invalidate the region.
 366
 367 * 'new' is poorly modeled due to some nasty CFG/design issues.  This is tracked
 368   in PR12014.  'delete' is not modeled at all.
 369
 370 * Arrays of objects are modeled very poorly right now.  ExprEngine currently
 371   only simulates the first constructor and first destructor. Because of this,
 372   ExprEngine does not inline any constructors or destructors for arrays.
 373
 374
 375 CallEvent
 376 ^^^^^^^^^
 377
 378 A CallEvent represents a specific call to a function, method, or other body of
 379 code. It is path-sensitive, containing both the current state (ProgramStateRef)
 380 and stack space (LocationContext), and provides uniform access to the argument
 381 values and return type of a call, no matter how the call is written in the
 382 source or what sort of code body is being invoked.
 383
 384   NOTE: For those familiar with Cocoa, CallEvent is roughly equivalent to
 385         NSInvocation.
 386
 387 CallEvent should be used whenever there is logic dealing with function calls
 388 that does not care how the call occurred.
 389
 390 Examples include checking that arguments satisfy preconditions (such as
 391 __attribute__((nonnull))), and attempting to inline a call.
 392
 393 CallEvents are reference-counted objects managed by a CallEventManager. While
 394 there is no inherent issue with persisting them (say, in a ProgramState's GDM),
 395 they are intended for short-lived use, and can be recreated from CFGElements or
 396 non-top-level StackFrameContexts fairly easily.