modules/video/doc/motion_analysis_and_object_tracking.rst

   1 Motion Analysis and Object Tracking
   2 ===================================
   3
   4 .. highlight:: cpp
   5
   6
   7 calcOpticalFlowPyrLK
   8 ------------------------
   9 Calculates an optical flow for a sparse feature set using the iterative Lucas-Kanade method with pyramids.
  10
  11 .. ocv:function:: void calcOpticalFlowPyrLK( InputArray prevImg, InputArray nextImg, InputArray prevPts, InputOutputArray nextPts, OutputArray status, OutputArray err, Size winSize=Size(21,21), int maxLevel=3, TermCriteria criteria=TermCriteria(TermCriteria::COUNT+TermCriteria::EPS, 30, 0.01), int flags=0, double minEigThreshold=1e-4 )
  12
  13 .. ocv:pyfunction:: cv2.calcOpticalFlowPyrLK(prevImg, nextImg, prevPts[, nextPts[, status[, err[, winSize[, maxLevel[, criteria[, flags[, minEigThreshold]]]]]]]]) -> nextPts, status, err
  14
  15 .. ocv:cfunction:: void cvCalcOpticalFlowPyrLK( const CvArr* prev, const CvArr* curr, CvArr* prev_pyr, CvArr* curr_pyr, const CvPoint2D32f* prev_features, CvPoint2D32f* curr_features, int count, CvSize win_size, int level, char* status, float* track_error, CvTermCriteria criteria, int flags )
  16 .. ocv:pyoldfunction:: cv.CalcOpticalFlowPyrLK(prev, curr, prevPyr, currPyr, prevFeatures, winSize, level, criteria, flags, guesses=None) -> (currFeatures, status, track_error)
  17
  18     :param prevImg: first 8-bit input image or pyramid constructed by :ocv:func:`buildOpticalFlowPyramid`.
  19
  20     :param nextImg: second input image or pyramid of the same size and the same type as ``prevImg``.
  21
  22     :param prevPts: vector of 2D points for which the flow needs to be found; point coordinates must be single-precision floating-point numbers.
  23
  24     :param nextPts: output vector of 2D points (with single-precision floating-point coordinates) containing the calculated new positions of input features in the second image; when ``OPTFLOW_USE_INITIAL_FLOW`` flag is passed, the vector must have the same size as in the input.
  25
  26     :param status: output status vector (of unsigned chars); each element of the vector is set to 1 if the flow for the corresponding features has been found, otherwise, it is set to 0.
  27
  28     :param err: output vector of errors; each element of the vector is set to an error for the corresponding feature, type of the error measure can be set in ``flags`` parameter; if the flow wasn't found then the error is not defined (use the ``status`` parameter to find such cases).
  29
  30     :param winSize: size of the search window at each pyramid level.
  31
  32     :param maxLevel: 0-based maximal pyramid level number; if set to 0, pyramids are not used (single level), if set to 1, two levels are used, and so on; if pyramids are passed to input then algorithm will use as many levels as pyramids have but no more than ``maxLevel``.
  33
  34     :param criteria: parameter, specifying the termination criteria of the iterative search algorithm (after the specified maximum number of iterations  ``criteria.maxCount``  or when the search window moves by less than  ``criteria.epsilon``.
  35
  36     :param flags: operation flags:
  37
  38         * **OPTFLOW_USE_INITIAL_FLOW** uses initial estimations, stored in ``nextPts``; if the flag is not set, then ``prevPts`` is copied to ``nextPts`` and is considered the initial estimate.
  39         * **OPTFLOW_LK_GET_MIN_EIGENVALS** use minimum eigen values as an error measure (see ``minEigThreshold`` description); if the flag is not set, then L1 distance between patches around the original and a moved point, divided by number of pixels in a window, is used as a error measure.
  40
  41     :param minEigThreshold: the algorithm calculates the minimum eigen value of a 2x2 normal matrix of optical flow equations (this matrix is called a spatial gradient matrix in [Bouguet00]_), divided by number of pixels in a window; if this value is less than ``minEigThreshold``, then a corresponding feature is filtered out and its flow is not processed, so it allows to remove bad points and get a performance boost.
  42
  43 The function implements a sparse iterative version of the Lucas-Kanade optical flow in pyramids. See [Bouguet00]_. The function is parallelized with the TBB library.
  44
  45 buildOpticalFlowPyramid
  46 -----------------------
  47 Constructs the image pyramid which can be passed to :ocv:func:`calcOpticalFlowPyrLK`.
  48
  49 .. ocv:function:: int buildOpticalFlowPyramid(InputArray img, OutputArrayOfArrays pyramid, Size winSize, int maxLevel, bool withDerivatives = true, int pyrBorder = BORDER_REFLECT_101, int derivBorder = BORDER_CONSTANT, bool tryReuseInputImage = true)
  50
  51 .. ocv:pyfunction:: cv2.buildOpticalFlowPyramid(img, winSize, maxLevel[, pyramid[, withDerivatives[, pyrBorder[, derivBorder[, tryReuseInputImage]]]]]) -> retval, pyramid
  52
  53     :param img: 8-bit input image.
  54
  55     :param pyramid: output pyramid.
  56
  57     :param winSize: window size of optical flow algorithm. Must be not less than ``winSize`` argument of :ocv:func:`calcOpticalFlowPyrLK`. It is needed to calculate required padding for pyramid levels.
  58
  59     :param maxLevel: 0-based maximal pyramid level number.
  60
  61     :param withDerivatives: set to precompute gradients for the every pyramid level. If pyramid is constructed without the gradients then :ocv:func:`calcOpticalFlowPyrLK` will calculate them internally.
  62
  63     :param pyrBorder: the border mode for pyramid layers.
  64
  65     :param derivBorder: the border mode for gradients.
  66
  67     :param tryReuseInputImage: put ROI of input image into the pyramid if possible. You can pass ``false`` to force data copying.
  68
  69     :return: number of levels in constructed pyramid. Can be less than ``maxLevel``.
  70
  71
  72 calcOpticalFlowFarneback
  73 ----------------------------
  74 Computes a dense optical flow using the Gunnar Farneback's algorithm.
  75
  76 .. ocv:function:: void calcOpticalFlowFarneback( InputArray prev, InputArray next, InputOutputArray flow, double pyr_scale, int levels, int winsize, int iterations, int poly_n, double poly_sigma, int flags )
  77
  78 .. ocv:cfunction:: void cvCalcOpticalFlowFarneback( const CvArr* prev, const CvArr* next, CvArr* flow, double pyr_scale, int levels, int winsize, int iterations, int poly_n, double poly_sigma, int flags )
  79
  80 .. ocv:pyfunction:: cv2.calcOpticalFlowFarneback(prev, next, pyr_scale, levels, winsize, iterations, poly_n, poly_sigma, flags[, flow]) -> flow
  81
  82     :param prev: first 8-bit single-channel input image.
  83
  84     :param next: second input image of the same size and the same type as ``prev``.
  85
  86     :param flow: computed flow image that has the same size as ``prev`` and type ``CV_32FC2``.
  87
  88     :param pyr_scale: parameter, specifying the image scale (<1) to build pyramids for each image; ``pyr_scale=0.5`` means a classical pyramid, where each next layer is twice smaller than the previous one.
  89
  90     :param levels: number of pyramid layers including the initial image; ``levels=1`` means that no extra layers are created and only the original images are used.
  91
  92     :param winsize: averaging window size; larger values increase the algorithm robustness to image noise and give more chances for fast motion detection, but yield more blurred motion field.
  93
  94     :param iterations: number of iterations the algorithm does at each pyramid level.
  95
  96     :param poly_n: size of the pixel neighborhood used to find polynomial expansion in each pixel; larger values mean that the image will be approximated with smoother surfaces, yielding more robust algorithm and more blurred  motion field, typically ``poly_n`` =5 or 7.
  97
  98     :param poly_sigma: standard deviation of the Gaussian that is used to smooth derivatives used as a basis for the polynomial expansion; for  ``poly_n=5``, you can set ``poly_sigma=1.1``, for ``poly_n=7``, a good value would be ``poly_sigma=1.5``.
  99
 100     :param flags: operation flags that can be a combination of the following:
 101
 102             * **OPTFLOW_USE_INITIAL_FLOW** uses the input  ``flow``  as an initial flow approximation.
 103
 104             * **OPTFLOW_FARNEBACK_GAUSSIAN** uses the Gaussian :math:`\texttt{winsize}\times\texttt{winsize}` filter instead of a box filter of the same size for optical flow estimation; usually, this option gives z more accurate flow than with a box filter, at the cost of lower speed; normally, ``winsize`` for a Gaussian window should be set to a larger value to achieve the same level of robustness.
 105
 106 The function finds an optical flow for each ``prev`` pixel using the [Farneback2003]_ algorithm so that
 107
 108 .. math::
 109
 110     \texttt{prev} (y,x)  \sim \texttt{next} ( y + \texttt{flow} (y,x)[1],  x + \texttt{flow} (y,x)[0])
 111
 112
 113 estimateRigidTransform
 114 --------------------------
 115 Computes an optimal affine transformation between two 2D point sets.
 116
 117 .. ocv:function:: Mat estimateRigidTransform( InputArray src, InputArray dst, bool fullAffine )
 118
 119 .. ocv:pyfunction:: cv2.estimateRigidTransform(src, dst, fullAffine) -> retval
 120
 121     :param src: First input 2D point set stored in ``std::vector`` or ``Mat``, or an image stored in ``Mat``.
 122
 123     :param dst: Second input 2D point set of the same size and the same type as ``A``, or another image.
 124
 125     :param fullAffine: If true, the function finds an optimal affine transformation with no additional restrictions (6 degrees of freedom). Otherwise, the class of transformations to choose from is limited to combinations of translation, rotation, and uniform scaling (5 degrees of freedom).
 126
 127 The function finds an optimal affine transform *[A|b]* (a ``2 x 3`` floating-point matrix) that approximates best the affine transformation between:
 128
 129   *
 130       Two point sets
 131   *
 132       Two raster images. In this case, the function first finds some features in the ``src`` image and finds the corresponding features in ``dst`` image. After that, the problem is reduced to the first case.
 133
 134 In case of point sets, the problem is formulated as follows: you need to find a 2x2 matrix *A* and 2x1 vector *b* so that:
 135
 136     .. math::
 137
 138         [A^*|b^*] = arg  \min _{[A|b]}  \sum _i  \| \texttt{dst}[i] - A { \texttt{src}[i]}^T - b  \| ^2
 139
 140     where ``src[i]`` and ``dst[i]`` are the i-th points in ``src`` and ``dst``, respectively
 141
 142     :math:`[A|b]` can be either arbitrary (when ``fullAffine=true`` ) or have a form of
 143
 144     .. math::
 145
 146         \begin{bmatrix} a_{11} & a_{12} & b_1  \\ -a_{12} & a_{11} & b_2  \end{bmatrix}
 147
 148     when ``fullAffine=false`` .
 149
 150 .. seealso::
 151
 152     :ocv:func:`getAffineTransform`,
 153     :ocv:func:`getPerspectiveTransform`,
 154     :ocv:func:`findHomography`
 155
 156
 157
 158 updateMotionHistory
 159 -----------------------
 160 Updates the motion history image by a moving silhouette.
 161
 162 .. ocv:function:: void updateMotionHistory( InputArray silhouette, InputOutputArray mhi, double timestamp, double duration )
 163
 164 .. ocv:pyfunction:: cv2.updateMotionHistory(silhouette, mhi, timestamp, duration) -> None
 165
 166 .. ocv:cfunction:: void cvUpdateMotionHistory( const CvArr* silhouette, CvArr* mhi, double timestamp, double duration )
 167 .. ocv:pyoldfunction:: cv.UpdateMotionHistory(silhouette, mhi, timestamp, duration)-> None
 168
 169     :param silhouette: Silhouette mask that has non-zero pixels where the motion occurs.
 170
 171     :param mhi: Motion history image that is updated by the function (single-channel, 32-bit floating-point).
 172
 173     :param timestamp: Current time in milliseconds or other units.
 174
 175     :param duration: Maximal duration of the motion track in the same units as  ``timestamp`` .
 176
 177 The function updates the motion history image as follows:
 178
 179 .. math::
 180
 181     \texttt{mhi} (x,y)= \forkthree{\texttt{timestamp}}{if $\texttt{silhouette}(x,y) \ne 0$}{0}{if $\texttt{silhouette}(x,y) = 0$ and $\texttt{mhi} < (\texttt{timestamp} - \texttt{duration})$}{\texttt{mhi}(x,y)}{otherwise}
 182
 183 That is, MHI pixels where the motion occurs are set to the current ``timestamp`` , while the pixels where the motion happened last time a long time ago are cleared.
 184
 185 The function, together with
 186 :ocv:func:`calcMotionGradient` and
 187 :ocv:func:`calcGlobalOrientation` , implements a motion templates technique described in
 188 [Davis97]_ and [Bradski00]_.
 189 See also the OpenCV sample ``motempl.c`` that demonstrates the use of all the motion template functions.
 190
 191
 192 calcMotionGradient
 193 ----------------------
 194 Calculates a gradient orientation of a motion history image.
 195
 196 .. ocv:function:: void calcMotionGradient( InputArray mhi, OutputArray mask, OutputArray orientation,                         double delta1, double delta2, int apertureSize=3 )
 197
 198 .. ocv:pyfunction:: cv2.calcMotionGradient(mhi, delta1, delta2[, mask[, orientation[, apertureSize]]]) -> mask, orientation
 199
 200 .. ocv:cfunction:: void cvCalcMotionGradient( const CvArr* mhi, CvArr* mask, CvArr* orientation, double delta1, double delta2, int aperture_size=3 )
 201 .. ocv:pyoldfunction:: cv.CalcMotionGradient(mhi, mask, orientation, delta1, delta2, apertureSize=3)-> None
 202
 203     :param mhi: Motion history single-channel floating-point image.
 204
 205     :param mask: Output mask image that has the type  ``CV_8UC1``  and the same size as  ``mhi`` . Its non-zero elements mark pixels where the motion gradient data is correct.
 206
 207     :param orientation: Output motion gradient orientation image that has the same type and the same size as  ``mhi`` . Each pixel of the image is a motion orientation, from 0 to 360 degrees.
 208
 209     :param delta1: Minimal (or maximal) allowed difference between  ``mhi``  values within a pixel neighborhood.
 210
 211     :param delta2: Maximal (or minimal) allowed difference between  ``mhi``  values within a pixel neighborhood. That is, the function finds the minimum ( :math:`m(x,y)` ) and maximum ( :math:`M(x,y)` )  ``mhi``  values over  :math:`3 \times 3`  neighborhood of each pixel and marks the motion orientation at  :math:`(x, y)`  as valid only if
 212
 213         .. math::
 214
 215             \min ( \texttt{delta1}  ,  \texttt{delta2}  )  \le  M(x,y)-m(x,y)  \le   \max ( \texttt{delta1}  , \texttt{delta2} ).
 216
 217     :param apertureSize: Aperture size of  the :ocv:func:`Sobel`  operator.
 218
 219 The function calculates a gradient orientation at each pixel
 220 :math:`(x, y)` as:
 221
 222 .. math::
 223
 224     \texttt{orientation} (x,y)= \arctan{\frac{d\texttt{mhi}/dy}{d\texttt{mhi}/dx}}
 225
 226 In fact,
 227 :ocv:func:`fastAtan2` and
 228 :ocv:func:`phase` are used so that the computed angle is measured in degrees and covers the full range 0..360. Also, the ``mask`` is filled to indicate pixels where the computed angle is valid.
 229
 230
 231
 232 calcGlobalOrientation
 233 -------------------------
 234 Calculates a global motion orientation in a selected region.
 235
 236 .. ocv:function:: double calcGlobalOrientation( InputArray orientation, InputArray mask, InputArray mhi, double timestamp, double duration )
 237
 238 .. ocv:pyfunction:: cv2.calcGlobalOrientation(orientation, mask, mhi, timestamp, duration) -> retval
 239
 240 .. ocv:cfunction:: double cvCalcGlobalOrientation( const CvArr* orientation, const CvArr* mask, const CvArr* mhi, double timestamp, double duration )
 241 .. ocv:pyoldfunction:: cv.CalcGlobalOrientation(orientation, mask, mhi, timestamp, duration)-> float
 242
 243     :param orientation: Motion gradient orientation image calculated by the function  :ocv:func:`calcMotionGradient` .
 244
 245     :param mask: Mask image. It may be a conjunction of a valid gradient mask, also calculated by  :ocv:func:`calcMotionGradient` , and the mask of a region whose direction needs to be calculated.
 246
 247     :param mhi: Motion history image calculated by  :ocv:func:`updateMotionHistory` .
 248
 249     :param timestamp: Timestamp passed to  :ocv:func:`updateMotionHistory` .
 250
 251     :param duration: Maximum duration of a motion track in milliseconds, passed to  :ocv:func:`updateMotionHistory` .
 252
 253 The function calculates an average
 254 motion direction in the selected region and returns the angle between
 255 0 degrees  and 360 degrees. The average direction is computed from
 256 the weighted orientation histogram, where a recent motion has a larger
 257 weight and the motion occurred in the past has a smaller weight, as recorded in ``mhi`` .
 258
 259
 260
 261
 262 segmentMotion
 263 -------------
 264 Splits a motion history image into a few parts corresponding to separate independent motions (for example, left hand, right hand).
 265
 266 .. ocv:function:: void segmentMotion(InputArray mhi, OutputArray segmask, vector<Rect>& boundingRects, double timestamp, double segThresh)
 267
 268 .. ocv:pyfunction:: cv2.segmentMotion(mhi, timestamp, segThresh[, segmask]) -> segmask, boundingRects
 269
 270 .. ocv:cfunction:: CvSeq* cvSegmentMotion( const CvArr* mhi, CvArr* seg_mask, CvMemStorage* storage, double timestamp, double seg_thresh )
 271 .. ocv:pyoldfunction:: cv.SegmentMotion(mhi, seg_mask, storage, timestamp, seg_thresh) -> boundingRects
 272
 273     :param mhi: Motion history image.
 274
 275     :param segmask: Image where the found mask should be stored, single-channel, 32-bit floating-point.
 276
 277     :param boundingRects: Vector containing ROIs of motion connected components.
 278
 279     :param timestamp: Current time in milliseconds or other units.
 280
 281     :param segThresh: Segmentation threshold that is recommended to be equal to the interval between motion history "steps" or greater.
 282
 283
 284 The function finds all of the motion segments and marks them in ``segmask`` with individual values (1,2,...). It also computes a vector with ROIs of motion connected components. After that the motion direction for every component can be calculated with :ocv:func:`calcGlobalOrientation` using the extracted mask of the particular component.
 285
 286
 287
 288
 289 CamShift
 290 --------
 291 Finds an object center, size, and orientation.
 292
 293 .. ocv:function:: RotatedRect CamShift( InputArray probImage, Rect& window, TermCriteria criteria )
 294
 295 .. ocv:pyfunction:: cv2.CamShift(probImage, window, criteria) -> retval, window
 296
 297 .. ocv:cfunction:: int cvCamShift( const CvArr* prob_image, CvRect window, CvTermCriteria criteria, CvConnectedComp* comp, CvBox2D* box=NULL )
 298
 299 .. ocv:pyoldfunction:: cv.CamShift(prob_image, window, criteria) -> (int, comp, box)
 300
 301     :param probImage: Back projection of the object histogram. See  :ocv:func:`calcBackProject` .
 302
 303     :param window: Initial search window.
 304
 305     :param criteria: Stop criteria for the underlying  :ocv:func:`meanShift` .
 306
 307     :returns: (in old interfaces) Number of iterations CAMSHIFT took to converge
 308
 309 The function implements the CAMSHIFT object tracking algorithm
 310 [Bradski98]_.
 311 First, it finds an object center using
 312 :ocv:func:`meanShift` and then adjusts the window size and finds the optimal rotation. The function returns the rotated rectangle structure that includes the object position, size, and orientation. The next position of the search window can be obtained with ``RotatedRect::boundingRect()`` .
 313
 314 See the OpenCV sample ``camshiftdemo.c`` that tracks colored objects.
 315
 316
 317
 318 meanShift
 319 ---------
 320 Finds an object on a back projection image.
 321
 322 .. ocv:function:: int meanShift( InputArray probImage, Rect& window, TermCriteria criteria )
 323
 324 .. ocv:pyfunction:: cv2.meanShift(probImage, window, criteria) -> retval, window
 325
 326 .. ocv:cfunction:: int cvMeanShift( const CvArr* prob_image, CvRect window, CvTermCriteria criteria, CvConnectedComp* comp )
 327 .. ocv:pyoldfunction:: cv.MeanShift(prob_image, window, criteria) -> comp
 328
 329     :param probImage: Back projection of the object histogram. See  :ocv:func:`calcBackProject` for details.
 330
 331     :param window: Initial search window.
 332
 333     :param criteria: Stop criteria for the iterative search algorithm.
 334
 335     :returns: Number of iterations CAMSHIFT took to converge.
 336
 337 The function implements the iterative object search algorithm. It takes the input back projection of an object and the initial position. The mass center in ``window`` of the back projection image is computed and the search window center shifts to the mass center. The procedure is repeated until the specified number of iterations ``criteria.maxCount`` is done or until the window center shifts by less than ``criteria.epsilon`` . The algorithm is used inside
 338 :ocv:func:`CamShift` and, unlike
 339 :ocv:func:`CamShift` , the search window size or orientation do not change during the search. You can simply pass the output of
 340 :ocv:func:`calcBackProject` to this function. But better results can be obtained if you pre-filter the back projection and remove the noise. For example, you can do this by retrieving connected components with
 341 :ocv:func:`findContours` , throwing away contours with small area (
 342 :ocv:func:`contourArea` ), and rendering the  remaining contours with
 343 :ocv:func:`drawContours` .
 344
 345
 346
 347 KalmanFilter
 348 ------------
 349 .. ocv:class:: KalmanFilter
 350
 351     Kalman filter class.
 352
 353 The class implements a standard Kalman filter
 354 http://en.wikipedia.org/wiki/Kalman_filter, [Welch95]_. However, you can modify ``transitionMatrix``, ``controlMatrix``, and ``measurementMatrix`` to get an extended Kalman filter functionality. See the OpenCV sample ``kalman.cpp`` .
 355
 356
 357
 358
 359 KalmanFilter::KalmanFilter
 360 --------------------------
 361 The constructors.
 362
 363 .. ocv:function:: KalmanFilter::KalmanFilter()
 364
 365 .. ocv:function:: KalmanFilter::KalmanFilter(int dynamParams, int measureParams, int controlParams=0, int type=CV_32F)
 366
 367 .. ocv:pyfunction:: cv2.KalmanFilter([dynamParams, measureParams[, controlParams[, type]]]) -> <KalmanFilter object>
 368
 369 .. ocv:cfunction:: CvKalman* cvCreateKalman( int dynam_params, int measure_params, int control_params=0 )
 370 .. ocv:pyoldfunction:: cv.CreateKalman(dynam_params, measure_params, control_params=0) -> CvKalman
 371
 372     The full constructor.
 373
 374     :param dynamParams: Dimensionality of the state.
 375
 376     :param measureParams: Dimensionality of the measurement.
 377
 378     :param controlParams: Dimensionality of the control vector.
 379
 380     :param type: Type of the created matrices that should be ``CV_32F`` or ``CV_64F``.
 381
 382 .. note:: In C API when ``CvKalman* kalmanFilter`` structure is not needed anymore, it should be released with ``cvReleaseKalman(&kalmanFilter)``
 383
 384 KalmanFilter::init
 385 ------------------
 386 Re-initializes Kalman filter. The previous content is destroyed.
 387
 388 .. ocv:function:: void KalmanFilter::init(int dynamParams, int measureParams, int controlParams=0, int type=CV_32F)
 389
 390     :param dynamParams: Dimensionalityensionality of the state.
 391
 392     :param measureParams: Dimensionality of the measurement.
 393
 394     :param controlParams: Dimensionality of the control vector.
 395
 396     :param type: Type of the created matrices that should be ``CV_32F`` or ``CV_64F``.
 397
 398
 399 KalmanFilter::predict
 400 ---------------------
 401 Computes a predicted state.
 402
 403 .. ocv:function:: const Mat& KalmanFilter::predict(const Mat& control=Mat())
 404
 405 .. ocv:pyfunction:: cv2.KalmanFilter.predict([control]) -> retval
 406
 407 .. ocv:cfunction:: const CvMat* cvKalmanPredict( CvKalman* kalman, const CvMat* control=NULL)
 408 .. ocv:pyoldfunction:: cv.KalmanPredict(kalman, control=None) -> mat
 409
 410     :param control: The optional input control
 411
 412
 413 KalmanFilter::correct
 414 ---------------------
 415 Updates the predicted state from the measurement.
 416
 417 .. ocv:function:: const Mat& KalmanFilter::correct(const Mat& measurement)
 418
 419 .. ocv:pyfunction:: cv2.KalmanFilter.correct(measurement) -> retval
 420
 421 .. ocv:cfunction:: const CvMat* cvKalmanCorrect( CvKalman* kalman, const CvMat* measurement )
 422
 423 .. ocv:pyoldfunction:: cv.KalmanCorrect(kalman, measurement) -> mat
 424
 425     :param measurement: The measured system parameters
 426
 427
 428 BackgroundSubtractor
 429 --------------------
 430
 431 .. ocv:class:: BackgroundSubtractor : public Algorithm
 432
 433 Base class for background/foreground segmentation. ::
 434
 435     class BackgroundSubtractor : public Algorithm
 436     {
 437     public:
 438         virtual ~BackgroundSubtractor();
 439         virtual void operator()(InputArray image, OutputArray fgmask, double learningRate=0);
 440         virtual void getBackgroundImage(OutputArray backgroundImage) const;
 441     };
 442
 443
 444 The class is only used to define the common interface for the whole family of background/foreground segmentation algorithms.
 445
 446
 447 BackgroundSubtractor::operator()
 448 --------------------------------
 449 Computes a foreground mask.
 450
 451 .. ocv:function:: void BackgroundSubtractor::operator()(InputArray image, OutputArray fgmask, double learningRate=0)
 452
 453 .. ocv:pyfunction:: cv2.BackgroundSubtractor.apply(image[, fgmask[, learningRate]]) -> fgmask
 454
 455     :param image: Next video frame.
 456
 457     :param fgmask: The output foreground mask as an 8-bit binary image.
 458
 459
 460 BackgroundSubtractor::getBackgroundImage
 461 ----------------------------------------
 462 Computes a background image.
 463
 464 .. ocv:function:: void BackgroundSubtractor::getBackgroundImage(OutputArray backgroundImage) const
 465
 466     :param backgroundImage: The output background image.
 467
 468 .. note:: Sometimes the background image can be very blurry, as it contain the average background statistics.
 469
 470 BackgroundSubtractorMOG
 471 -----------------------
 472
 473 .. ocv:class:: BackgroundSubtractorMOG : public BackgroundSubtractor
 474
 475 Gaussian Mixture-based Background/Foreground Segmentation Algorithm.
 476
 477 The class implements the algorithm described in P. KadewTraKuPong and R. Bowden, *An improved adaptive background mixture model for real-time tracking with shadow detection*, Proc. 2nd European Workshop on Advanced Video-Based Surveillance Systems, 2001: http://personal.ee.surrey.ac.uk/Personal/R.Bowden/publications/avbs01/avbs01.pdf
 478
 479
 480
 481
 482 BackgroundSubtractorMOG::BackgroundSubtractorMOG
 483 ------------------------------------------------
 484 The constructors.
 485
 486 .. ocv:function:: BackgroundSubtractorMOG::BackgroundSubtractorMOG()
 487
 488 .. ocv:function:: BackgroundSubtractorMOG::BackgroundSubtractorMOG(int history, int nmixtures, double backgroundRatio, double noiseSigma=0)
 489
 490 .. ocv:pyfunction:: cv2.BackgroundSubtractorMOG([history, nmixtures, backgroundRatio[, noiseSigma]]) -> <BackgroundSubtractorMOG object>
 491
 492     :param history: Length of the history.
 493
 494     :param nmixtures: Number of Gaussian mixtures.
 495
 496     :param backgroundRatio: Background ratio.
 497
 498     :param noiseSigma: Noise strength.
 499
 500 Default constructor sets all parameters to default values.
 501
 502
 503
 504
 505 BackgroundSubtractorMOG::operator()
 506 -----------------------------------
 507 Updates the background model and returns the foreground mask
 508
 509 .. ocv:function:: void BackgroundSubtractorMOG::operator()(InputArray image, OutputArray fgmask, double learningRate=0)
 510
 511 Parameters are the same as in :ocv:funcx:`BackgroundSubtractor::operator()`
 512
 513
 514 BackgroundSubtractorMOG2
 515 ------------------------
 516 Gaussian Mixture-based Background/Foreground Segmentation Algorithm.
 517
 518 .. ocv:class:: BackgroundSubtractorMOG2 : public BackgroundSubtractor
 519
 520     Here are important members of the class that control the algorithm, which you can set after constructing the class instance:
 521
 522     .. ocv:member:: int nmixtures
 523
 524         Maximum allowed number of mixture components. Actual number is determined dynamically per pixel.
 525
 526     .. ocv:member:: float backgroundRatio
 527
 528         Threshold defining whether the component is significant enough to be included into the background model ( corresponds to ``TB=1-cf`` from the paper??which paper??). ``cf=0.1 => TB=0.9`` is default. For ``alpha=0.001``, it means that the mode should exist for approximately 105 frames before it is considered foreground.
 529
 530     .. ocv:member:: float varThresholdGen
 531
 532         Threshold for the squared Mahalanobis distance that helps decide when a sample is close to the existing components (corresponds to ``Tg``). If it is not close to any component, a new component is generated. ``3 sigma => Tg=3*3=9`` is default. A smaller ``Tg`` value generates more components. A higher ``Tg`` value may result in a small number of components but they can grow too large.
 533
 534     .. ocv:member:: float fVarInit
 535
 536         Initial variance for the newly generated components. It affects the speed of adaptation. The parameter value is based on your estimate of the typical standard deviation from the images. OpenCV uses 15 as a reasonable value.
 537
 538     .. ocv:member:: float fVarMin
 539
 540         Parameter used to further control the variance.
 541
 542     .. ocv:member:: float fVarMax
 543
 544         Parameter used to further control the variance.
 545
 546     .. ocv:member:: float fCT
 547
 548         Complexity reduction parameter. This parameter defines the number of samples needed to accept to prove the component exists. ``CT=0.05`` is a default value for all the samples. By setting ``CT=0`` you get an algorithm very similar to the standard Stauffer&Grimson algorithm.
 549
 550     .. ocv:member:: uchar nShadowDetection
 551
 552         The value for marking shadow pixels in the output foreground mask. Default value is 127.
 553
 554     .. ocv:member:: float fTau
 555
 556         Shadow threshold. The shadow is detected if the pixel is a darker version of the background. ``Tau`` is a threshold defining how much darker the shadow can be. ``Tau= 0.5`` means that if a pixel is more than twice darker then it is not shadow. See Prati,Mikic,Trivedi,Cucchiarra, *Detecting Moving Shadows...*, IEEE PAMI,2003.
 557
 558
 559 The class implements the Gaussian mixture model background subtraction described in:
 560
 561   * Z.Zivkovic, *Improved adaptive Gausian mixture model for background subtraction*, International Conference Pattern Recognition, UK, August, 2004, http://www.zoranz.net/Publications/zivkovic2004ICPR.pdf. The code is very fast and performs also shadow detection. Number of Gausssian components is adapted per pixel.
 562
 563   * Z.Zivkovic, F. van der Heijden, *Efficient Adaptive Density Estimapion per Image Pixel for the Task of Background Subtraction*, Pattern Recognition Letters, vol. 27, no. 7, pages 773-780, 2006. The algorithm similar to the standard Stauffer&Grimson algorithm with additional selection of the number of the Gaussian components based on: Z.Zivkovic, F.van der Heijden, Recursive unsupervised learning of finite mixture models, IEEE Trans. on Pattern Analysis and Machine Intelligence, vol.26, no.5, pages 651-656, 2004.
 564
 565
 566 BackgroundSubtractorMOG2::BackgroundSubtractorMOG2
 567 --------------------------------------------------
 568 The constructors.
 569
 570 .. ocv:function:: BackgroundSubtractorMOG2::BackgroundSubtractorMOG2()
 571
 572 .. ocv:function:: BackgroundSubtractorMOG2::BackgroundSubtractorMOG2( int history, float varThreshold, bool bShadowDetection=true )
 573
 574     :param history: Length of the history.
 575
 576     :param varThreshold: Threshold on the squared Mahalanobis distance to decide whether it is well described by the background model (see Cthr??). This parameter does not affect the background update. A typical value could be 4 sigma, that is, ``varThreshold=4*4=16;`` (see Tb??).
 577
 578     :param bShadowDetection: Parameter defining whether shadow detection should be enabled (``true`` or ``false``).
 579
 580
 581
 582 BackgroundSubtractorMOG2::operator()
 583 ------------------------------------
 584 Updates the background model and computes the foreground mask
 585
 586 .. ocv:function:: void BackgroundSubtractorMOG2::operator()(InputArray image, OutputArray fgmask, double learningRate=-1)
 587
 588     See :ocv:funcx:`BackgroundSubtractor::operator()`.
 589
 590
 591 BackgroundSubtractorMOG2::getBackgroundImage
 592 --------------------------------------------
 593 Returns background image
 594
 595 .. ocv:function:: void BackgroundSubtractorMOG2::getBackgroundImage(OutputArray backgroundImage)
 596
 597 See :ocv:func:`BackgroundSubtractor::getBackgroundImage`.
 598
 599
 600 calcOpticalFlowSF
 601 -----------------
 602 Calculate an optical flow using "SimpleFlow" algorithm.
 603
 604 .. ocv:function:: void calcOpticalFlowSF( Mat& from, Mat& to, Mat& flow, int layers, int averaging_block_size, int max_flow )
 605
 606 .. ocv:function:: void calcOpticalFlowSF( Mat& from, Mat& to, Mat& flow, int layers, int averaging_block_size, int max_flow, double sigma_dist, double sigma_color, int postprocess_window, double sigma_dist_fix, double sigma_color_fix, double occ_thr, int upscale_averaging_radius, double upscale_sigma_dist, double upscale_sigma_color, double speed_up_thr )
 607
 608     :param prev: First 8-bit 3-channel image.
 609
 610     :param next: Second 8-bit 3-channel image
 611
 612     :param flowX: X-coordinate of estimated flow
 613
 614     :param flowY: Y-coordinate of estimated flow
 615
 616     :param layers: Number of layers
 617
 618     :param averaging_block_size: Size of block through which we sum up when calculate cost function for pixel
 619
 620     :param max_flow: maximal flow that we search at each level
 621
 622     :param sigma_dist: vector smooth spatial sigma parameter
 623
 624     :param sigma_color: vector smooth color sigma parameter
 625
 626     :param postprocess_window: window size for postprocess cross bilateral filter
 627
 628     :param sigma_dist_fix: spatial sigma for postprocess cross bilateralf filter
 629
 630     :param sigma_color_fix: color sigma for postprocess cross bilateral filter
 631
 632     :param occ_thr: threshold for detecting occlusions
 633
 634     :param upscale_averaging_radiud: window size for bilateral upscale operation
 635
 636     :param upscale_sigma_dist: spatial sigma for bilateral upscale operation
 637
 638     :param upscale_sigma_color: color sigma for bilateral upscale operation
 639
 640     :param speed_up_thr: threshold to detect point with irregular flow - where flow should be recalculated after upscale
 641
 642 See [Tao2012]_. And site of project - http://graphics.berkeley.edu/papers/Tao-SAN-2012-05/.
 643
 644
 645
 646 createOptFlow_DualTVL1
 647 ----------------------
 648 "Dual TV L1" Optical Flow Algorithm.
 649
 650 .. ocv:function:: Ptr<DenseOpticalFlow> createOptFlow_DualTVL1()
 651
 652
 653   The class implements the "Dual TV L1" optical flow algorithm described in [Zach2007]_ and [Javier2012]_ .
 654
 655   Here are important members of the class that control the algorithm, which you can set after constructing the class instance:
 656
 657     .. ocv:member:: double tau
 658
 659         Time step of the numerical scheme.
 660
 661     .. ocv:member:: double lambda
 662
 663         Weight parameter for the data term, attachment parameter. This is the most relevant parameter, which determines the smoothness of the output. The smaller this parameter is, the smoother the solutions we obtain. It depends on the range of motions of the images, so its value should be adapted to each image sequence.
 664
 665     .. ocv:member:: double theta
 666
 667         Weight parameter for (u - v)^2, tightness parameter. It serves as a link between the attachment and the regularization terms. In theory, it should have a small value in order to maintain both parts in correspondence. The method is stable for a large range of values of this parameter.
 668
 669     .. ocv:member:: int nscales
 670
 671         Number of scales used to create the pyramid of images.
 672
 673     .. ocv:member:: int warps
 674
 675         Number of warpings per scale. Represents the number of times that I1(x+u0) and grad( I1(x+u0) ) are computed per scale. This is a parameter that assures the stability of the method. It also affects the running time, so it is a compromise between speed and accuracy.
 676
 677     .. ocv:member:: double epsilon
 678
 679         Stopping criterion threshold used in the numerical scheme, which is a trade-off between precision and running time. A small value will yield more accurate solutions at the expense of a slower convergence.
 680
 681     .. ocv:member:: int iterations
 682
 683         Stopping criterion iterations number used in the numerical scheme.
 684
 685
 686
 687
 688 DenseOpticalFlow::calc
 689 --------------------------
 690 Calculates an optical flow.
 691
 692 .. ocv:function:: void DenseOpticalFlow::calc(InputArray I0, InputArray I1, InputOutputArray flow)
 693
 694     :param prev: first 8-bit single-channel input image.
 695
 696     :param next: second input image of the same size and the same type as ``prev`` .
 697
 698     :param flow: computed flow image that has the same size as ``prev`` and type ``CV_32FC2`` .
 699
 700
 701
 702 DenseOpticalFlow::collectGarbage
 703 --------------------------------
 704 Releases all inner buffers.
 705
 706 .. ocv:function:: void DenseOpticalFlow::collectGarbage()
 707
 708
 709
 710 .. [Bouguet00] Jean-Yves Bouguet. Pyramidal Implementation of the Lucas Kanade Feature Tracker.
 711
 712 .. [Bradski98] Bradski, G.R. "Computer Vision Face Tracking for Use in a Perceptual User Interface", Intel, 1998
 713
 714 .. [Bradski00] Davis, J.W. and Bradski, G.R. “Motion Segmentation and Pose Recognition with Motion History Gradients”, WACV00, 2000
 715
 716 .. [Davis97] Davis, J.W. and Bobick, A.F. “The Representation and Recognition of Action Using Temporal Templates”, CVPR97, 1997
 717
 718 .. [Farneback2003] Gunnar Farneback, Two-frame motion estimation based on polynomial expansion, Lecture Notes in Computer Science, 2003, (2749), , 363-370.
 719
 720 .. [Horn81] Berthold K.P. Horn and Brian G. Schunck. Determining Optical Flow. Artificial Intelligence, 17, pp. 185-203, 1981.
 721
 722 .. [Lucas81] Lucas, B., and Kanade, T. An Iterative Image Registration Technique with an Application to Stereo Vision, Proc. of 7th International Joint Conference on Artificial Intelligence (IJCAI), pp. 674-679.
 723
 724 .. [Welch95] Greg Welch and Gary Bishop “An Introduction to the Kalman Filter”, 1995
 725
 726 .. [Tao2012] Michael Tao, Jiamin Bai, Pushmeet Kohli and Sylvain Paris. SimpleFlow: A Non-iterative, Sublinear Optical Flow Algorithm. Computer Graphics Forum (Eurographics 2012)
 727
 728 .. [Zach2007] C. Zach, T. Pock and H. Bischof. "A Duality Based Approach for Realtime TV-L1 Optical Flow", In Proceedings of Pattern Recognition (DAGM), Heidelberg, Germany, pp. 214-223, 2007
 729
 730 .. [Javier2012] Javier Sanchez, Enric Meinhardt-Llopis and Gabriele Facciolo. "TV-L1 Optical Flow Estimation".