inference-engine/samples/validation_app/README.md

   1 # Validation Application
   2
   3 Inference Engine Validation Application is a tool that allows to infer deep learning models with
   4 standard inputs and outputs configuration and to collect simple
   5 validation metrics for topologies. It supports **top-1** and **top-5** metric for Classification networks and
   6 11-points **mAP** metric for Object Detection networks.
   7
   8 > **NOTE**: Before running the application with trained models, make sure the models are converted to the Inference Engine format (\*.xml + \*.bin) using the [Model Optimizer tool](./docs/MO_DG/Deep_Learning_Model_Optimizer_DevGuide.md).
   9
  10 Possible use cases of the tool:
  11 * Check if the Inference Engine infers the public topologies well (the engineering team uses the Validation Application for
  12   regular testing)
  13 * Verify if a custom model is compatible with the default input/output configuration and compare its
  14   accuracy with the public models
  15 * Use Validation Application as another sample: although the code is much more complex than in classification and object
  16   detection samples, the source code is open and can be re-used.
  17
  18 ## Validation Application Options
  19
  20 The Validation Application provides the following command-line interface (CLI):
  21 ```sh
  22 Usage: validation_app [OPTION]
  23
  24 Available options:
  25
  26     -h                        Print a help message
  27     -t <type>                 Type of an inferred network ("C" by default)
  28       -t "C" for classification
  29       -t "OD" for object detection
  30     -i <path>                 Required. Folder with validation images. Path to a directory with validation images. For Classification models, the directory must contain folders named as labels with images inside or a .txt file with a list of images. For Object Detection models, the dataset must be in VOC format.
  31     -m <path>                 Required. Path to an .xml file with a trained model
  32     -lbl <path>               Labels file path. The labels file contains names of the dataset classes
  33     -l <absolute_path>        Required for CPU custom layers. Absolute path to a shared library with the kernel implementations
  34     -c <absolute_path>        Required for GPU custom kernels.Absolute path to an .xml file with the kernel descriptions.
  35     -d <device>               Target device to infer on: CPU (default), GPU, FPGA, or MYRIAD. The application looks for a suitable plugin for the specified device.
  36     -b N                      Batch size value. If not specified, the batch size value is taken from IR
  37     -ppType <type>            Preprocessing type. Options: "None", "Resize", "ResizeCrop"
  38     -ppSize N                 Preprocessing size (used with ppType="ResizeCrop")
  39     -ppWidth W                Preprocessing width (overrides -ppSize, used with ppType="ResizeCrop")
  40     -ppHeight H               Preprocessing height (overrides -ppSize, used with ppType="ResizeCrop")
  41     --dump                    Dump file names and inference results to a .csv file
  42
  43     Classification-specific options:
  44       -Czb true               "Zero is a background" flag. Some networks are trained with a modified dataset where the class IDs  are enumerated from 1, but 0 is an undefined "background" class (which is never detected)
  45
  46     Object detection-specific options:
  47       -ODkind <kind>          Type of an Object Detection model. Options: SSD
  48       -ODa <path>             Required for Object Detection models. Path to a directory containing an .xml file with annotations for images.
  49       -ODc <file>             Required for Object Detection models. Path to a file containing a list of classes
  50       -ODsubdir <name>        Directory between the path to images (specified with -i) and image name (specified in the .xml file). For VOC2007 dataset, use JPEGImages.
  51 ```
  52 The tool options are divided into two categories:
  53 1. **Common options** named with a single letter or a word, such as `-b` or `--dump`.
  54    These options are the same in all Validation Application modes.
  55 2. **Network type-specific options** named as an acronym of the network type (`C` or `OD`)
  56    followed by a letter or a word.
  57
  58 ## General Workflow
  59
  60 When executed, the Validation Application perform the following steps:
  61
  62 1. Loads a model to an Inference Engine plugin
  63 2. Reads validation set (specified with the `-i` option):
  64     - if you specified a directory, the application tries to load labels first. To do this, it searches for the file
  65       with the same name as a model, but with `.labels` extension (instead of `.xml`).
  66       Then it searches for the specified folder, detects its sub-folders named as known labels, and adds all images from these sub-folders to the validation set. When there are no such sub-folders, validation set is considered empty.
  67
  68     - if you specified a `.txt` file, the application reads this file expecting every line to be in the correct format.
  69       For more information about the format, refer to the <a href="#preparing">Preparing the Dataset</a> section below.
  70
  71 3. Reads the batch size value specified with the `-b` option and loads this number of images to the plugin
  72    > **NOTE**: Images loading time is not a part of inference time reported by the application.
  73
  74 4. The plugin infers the model, and the Validation Application collects the statistics.
  75
  76 You can also retrieve infer result by specifying the `--dump` option, however it generates a report only
  77 for Classification models. This CLI option enables creation (if possible) of an inference report in
  78 the `.csv` format.
  79
  80 The structure of the report is a set of lines, each of them contains semicolon-separated values:
  81 * image path
  82 * a flag representing correctness of prediction
  83 * ID of Top-1 class
  84 * probability that the image belongs to Top-1 class in per cents
  85 * ID of Top-2 class
  86 * probability that the image belongs to Top-2 class in per cents
  87 *
  88
  89 This is an example line from such report:
  90 ```bash
  91 "ILSVRC2012_val_00002138.bmp";1;1;8.5;392;6.875;123;5.875;2;5.5;396;5;
  92 ```
  93 It means that the given image was predicted correctly. The most probable prediction is that this image
  94 represents class *1* with the probability *0.085*.
  95
  96 ## <a name="preparing"></a>Prepare a Dataset
  97
  98 You must prepare the dataset before running the Validation Application. The format of dataset depends on
  99 a type of the model you are going to validate. Make sure that the dataset is format is applicable
 100 for the chosen model type.
 101
 102 ### Dataset Format for Classification: Folders as Classes
 103
 104 In this case, a dataset has the following structure:
 105 ```sh
 106 |-- <path>/dataset
 107     |-- apron
 108         |-- apron1.bmp
 109         |-- apron2.bmp
 110     |-- collie
 111         |-- a_big_dog.jpg
 112     |-- coral reef
 113         |-- reef.bmp
 114     |-- Siamese
 115         |-- cat3.jpg
 116 ```
 117
 118 This structure means that each folder in dataset directory must have the name of one of the classes and contain all images of this class. In the given example, there are two images that represent the class `apron`, while three other classes have only one image
 119 each.
 120
 121 > **NOTE:** A dataset can contain images of both `.bmp` and `.jpg` formats.
 122
 123 The correct way to use such dataset is to specify the path as `-i <path>/dataset`.
 124
 125 ### Dataset Format for Classification: List of Images (ImageNet-like)
 126
 127 If you want to use this dataset format, create a single file with a list of images. In this case, the correct set of files must be similar to the following:
 128 ```bash
 129 |-- <path>/dataset
 130     |-- apron1.bmp
 131     |-- apron2.bmp
 132     |-- a_big_dog.jpg
 133     |-- reef.bmp
 134     |-- cat3.jpg
 135     |-- labels.txt
 136 ```
 137
 138 Where `labels.txt` looks like:
 139 ```bash
 140 apron1.bmp 411
 141 apron2.bmp 411
 142 cat3.jpg 284
 143 reef.bmp 973
 144 a_big_dog.jpg 231
 145 ```
 146
 147 Each line of the file must contain the name of the image and the ID of the class
 148 that it represents in the format `<image_name> tabulation <class_id>`. For example, `apron1.bmp` represents the class with ID `411`.
 149
 150 > **NOTE:** A dataset can contain images of both `.bmp` and `.jpg` formats.
 151
 152 The correct way to use such dataset is to specify the path as `-i <path>/dataset/labels.txt`.
 153
 154 ### Dataset Format for Object Detection (VOC-like)
 155
 156 Object Detection SSD models can be inferred on the original dataset that was used as a testing dataset during the model training.
 157 To prepare the VOC dataset, follow the steps below :
 158
 159 1. Download the pre-trained SSD-300 model from the SSD GitHub* repository at
 160    [https://github.com/weiliu89/caffe/tree/ssd](https://github.com/weiliu89/caffe/tree/ssd).
 161
 162 2. Download VOC2007 testing dataset:
 163   ```bash
 164   $wget http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtest_06-Nov-2007.tar
 165   tar -xvf VOCtest_06-Nov-2007.tar
 166   ```
 167 3. Convert the model with the [Model Optimizer](docs/MO_DG/prepare_model/convert_model/Convert_Model_From_Caffe.md).
 168
 169 4. Create a proper `.txt` class file from the original `labelmap_voc.prototxt`. The new file must be in
 170 the following format:
 171 ```sh
 172         none_of_the_above 0
 173         aeroplane 1
 174         bicycle 2
 175         bird 3
 176         boat 4
 177         bottle 5
 178         bus 6
 179         car 7
 180         cat 8
 181         chair 9
 182         cow 10
 183         diningtable 11
 184         dog 12
 185         horse 13
 186         motorbike 14
 187         person 15
 188         pottedplant 16
 189         sheep 17
 190         sofa 18
 191         train 19
 192         tvmonitor 20
 193 ```
 194 Save this file as `VOC_SSD_Classes.txt`.
 195
 196 ## Validate Classification Models
 197
 198 Once you have prepared the dataset (refer to the <a href="#preparing">Preparing the Dataset</a> section above),
 199 run the following command to infer a classification model on the selected dataset:
 200 ```bash
 201 ./validation_app -t C -i <path_to_images_directory_or_txt_file> -m <path_to_classification_model>/<model_name>.xml -d <CPU|GPU>
 202 ```
 203
 204 ## Validate Object Detection Models
 205
 206 > **NOTE**: Validation Application was validated with SSD CNN. Any network that can be inferred by the Inference Engine
 207 > and has the same input and output format as one of these should be supported as well.
 208
 209 Once you have prepared the dataset (refer to the <a href="#preparing">Preparing the Dataset</a> section above),
 210 run the following command to infer an Object Detection model on the selected dataset:
 211 ```bash
 212 ./validation_app -d CPU -t OD -ODa "<path_to_VOC_dataset>/VOCdevkit/VOC2007/Annotations" -i "<path_to_VOC_dataset>/VOCdevkit" -m "<path_to_model>/vgg_voc0712_ssd_300x300.xml" -ODc "<path_to_classes_file>/VOC_SSD_Classes.txt" -ODsubdir JPEGImages
 213 ```
 214
 215 ## Understand Validation Application Output
 216
 217 During the validation process, you can see the interactive progress bar that represents the current validation stage. When it is
 218 full, the validation process is over, and you can analyze the output.
 219
 220 Key data from the output:
 221 * **Network loading time** - time spent on topology loading in ms
 222 * **Model** - path to a chosen model
 223 * **Model Precision** - precision of the chosen model
 224 * **Batch size** - specified batch size
 225 * **Validation dataset** - path to a validation set
 226 * **Validation approach** - type of the model: Classification or Object Detection
 227 * **Device** - device type
 228
 229 Below you can find the example output for Classification models, which reports average infer time and
 230 **Top-1** and **Top-5** metric values:
 231 ```bash
 232 Average infer time (ms): 588.977 (16.98 images per second with batch size = 10)
 233
 234 Top1 accuracy: 70.00% (7 of 10 images were detected correctly, top class is correct)
 235 Top5 accuracy: 80.00% (8 of 10 images were detected correctly, top five classes contain required class)
 236 ```
 237
 238 Below you can find the example output for Object Detection models:
 239
 240 ```bash
 241 Progress: [....................] 100.00% done
 242 [ INFO ] Processing output blobs
 243 Network load time: 27.70ms
 244 Model: /home/user/models/ssd/withmean/vgg_voc0712_ssd_300x300/vgg_voc0712_ssd_300x300.xml
 245 Model Precision: FP32
 246 Batch size: 1
 247 Validation dataset: /home/user/Data/SSD-data/testonly/VOCdevkit
 248 Validation approach: Object detection network
 249
 250 Average infer time (ms): 166.49 (6.01 images per second with batch size = 1)
 251 Average precision per class table:
 252
 253 Class   AP
 254 1       0.796
 255 2       0.839
 256 3       0.759
 257 4       0.695
 258 5       0.508
 259 6       0.867
 260 7       0.861
 261 8       0.886
 262 9       0.602
 263 10      0.822
 264 11      0.768
 265 12      0.861
 266 13      0.874
 267 14      0.842
 268 15      0.797
 269 16      0.526
 270 17      0.792
 271 18      0.795
 272 19      0.873
 273 20      0.773
 274
 275 Mean Average Precision (mAP): 0.7767
 276 ```
 277
 278 This output shows the resulting `mAP` metric value for the SSD300 model used to prepare the
 279 dataset. This value repeats the result stated in the
 280 [SSD GitHub* repository](https://github.com/weiliu89/caffe/tree/ssd) and in the
 281 [original arXiv paper](http://arxiv.org/abs/1512.02325).
 282
 283
 284
 285 ## See Also
 286
 287 * [Using Inference Engine Samples](./docs/IE_DG/Samples_Overview.md)