dlib

dlib is an open-source, C++ toolkit containing machine learning algorithms and other tools.

The dlib implementation in Viseron provides face recognition capabilities.

Configuration

Configuration example

dlibmap required

dlib configuration.

face_recognitionmap (optional)

Face recognition domain config.

camerasmap required

Camera-specific configuration. All subordinate keys corresponds to the camera_identifier of a configured camera.

<CAMERA_IDENTIFIER>map required

Camera identifier. Valid characters are lowercase a-z, numbers and underscores.

labelslist (optional)

A list of labels that when detected will be sent to the post processor. Applies only to this specific camera.

masklist (optional)

A mask is used to exclude certain areas in the image from post processing.

coordinateslist required

List of X and Y coordinates to form a polygon

Minimum items: 3

xinteger required

X-coordinate (horizontal axis).

yinteger required

Y-coordinate (vertical axis).

labelslist (optional)

A list of labels that when detected will be sent to the post processor. Applies to all cameras defined under cameras.

face_recognition_pathstring (optional, default: /config/face_recognition/faces)

Path to folder which contains subdirectories with images for each face to track.

save_unknown_facesboolean (optional, default: true)

If set to true, any unrecognized faces will be stored in the database, as well as having a snapshot saved. You can then move this image to the folder of the correct person to improve accuracy.

unknown_faces_pathstring deprecated

DEPRECATED. Config option 'unknown_faces_path' is deprecated and will be removed in a future version.

Path to folder where unknown faces will be stored.

expire_afterfloat (optional, default: 5)

Time in seconds before a detected face is no longer considered detected.

Lowest value: 0

save_facesboolean (optional, default: true)

If set to true, detected faces will be stored in the database, as well as having a snapshot saved.

modelselect (optional, default: hog)

Which face recognition model to run. See models for more information on this.

Valid values:

hog
cnn

Face recognition

Face recognition runs as a post processor when a specific object is detected.

Labels

Labels are used to tell Viseron when to run a post processor.

Any label configured under the object_detector for your camera can be added to the post processors labels section.

note

Only objects that are tracked by an object_detector can be sent to a post_processor. The object also has to pass all of its filters (confidence, height, width etc).

Train

On startup images are read from face_recognition_path and a model is trained to recognize these faces.
The folder structure of the faces folder is very strict. Here is an example of the default one:

/config
|── face_recognition
|   └── faces
|       ├── person1
|       |   ├── image_of_person1_1.jpg
|       |   ├── image_of_person1_2.png
|       |   └── image_of_person1_3.jpg
|       └── person2
|       |   ├── image_of_person2_1.jpeg
|       |   └── image_of_person2_2.jpg

warning

You need to follow this folder structure, otherwise training will not be possible.

Models

dlib implements two different models for face recognition, hog and cnn.
hog is less accurate but faster on CPUs.
cnn is a more accurate deep-learning model which is GPU/CUDA accelerated (if available).

If you have a CUDA compatible GPU, dlib will run the cnn model by default. Otherwise the hog model is used.

Troubleshooting

To enable debug logging for dlib, add the following to your config.yaml

/config/config.yaml
logger:
  logs:
    viseron.components.dlib: debug

dlib

Configuration​

Face recognition​

Labels​

Train​

Models​

Troubleshooting​