WIDER FACE dataset is a face detection benchmark dataset, of which images are selected from the publicly available WIDER dataset. We choose 32,203 images and label 393,703 faces with a high degree of variability in scale, pose and occlusion as depicted in the sample images. WIDER FACE dataset is organized based on 61 event classes. For each event class, we randomly select 40%, 10%, 50% data as training, validation and testing sets. We adopt the same evaluation metric employed in the PASCAL VOC dataset. Similar to MALF and Caltech datasets, we do not release bounding box ground truth for the test images. Users are required to submit final prediction files, which we shall proceed to evaluate.
Please contact us to evaluate your detection results. An evaluation server will be available soon.
The detection result for each image should be a text file, with the same name of the image.
The detection results are organized by the event categories. For example, if the directory
of a testing image is "./0--Parade/0_Parade_marchingband_1_5.jpg"
, the detection result should
be writtern in the text file in "./0--Parade/0_Parade_marchingband_1_5.txt"
. The detection
output is expected in the follwing format:
< image name i >
< number of faces in this image = im >
< face i1 >
< face i2 >
...
< face im >
Each text file should contain 1 row per detected
bounding box, in the format "[left, top, width, height, score]"
. Please see the output example
files and the README if the above descriptions are unclear.
Please use the following citation when referencing the dataset:
@inproceedings{yang2016wider,
Author = {Yang, Shuo and Luo, Ping and Loy, Chen Change and Tang, Xiaoou},
Bootitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
Title = {WIDER FACE: A Face Detection Benchmark},
Year = {2016}}