Pascal VOC
Folder structure
train
images
image1.jpg
image2.jpg
image(N).jpg
labels
image3.xml
image4.xml
image(N+1).xml
valid
images
image5.jpg
image6.jpg
image(N+2).jpg
labels
image5.xml
image6.xml
image(N+2).xml
test
images
image7.jpg
image8.jpg
image(N+3).jpg
labels
image7.xml
image8.xml
image(N+3).xml
The label file
The object field is composed of the following 2 fields.
name is the name of the bounding box class.
bndbox is the bounding box values.
xmin is the top left point X value.
ymin is the top left point Y value.
xmax is the bottom right point X value.
ymax is the bottom right point Y value.
einstein.jpg
<annotation>
<filename>einstein.jpg</filename>
<object>
<name>eye</name>
<bndbox>
<xmin>687</xmin>
<ymin>579</ymin>
<xmax>752</xmax>
<ymax>617</ymax>
</bndbox>
</object>
<object>
<name>eye</name>
<bndbox>
<xmin>498</xmin>
<ymin>575</ymin>
<xmax>579</xmax>
<ymax>617</ymax>
</bndbox>
</object>
<object>
<name>nose</name>
<bndbox>
<xmin>586</xmin>
<ymin>579</ymin>
<xmax>716</xmax>
<ymax>777</ymax>
</bndbox>
</object>
<object>
<name>mouth</name>
<bndbox>
<xmin>563</xmin>
<ymin>835</ymin>
<xmax>716</xmax>
<ymax>833</ymax>
</bndbox>
</object>
<object>
<name>face</name>
<bndbox>
<xmin>347</xmin>
<ymin>324</ymin>
<xmax>800</xmax>
<ymax>984</ymax>
</bndbox>
</object>
</annotation>
Last updated
Was this helpful?