LogoLogo
  • What is Theos AI?
  • Get Started
    • Object Detection
    • Pose Estimation
  • Library
    • Computer Vision
      • Object Detection
      • Semantic Segmentation
      • Image Classification
      • Pose Estimation
      • Face Recognition
    • Natural Language Processing
      • Language Translation
      • Question Answering
      • Sentiment Analysis
      • Text Generation
      • Text Summarization
    • Speech
      • Speech Recognition
      • Voice Cloning
      • Emotion Recognition
      • Speaker Verification
      • Speech Synthesis
  • Datasets
    • Image
      • Upload
      • Classes
      • Labels
        • Bounding Box
          • Labeling
          • Autolabeling
          • Formats
            • Theos JSON
            • COCO JSON
            • Darknet TXT
            • Pascal VOC
    • Text
    • Audio
  • Machines
    • Theos Cloud
    • Google Colab
    • On-Premise
  • Train
  • Deploy
    • OCR Languages
  • Rest API
    • Datasets
    • Machines
    • Train
    • Deploy
Powered by GitBook
On this page
  • Folder structure
  • The label file

Was this helpful?

  1. Datasets
  2. Image
  3. Labels
  4. Bounding Box
  5. Formats

Pascal VOC

Folder structure

  • train

    • images

      • image1.jpg

      • image2.jpg

      • image(N).jpg

    • labels

      • image3.xml

      • image4.xml

      • image(N+1).xml

  • valid

    • images

      • image5.jpg

      • image6.jpg

      • image(N+2).jpg

    • labels

      • image5.xml

      • image6.xml

      • image(N+2).xml

  • test

    • images

      • image7.jpg

      • image8.jpg

      • image(N+3).jpg

    • labels

      • image7.xml

      • image8.xml

      • image(N+3).xml

The label file

The object field is composed of the following 2 fields.

  • name is the name of the bounding box class.

  • bndbox is the bounding box values.

    • xmin is the top left point X value.

    • ymin is the top left point Y value.

    • xmax is the bottom right point X value.

    • ymax is the bottom right point Y value.

einstein.jpg

einstein.xml
<annotation>
  <filename>einstein.jpg</filename>
  <object>
    <name>eye</name>
    <bndbox>
      <xmin>687</xmin>
      <ymin>579</ymin>
      <xmax>752</xmax>
      <ymax>617</ymax>
    </bndbox>
  </object>
  <object>
    <name>eye</name>
    <bndbox>
      <xmin>498</xmin>
      <ymin>575</ymin>
      <xmax>579</xmax>
      <ymax>617</ymax>
    </bndbox>
  </object>
  <object>
    <name>nose</name>
    <bndbox>
      <xmin>586</xmin>
      <ymin>579</ymin>
      <xmax>716</xmax>
      <ymax>777</ymax>
    </bndbox>
  </object>
  <object>
    <name>mouth</name>
    <bndbox>
      <xmin>563</xmin>
      <ymin>835</ymin>
      <xmax>716</xmax>
      <ymax>833</ymax>
    </bndbox>
  </object>
  <object>
    <name>face</name>
    <bndbox>
      <xmin>347</xmin>
      <ymin>324</ymin>
      <xmax>800</xmax>
      <ymax>984</ymax>
    </bndbox>
  </object>
</annotation>
PreviousDarknet TXTNextText

Last updated 2 years ago

Was this helpful?

einstein.jpg