LogoLogo
  • What is Theos AI?
  • Get Started
    • Object Detection
    • Pose Estimation
  • Library
    • Computer Vision
      • Object Detection
      • Semantic Segmentation
      • Image Classification
      • Pose Estimation
      • Face Recognition
    • Natural Language Processing
      • Language Translation
      • Question Answering
      • Sentiment Analysis
      • Text Generation
      • Text Summarization
    • Speech
      • Speech Recognition
      • Voice Cloning
      • Emotion Recognition
      • Speaker Verification
      • Speech Synthesis
  • Datasets
    • Image
      • Upload
      • Classes
      • Labels
        • Bounding Box
          • Labeling
          • Autolabeling
          • Formats
            • Theos JSON
            • COCO JSON
            • Darknet TXT
            • Pascal VOC
    • Text
    • Audio
  • Machines
    • Theos Cloud
    • Google Colab
    • On-Premise
  • Train
  • Deploy
    • OCR Languages
  • Rest API
    • Datasets
    • Machines
    • Train
    • Deploy
Powered by GitBook
On this page
  • Folder structure
  • Labels file

Was this helpful?

  1. Datasets
  2. Image
  3. Labels
  4. Bounding Box
  5. Formats

COCO JSON

Folder structure

  • train

    • annotations.json

    • image1.jpg

    • image2.jpg

    • image(N).jpg

  • valid

    • annotations.json

    • image3.jpg

    • image4.jpg

    • image(N+1).jpg

  • test

    • annotations.json

    • image5.jpg

    • image(N+2).jpg

Labels file

The bbox field is composed of the following 4 numbers in order.

  1. X value of the top left point of the bounding box.

  2. Y value of the top left point of the bounding box.

  3. Width of the bounding box.

  4. Height of the bounding box.

einstein.jpg

annotations.json
{
  "categories":[
    {
      "id":0,
      "name":"eye",
      "supercategory":"none"
    },
    {
      "id":1,
      "name":"nose",
      "supercategory":"none"
    },
    {
      "id":2,
      "name":"mouth",
      "supercategory":"none"
    },
    {
      "id":3,
      "name":"face",
      "supercategory":"none"
    }
  ],
  "images":[
    {
      "id":0,
      "file_name":"einstein.jpg",
      "height":416,
      "width":416
    }
  ],
  "annotations":[
    {
      "id":0,
      "image_id":0,
      "category_id":0,
      "bbox":[
        687,
        579,
        65,
        38
      ]
    },
    {
      "id":1,
      "image_id":0,
      "category_id":0,
      "bbox":[
        498,
        575,
        81,
        42
      ]
    },
    {
      "id":2,
      "image_id":0,
      "category_id":1,
      "bbox":[
        586,
        579,
        122,
        198
      ]
    },
    {
      "id":3,
      "image_id":0,
      "category_id":2,
      "bbox":[
        563,
        835,
        153,
        48
      ]
    },
    {
      "id":4,
      "image_id":0,
      "category_id":3,
      "bbox":[
        347,
        324,
        453,
        660
      ]
    }
  ]
}
PreviousTheos JSONNextDarknet TXT

Last updated 2 years ago

Was this helpful?

einstein.jpg