LogoLogo
  • What is Theos AI?
  • Get Started
    • Object Detection
    • Pose Estimation
  • Library
    • Computer Vision
      • Object Detection
      • Semantic Segmentation
      • Image Classification
      • Pose Estimation
      • Face Recognition
    • Natural Language Processing
      • Language Translation
      • Question Answering
      • Sentiment Analysis
      • Text Generation
      • Text Summarization
    • Speech
      • Speech Recognition
      • Voice Cloning
      • Emotion Recognition
      • Speaker Verification
      • Speech Synthesis
  • Datasets
    • Image
      • Upload
      • Classes
      • Labels
        • Bounding Box
          • Labeling
          • Autolabeling
          • Formats
            • Theos JSON
            • COCO JSON
            • Darknet TXT
            • Pascal VOC
    • Text
    • Audio
  • Machines
    • Theos Cloud
    • Google Colab
    • On-Premise
  • Train
  • Deploy
    • OCR Languages
  • Rest API
    • Datasets
    • Machines
    • Train
    • Deploy
Powered by GitBook
On this page

Was this helpful?

  1. Datasets

Audio

The modality used in all subfields of Speech.

PreviousTextNextMachines

Last updated 2 years ago

Was this helpful?

Coming soon

Our mission at is to democratize AI. To achieve this, we have to create a development platform for all the possible modalities an AI can receive as input. We started with and plan to release more modalities soon. Please, contact us at if you think we should tackle this modality next, or another modality we missed.

Theos
Images
contact@theos.ai