LogoLogo
  • What is Theos AI?
  • Get Started
    • Object Detection
    • Pose Estimation
  • Library
    • Computer Vision
      • Object Detection
      • Semantic Segmentation
      • Image Classification
      • Pose Estimation
      • Face Recognition
    • Natural Language Processing
      • Language Translation
      • Question Answering
      • Sentiment Analysis
      • Text Generation
      • Text Summarization
    • Speech
      • Speech Recognition
      • Voice Cloning
      • Emotion Recognition
      • Speaker Verification
      • Speech Synthesis
  • Datasets
    • Image
      • Upload
      • Classes
      • Labels
        • Bounding Box
          • Labeling
          • Autolabeling
          • Formats
            • Theos JSON
            • COCO JSON
            • Darknet TXT
            • Pascal VOC
    • Text
    • Audio
  • Machines
    • Theos Cloud
    • Google Colab
    • On-Premise
  • Train
  • Deploy
    • OCR Languages
  • Rest API
    • Datasets
    • Machines
    • Train
    • Deploy
Powered by GitBook
On this page

Was this helpful?

  1. Library

Speech

Speech is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enables cognitive auditory tasks in machines.

PreviousText SummarizationNextSpeech Recognition

Last updated 2 years ago

Was this helpful?

Subfields

Following are the subfields of Speech supported by Theos.

  • is the task of recognising speech within audio and converting it into text.

  • is the task of creating speech that's indistinguishable from the original speaker.

  • is the task of detecting human emotions using speech signals.

  • is the task of verifying a person's identity from the characteristics of his or her voice.

  • is the task of generating speech from text.

Speech Recognition
Voice Cloning
Emotion Recognition
Speaker Verification
Speech Synthesis