LogoLogo
  • What is Theos AI?
  • Get Started
    • Object Detection
    • Pose Estimation
  • Library
    • Computer Vision
      • Object Detection
      • Semantic Segmentation
      • Image Classification
      • Pose Estimation
      • Face Recognition
    • Natural Language Processing
      • Language Translation
      • Question Answering
      • Sentiment Analysis
      • Text Generation
      • Text Summarization
    • Speech
      • Speech Recognition
      • Voice Cloning
      • Emotion Recognition
      • Speaker Verification
      • Speech Synthesis
  • Datasets
    • Image
      • Upload
      • Classes
      • Labels
        • Bounding Box
          • Labeling
          • Autolabeling
          • Formats
            • Theos JSON
            • COCO JSON
            • Darknet TXT
            • Pascal VOC
    • Text
    • Audio
  • Machines
    • Theos Cloud
    • Google Colab
    • On-Premise
  • Train
  • Deploy
    • OCR Languages
  • Rest API
    • Datasets
    • Machines
    • Train
    • Deploy
Powered by GitBook
On this page

Was this helpful?

  1. Library

Speech

Speech is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enables cognitive auditory tasks in machines.

Subfields

Following are the subfields of Speech supported by Theos.

  • Speech Recognition is the task of recognising speech within audio and converting it into text.

  • Voice Cloning is the task of creating speech that's indistinguishable from the original speaker.

  • Emotion Recognition is the task of detecting human emotions using speech signals.

  • Speaker Verification is the task of verifying a person's identity from the characteristics of his or her voice.

  • Speech Synthesis is the task of generating speech from text.

PreviousText SummarizationNextSpeech Recognition

Last updated 2 years ago

Was this helpful?