Audio Classification

This API allows you to distinguish audio sounds to one of the 500+ classes available

Overview

Audio Classification model is used to detect around 500+ pre-trained audio commonly occurring sounds such as door opening, car moving sound, dog barking, etc.

Audio Classification Types

  • Speech

  • Noise

  • Music

  • Hold Sound

  • Silence

  • 500+ other sounds (contact [email protected] for more info)

This is a Beta API - Undergoing further development. Please reach us at [email protected]

Input Type Supported: Audio

Response Object Fields

Fields

Description

start_time

Starting time of the chunk in milliseconds

end_time

Ending time of the chunk in milliseconds

sentence

The transcribed sentence from Marsview STT

audio_type

Audio type label for the Sentence/Chunk

confidence

Confidence of the speech type label (ranges from 0 to 1). Higher the better

‚Äč