Audio Classification model is used to detect around 500+ pre-trained audio commonly occurring sounds such as door opening, car moving sound, dog barking, etc.
500+ other sounds (contact [email protected] for more info)
Starting time of the chunk in milliseconds
Ending time of the chunk in milliseconds
The transcribed sentence from Marsview STT
Audio type label for the Sentence/Chunk
Confidence of the speech type label (ranges from 0 to 1). Higher the better