Speech Insights (BETA)
Enable this model configuration to get useful conversational insights that can measure or help measure many of your KPIs.
This API is in BETA and will be provided on request. Please contact support@marsview.ai to enable this API.
Overview
Perform in-depth analysis of conversational data to visualize trends on topics, sentiments, keywords, and behaviors to achieve better outcomes.
Marsview provides a way to capture the engagement level of speakers in real-time. Additionally, you can track user sentiment and emotions along with engagement data.
Insights
For each conversation/file uploaded it returns the following
Insight | Description |
Talk-to-listen Ratio | Speaker’s talk and listen ratio and time |
Speech Insights | Insights based on speakers such as- Longest monologue, filler words used, speech clarity, etc. |
Call Sentiment Score | Gives an overall assessment of the conversation sentiment based on the sentiments, emotions, and tone used in the conversation |
Call Engagement Score | Gives an overall assessment of the conversation engagement based on the talk-time, dead air, and other factors. |
Call Score | Scores the call based on different quantitative and qualitative measurements of the conversation. This can be further customized to the business need. |
Avg. Speech Speed | Get speech speed by the speaker in terms of WPM (words per minute) |
Sentiment vs Time | Capture variations in sentiment over the course of the call by each speaker individually and combined. |
Phrase Cloud (by Topics Type) | Captures salient topics found or spoken in the conversation. |
Topic Sentiment over Time | Capture variations in sentiment over the course of the call by each speaker individually and combined along with the corresponding topics mentioned. |
Speaker Emotions over Time | Capture variations in emotions over the course of the call by each speaker individually and combined. |
Dead Air | timestamps of dead air (silence) found during the conversation |
modelType
Configuration
modelType
Configuration Key | Value |
|
|
| Model Configuration object for |
modelConfig
Parameters
modelConfig
ParametersmodelConfig | Description | Defaults |
| The time threshold(in milliseconds) beyond which silence in a meeting should be considered as dead air time. | 3000 |
Example Request
Example Metadata Response
Response Objects
Field | Description |
| Data insights object containing all the insights of the given audio.video |
| List of trabscript insight objects for each sentence identified by the model |
| Object containing all the insights of the meeting |
| Object containing all the insights of the speaker in the meeting |
transcriptInsights
List<Objects>
transcriptInsights
List<Objects>Field | Description |
| Sentence Identified in the given time frame |
| Start time of the sentence in the input Video/Audio in milliseconds |
| End time of the sentence in the input Video/Audio in milliseconds |
| Speaker id whose voice is identified in the given time frame |
| List of topic object identified in the given time frame |
| List of keywords found in the given sentence |
| The type of speech best representing the sentence identified in the given time frame eg: Statement, Question, |
| The models confidence in the predicted |
| Sentiment of the speaker during the given time frame . |
| Integer representation of the sentiment of the speaker. Can have values between -1 and 1. -1 being very negative and 1 being very positive. |
| A scale of how much the sentence is based on facts and figures. A high subjectivity indicates that the information given by the speaker is not based on facts and that it is highly subjective. |
| Tone of the speaker in the given time frame |
| Value indicating the models confidence in the predicted tone value |
| Emotion of the speaker in the given time frame. |
| Value indicating the models confidence in the predicted emotion value. |
| Average words per minute spoken by the speaker in the given time frame. |
meetingInsights
Object
meetingInsights
ObjectKey | Description |
| List of meeting sentiment objects |
| A specific sentiment identified in the meeting |
| Value specifying the presence if the given sentiment in the meeting. This value ranges from 0 to 1, 0 meaning it wasn't present and 1 meaning only that sentiment was present. Multiplying this with 100 will give you a percentage representation of the same. |
| List of meeting emotion objects |
| A specific emotion identified in the meeting |
| Value specifying the presence if the given emotion in the meeting. This value ranges from 0 to 1, 0 meaning it wasn't present and 1 meaning only that emotion was present. Multiplying this with 100 will give you a percentage representation of the same. |
| Point of time at which the first conversation was initiated in the meeting. Time given is in milliseconds |
| Value indicating how active the meeting was. This value can range between 0 and 1, 0 being no activity at all and 1 being active throughout. |
| List of keyword objects identified in the meeting |
| A specific keyword identified in the meeting |
| Frequency of the given keyword in the meeting |
| The calculated inactive time in the meeting. This will vary depending upon the dead air threshold given |
speakerInsights
Object
speakerInsights
ObjectKey | Value |
| List of speakers in present in the meeting |
| Object representing the talk time ratio of each user in the meeting |
| Object representing the talk time in milliseconds of each user in the meeting |
| |
| Different emotions and their ratios for all users in the meeting. This can help identify the emotion of specific users during the meeting. |
| A specific emotin of a specific user during the meeting |
| Value specifying the presence if the given emotion for a specific user in the meeting. This value ranges from 0 to 1, 0 meaning it wasn't present and 1 meaning only that sentiment was present. Multiplying this with 100 will give you a percentage representation of the same. |
| The average words per minute spoken by the speaker. |
Last updated