Score Task/Task Achievement
Last updated
Last updated
The Speechace Task Achievement API supports following task types:
Describe-Image: The speaker is presented with an image and asked to describe the details, relationships, and conclusion to be drawn from elements of the image.
Retell-Lecture: The speaker listens to a 1-2 minute lecture and is asked to summarize the lecture focusing on key elements, concepts and conclusions from the lecture.
Answer-Question: The speaker is presented with a short question which typically requires a one or two word answer.
Each task type has particular input and outputs:
describe-image
task_context
: A model description of the image which is presented to the speaker.
Max length: 1024 chars.
Task score on scale of 0-5.
retell-lecture
task_context
: A model summary of the lecture which is presented to the speaker.
Max length: 1024 chars.
Task score on scale of 0-5.
answer-question
task-question
: The question presented to the user.
Task score on scale of 0-1 where 0 is incorrect and 1 is correct.
The API supports different modes in combining task scores and language scores in assessment:
user_audio_file
or user_audio_text
: The speaker's response can be submitted as either audio or text, allowing task scoring to be used with written responses as well.
include_speech_score
: Speech scoring can be included or excluded along with the task score. Note that if user_audio_text
is used, the include_speech_score
will always be zero. Therefore, in written responses, only task scores are provided.
All tasks are available in the following languages:
English (en-us, en-gb)
Spanish (es-es, es-mx)
French (fr-fr, fr-ca)
key
String
dialect
String
user_id
String
Optional: A unique anonymized identifier (generated by your applications) for the end-user who spoke the audio.
task_type
String
The task_type to score. Supported types are:
describe-image
retell-lecture
answer-question.
task_context
String
The context or model or model answer for the task presented to the speaker.
Used in the following task-types:
describe-image: a model description of the image
retell-lecture: a model description of the lecture
This must be provided in the same language as the one being assessed.
task_question
String
The task question presented to the speaker, used in task-type = answer-question.
This must be provided in the same language as the one being assessed.
user_audio_file
File
file with user audio (wav, mp3, m4a, webm, ogg, aiff)
include_speech_score
String
Set to 1
, to include scoring other aspects of the speech: Pronunciation, Fluency, Grammar, Vocab, Coherence.
Set to 0
if you only want to receive the task score only.
user_audio_text
String
A text transcript of the speaker's response.
Use this field instead of user_audio_file
if you already have a transcript of the user's response and do not wish to re-transcribe an audio.
Note: In this case, you will only be able to receive an overall task_score.
Notice the task_score.score
key for the overall task achievement score in the response below:
The new addition is the task score parameters, which indicate the extent to which the task has been achieved.
The endpoint that is to be used will depend on the of your subscription. For example, for US West, the endpoint is .
POST
API issued by Speechace.
This is the in which the speaker will be assessed. Supported values are: en-us, en-gb, fr-fr, fr-ca, es-es, es-mx.
The and interpretation of the key elements in the response of the spoken word or sentence remains the same.
is binary and is higher level. It evaluates whether the response is on-topic or not (True or False)
is more nuanced and scores how well the response addresses the task