Score Task/Task Achievement
The Speechace Task Achievement API supports following task types:
Describe-Image: The speaker is presented with an image and asked to describe the details, relationships, and conclusion to be drawn from elements of the image.
Retell-Lecture: The speaker listens to a 1-2 minute lecture and is asked to summarize the lecture focusing on key elements, concepts and conclusions from the lecture.
Answer-Question: The speaker is presented with a short question which typically requires a one or two word answer.
Each task type has particular input and outputs:
describe-image
task_context: A model description of the image which is presented to the speaker.
Max length: 1024 chars.
Task score on scale of 0-5.
retell-lecture
task_context: A model summary of the lecture which is presented to the speaker.
Max length: 1024 chars.
Task score on scale of 0-5.
answer-question
task-question: The question presented to the user.
Task score on scale of 0-1 where 0 is incorrect and 1 is correct.
The API supports different modes in combining task scores and language scores in assessment:
user_audio_fileoruser_audio_text: The speaker's response can be submitted as either audio or text, allowing task scoring to be used with written responses as well.include_speech_score: Speech scoring can be included or excluded along with the task score. Note that ifuser_audio_textis used, theinclude_speech_scorewill always be zero. Therefore, in written responses, only task scores are provided.
All tasks are available in the following languages:
English (en-us, en-gb)
Spanish (es-es, es-mx)
French (fr-fr, fr-ca)
Request Format
The endpoint that is to be used will depend on the region of your subscription. For example, for US West, the endpoint is https://api.speechace.co.
POST https://api.speechace.co/api/scoring/task/v9/json
Query Parameters
dialect
String
This is the dialect in which the speaker will be assessed. Supported values are: en-us, en-gb, fr-fr, fr-ca, es-es, es-mx.
user_id
String
Optional: A unique anonymized identifier (generated by your applications) for the end-user who spoke the audio.
task_type
String
The task_type to score. Supported types are:
describe-image
retell-lecture
answer-question.
Request Body
task_context
String
The context or model or model answer for the task presented to the speaker.
Used in the following task-types:
describe-image: a model description of the image
retell-lecture: a model description of the lecture
This must be provided in the same language as the one being assessed.
task_question
String
The task question presented to the speaker, used in task-type = answer-question.
This must be provided in the same language as the one being assessed.
user_audio_file
File
file with user audio (wav, mp3, m4a, webm, ogg, aiff)
include_speech_score
String
Set to
1, to include scoring other aspects of the speech: Pronunciation, Fluency, Grammar, Vocab, Coherence.Set to
0if you only want to receive the task score only.
user_audio_text
String
A text transcript of the speaker's response.
Use this field instead of
user_audio_fileif you already have a transcript of the user's response and do not wish to re-transcribe an audio.Note: In this case, you will only be able to receive an overall
task_score.
Response Example
Notice the task_score.score key for the overall task achievement score in the response below:
The pronunciation and fluency interpretation of the key elements in the response of the spoken word or sentence remains the same.
The new addition is the task score parameters, which indicate the extent to which the task has been achieved.

Last updated