Speechace
  • Introduction
    • Overview
    • Use-Cases
  • Getting Started
    • Pre-requisites
      • API Features
      • Getting the API Key
      • API Regions and endpoints
      • API Limits
    • API Samples
    • Supported Languages
    • API Versioning
    • Authentication
    • Try the Speechace API
    • Error Handling
      • Common Errors
      • Retry Strategies
  • Solutions
    • Speaking Practice for Language Learning
    • Automated Language Assessment with AI
    • Voice AI for Early Literacy
    • Test Prep for Standardized tests
      • PTE Speaking Questions
      • IELTS Speaking Questions
      • TOEFL Speaking Questions
      • CEFR Speaking Questions
      • TOEIC Speaking Questions
    • Speaking Practice in Spanish and French
  • Features
    • Introduction
    • Scripted activities
      • Pronunciation Scoring
        • Word and Sentence pronunciation
        • Multiple choice
        • Custom pronunciations
        • Phoneme list
      • Fluency scoring
        • Passage scoring
      • Lexical stress and intonation
    • Spontaneous activities
      • Open-ended scoring
        • Language scoring
        • Relevance scoring
        • Language detection
      • Task achievement scoring
        • Describe Image
        • Re-tell Lecture
        • Answer Question
  • API Reference
    • Postman API reference
    • Score Text/Pronunciation
      • Handling overall scores
      • Handling word scores
      • Handling phoneme and syllable scores
    • Score Text/Multiple choice
      • Handling multiple choice response
    • Score Text/Markup Language
      • Handling Markup Response
    • Score Text/Stress & Intonation
      • Handing stress and intonation response
    • Score Text/Phoneme list
      • Handling phoneme list response
    • Score Text/Fluency
      • Handling fluency response
      • Fidelity detection
    • Score Text/Validate Text
    • Score Speech/Open-ended
      • Handling language scores
      • Per metric feedback
        • Grammar metrics
        • Vocabulary metrics
        • Coherence metrics
    • Score Speech/Relevance
      • Handling relevance response
    • Score Speech/Language Detection
    • Score Task/Task Achievement
  • Guides on common topics
    • Intepreting quality score
    • Interpreting overall scores
      • Pronunciation Bands
      • Fluency Bands
      • Vocabulary Bands
      • Grammar Bands
      • Coherence Bands
    • Scoring rubrics
    • Interpreting fidelity class
    • Phonetic notation
      • US English (en-us)
      • UK English (en-gb)
      • French (fr-fr, fr-ca)
      • Spanish (es-es, es-mx)
    • Getting word timestamps in audio
    • Automatic handling of unknown words
    • Phoneme to letter mapping
    • Markup Language
  • Other Resources
    • Requesting Support
    • Rate Limiting
    • Data Retention
    • FAQs
    • Appendices
Powered by GitBook
On this page
  1. API Reference
  2. Score Text/Fluency

Handling fluency response

PreviousScore Text/FluencyNextFidelity detection

Last updated 1 month ago

The pronunciation interpretation of the spoken word or sentence remains the same as in the function. To interpret the fluency quality, refer to the below key elements:

Overall Fluency Scores

These scores assist test creators in evaluating the overall fluency of spoken responses, offering insights into the quality of the test-taker's speech. Below is an example of how the fluency score is presented. For detailed interpretation, please refer to the , which includes scales from systems such as IELTS, PTE, and Speechace.

Fluency Metrics

The API returns the following feedback metrics under the fluency node:

Field
Description

duration

total length of speech in seconds

articulation

total length of articulation (speech minus pauses, hesitations and non-speech events such as laughter). Excludes beginning silence on very first segment and ending silence on very last segment.

speech_rate

speaking rate in syllables per second.

syllable_count

Count of syllables in this segment

word_count

Count of words in this segment

correct_syllable_count

Count of correctly spoken syllables in this segment

correct_word_count

Count of correctly spoken words in this segment

syllable_correct_per_minute

correct_syllable_count / duration in mins

word_correct_per_minute

correct_word_count / duration in mins

all_pause_count

count of all pauses (filled and unfilled) which are longer than the minimum pause threshold

all_pause_duration

total duration of all pauses (filled and unfilled) in seconds

all_pause_list[]

a list of all the pauses with the begin/end markers for each in extents of 10 msecs

mean_length_run

mean length of run in syllables between pauses

max_length_run

max length of run in syllables between pauses

segment_metrics_list[]

A list of segments within the overall text/audio with the fluency metrics for each segment.

The following are the most commonly used metrics to provide feedback to the user:

  1. word_correct_per_minute: This measures the count of words per minute. You can color-code the test-taker's rate and compare it to the standard rate of 120 words per minute, which is widely considered the minimum fluent speaking rate.

  2. all_pause_list []: This is a list of all pauses, with each pause marked by begin and end times, accurate to within 10 milliseconds. Identify and display the locations of medium pause duration (≥500 milliseconds) and long pause duration (>1 second) based on the length and positions of entries in the all_pause_list[].

  3. duration and articulation: Display the duration and articulation length to show how much time the user spent speaking compared to pausing or using fillers.

    a. duration : The total length of the speech in seconds, including all pauses, fillers, and non-speech events.

    b. articulation: The total length of actual articulation, calculated as the total speech duration minus pauses, hesitations, non-speech events (such as laughter), and excluding any silence at the very beginning and very end of the speech.

Score Text/Pronunciation
overall score guide