Chrome Extension
WeChat Mini Program
Use on ChatGLM

Improving Oral Reading Fluency Assessment Through Sub-Sequence Matching of Acoustic Word Embeddings

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)(2024)

Cited 0|Views8
No score
Abstract
Oral reading fluency assessment is a process where a student reads a passage aloud and is scored against words read correctly by a human listener. Current automatic reading fluency systems match these words read using speech recognition models trained with clean speech data from native adult speakers. This mismatch in training and deployment, compounded by numerous background noises from the classroom, means that student speech is often not correctly recognized. This paper describes a deep learning model that employs text-to-speech and contrastive learning to create acoustic word embeddings of student speech. This embedding is trained with unlabeled data of students reading known passages. Our model then uses sub-sequence matching in the acoustic embedding space to estimate words read correctly per minute, a common criterion in oral reading fluency. Our model’s words read correctly per minute is significantly closer to human listeners compared to systems that use automatic speech recognition only, reducing error of words correct per minute from 15.1 to 8.4, on average.
More
Translated text
Key words
Reading Fluency,Deep Learning,Subsequence Matching,Acoustic Word Embedding
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined