- Princton NJ, US Derrick Higgins - Chicago IL, US Klaus Zechner - Princeton NJ, US Shasha Xie - Sunnyvale CA, US Je Hun Jeon - Woburn MA, US Keelan Evanini - Pennington NJ, US Guangming Ling - Lawrenceville NJ, US
International Classification:
G10L 15/08 G10L 15/00
Abstract:
A method for scoring non-native speech includes receiving a speech sample spoken by a non-native speaker and performing automatic speech recognition and metric extraction on the speech sample to generate a transcript of the speech sample and a speech metric associated with the speech sample. The method further includes determining whether the speech sample is scorable or non-scorable based upon the transcript and speech metric, where the determination is based on an audio quality of the speech sample, an amount of speech of the speech sample, a degree to which the speech sample is off-topic, whether the speech sample includes speech from an incorrect language, or whether the speech sample includes plagiarized material. When the sample is determined to be non-scorable, an indication of non-scorability is associated with the speech sample. When the sample is determined to be scorable, the sample is provided to a scoring model for scoring.