Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions.

Hannah Craighead, Andrew Caines, Paula Buttery, Helen Yannakoudakis

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

121 Downloads (Pure)

Abstract

We address the task of automatically grading the language proficiency of spontaneousspeech based on textual features from automatic speech recognition transcripts. Motivated by recent advances in multi-task learning, we develop neural networks trained in a multi-task fashion that learn to predict the proficiency level of non-native English speakers by taking advantage of inductive transfer between the main task (grading) and auxiliary prediction tasks: morpho-syntactic labeling,language modeling, and native language identification (L1). We encode the transcriptions with both bi-directional recurrent neural networks and with bi-directional representations from transformers, compare against a feature-rich baseline, and analyse performance at different proficiency levels and with transcriptions of varying error rates. Our best performance comes from a transformer encoder with L1 prediction as an auxiliary task. We discuss areas for improvement and potential applications for text-only speech scoring.
Original languageEnglish
Title of host publication Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.
PublisherAssociation for Computational Linguistics (ACL)
Publication statusAccepted/In press - 3 Apr 2020

Fingerprint

Dive into the research topics of 'Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions.'. Together they form a unique fingerprint.

Cite this