A Method for Extracting Formulaic Sequences from a Student Corpus

IR

Search this article

Abstract

type:Article

Use of appropriate formulaic sequences can add fluency, accuracyand appropriacy to written English, and one important place thesesequences occur are as sentence starters such as “Needless to say”or “At the same time”. This article describes and provides opensource code for a tool created using Python and the Natural LanguageToolkit which can help identify formulaic sentence-startersin an untagged corpus of student writing, for use in progress measurementand course design. Example results from two corpora arepresented and discussed.

Journal

Details

Report a problem

Back to top