Abstract
© 2017 Universidade de Santiago de Compostela. All Rights Reserved. This paper aims to provide a comprehensive review of the different methods used to measure the lexical richness of texts and make a proposal for their application to text corpora. Firstly, it presents an overview of the main existing metrics to quantify lexical richness, explaining how these are defined and evaluating their strengths and weaknesses by conducting experimental activities. Secondly, it proposes a methodology for measuring lexical richness that can be used across a complete text corpus so that we can both draw comparisons between texts and create a patterned rating of the degree of lexical richness for each of the texts within the whole corpus.
Original language | English |
---|---|
Pages (from-to) | 347-408 |
Journal | Verba |
Volume | 44 |
DOIs | |
Publication status | Published - 1 Jan 2017 |
Keywords
- Corpus linguistics
- Lexical richness
- Lexical statistics
- Quantitative linguistics
- Stylometry
- Word frequency distributions