The sensem corpus: An annotated corpus for Spanish and Catalan with information about aspectuality, modality, polarity and factuality

Ana Fernández-Montraveta, Gloria Vázquez*

*Corresponding author for this work

Research output: Contribution to journalArticleResearchpeer-review

7 Citations (Scopus)

Abstract

In this paper, we present the annotation scheme used in the SenSem1 corpora (SSC), for Spanish and Catalan, to codify information regarding aspectuality, modality, polarity and factuality. As regards aspectuality, the most relevant contribution is the codification of information about dynamicity, telicity and iterativity. Regarding factuality, we present a more fine-grained annotation of uncertainty as applied to the identification of impossible events, completely uncertain events and neutral uncertain events. Although information about factuality in Spanish has been provided elsewhere, the Catalan SSC is the only corpus to do so for Catalan.

Original languageEnglish
Pages (from-to)273-288
Number of pages16
JournalCorpus Linguistics and Linguistic Theory
Volume10
Issue number2
DOIs
Publication statusPublished - 1 Oct 2014

Keywords

  • aspectuality
  • assertivity
  • certainty
  • corpus annotation
  • dynamicity
  • factuality
  • impossibility
  • modality
  • polarity
  • telicity

Fingerprint

Dive into the research topics of 'The sensem corpus: An annotated corpus for Spanish and Catalan with information about aspectuality, modality, polarity and factuality'. Together they form a unique fingerprint.

Cite this