Tractament automàtic de la informació en espanyol: processament lèxic, sintàctic i semàntic

  • Subirats Ruggeberg, Carlos (Principal Investigator)
  • Torres Hernández, Maria Judit (Scholar)
  • Colás Pasamontes, José (Collaborator)
  • Cristóbal de Gregorio, María (Collaborator)
  • Emilia Victoria Enríquez Carrasco (Collaborator)
  • Mairal Usón, Ricardo (Collaborator)
  • Bobes Soler, Eulalia De (Researcher on contract)
  • Fuentes Morán, Teresa (Researcher on contract)
  • Garcia De Quesada, Mercedes (Researcher on contract)
  • Garrido Íñigo, Paloma (Researcher on contract)
  • Moya Sariñena, Lidia (Researcher on contract)
  • Ortega Gil, Marc (Researcher on contract)
  • Pascual Dominguez, Blanca (Researcher on contract)
  • Rios Mestre, Antonio (Investigator)

Project Details

Description

1. The first aim of this project is to enhance the lexical coverage as well as to develop the tools of a computational system which allows the automatic processing of lexical and syntactic information in Spanish by means of an electronic dictionary, a computational syntax, and automata intersection algorithms. These algorithms will intersect a text, previously converted into deterministic finite automata (DFA) whose transitions have been tagged using information from an electronic dictionary, with p-subsequential transducers. These transducers will be automatically generated from a computional syntax which formalizes the syntactic constructions associated with Spanish predicates. The output of the intersection of the DFA (text) and the transducers (syntactic constructions) will be a transduced DFA where the predicate relations will be restored in an unambiguous way.
2. The second objective of this project is to develop a semantic tagged corpus with applications for the automatic processing of textual information in Spanish. Following syntactic criteria, the corpus sentence selection will be carried out automatically with the lexical and syntactic analysis tools from 1. The semantic tagging will be developed within the frame semantics theory with the collaboration of the FrameNet\super 2\nosuper project that will allow us the following tools:
(1) a corpus database,
(2) semiautomatic tagging tools, and
(3) corpus exploitation tools for the automatic semantic processing of texts.
StatusFinished
Effective start/end date1/12/021/12/05

Funding

  • Ministerio de Ciencia y Tecnología: €160,080.00

Fingerprint Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.