- Inici
- Departament
- Nous estudiants
- Docència
- Grau
- Postgraus, Màsters i Doctorat
- Mobilitat i intercanvi
- Pla d’acció tutorial (PAT), Filologia Catalana
- Pla d'acció tutorial (PAT), Lingüística General
- Sortides professionals (Filologia Catalana)
- Sortides professionals (Lingüística General)
- Estudiants amb necessitats específiques
- Aula de Literatura i Meditació
- Horaris de visita
- Recerca
- Publicacions
- Actualitat
Extending Automatic Discourse Segmentation for Texts in Spanish to Catalan
Any
2016
Lloc
Proceedings of the First Workshop on Modeling, Learning and Mining for Cross/Multilinguality (MultiLingMine 2016), p. 36-45
At present, automatic discourse analysis is a relevant research topic in the field of NLP. However, discourse is one of the phenomena most difficult to process. Although discourse parsers have been already developed for several languages, this tool does not exist for Catalan. In order to implement this kind of parser, the first step is to develop a discourse segmenter. In this article we present the first discourse segmenter for texts in Catalan. This segmenter is based on Rhetorical Structure Theory (RST) for Spanish, and uses lexical and syntactic information to translate rules valid for Spanish into rules for Catalan. We have evaluated the system by using a gold standard corpus including manually segmented texts and results are promising.
dimarts, 4 juliol, 2017 - 11:06