Iria da Cunha, Eric SanJuan, Juan-Manuel Torres-Moreno, Irene Castellón, Marina Lloberes
Proceedings of the First Workshop on Modeling, Learning and Mining for Cross/Multilinguality (MultiLingMine 2016), p. 36-45
At present, automatic discourse analysis is a relevant research topic in the field of NLP. However, discourse is one of the phenomena most difficult to process. Although discourse parsers have been already developed for several languages, this tool does not exist for Catalan. In order to implement this kind of parser, the first step is to develop a discourse segmenter. In this article we present the first discourse segmenter for texts in Catalan.
Treballs de sociolingüística catalana 26, pp. 285-308
ISSN (ed. impresa): 0211-0784
En aquest article, es posen de manifest els fenòmens més de tipus micro que intervenen en l’organització lingüística de les famílies lingüísticament mixtes. La majoria de les dades provenen d’entrevistes semidirigides a progenitors que tenien fills a la xarxa municipal d’escoles bressol de Barcelona.
This study investigates how epistemic stance is encoded and perceived in face-toface communication when language is regarded as comprised by speech and gesture. Two studies were conducted with this goal in mind. The first study consisted of a production task in which participants performed opinion reports.
Jiménez-Zafra Salud María , M Teresa Martín-Valdivia, Luis Alfonso Ureña-López, M. Antònia Martí, Mariona Taulé
Proceedings of the Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics
This paper presents the main sources of disagreement found during the annotation of the Spanish SFU Review Corpus with negation (SFU ReviewSP -NEG). Negation detection is a challenge in most of the task related to NLP, so the availability of corpora annotated with this phenomenon is essential in order to advance in tasks related to this area. A thorough analysis of the problems found during the annotation could help in the study of this phenomenon.
The present study approaches the indexing of epistemicity and evidentiality from the point of view of register by analyzing a total of 30 oral and 30 written productions of two opinion reports (one dealing with a debatable issue and the other with a nondebatable issue) produced by 15 Catalan speakers. The main aim of the paper is to test the potential effects of register (i.e., oral vs. written discourse) and debatability (i.e., debatable vs. nondebatable issue) on the discourse marking of evidentiality and epistemicity.
Kovatchev, Venelin , Maria Salamó, M. Antònia Martí
Procesamiento del Lenguaje Natural, Vol. 57: 109-116
Distributional Semantic Models (DSM) are growing in popularity in Computational Linguistics. DSM use corpora of language use to automatically induce formal representations of word meaning. This article focuses on one of the applications of DSM: identifying groups of semantically related words. We compare two models for obtaining formal representations: a well known approach (CLUTO) and a more recently introduced one (Word2Vec). We compare the two models with respect to the PoS coherence and the semantic relatedness of the words within the obtained groups.