Thursday , March 28 2024

Towards the Development of a Romanian Lexicon for the Analysis of Emotions in the Literary Works of Canonical Authors

Veronica GAVRILĂ, Lidia BĂJENARU*, Ciprian DOBRE, Mihaela TOMESCU
National Institute for Research and Development in Informatics – ICI Bucharest,
8-10 Mareşal Averescu Avenue, 011455, Bucharest, Romania
veronica.gavrila@ici.ro, lidia.bajenaru@ici.ro (*Corresponding author),
ciprian.dobre@ici.ro, mihaela.tomescu@ici.ro

Abstract: The analysis of emotions is still considered a rather difficult procedure and it is constrained by the limitations related to the languages available for this analysis. The Romanian language is one of the richest and most complex languages with different dialects, regionalisms, archaisms and a powerful and expressive poetic language. Starting from the original vision of the poets, the expressiveness of the Romanian language has given birth to special metaphors that often lose their meaning if one tries to translate them. For this reason, the need arose to develop a lexicon specific to the Romanian language that would allow the analysis of emotions from different poems and to successfully identify the emotions presented by an author through the artistic images created in his work. This paper presents the first steps in the development of a Romanian lexicon based on methodologies employed in other languages, mainly the English language. The data set was subject to a filtering and refining process after which it was enriched by using specific tailored web crawlers for adding all the forms and conjugations of the respective words.

Keywords: Lexicon, Natural language processing, Emotion analysis, Crawlers, INTELLIT, Romanian language.

>>FULL TEXT: PDF

CITE THIS PAPER AS:
Veronica GAVRILĂ, Lidia BĂJENARU, Ciprian DOBRE, Mihaela TOMESCU, Towards the Development of a Romanian Lexicon for the Analysis of Emotions in the Literary Works of Canonical Authors, Studies in Informatics and Control, ISSN 1220-1766, vol. 30(2), pp. 111-120, 2021. https://doi.org/10.24846/v30i2y202110