Por favor, use este identificador para citar o enlazar este ítem: http://repositoriodigital.ipn.mx/handle/123456789/14869
Título : Evaluating n-gram Models for a Bilingual Word Sense Disambiguation Task
Otros títulos : Evaluación de modelos de n-gramas para la tarea de desambiguación bilingüe del sentido de las palabras
Autor : Pinto, David
Vilariño, Darnes
Balderas, Balderas
Tovar, Mireya
Beltrán, Beatriz
Palabras clave : Keywords. Bilingual word sense disambiguation, machine translation, parallel corpus, Naïve Bayes classifier.
Fecha de publicación : 13-dic-2011
Editorial : Revista Computación y Sistemas; Vol. 15 No. 2
Citación : Revista Computación y Sistemas; Vol. 15 No. 2
Citación : Revista Computación y Sistemas;Vol. 15 No.2
Resumen : Abstract. The problem of Word Sense Disambiguation (WSD) is about selecting the correct sense of an ambiguous word in a given context. However, even if the problem of WSD is difficult, when we consider its bilingual version, this problem becomes much more complex. In this case, it is necessary not only to find the correct translation, but such translation must consider the contextual senses of the original sentence (in the source language), in order to find the correct sense (in the target language) of the source word. In this paper we present a probabilistic model for bilingual WSD based on n-grams (2-grams, 3-grams, 5-grams and k-grams, for a sentence S of a length k). The aim is to analyze the behavior of the system with different representations of a given sentence containing an ambiguous word. We use a Naïve Bayes classifier for determining the probability of the target sense (in the target language) given a sentence which contains an ambiguous word (in the source language). For this purpose, we use a bilingual statistical dictionary, which is calculated with Giza++ by using the EUROPARL parallel corpus. On the average, the representation model based on 5-grams with mutual information demonstrated the best performance.
URI : http://www.repositoriodigital.ipn.mx/handle/123456789/14869
ISSN : 1405-5546
Aparece en las colecciones: Revistas

Ficheros en este ítem:
Fichero Descripción Tamaño Formato  
209_ART 6_CyS_109.pdf670.86 kBAdobe PDFVisualizar/Abrir


Los ítems de DSpace están protegidos por copyright, con todos los derechos reservados, a menos que se indique lo contrario.