Evaluación del efecto en el algoritmo de Análisis Semántico Latente al utilizar colecciones de datos cada vez más grandes para la detección y extracción de sinónimos y su independencia respecto al lenguaje, por medio de su implementación distribuida

Alfaro-Flores, Rafael

View/Open

Evaluacion_efecto_algoritmo_analisis_semantico_latente.pdf (961.4Kb)

Date

2014

Author

Alfaro-Flores, Rafael

Metadata

Show full item record

Abstract

Access to large data, especially for text processing applications, results in more eﬀec tive algorithms and therefore becomes transcendental to take advantage of these large amounts of data. Latent Semantic Analysis (LSA) is an unsupervised machine learning algorithm which beneﬁts from these features and can be used for synonym detection and extraction. LSA takes advantage of the implicit semantic structure that exists in the association between documents and the terms they contain to statistically analyze the relationships between the terms of the collection of text documents; and because it uses a strictly mathematical approach, it is inherently independent of language. This is a thesis for the Masters in Computing degree that analyzes the LSA algorithm in a distributed environment, in order to evaluate its eﬀect for synonym detection and extraction on larger collections of data.

Description

Proyecto de Graduación (Maestría en Computación) Instituto Tecnológico de Costa Rica, Escuela de Ingeniería en Computación, 2014.

URI

https://hdl.handle.net/2238/6674

Collections

Maestría en Computación [119]