Notice détaillée

Semantic text similarity using corpus-based word similarity and string similarity

Article Ecrit par: Islam, Aminul ; Inkpen, Diana ;

Résumé: We present a method for measuring the semantic similarity of texts using a corpus-based measure of semantic word similarity and a normalized and modified version of the Longest Common Subsequence (LCS) string matching algorithm. Existing methods for computing text similarity have focused mainly on either large documents or individual words.We focus on computing the similarity between two sentences or two short paragraphs. The proposed method can be exploited in a variety of applications involving textual knowledge representation and knowledge discovery. Evaluation results on two different data sets show that our method outperforms several competing methods.

Langue: Anglais

FAQ

Quelles sont les types de documents recensés dans le catalogue de la bibliothèque CERIST?

Les documents recensés dans le catalogue sont : Les périodiques, Articles de périodiques, les livres, les thèses de post-graduation (magister et doctorat), Rapport de recherche, documents Audiovisuels.

Quels sont les différents horaires de la bibliothèque durant l’année ?

La bibliothèque vous accueille de Dimanche à jeudi de 8h30 à 16h30. Notez que la bibliothèque peut être réquisitionnée pour des raisons administratives.

Où se situe la bibliothèque du Cerist ?

La Bibliothèque se situe au réez de chaussée du bloc B Voir Google Maps du site web pour localiser l’adresse.