The UvA-LINKER will give you a range of other options to find the full text of a publication (including a direct link to the full-text if it is located on another database on the internet).
De UvA-LINKER biedt mogelijkheden om een publicatie elders te vinden (inclusief een directe link naar de publicatie online als deze beschikbaar is in een database op het internet).

Zoekresultaten

Zoekopdracht: faculteit: "FNWI" en publicatiejaar: "2000"

AuteurSjaak Verbeek
TitelAn Information Theoretic Approach to Finding Word Groups for Text Classification
Jaar2000
FaculteitFaculteit der Natuurwetenschappen, Wiskunde en Informatica
Instituut/afd.FNWI/FGw: Institute for Logic, Language and Computation (ILLC)
SerieILLC Master of Logic Theses / ILLC ; MoL-2000-03
SamenvattingAn Information Theoretic Approach to Finding Word Groups for Text Classification Sjaak Verbeek This thesis concerns finding the `optimal' number of (non-overlapping) word groups for text classification. We present a method to select _which_ words to cluster in word groups and _how many_ such word groups to use on the basis of a set of pre-classified texts. The method involves a greedy search through the space of possible word groups. The criterion on which is navigated through this space is based on `mutual information' and is known as `Jensen Shannon divergence'. The criterion to decide _which number_ of word groups to use is based on Rissanen's MDL Principle. We present empirical results that indicate that the proposed method performs well at its task. The prediction model used is based on the Naive Bayes model and the date set used for the experiments is a subset of the `20 Newsgroup Dataset'.
Soort documentPreprint
Download bestand
Document finderUvA-Linker