Zoekopdracht:
faculteit: "FGw" en publicatiejaar: "2005"
| Auteurs | J. Kamps, G.A. Mishne, M. de Rijke | | Titel | Language Models for Searching in Web Corpora |
| Boek/bron titel | The Thirteenth Text Retrieval Conference (TREC 2004) |
| Auteurs/Editors | E.M. Voorhees, L.P. Buckland |
| Jaar | 2005 |
| Pagina's | 11 |
| Faculteit | Faculteit der Natuurwetenschappen, Wiskunde en Informatica Faculteit der Geesteswetenschappen |
| Instituut/afd. | FNWI: Institute for Logic, Language and Computation (ILLC) FNWI: Informatics Institute (II) |
| Trefwoorden | Information retrieval |
| Basisclassificatie | 54.69 computer science: information systems: other |
| Samenvatting | We describe our participation in the TREC 2004 Web and Terabyte tracks. For the web track, we employ mixture language models based on document full-text, incoming anchortext, and documents titles, with a range of webcentric priors. We provide a detailed analysis of the effect on relevance of document length, URL structure, and link topology. The resulting web-centric priors are applied to three types of topics?distillation, home page, and named page?and improve effectiveness for all topic types, as well as for the mixed query set. For the terabyte track, we experimented with building an index just based on the document titles, or on the incoming anchor texts. Very selective indexing leads to a compact index that is effective in terms of early precision, catering for the typical web searcher behavior. |
| Soort document | Hoofdstuk |
| Download bestand | |
| Document finder |
|
Gebruik dit adres om naar deze pagina te linken: http://dare.uva.nl/record/165575
Vraag/opmerking over dit recordMail aan een collega
Toevoegen aan bewaarset
|