Query:
faculty: "FNWI" and publication year: "2005"
| Authors | B. Sigurbjornsson, J. Kamps, M. de Rijke | | Title | EuroGOV: Engineering a Multilingual Web Corpus |
| Book/source title | Working Notes for the CLEF 2005 Workshop |
| Year | 2005 |
| Faculty | Faculty of Science |
| Institute/dept. | FNWI: Informatics Institute (II) |
| Keywords | Multilingual web corpus; Web retrieval; Multilingual retrieval |
| Classification | 54.81 computer science: applications of information systems |
| Abstract | EuroGOV is a multilingual web corpus that was created to serve as the document collection for WebCLEF, the CLEF 2005 web retrieval task. EuroGOV is a collection of web pages crawled from the European Union portal, European Union member state governmental web sites, and Russian government web sites. The corpus contains over 3 million documents written in more than 20 different European languages. In this paper we provide a detailed description of the EuroGOV collection. |
| Document type | Chapter |
| Download paper | |
| Document finder |
|
Use this url to link to this page: http://dare.uva.nl/en/record/165568
Contact us about this recordNotify a colleague
Add to bookbag
|