| Authors |
-
C. Callison-Burch
-
P. Koehn
-
C. Monz
-
J. Schroeder
|
| Publication date |
2009
|
| Host editors |
-
C. Callison-Burch
-
P. Koehn
-
C. Monz
-
J. Schroeder
|
| Book title |
Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece
|
| Event |
4th Workshop on Statistical Machine Translation, Athens, Greece
|
| Pages (from-to) |
1-28
|
| Publisher |
Morristown, NJ: Association for Computational Linguistics
|
| Organisations |
-
Faculty of Science (FNWI) - Informatics Institute (IVI)
|
| Abstract |
This paper presents the results of the WMT09 shared tasks, which included a translation task, a system combination task, and an evaluation task. We conducted a large-scale manual evaluation of 87 machine translation systems and 22 system combination entries. We used the ranking of these systems to measure how strongly automatic metrics correlate with human judgments of translation quality, for more than 20 metrics. We present a new evaluation technique whereby system output is edited and judged for correctness.
|
| Document type |
Conference contribution
|
| Language |
English
|
| Published at |
http://portal.acm.org/citation.cfm?id=1626431.1626433
|
|
Permalink to this page
|