HOANG Cong Duy Vu's research logs: 2012-11-04

Monday, 5 November 2012

Multeval

Link: https://github.com/jhclark/multeval

Intro: MultEval takes machine translation hypotheses from several runs of an optimizer and provides 3 popular metric scores, as well as, standard deviations (via bootstrap resampling) and p-values (via approximate randomization). This allows researchers to mitigate some of the risk of using unstable optimizers such as MERT, MIRA, and MCMC. It is intended to help in evaluating the impact of in-house experimental variations on translation quality; it is currently not setup to do bake-off style comparisons (bake-offs can't require multiple optimizer runs nor a standard tokenization).

Related: http://www.ark.cs.cmu.edu/MT/ (Code for Statistical Significance Testing for MT Evaluation Metrics)

HOANG Cong Duy Vu's research logs

Monday, 5 November 2012

Multeval

Pages

My Blog List