... of the hypothesis. And we assign the same weights to each system. For selecting the backbone, only the top hypo-thesis from each system is considered as a candi-date for the backbone. Concerning ... single systems’ scores; row MBR is the scores of back-bone; GIZA++, TER, CLA, IHMM stand for scores of systems for four word alignment me-thods. z MBR decoding slightly improves the per-formance ... 1: Statistics of training, dev and test data for IWSLT and NIST tasks. In both experiments, we used four systems, as listed in Table 2, they are phrase-based system Moses (Koehn et al., 2007),...