BLEU Has Measured Machine Translation Quality Since 2002. It’s Fast Becoming Useless.
As machine translation quality improves, precision-based metrics struggle to identify the best systems. Why have newer metrics not ousted longtime standard BLEU? Continue reading