Towards Neural Similarity Evaluators

Muhammed Yusuf Kocyigit, Hassan Kane

July 2019

Abstract

We review the limitations of BLEU and ROUGE – the most popular metrics used to assess reference summaries against hypothesis summaries, and come up with criteria for what a good metric should behave like and propose concrete ways to use and test recent Transformers-based Language Models to assess reference summaries against hypothesis summaries.

Type

Conference paper

Publication

In Document Intelligence Workshop NeurIPS'19

Towards Neural Similarity Evaluators

Abstract

Muhammed Yusuf Kocyigit

PhD Student at Boston University