Added a way to evaluate overall performance of our model based on exact match and F1-score. 2827202 Robert commited on Mar 14, 2022