BERT
blogs
publications

Perturbation CheckLists for Evaluating NLG Evaluation Metrics
https://arxiv.org/abs/2109.05771

The heads hypothesis: A unifying statistical approach towards understanding multi-headed attention in BERT
https://ojs.aaai.org/index.php/AAAI/article/view/17605

Improving Dialog Evaluation with a Multi-reference Adversarial Dataset and Large Scale Pretraining
https://doi.org/10.1162/tacl_a_00347

On the weak link between importance and prunability of attention heads
http://dx.doi.org/10.18653/v1/2020.emnlp-main.260

Towards Interpreting BERT for Reading Comprehension Based QA
http://dx.doi.org/10.18653/v1/2020.emnlp-main.261

On Incorporating Structural Information to improve Dialogue Response Generation
https://arxiv.org/pdf/2005.14315.pdf