BERT
blogs
publications
Perturbation CheckLists for Evaluating NLG Evaluation Metrics
https://arxiv.org/abs/2109.05771
The heads hypothesis: A unifying statistical approach towards understanding multi-headed attention in BERT
https://ojs.aaai.org/index.php/AAAI/article/view/17605
Improving Dialog Evaluation with a Multi-reference Adversarial Dataset and Large Scale Pretraining
https://doi.org/10.1162/tacl_a_00347
On the weak link between importance and prunability of attention heads
http://dx.doi.org/10.18653/v1/2020.emnlp-main.260
Towards Interpreting BERT for Reading Comprehension Based QA
http://dx.doi.org/10.18653/v1/2020.emnlp-main.261
On Incorporating Structural Information to improve Dialogue Response Generation
https://arxiv.org/pdf/2005.14315.pdf