About us
People
faculty
Collaborators
Researchers
staff
Management
Alumni
Resources
Blogs
Events
News
Newsletter
Education
Online Courses
Training Programmes
Certificate Programmes
Careers
Work with us
Internships
research
Overview
Themes
Publications
Software & Datasets
projects
Deployable AI
Collaborations
QUICK LINKS
For Students
For Academicians
For Industry
Upcoming Events
contact
Mitesh Khapra
Mitesh Khapra
The heads hypothesis: A unifying statistical approach towards understanding multi-headed attention in BERT
On Controllable Sparse Alternatives to Softmax
An autoencoder approach to learning bilingual word representations