Rethinking embedding coupling in pre-trained language models Paper • 2010.12821 • Published Oct 24, 2020 • 1