Tanmay Bakshi - What do language models really learn

Swift Tensorflow Machine Learning AI Frameworks

What do language models really learn

Tanmay Bakshi - 5 years ago

Deep Neural Networks and Language Models built on top of them have seen a lot of hype over the past couple of months, especially in the GPT family of techniques. However, is this hype really deserved? What relationships and patterns do these models end up learning? Do they really reason and internally represent abstract concepts? Most importantly, how does this generalize to applications of Deep Neural Networks outside of Language Modelling? We'll decipher this situation and find the answers to these questions with a bottom up approach.

Jobs with related skills

Frankfurt am Main, Germany

Hybrid

Frankfurt am Main, Germany

Hybrid

Essen, Germany

Hybrid