Memory-Efficient Embeddings

Creating smaller models with a new kind of embedding layer

Dr. Robert Kübler
Towards Data Science
13 min readJan 1, 2024


Photo by Kostiantyn Vierkieiev on Unsplash

Whenever dealing with categorical data, beginners resort to one-hot encoding. This is often okay, but if you are dealing with thousands or even millions of categories, this approach becomes infeasible. This has the following reasons:

  1. Increased dimensionality: For each category, you get an…



Studied Mathematics, PhD in Cryptanalysis, working as a Data Scientist. Check out my new publication!