Matrix Linear Algebra
-
Matrix algebra for a data scientist
24 min read -
A comprehensive and detailed formalization of multi-head attention.
10 min read
Matrix algebra for a data scientist
A comprehensive and detailed formalization of multi-head attention.