Post
586
Running billion parameter models, sometimes we forget what it all is! ๐ค๐ก
Matrix multiplication ๐งฎโจ
While there are multiple plays on memory management and caching to speed it up! ๐๏ธ๐พโก
The naive way of Matrix multiplication becomes even more fascinating the bigger these models get! ๐คฏ๐
QKV for the win! ๐๐๐
GitHub: https://github.com/wentasah/mmul-anim
Slides: https://cw.fel.cvut.cz/wiki/_media/courses/b4m36esw/esw09_2019.pdf ๐๐
Matrix multiplication ๐งฎโจ
While there are multiple plays on memory management and caching to speed it up! ๐๏ธ๐พโก
The naive way of Matrix multiplication becomes even more fascinating the bigger these models get! ๐คฏ๐
QKV for the win! ๐๐๐
GitHub: https://github.com/wentasah/mmul-anim
Slides: https://cw.fel.cvut.cz/wiki/_media/courses/b4m36esw/esw09_2019.pdf ๐๐