Mixture-of-Depths: Dynamically allocating compute in transformer-based language models Paper • 2404.02258 • Published Apr 2 • 103
Textbooks Are All You Need II: phi-1.5 technical report Paper • 2309.05463 • Published Sep 11, 2023 • 86
LLM papers Collection It is a collection of papers that are useful in studying LLM. • 14 items • Updated Apr 3 • 12