-
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 141 -
ReFT: Reasoning with Reinforced Fine-Tuning
Paper • 2401.08967 • Published • 27 -
Tuning Language Models by Proxy
Paper • 2401.08565 • Published • 20 -
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 63
Collections
Discover the best community collections!
Collections including paper arxiv:2407.01449
-
Masked Autoencoders Are Scalable Vision Learners
Paper • 2111.06377 • Published • 2 -
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling
Paper • 2311.00430 • Published • 56 -
distil-whisper/distil-large-v2
Automatic Speech Recognition • Updated • 82.4k • 502 -
Seven Failure Points When Engineering a Retrieval Augmented Generation System
Paper • 2401.05856 • Published • 2
-
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper • 2309.11495 • Published • 38 -
Adapting Large Language Models via Reading Comprehension
Paper • 2309.09530 • Published • 75 -
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Paper • 2309.09400 • Published • 82 -
Language Modeling Is Compression
Paper • 2309.10668 • Published • 82