GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI Paper • 2408.03361 • Published Aug 6 • 85
Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition Paper • 2405.15216 • Published May 24 • 12
FIFO-Diffusion: Generating Infinite Videos from Text without Training Paper • 2405.11473 • Published May 19 • 53
Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching Paper • 2405.11252 • Published May 18 • 12
Towards Modular LLMs by Building and Reusing a Library of LoRAs Paper • 2405.11157 • Published May 18 • 25
SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization Paper • 2405.11582 • Published May 19 • 12
Imp: Highly Capable Large Multimodal Models for Mobile Devices Paper • 2405.12107 • Published May 20 • 25
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning Paper • 2405.12130 • Published May 20 • 45
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework Paper • 2405.11143 • Published May 20 • 33
Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training Paper • 2405.06932 • Published May 11 • 16
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts Paper • 2405.07518 • Published May 13 • 24
view article Article Advancing Open-source Large Language Models in the Medical & Healthcare Domain By aaditya • May 10 • 5
Exploiting Reasoning Chains for Multi-hop Science Question Answering Paper • 2109.02905 • Published Sep 7, 2021 • 1
Answering Questions by Meta-Reasoning over Multiple Chains of Thought Paper • 2304.13007 • Published Apr 25, 2023 • 1
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation Paper • 2305.14251 • Published May 23, 2023 • 2
L-Eval: Instituting Standardized Evaluation for Long Context Language Models Paper • 2307.11088 • Published Jul 20, 2023 • 4
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems Paper • 2311.09476 • Published Nov 16, 2023 • 4
Can Large Language Models Be an Alternative to Human Evaluations? Paper • 2305.01937 • Published May 3, 2023 • 2
Better & Faster Large Language Models via Multi-token Prediction Paper • 2404.19737 • Published Apr 30 • 73
A Latent Space Theory for Emergent Abilities in Large Language Models Paper • 2304.09960 • Published Apr 19, 2023 • 3
The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction Paper • 2312.13558 • Published Dec 21, 2023 • 5
LLM-AD: Large Language Model based Audio Description System Paper • 2405.00983 • Published May 2 • 16
Retentive Network: A Successor to Transformer for Large Language Models Paper • 2307.08621 • Published Jul 17, 2023 • 170
Parameter-Efficient Mixture-of-Experts Architecture for Pre-trained Language Models Paper • 2203.01104 • Published Mar 2, 2022 • 2
The Impact of Depth and Width on Transformer Language Model Generalization Paper • 2310.19956 • Published Oct 30, 2023 • 9
ReAct: Synergizing Reasoning and Acting in Language Models Paper • 2210.03629 • Published Oct 6, 2022 • 14
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published Apr 22 • 250
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper • 2404.14619 • Published Apr 22 • 124
Neural Circuit Diagrams: Robust Diagrams for the Communication, Implementation, and Analysis of Deep Learning Architectures Paper • 2402.05424 • Published Feb 8 • 17
Hypothesis Search: Inductive Reasoning with Language Models Paper • 2309.05660 • Published Sep 11, 2023 • 1
Knowledge Sheaves: A Sheaf-Theoretic Framework for Knowledge Graph Embedding Paper • 2110.03789 • Published Oct 7, 2021 • 2
RARR: Researching and Revising What Language Models Say, Using Language Models Paper • 2210.08726 • Published Oct 17, 2022 • 1
BlenderAlchemy: Editing 3D Graphics with Vision-Language Models Paper • 2404.17672 • Published Apr 26 • 18
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare Apr 19 • 99