view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models Jun 24 • 168
MagpieLM Collection Aligning LMs with Fully Open Recipe (data+training configs+logs) • 9 items • Updated 3 days ago • 12
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated 7 days ago • 181
YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models Paper • 2409.13592 • Published 5 days ago • 39
Qwen2-Audio Collection Audio-language model series based on Qwen2 • 4 items • Updated 7 days ago • 40
Qwen2-VL Collection Vision-language model series based on Qwen2 • 15 items • Updated 7 days ago • 121
Imagine yourself: Tuning-Free Personalized Image Generation Paper • 2409.13346 • Published 5 days ago • 57
Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published 6 days ago • 107
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution Paper • 2409.12191 • Published 7 days ago • 63
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 7 days ago • 188
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 14 items • Updated about 2 hours ago • 57
Qwen2.5-Math Collection Math-specific model series based on Qwen2.5 • 9 items • Updated 2 days ago • 30
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems Paper • 2402.12875 • Published Feb 20 • 12
Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning Paper • 2407.10718 • Published Jul 15 • 17
Video Generation models Collection The domain of video generation is booming. Here are the list of selected Open Access video generation (T2V) models. • 14 items • Updated 29 days ago • 12
view article Article Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models By isidentical • 30 days ago • 34
view article Article Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap Jun 19 • 11
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 7 items • Updated 8 days ago • 54
Top LLM Collection Collection of TOP Open Source LLM, Sort by Best on top • 6 items • Updated Jul 26 • 9
view article Article Introducing HelpingAI-Flash: Emotionally Intelligent Conversational AI for All Devices By Abhaykoul • Jul 19 • 2
view article Article Introducing HelpingAI-15B: Emotionally Intelligent Conversational AI By Abhaykoul • Jul 12 • 3
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Paper • 2403.04692 • Published Mar 7 • 40
view article Article Introducing NPC-Playground, a 3D playground to interact with LLM-powered NPCs Jun 5 • 17
view article Article Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI. By KingNish • May 21 • 30
llama 3 self-align experiments Collection Replicating the pipeline for StarCoder-2 Instruct on Llama-3-8B with some tweaks https://huggingface.co./blog/sc2-instruct • 4 items • Updated May 9 • 6
FIFO-Diffusion: Generating Infinite Videos from Text without Training Paper • 2405.11473 • Published May 19 • 53
Edit Your Image! Collection Find all the trending and useful Gradio demos that you can use to edit your images. • 21 items • Updated Apr 26 • 23