KingNish (Nishith Jain)

upvoted an article about 5 hours ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24

• 168

upvoted 2 collections about 19 hours ago

MagpieLM

Collection

Aligning LMs with Fully Open Recipe (data+training configs+logs) • 9 items • Updated 3 days ago • 12

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated 7 days ago • 181

upvoted a collection 1 day ago

Paper-to-Read

Collection

5 items • Updated 1 day ago • 1

upvoted a paper 1 day ago

YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models

Paper • 2409.13592 • Published 5 days ago • 39

upvoted a collection 1 day ago

RealFlux (Flux)

Collection

2 items • Updated 2 days ago • 10

upvoted a paper 1 day ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published 7 days ago • 109

upvoted 3 collections 2 days ago

upvoted a paper 2 days ago

Imagine yourself: Tuning-Free Personalized Image Generation

Paper • 2409.13346 • Published 5 days ago • 57

upvoted 2 collections 4 days ago

Realistic Vision (SD1.5)

Collection

8 items • Updated Dec 4, 2023 • 31

RealVisXL (SDXL)

Collection

14 items • Updated 23 days ago • 61

upvoted 2 papers 5 days ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published 6 days ago • 107

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published 8 days ago • 75

upvoted a collection 6 days ago

Collection Zero & Demo

Collection

Image Gen - Text -to-Image • 22 items • Updated 17 days ago • 10

upvoted a paper 6 days ago

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published 7 days ago • 63

upvoted 3 collections 7 days ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 7 days ago • 188

Qwen2.5-Coder

Collection

Code-specific model series based on Qwen2.5 • 14 items • Updated about 2 hours ago • 57

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 9 items • Updated 2 days ago • 30

upvoted a paper 8 days ago

Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

Paper • 2402.12875 • Published Feb 20 • 12

upvoted an article 9 days ago

Article

Introducing Community Tools on HuggingChat

9 days ago

• 26

upvoted an article 11 days ago

Article

"Diffusers Image Fill" guide

By

•

12 days ago

• 22

upvoted a paper 13 days ago

Agent Workflow Memory

Paper • 2409.07429 • Published 14 days ago • 26

upvoted a collection 13 days ago

Agents

Collection

7 items • Updated 7 days ago • 1

upvoted a paper 23 days ago

Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning

Paper • 2407.10718 • Published Jul 15 • 17

upvoted 2 collections 28 days ago

Video generation models (Image-to-Video)

Collection

4 items • Updated 29 days ago • 1

Video Generation models

Collection

The domain of video generation is booming. Here are the list of selected Open Access video generation (T2V) models. • 14 items • Updated 29 days ago • 12

upvoted an article 28 days ago

Article

quanto: a pytorch quantization toolkit

Mar 18

• 28

upvoted an article 29 days ago

Article

Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models

By

•

30 days ago

• 34

upvoted an article 30 days ago

Article

Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap

Jun 19

• 11

upvoted 3 articles about 1 month ago

Article

Student Ambassador Program's call for applications is open!

May 13, 2022

• 2

Article

Announcing the Hugging Face Fellowship Program

May 17, 2022

• 5

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 98

upvoted a collection about 1 month ago

Minitron

Collection

A family of compressed models obtained via pruning and knowledge distillation • 7 items • Updated 8 days ago • 54

upvoted a paper about 1 month ago

Imagen 3

Paper • 2408.07009 • Published Aug 13 • 60

upvoted 4 articles about 2 months ago

Article

XetHub is joining Hugging Face!

Aug 8

• 77

Article

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Jul 30

• 51

Article

Our Transformers Code Agent beats the GAIA benchmark!

Jul 1

• 45

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 196

upvoted a collection 2 months ago

Top LLM

Collection

Collection of TOP Open Source LLM, Sort by Best on top • 6 items • Updated Jul 26 • 9

upvoted 2 articles 2 months ago

Article

Train a Llama model from scratch

By

•

Jul 29

• 40

Article

Introducing HelpingAI-Flash: Emotionally Intelligent Conversational AI for All Devices

By

•

Jul 19

• 2

upvoted a paper 2 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 153

upvoted an article 3 months ago

Article

Introducing HelpingAI-15B: Emotionally Intelligent Conversational AI

By

•

Jul 12

• 3

upvoted a paper 3 months ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 141

upvoted 2 articles 3 months ago

Article

🧨 Diffusers welcomes Stable Diffusion 3

Jun 12

• 86

Article

Thoughts on LoRA Training #1

By

•

Jun 18

• 31

upvoted a paper 3 months ago

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Paper • 2403.04692 • Published Mar 7 • 40

upvoted 5 articles 4 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 322

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22

• 221

Article

Introducing NPC-Playground, a 3D playground to interact with LLM-powered NPCs

Jun 5

• 17

Article

BrAIn: next generation neurons?

By

•

Jun 5

• 15

Article

HelpingAI 9B: Cutting Edge Emotionally Intelligent AI

By

•

May 31

• 3

upvoted a collection 4 months ago

Emotional Intelligence

Collection

Models are listed based on EQ • 16 items • Updated Aug 16 • 8

upvoted an article 4 months ago

Article

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.

By

•

May 21

• 30

upvoted a collection 4 months ago

llama 3 self-align experiments

Collection

Replicating the pipeline for StarCoder-2 Instruct on Llama-3-8B with some tweaks https://huggingface.co./blog/sc2-instruct • 4 items • Updated May 9 • 6

upvoted an article 4 months ago

Article

How OpenGPT 4o works

By

•

Jul 17

• 30

upvoted a paper 4 months ago

FIFO-Diffusion: Generating Infinite Videos from Text without Training

Paper • 2405.11473 • Published May 19 • 53

upvoted a collection 4 months ago

Edit Your Image!

Collection

Find all the trending and useful Gradio demos that you can use to edit your images. • 21 items • Updated Apr 26 • 23

Nishith Jain

AI & ML interests

Articles

How OpenGPT 4o works

HelpingAI 9B: Cutting Edge Emotionally Intelligent AI

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.

Organizations

KingNish's activity

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Introducing Community Tools on HuggingChat

"Diffusers Image Fill" guide

quanto: a pytorch quantization toolkit

Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models

Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap

Student Ambassador Program's call for applications is open!

Announcing the Hugging Face Fellowship Program

Welcome FalconMamba: The first strong attention-free 7B model

XetHub is joining Hugging Face!

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Our Transformers Code Agent beats the GAIA benchmark!

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Train a Llama model from scratch

Introducing HelpingAI-Flash: Emotionally Intelligent Conversational AI for All Devices

Introducing HelpingAI-15B: Emotionally Intelligent Conversational AI

🧨 Diffusers welcomes Stable Diffusion 3

Thoughts on LoRA Training #1

Uncensor any LLM with abliteration

Fine-tune Llama 3 with ORPO

Introducing NPC-Playground, a 3D playground to interact with LLM-powered NPCs

BrAIn: next generation neurons?

HelpingAI 9B: Cutting Edge Emotionally Intelligent AI

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.

How OpenGPT 4o works