1660 631 1577

Julien Chaumond PRO

julien-c

https://huggingface.co.

AI & ML interests

<3 ML/AI for everyone, building products to propel communities fwd

Articles

Hugging Face Selected for the French Data Protection Agency Enhanced Support Program

May 15, 2023

How to train a new language model from scratch using Transformers and Tokenizers

Feb 14, 2020

• 16

Organizations

julien-c's activity

upvoted a collection about 24 hours ago

Wonder Tools picks

Collection

Notable demo apps for exploring useful ways to capitalize on AI • 12 items • Updated 12 days ago • 8

upvoted 3 papers 2 days ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published 7 days ago • 109

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published 6 days ago • 107

Imagine yourself: Tuning-Free Personalized Image Generation

Paper • 2409.13346 • Published 5 days ago • 57

upvoted an article 2 days ago

Article

Exploring the Daily Papers Page on Hugging Face

2 days ago

• 16

upvoted a collection 3 days ago

Core ML Segment Anything 2

Collection

4 items • Updated 12 days ago • 18

upvoted a collection 7 days ago

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated 7 days ago • 181

upvoted 2 articles 7 days ago

Article

Introducing Community Tools on HuggingChat

9 days ago

• 26

Article

Introducing the SQL Console on Datasets

8 days ago

• 13

upvoted a paper 8 days ago

ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration

Paper • 2409.09506 • Published 11 days ago • 2

upvoted a paper 15 days ago

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published 29 days ago • 120

upvoted an article 15 days ago

Article

The Environmental Impacts of AI -- Primer

•

22 days ago

• 26

upvoted a paper 16 days ago

ClimDetect: A Benchmark Dataset for Climate Change Detection and Attribution

Paper • 2408.15993 • Published 28 days ago • 7

upvoted an article 21 days ago

Article

Hugging Face partners with TruffleHog to Scan for Secrets

21 days ago

• 9

upvoted 5 papers 21 days ago

Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming

Paper • 2408.16725 • Published 27 days ago • 50

VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters

Paper • 2408.17253 • Published 26 days ago • 35

LongRecipe: Recipe for Efficient Long Context Generalization in Large Languge Models

Paper • 2409.00509 • Published 25 days ago • 38

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published 22 days ago • 76

Kvasir-VQA: A Text-Image Pair GI Tract Dataset

Paper • 2409.01437 • Published 23 days ago • 70

upvoted an article 23 days ago

Article

Scaling robotics datasets with video encoding

29 days ago

• 33

upvoted 3 articles about 1 month ago

Article

Introduction to ggml

Aug 13

• 92

Article

Parquet in Action: A Beginners Guide

•

Aug 14

• 3

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 98

upvoted an article about 2 months ago

Article

XetHub is joining Hugging Face!

Aug 8

• 77

upvoted a paper about 2 months ago

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3 • 74

upvoted an article about 2 months ago

Article

Gradio joins Hugging Face!

Dec 21, 2021

• 3

upvoted 3 papers about 2 months ago

Medical SAM 2: Segment medical images as video via Segment Anything Model 2

Paper • 2408.00874 • Published Aug 1 • 40

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31 • 73

Meltemi: The first open Large Language Model for Greek

Paper • 2407.20743 • Published Jul 30 • 67

upvoted 3 collections about 2 months ago

Gemma Scope Release

Collection

A comprehensive, open suite of sparse autoencoders for Gemma 2 2B and 9B. • 10 items • Updated Aug 11 • 13

ShieldGemma Release

Collection

A series of safety classifiers, trained on top of Gemma 2, for developers to filter inputs and outputs of their applications. • 3 items • Updated Jul 31 • 11

Gemma 2 2B Release

Collection

The 2.6B parameter version of Gemma 2. • 6 items • Updated Jul 31 • 76

upvoted an article about 2 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31

• 58

upvoted a collection about 2 months ago

Research projects on top of vLLM

Collection

Papers cited in https://blog.vllm.ai/2024/07/25/lfai-perf.html • 6 items • Updated Jul 29 • 12

upvoted an article about 2 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29

• 197

upvoted an article 2 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 196

upvoted a collection 2 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Meta Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Aug 2 • 579

upvoted 3 articles 2 months ago

Article

WWDC 24: Running Mistral 7B with Core ML

Jul 22

• 54

Article

Querying Datasets with the Datasets Explorer Chrome Extension

•

Jul 19

• 6

Article

Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing

•

Jul 19

• 17

upvoted 5 papers 2 months ago

E-BATCH: Energy-Efficient and High-Throughput RNN Batching

Paper • 2009.10656 • Published Sep 22, 2020 • 1

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 153

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17 • 48

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12 • 123

Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs

Paper • 2407.07775 • Published Jul 10 • 3

upvoted an article 2 months ago

Article

The Rise of Agentic Data Generation

•

Jul 15

• 74

upvoted 7 papers 2 months ago

LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

Paper • 2407.03963 • Published Jul 4 • 15

Vision language models are blind

Paper • 2407.06581 • Published Jul 9 • 80

Inference Performance Optimization for Large Language Models on CPUs

Paper • 2407.07304 • Published Jul 10 • 52

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10 • 64

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Paper • 2407.07053 • Published Jul 9 • 41

Video Diffusion Alignment via Reward Gradients

Paper • 2407.08737 • Published Jul 11 • 47

Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On

Paper • 2407.08348 • Published Jul 11 • 51

upvoted an article 2 months ago

Article

How to run Gemini Nano locally in your browser

•

Jul 11

• 42

upvoted an article 3 months ago

Article

Announcing New Hugging Face and Keras NLP integration

Jul 10

• 29

upvoted a collection 3 months ago

AIMO Progress Prize

Collection

Models and datasets used in the winning solution to the AIMO 1st Progress Prize • 7 items • Updated Jul 19 • 9

upvoted 3 articles 3 months ago

Article

Experimenting with Automatic PII Detection on the Hub using Presidio

Jul 10

• 23

Article

Google Cloud TPUs made available to Hugging Face users

Jul 9

• 19

Article

Announcing New Dataset Search Features

Jul 8

• 22

upvoted a paper 3 months ago

Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models

Paper • 2407.01906 • Published Jul 2 • 34

Julien Chaumond PRO

AI & ML interests

Articles

XetHub is joining Hugging Face!

Hugging Face partners with Wiz Research to Improve AI Security

Introducing Storage Regions on the HF Hub

Hugging Face Selected for the French Data Protection Agency Enhanced Support Program

How to train a new language model from scratch using Transformers and Tokenizers

Organizations

julien-c's activity

Exploring the Daily Papers Page on Hugging Face

Introducing Community Tools on HuggingChat

Introducing the SQL Console on Datasets

The Environmental Impacts of AI -- Primer

Hugging Face partners with TruffleHog to Scan for Secrets

Scaling robotics datasets with video encoding

Introduction to ggml

Parquet in Action: A Beginners Guide

Welcome FalconMamba: The first strong attention-free 7B model

XetHub is joining Hugging Face!

Gradio joins Hugging Face!

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

WWDC 24: Running Mistral 7B with Core ML

Querying Datasets with the Datasets Explorer Chrome Extension

Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing

The Rise of Agentic Data Generation

How to run Gemini Nano locally in your browser

Announcing New Hugging Face and Keras NLP integration

Experimenting with Automatic PII Detection on the Hub using Presidio

Google Cloud TPUs made available to Hugging Face users

Announcing New Dataset Search Features