Yacine Jernite

yjernite

AI & ML interests

Technical, community, and regulatory tools of AI governance @HuggingFace

Articles

Organizations

yjernite's activity

upvoted an article 19 days ago
view article
Article

Getty Images Brings High-Quality, Commercially Safe Dataset to Hugging Face

15
upvoted an article 22 days ago
view article
Article

The Environmental Impacts of AI -- Primer

By sasha
26
upvoted an article 30 days ago
view article
Article

The 5 Most Under-Rated Tools on Hugging Face

79
upvoted an article about 2 months ago
upvoted 4 articles 2 months ago
view article
Article

How NuminaMath Won the 1st AIMO Progress Prize

92
view article
Article

Docmatix - a huge dataset for Document Visual Question Answering

64
view article
Article

Structured Harm Reporting in AI: New Research Paper at AIES and DEFCON event!

By evijit
3
upvoted an article 2 months ago
upvoted an article 2 months ago
view article
Article

SmolLM - blazingly fast and remarkably powerful

244
upvoted 12 articles 3 months ago
view article
Article

Experimenting with Automatic PII Detection on the Hub using Presidio

23
view article
Article

Announcing New Dataset Search Features

22
view article
Article

EU Training Data Transparency: A Proposal for a Sufficiently Detailed Summary 📑📚🖼️🇪🇺

By yjernite
8
view article
Article

BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡

By xhluca
35
view article
Article

AI Policy @🤗: Open ML Considerations in the EU AI Act

2
view article
Article

📚 Training Data Transparency in AI: Tools, Trends, and Policy Recommendations 🗳️

By yjernite
1
view article
Article

Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality

30
view article
Article

Introducing Synthetic Data Workshop: Your Gateway to Easy Synthetic Dataset Creation

12
view article
Article

Data Is Better Together: A Look Back and Forward

18
view article
Article

Open-source embeddings and LLMs outperform Gemini and OpenAI for Web Navigation while being faster and cheaper

By dhuynh95
5
view article
Article

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

35
view article
Article

Unveiling CIVICS: A New Dataset for Examining Cultural Values in Language Models

By giadap
7
upvoted an article 3 months ago
upvoted an article 3 months ago
view article
Article

Reports on the Hub: A First Look at Self-governance in Open Source AI Development

By frimelle
7
upvoted an article 4 months ago
view article
Article

How to build an interactive HF Space to visualize an Image Dataset

3
upvoted 2 articles 4 months ago
view article
Article

How to directly access 150k+ Hugging Face Datasets with DuckDB and query using GPT-4o

By chilijung
10
view article
Article

Wikipedia's Treasure Trove: Advancing Machine Learning with Diverse Data

By frimelle
13
upvoted 3 articles 4 months ago
view article
Article

Space secrets security update

50
upvoted an article 4 months ago
upvoted 2 articles 5 months ago
view article
Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

28
view article
Article

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

21
upvoted 3 articles 6 months ago
view article
Article

Vision Language Models Explained

177
view article
Article

Policy Questions Blog 1: AI Data Transparency Remarks for NAIAC Panel 📚🔍⚖️

By yjernite
2
view article
Article

Public Policy at Hugging Face

19