gagan3012 (Gagan Bhatia)

upvoted a paper 1 day ago

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published 7 days ago • 63

upvoted 2 papers 2 months ago

Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic

Paper • 2407.18129 • Published Jul 25 • 11

Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition

Paper • 2407.13559 • Published Jul 18 • 12

upvoted a paper 3 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20 • 85

upvoted an article 4 months ago

Article

Introducing the Open Arabic LLM Leaderboard

May 14

• 62

upvoted an article 5 months ago

Article

Custom architectures with HuggingFace 🤗

By

•

Apr 22

• 21

upvoted a paper 7 months ago

Design2Code: How Far Are We From Automating Front-End Engineering?

Paper • 2403.03163 • Published Mar 5 • 93

upvoted a collection 7 months ago

Finance

Collection

12 items • Updated Jun 8 • 3

upvoted 5 papers 7 months ago

upvoted a paper 11 months ago

OtterHD: A High-Resolution Multi-modality Model

Paper • 2311.04219 • Published Nov 7, 2023 • 31

upvoted a paper about 1 year ago

Robust Distortion-free Watermarks for Language Models

Paper • 2307.15593 • Published Jul 28, 2023 • 8

Gagan Bhatia

AI & ML interests

Organizations

gagan3012's activity

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic

Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Introducing the Open Arabic LLM Leaderboard

Custom architectures with HuggingFace 🤗

Design2Code: How Far Are We From Automating Front-End Engineering?

Finance

StarCoder 2 and The Stack v2: The Next Generation

FuseChat: Knowledge Fusion of Chat Models

The FinBen: An Holistic Financial Benchmark for Large Language Models

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

SPAR: Personalized Content-Based Recommendation via Long Engagement Attention

OtterHD: A High-Resolution Multi-modality Model

Robust Distortion-free Watermarks for Language Models