Aritra Roy Gosthipaty's picture

Building on HF

Aritra Roy Gosthipaty PRO

ariG23498

huggingface

·

https://arig23498.github.io/

AI & ML interests

Deep Representation Learning

Recent Activity

updated a bucket 1 day ago

ariG23498/traces

upvoted an article 1 day ago

How to Comply with SOC 2 and ISO 27001 with Hugging Face: A Practical Guide to AI Model Supply Chain Governance

upvoted an article 1 day ago

Unlocking asynchronicity in continuous batching

View all activity

Organizations

liked a model 5 days ago

google/gemma-4-E2B-it-assistant

Any-to-Any • 78M • Updated 5 days ago • 13.3k • 50

liked a Space 10 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

Building and scaling RL environments for LLM training

liked 6 models 16 days ago

ibm-granite/granite-vision-4.1-4b

Image-Text-to-Text • 4B • Updated 10 days ago • 31.4k • 74

nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

Any-to-Any • 33B • Updated 8 days ago • 257k • 291

lewtun/talkie-1930-13b-it-hf

Text Generation • 13B • Updated 18 days ago • 6.93k • 23

RedHatAI/Qwen3.6-35B-A3B-NVFP4

Updated 26 days ago • 1.92M • 137

talkie-lm/talkie-1930-13b-it

Updated 23 days ago • 269

inclusionAI/LLaDA2.0-Uni

Any-to-Any • 16B • Updated 4 days ago • 2.15k • 246

liked a model 24 days ago

google/gemma-4-31B

Image-Text-to-Text • 33B • Updated Apr 2 • 364k • 372

liked a model 29 days ago

Qwen/Qwen3.6-35B-A3B

Image-Text-to-Text • 36B • Updated 22 days ago • 5.26M • • 1.78k

liked a Space about 1 month ago

Transformers PR Dashboard

PR triage dashboard with cluster and signal analysis

liked 6 models about 1 month ago

google/gemma-4-E2B

Any-to-Any • 5B • Updated Apr 2 • 777k • 265

microsoft/harrier-oss-v1-0.6b

Feature Extraction • 0.6B • Updated Mar 30 • 206k • • 233

google/gemma-4-26B-A4B-it

Image-Text-to-Text • 27B • Updated 9 days ago • 8.22M • • 957

google/gemma-4-E4B-it

Any-to-Any • 8B • Updated 9 days ago • 6.1M • 1.01k

google/gemma-4-31B-it

Image-Text-to-Text • 33B • Updated 9 days ago • 9.85M • • 2.65k

google/gemma-4-E2B-it

Any-to-Any • 5B • Updated 9 days ago • 3.38M • 619

liked a Space about 1 month ago

Distilling 100B+ Models 40x Faster with TRL

TRL distillation for 100B+ teachers, 40x faster

liked 2 models about 1 month ago

Qwen/Qwen3-VL-Embedding-2B

Sentence Similarity • 2B • Updated about 1 month ago • 1.86M • 402

arcee-ai/Trinity-Large-Thinking

Text Generation • 399B • Updated 1 day ago • 21.2k • • 169