Hunter Heidenreich's picture

4 9 26

Hunter Heidenreich

hheiden-roots

·

hunterheiden

AI & ML interests

None yet

Recent Activity

liked a dataset 21 days ago

docling-project/DocLayNet

new activity 21 days ago

docling-project/DocLayNet:Fix license metadata: cdla-permissive-1.0

commented on a paper 21 days ago

GutenOCR: A Grounded Vision-Language Front-End for Documents

View all activity

Organizations

upvoted 3 papers 26 days ago

CommonForms: A Large, Diverse Dataset for Form Field Detection

Paper • 2509.16506 • Published Sep 20, 2025 • 22

Large Language Models for Page Stream Segmentation

Paper • 2408.11981 • Published Aug 21, 2024 • 3

GutenOCR: A Grounded Vision-Language Front-End for Documents

Paper • 2601.14490 • Published 28 days ago • 37

upvoted 3 collections 26 days ago

GutenOCR

3 items • Updated 26 days ago • 6

OCR

Data and models for optical character recognition • 6 items • Updated 26 days ago • 5

RICO

A collection of RICO screenshot-based datasets for training and evaluation. We've attempted to compile all surrounding metadata for the relevant tasks • 8 items • Updated Jan 16 • 5

upvoted a paper 28 days ago

PubMed-OCR: PMC Open Access OCR Annotations

Paper • 2601.11425 • Published Jan 16 • 12

upvoted a collection 11 months ago

Gemma 3

All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 55 items • Updated 1 day ago • 105

upvoted a collection almost 2 years ago

LLaVA-1.6

A collection of LLaVA-1.6 checkpoints • 4 items • Updated Jan 31, 2024 • 75