Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
tuandunghcmut
's Collections
MT-LLM
Agentic Benchmarks
Safety SFT
Tool Calling dataset for search domain
Document Layout Analysis Dataset
Post-training Dataset
RL-Papers
Visual Chain-of-Thought Reasoning Benchmarks
LLM for Security Benchmarks/Datasets
Visual-CoT/GCoT related
Text Embedding Papers
EMPTY A
Quantized versions of LLMs/MLLMs
Multilingual Sentiment Analysis Dataset
LLM Series
LLM/MLLM (20B - 80B, fit on 1-2 A100/H100)
SLM
MLLM (100B - 300B)
Benchmarks for evaluating LLMs/MLLMs
Conversation Dataset
Multilingual Parallel Text Corpus
Multilingual Pretraining Corpus for Southeast Asian Language
Multilingual Parallel Text Corpus
updated
Mar 26
Upvote
-
vietgpt/opus100_envi
Viewer
•
Updated
Jul 3, 2023
•
1M
•
158
•
4
tuandunghcmut/PhoMT-MTet-Mixture
Viewer
•
Updated
Aug 11, 2025
•
7.62M
•
119
•
2
airesearch/scb_mt_enth_2020
Updated
Jan 18, 2024
•
279
•
9
Helsinki-NLP/opus_paracrawl
Viewer
•
Updated
Feb 22, 2024
•
27.3M
•
701
•
6
Helsinki-NLP/opus_books
Viewer
•
Updated
Mar 29, 2024
•
1.25M
•
13.3k
•
88
Helsinki-NLP/open_subtitles
Updated
Jan 18, 2024
•
1.11k
•
75
Helsinki-NLP/OpenSubtitles2024
Viewer
•
Updated
Mar 11
•
570M
•
7.43k
•
7
Upvote
-
Share collection
View history
Collection guide
Browse collections