luel
/

gemma-3-4b-tigrinya

Text Generation

sequence-modeling

Eval Results (legacy)

Model card Files Files and versions

luel commited on Jun 14, 2025

Commit

c220d03

·

verified ·

1 Parent(s): c64d109

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ model-index:
           name: "Language Modeling"
           type: "text-generation"
         dataset:
-          name: "Tigrinya News Corpus (59 M tokens)"
           type: "text"
           split: "validation"
         metrics:
@@ -40,7 +40,7 @@ This model demonstrates good generation and completion capabilities for Tigrinya
 ## Model Details
-- **Model Type:** Causal Language Model (Autoregressive)
 - **Base Model:** [google/gemma-3-4b-pt](https://huggingface.co/google/gemma-3-4b-pt)
 - **Parameters:** 4 billion
 - **Architecture:** Gemma 3 with `Gemma3ForCausalLM`
@@ -157,7 +157,7 @@ The following examples demonstrate the model's capabilities across different con
 | Training loss   | train      | 0.48     |
 | Validation loss | validation | 0.91     |
-*Validation corpus* – held-out 3 % of a 59 M-tokens.
 ## Limitations

           name: "Language Modeling"
           type: "text-generation"
         dataset:
+          name: "Tigrinya News Corpus (~60 M tokens)"
           type: "text"
           split: "validation"
         metrics:
 ## Model Details
+- **Model Type:** Causal Language Model
 - **Base Model:** [google/gemma-3-4b-pt](https://huggingface.co/google/gemma-3-4b-pt)
 - **Parameters:** 4 billion
 - **Architecture:** Gemma 3 with `Gemma3ForCausalLM`
 | Training loss   | train      | 0.48     |
 | Validation loss | validation | 0.91     |
+*Validation corpus* – held-out 3 % of a ~60 M-tokens.
 ## Limitations