Update README.md
Browse files
README.md
CHANGED
|
@@ -20,7 +20,7 @@ model-index:
|
|
| 20 |
name: "Language Modeling"
|
| 21 |
type: "text-generation"
|
| 22 |
dataset:
|
| 23 |
-
name: "Tigrinya News Corpus (
|
| 24 |
type: "text"
|
| 25 |
split: "validation"
|
| 26 |
metrics:
|
|
@@ -40,7 +40,7 @@ This model demonstrates good generation and completion capabilities for Tigrinya
|
|
| 40 |
|
| 41 |
## Model Details
|
| 42 |
|
| 43 |
-
- **Model Type:** Causal Language Model
|
| 44 |
- **Base Model:** [google/gemma-3-4b-pt](https://huggingface.co/google/gemma-3-4b-pt)
|
| 45 |
- **Parameters:** 4 billion
|
| 46 |
- **Architecture:** Gemma 3 with `Gemma3ForCausalLM`
|
|
@@ -157,7 +157,7 @@ The following examples demonstrate the model's capabilities across different con
|
|
| 157 |
| Training loss | train | 0.48 |
|
| 158 |
| Validation loss | validation | 0.91 |
|
| 159 |
|
| 160 |
-
*Validation corpus* – held-out 3 % of a
|
| 161 |
|
| 162 |
## Limitations
|
| 163 |
|
|
|
|
| 20 |
name: "Language Modeling"
|
| 21 |
type: "text-generation"
|
| 22 |
dataset:
|
| 23 |
+
name: "Tigrinya News Corpus (~60 M tokens)"
|
| 24 |
type: "text"
|
| 25 |
split: "validation"
|
| 26 |
metrics:
|
|
|
|
| 40 |
|
| 41 |
## Model Details
|
| 42 |
|
| 43 |
+
- **Model Type:** Causal Language Model
|
| 44 |
- **Base Model:** [google/gemma-3-4b-pt](https://huggingface.co/google/gemma-3-4b-pt)
|
| 45 |
- **Parameters:** 4 billion
|
| 46 |
- **Architecture:** Gemma 3 with `Gemma3ForCausalLM`
|
|
|
|
| 157 |
| Training loss | train | 0.48 |
|
| 158 |
| Validation loss | validation | 0.91 |
|
| 159 |
|
| 160 |
+
*Validation corpus* – held-out 3 % of a ~60 M-tokens.
|
| 161 |
|
| 162 |
## Limitations
|
| 163 |
|