luel commited on
Commit
c220d03
·
verified ·
1 Parent(s): c64d109

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -20,7 +20,7 @@ model-index:
20
  name: "Language Modeling"
21
  type: "text-generation"
22
  dataset:
23
- name: "Tigrinya News Corpus (59 M tokens)"
24
  type: "text"
25
  split: "validation"
26
  metrics:
@@ -40,7 +40,7 @@ This model demonstrates good generation and completion capabilities for Tigrinya
40
 
41
  ## Model Details
42
 
43
- - **Model Type:** Causal Language Model (Autoregressive)
44
  - **Base Model:** [google/gemma-3-4b-pt](https://huggingface.co/google/gemma-3-4b-pt)
45
  - **Parameters:** 4 billion
46
  - **Architecture:** Gemma 3 with `Gemma3ForCausalLM`
@@ -157,7 +157,7 @@ The following examples demonstrate the model's capabilities across different con
157
  | Training loss | train | 0.48 |
158
  | Validation loss | validation | 0.91 |
159
 
160
- *Validation corpus* – held-out 3 % of a 59 M-tokens.
161
 
162
  ## Limitations
163
 
 
20
  name: "Language Modeling"
21
  type: "text-generation"
22
  dataset:
23
+ name: "Tigrinya News Corpus (~60 M tokens)"
24
  type: "text"
25
  split: "validation"
26
  metrics:
 
40
 
41
  ## Model Details
42
 
43
+ - **Model Type:** Causal Language Model
44
  - **Base Model:** [google/gemma-3-4b-pt](https://huggingface.co/google/gemma-3-4b-pt)
45
  - **Parameters:** 4 billion
46
  - **Architecture:** Gemma 3 with `Gemma3ForCausalLM`
 
157
  | Training loss | train | 0.48 |
158
  | Validation loss | validation | 0.91 |
159
 
160
+ *Validation corpus* – held-out 3 % of a ~60 M-tokens.
161
 
162
  ## Limitations
163