Instructions to use ridwanFatur98/TinyStories-1M-Indonesia with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use ridwanFatur98/TinyStories-1M-Indonesia with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="ridwanFatur98/TinyStories-1M-Indonesia")

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("ridwanFatur98/TinyStories-1M-Indonesia")
model = AutoModelForCausalLM.from_pretrained("ridwanFatur98/TinyStories-1M-Indonesia")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use ridwanFatur98/TinyStories-1M-Indonesia with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "ridwanFatur98/TinyStories-1M-Indonesia"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ridwanFatur98/TinyStories-1M-Indonesia",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/ridwanFatur98/TinyStories-1M-Indonesia

SGLang

How to use ridwanFatur98/TinyStories-1M-Indonesia with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "ridwanFatur98/TinyStories-1M-Indonesia" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ridwanFatur98/TinyStories-1M-Indonesia",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "ridwanFatur98/TinyStories-1M-Indonesia" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ridwanFatur98/TinyStories-1M-Indonesia",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use ridwanFatur98/TinyStories-1M-Indonesia with Docker Model Runner:
```
docker model run hf.co/ridwanFatur98/TinyStories-1M-Indonesia
```

TinyStories-1M Indonesian Fine-tune (Experimental)

An experimental model fine-tuned from TinyStories-1M using Indonesian language dataset for exploration and learning purposes.

⚠️ Note: This is an experimental model for testing purposes only. Performance is far from optimal.

Model Details

Model Description

This model is a fine-tuned version of TinyStories-1M using an Indonesian language dataset. The primary purpose is for experimentation and learning, not for production use.

Model Sources

Base Model: TinyStories-1M
Training Dataset: Lyon28/Corpus-Indonesia

Performance Metrics

⚠️ Warning: This model is still in early stages and performance is not optimal.

Training Loss & Perplexity

Training loss remains quite high and perplexity shows the model has not converged well:

Rank	Train Loss	Perplexity
1	5.092371	162.775409
2	5.710950	302.158057
3	9.836301	18,700.406340
4	11.639643	113,509.674623
5	11.639969	113,546.630401

Training Details

Training Time: >3 hours
Hardware: T4 GPU
Training Regime: Full fine-tuning

Uses

Direct Use

This model can be used for:

Experimentation with Indonesian language modeling
Learning about model fine-tuning
Research and development

Out-of-Scope Use

NOT recommended for:

Production applications
Critical tasks requiring high accuracy
Professional text generation

This model is still experimental with very high perplexity, indicating poor prediction quality.

Bias, Risks, and Limitations

High Perplexity: Model shows very high perplexity (>100k on some checkpoints), indicating highly uncertain predictions
Training Loss: High loss indicates the model has not learned optimally
Experimental Status: This model was created for experimentation, not for serious applications
Data Bias: Model may inherit biases from the Lyon28/Corpus-Indonesia dataset

Recommendations

Use only for learning and experimentation purposes
Not recommended for production use
Requires further training with hyperparameter tuning for better results
Consider increasing epochs, adjusting learning rate, or using a larger dataset

Evaluation

Results

The model shows suboptimal performance:

Highest training loss: 11.639969
Lowest perplexity: 162.775409

All metrics indicate that the model requires:

Further training
Hyperparameter tuning
Possibly better architecture or more suitable dataset

Downloads last month: 542

Safetensors

Model size

3.75M params

Tensor type

F32