Instructions to use ridwanFatur98/TinyStories-1M-Indonesia with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use ridwanFatur98/TinyStories-1M-Indonesia with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="ridwanFatur98/TinyStories-1M-Indonesia")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("ridwanFatur98/TinyStories-1M-Indonesia") model = AutoModelForCausalLM.from_pretrained("ridwanFatur98/TinyStories-1M-Indonesia") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use ridwanFatur98/TinyStories-1M-Indonesia with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "ridwanFatur98/TinyStories-1M-Indonesia" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ridwanFatur98/TinyStories-1M-Indonesia", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/ridwanFatur98/TinyStories-1M-Indonesia
- SGLang
How to use ridwanFatur98/TinyStories-1M-Indonesia with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "ridwanFatur98/TinyStories-1M-Indonesia" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ridwanFatur98/TinyStories-1M-Indonesia", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "ridwanFatur98/TinyStories-1M-Indonesia" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ridwanFatur98/TinyStories-1M-Indonesia", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use ridwanFatur98/TinyStories-1M-Indonesia with Docker Model Runner:
docker model run hf.co/ridwanFatur98/TinyStories-1M-Indonesia
TinyStories-1M Indonesian Fine-tune (Experimental)
An experimental model fine-tuned from TinyStories-1M using Indonesian language dataset for exploration and learning purposes.
⚠️ Note: This is an experimental model for testing purposes only. Performance is far from optimal.
Model Details
Model Description
This model is a fine-tuned version of TinyStories-1M using an Indonesian language dataset. The primary purpose is for experimentation and learning, not for production use.
Model Sources
- Base Model: TinyStories-1M
- Training Dataset: Lyon28/Corpus-Indonesia
Performance Metrics
⚠️ Warning: This model is still in early stages and performance is not optimal.
Training Loss & Perplexity
Training loss remains quite high and perplexity shows the model has not converged well:
| Rank | Train Loss | Perplexity |
|---|---|---|
| 1 | 5.092371 | 162.775409 |
| 2 | 5.710950 | 302.158057 |
| 3 | 9.836301 | 18,700.406340 |
| 4 | 11.639643 | 113,509.674623 |
| 5 | 11.639969 | 113,546.630401 |
Training Details
- Training Time: >3 hours
- Hardware: T4 GPU
- Training Regime: Full fine-tuning
Uses
Direct Use
This model can be used for:
- Experimentation with Indonesian language modeling
- Learning about model fine-tuning
- Research and development
Out-of-Scope Use
NOT recommended for:
- Production applications
- Critical tasks requiring high accuracy
- Professional text generation
This model is still experimental with very high perplexity, indicating poor prediction quality.
Bias, Risks, and Limitations
- High Perplexity: Model shows very high perplexity (>100k on some checkpoints), indicating highly uncertain predictions
- Training Loss: High loss indicates the model has not learned optimally
- Experimental Status: This model was created for experimentation, not for serious applications
- Data Bias: Model may inherit biases from the Lyon28/Corpus-Indonesia dataset
Recommendations
- Use only for learning and experimentation purposes
- Not recommended for production use
- Requires further training with hyperparameter tuning for better results
- Consider increasing epochs, adjusting learning rate, or using a larger dataset
Evaluation
Results
The model shows suboptimal performance:
- Highest training loss: 11.639969
- Lowest perplexity: 162.775409
All metrics indicate that the model requires:
- Further training
- Hyperparameter tuning
- Possibly better architecture or more suitable dataset
- Downloads last month
- 542