Instructions to use shreecloud/llama3.1-8b-lora-kaggle with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use shreecloud/llama3.1-8b-lora-kaggle with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("NousResearch/Meta-Llama-3.1-8B") model = PeftModel.from_pretrained(base_model, "shreecloud/llama3.1-8b-lora-kaggle") - Notebooks
- Google Colab
- Kaggle
π¦ Llama-3.1-8B LoRA Fine-tuned (Kaggle)
Fine-tuned Llama-3.1-8B using LoRA β trained on Kaggle's free T4 GPUs.
Model Details
- Base Model:
NousResearch/Meta-Llama-3.1-8B - Method: LoRA (PEFT)
- Rank (
r): 8β16 - Trainable Parameters: ~0.04β0.12% of total
- Hardware: Kaggle Free Tier (Tesla T4)
Quick Inference
from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
import torch
model_id = "shreecloud/llama3.1-8b-lora-kaggle"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
model_id,
device_map="auto",
torch_dtype=torch.float16,
load_in_4bit=True,
)
prompt = "### Instruction:\nWrite a professional email about...\n\n### Response:\n"
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=512, temperature=0.7, do_sample=True)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
- Downloads last month
- 24
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support