Instructions to use siddham0909/trace-rca-qwen3-1.7b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use siddham0909/trace-rca-qwen3-1.7b with PEFT:
Task type is invalid.
- Notebooks
- Google Colab
- Kaggle
Trace RCA β Qwen/Qwen3-1.7B (GRPO + LoRA)
Trained on the Trace RCA Gym environment.
Training evidence
reward_plot.pngβ reward curves (regenerated every 5 episodes during training)metrics.jsonlβ per-episode rich metrics (fault_type, tier, difficulty, reward components)fault_type_metrics.jsonβ per-fault-type rolling successcompare.png,compare.mdβ before/after comparison on pinned scenariosadapter/β the LoRA adapter (load with PEFT)tb_episodes/β TensorBoard episode scalars
Usage
from peft import PeftModel
from transformers import AutoModelForCausalLM
base = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3-1.7B")
model = PeftModel.from_pretrained(base, "siddham0909/trace-rca-qwen3-1.7b", subfolder="adapter")
- Downloads last month
- -
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support