Text Generation
PEFT
Safetensors
English
Russian
qwen
qwen3.5
lora
unsloth
trl
sft
code
reasoning
bilingual
obsidian
graphite
conversational
Instructions to use OpenReasonAI/Graphite1.0-4B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use OpenReasonAI/Graphite1.0-4B with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3.5-4B-Base") model = PeftModel.from_pretrained(base_model, "OpenReasonAI/Graphite1.0-4B") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- Unsloth Studio new
How to use OpenReasonAI/Graphite1.0-4B with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for OpenReasonAI/Graphite1.0-4B to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for OpenReasonAI/Graphite1.0-4B to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for OpenReasonAI/Graphite1.0-4B to start chatting
Load model with FastModel
pip install unsloth from unsloth import FastModel model, tokenizer = FastModel.from_pretrained( model_name="OpenReasonAI/Graphite1.0-4B", max_seq_length=2048, )
Starred commited on
Training in progress, step 2256, checkpoint
Browse files
last-checkpoint/adapter_model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 84972248
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:56a3911c8195f07e7729206b22c78ea087b1f7bcd68b544aea4342793450bd94
|
| 3 |
size 84972248
|
last-checkpoint/optimizer.pt
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 43434405
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:62d491b9b99145a34586738989b18f1697bce28374cbfc95c2b44ca1eecbf851
|
| 3 |
size 43434405
|
last-checkpoint/scheduler.pt
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 1465
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:83a70477d2c42e292df2c1adf148bed9afdcc2369cdfb05931fd61094ec7740f
|
| 3 |
size 1465
|
last-checkpoint/trainer_state.json
CHANGED
|
@@ -2,9 +2,9 @@
|
|
| 2 |
"best_global_step": 2250,
|
| 3 |
"best_metric": 0.18876151740550995,
|
| 4 |
"best_model_checkpoint": "/kaggle/working/obsidian_critic_qwen35_t4x2_unsloth/runs/obsidian_critic_full_epoch/checkpoint-2250",
|
| 5 |
-
"epoch":
|
| 6 |
"eval_steps": 125,
|
| 7 |
-
"global_step":
|
| 8 |
"is_hyper_param_search": false,
|
| 9 |
"is_local_process_zero": true,
|
| 10 |
"is_world_process_zero": true,
|
|
@@ -817,12 +817,12 @@
|
|
| 817 |
"should_evaluate": false,
|
| 818 |
"should_log": false,
|
| 819 |
"should_save": true,
|
| 820 |
-
"should_training_stop":
|
| 821 |
},
|
| 822 |
"attributes": {}
|
| 823 |
}
|
| 824 |
},
|
| 825 |
-
"total_flos": 1.
|
| 826 |
"train_batch_size": 1,
|
| 827 |
"trial_name": null,
|
| 828 |
"trial_params": null
|
|
|
|
| 2 |
"best_global_step": 2250,
|
| 3 |
"best_metric": 0.18876151740550995,
|
| 4 |
"best_model_checkpoint": "/kaggle/working/obsidian_critic_qwen35_t4x2_unsloth/runs/obsidian_critic_full_epoch/checkpoint-2250",
|
| 5 |
+
"epoch": 1.0,
|
| 6 |
"eval_steps": 125,
|
| 7 |
+
"global_step": 2256,
|
| 8 |
"is_hyper_param_search": false,
|
| 9 |
"is_local_process_zero": true,
|
| 10 |
"is_world_process_zero": true,
|
|
|
|
| 817 |
"should_evaluate": false,
|
| 818 |
"should_log": false,
|
| 819 |
"should_save": true,
|
| 820 |
+
"should_training_stop": true
|
| 821 |
},
|
| 822 |
"attributes": {}
|
| 823 |
}
|
| 824 |
},
|
| 825 |
+
"total_flos": 1.666058653049815e+17,
|
| 826 |
"train_batch_size": 1,
|
| 827 |
"trial_name": null,
|
| 828 |
"trial_params": null
|