Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
piyawudk
/
PhishMe-R1-8B-GRPO
like
0
Text Classification
Transformers
Safetensors
piyawudk/spam-ham-reasoning-dataset-small
English
qwen3
text-generation
text-generation-inference
unsloth
text-embeddings-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
PhishMe-R1-8B-GRPO
Commit History
Update README.md
71c8882
verified
piyawudk
commited on
Sep 6, 2025
Update README.md
cb92e97
verified
piyawudk
commited on
Sep 6, 2025
Upload tuning_script_PhishMe_R1_8B_GRPO.ipynb
517a962
verified
piyawudk
commited on
Jun 23, 2025
Upload model trained with Unsloth
194e81c
verified
piyawudk
commited on
Jun 18, 2025
(Trained with Unsloth)
264d816
verified
piyawudk
commited on
Jun 18, 2025
(Trained with Unsloth)
25b3306
verified
piyawudk
commited on
Jun 18, 2025
(Trained with Unsloth)
f6989b7
verified
piyawudk
commited on
Jun 18, 2025
(Trained with Unsloth)
617003e
verified
piyawudk
commited on
Jun 18, 2025
(Trained with Unsloth)
844a0e8
verified
piyawudk
commited on
Jun 18, 2025
(Trained with Unsloth)
8cd439c
verified
piyawudk
commited on
Jun 18, 2025
Unsloth Model Card
4b6cdd9
verified
piyawudk
commited on
Jun 18, 2025
initial commit
11bb237
verified
piyawudk
commited on
Jun 18, 2025