Saif10
/

sft-model

Text Generation

sentiment-analysis

text-generation-inference

Model card Files Files and versions

Saif10 commited on Jul 19, 2025

Commit

e98565a

·

verified ·

1 Parent(s): ce3461f

cool

Files changed (1) hide show

README.md +15 -23

README.md CHANGED Viewed

@@ -13,15 +13,16 @@ datasets:
 - stanfordnlp/sst2
 base_model:
 - openai-community/gpt2
 ---
-# 🧠 GPT-2 SFT Model – Supervised Fine-Tuning for Positive Sentiment
 This model is the **first stage** in a 3-step RLHF (Reinforcement Learning from Human Feedback) pipeline using **GPT-2**. It has been fine-tuned on the **Stanford Sentiment Treebank v2 (SST2)** dataset, focusing on generating sentences with a positive sentiment tone.
 ---
-## 📌 Context
 This model is part of the following RLHF project structure:
@@ -33,7 +34,7 @@ You are currently viewing the **SFT model**.
 ---
-## ✅ Model Objective
 Train GPT-2 on sentiment-labeled sentences to mimic human-like, sentiment-aware generation.
@@ -41,37 +42,28 @@ Train GPT-2 on sentiment-labeled sentences to mimic human-like, sentiment-aware
 - **Output:** GPT-2 completes it with a positively-toned sentence.
 ---
-## 📚 Training Details
-### 🔧 Dataset
 - **Source:** `stanfordnlp/sst2`
 - **Type:** Movie review sentences
 - **Labels:** Positive and Negative
 - **Preprocessing:** Only positive samples retained for SFT
-### ⚙️ Configuration
-- **Model Base:** `gpt2`
-- **Max Sequence Length:** 128
-- **Batch Size:** 8
-- **Epochs:** 3
-- **Optimizer:** AdamW
-- **Learning Rate:** 5e-5
-- **Precision:** FP16
----
-## 🚀 Usage
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
-model = AutoModelForCausalLM.from_pretrained("your-hf-username/gpt2-sft-positive")
-tokenizer = AutoTokenizer.from_pretrained("your-hf-username/gpt2-sft-positive")
 prompt = "The movie was"
 inputs = tokenizer(prompt, return_tensors="pt")
 outputs = model.generate(**inputs, max_new_tokens=30)
-print(tokenizer.decode(outputs[0]))

 - stanfordnlp/sst2
 base_model:
 - openai-community/gpt2
+pipeline_tag: text-generation
 ---
+# GPT-2 SFT Model – Supervised Fine-Tuning for Positive Sentiment
 This model is the **first stage** in a 3-step RLHF (Reinforcement Learning from Human Feedback) pipeline using **GPT-2**. It has been fine-tuned on the **Stanford Sentiment Treebank v2 (SST2)** dataset, focusing on generating sentences with a positive sentiment tone.
 ---
+## Context
 This model is part of the following RLHF project structure:
 ---
+## Model Objective
 Train GPT-2 on sentiment-labeled sentences to mimic human-like, sentiment-aware generation.
 - **Output:** GPT-2 completes it with a positively-toned sentence.
 ---
+### Dataset
 - **Source:** `stanfordnlp/sst2`
 - **Type:** Movie review sentences
 - **Labels:** Positive and Negative
 - **Preprocessing:** Only positive samples retained for SFT
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
+model = AutoModelForCausalLM.from_pretrained("Saif10/sft-model")
+tokenizer = AutoTokenizer.from_pretrained("Saif10/sft-model")
 prompt = "The movie was"
 inputs = tokenizer(prompt, return_tensors="pt")
 outputs = model.generate(**inputs, max_new_tokens=30)
+print(tokenizer.decode(outputs[0]))
+```
+## Author
+Saif Rathod
+- Hugging Face: Saif10
+- GitHub: Saif-rathod