prernac1
/

pretendparentai

@@ -18,18 +18,18 @@ pipeline_tag: text-generation
 # Model Card for Model ID
 **PretendParentAI** is a fine-tuned variant of `mistralai/Mistral-7B-Instruct-v0.3`, adapted using **Quantized Low-Rank Adaptation (QLoRA)** on Reddit parenting discussions.
-It produces more *empathetic, warm, and relatable* parenting advice — though occasionally at the cost of clarity and precision.
 ## Model Details
 ### Model Description
-Goal: Explore how instruction fine-tuning can enhance warmth, relatability, and storytelling in parenting advice LLMs, while assessing trade-offs with factual precision.
-Action: Fine-tuned Mistral-7B-Instruct on ~40K curated Reddit parenting Q&A pairs (Alpaca format), using Supervised Fine Tuning (SFT) with Parameter-Efficient Fine Tuning (PEFT) i.e. Quantized Low-Rank Adaptation (QLoRA). Built a full instruction-tuning pipeline including Reddit data curation, efficient training/inference using QLoRA, and LLM-as-a-Judge evaluation across empathy, relatability, and other metrics.
-Result: Produced highly human-like, narrative responses that excelled in empathy (30% to 70%) and relatability (2% to 98%), though often over-personalized or hallucinated personal anecdotes—yielding key insights into the tension between emotional alignment and factual grounding in instruction tuning when using human-generated data (e.g. from reddit).
 - **Developed by:** Prerna Chikersal
 - **Model type:** PEFT
@@ -76,13 +76,13 @@ This model is not suitable for:
 - Content moderation, bias-free generation, or factual question answering — the Reddit dataset may contain noisy or biased language.
 ## Bias, Risks, and Limitations
-PretendParentAI was fine-tuned on Reddit parenting discussions, which reflect the biases and tone of online Western, English-speaking communities. As a result, the model may adopt informal, opinionated, or culturally specific parenting perspectives. It can also **hallucinate personal details** — such as referring to imaginary “sons,” “daughters,” or “partners” — because it imitates how Reddit users often share personal anecdotes. These outputs should not be interpreted as factual or autobiographical.
 The model should **not** be used for real parenting, psychological, or medical guidance. Instead, it serves as a research tool for exploring empathy and tone in language models, and all outputs should be reviewed critically before use.
 ### Recommendations
-- Always pair this adapter with the base model mistralai/Mistral-7B-Instruct-v0.3 for best performance.
 - Use bfloat16 precision and FlashAttention 2 on A100 or H100 GPUs for optimal speed.
 - Evaluate generations qualitatively for empathy, clarity, and factual accuracy before any downstream use.
 - For production or sensitive domains, fine-tune further using curated, high-quality data or Direct Preference Optimization (DPO) to balance warmth and helpfulness.
@@ -91,26 +91,15 @@ The model should **not** be used for real parenting, psychological, or medical g
 ## How to Get Started with the Model
-# 🧸 PretendParentAI
-**PretendParentAI** is a fine-tuned variant of `mistralai/Mistral-7B-Instruct-v0.3`, adapted using **Quantized Low-Rank Adaptation (QLoRA)** on Reddit parenting discussions.
-It produces more *empathetic, warm, and relatable* parenting advice — though occasionally at the cost of clarity and precision.
-> ⚠️ This repository only contains **PEFT adapter weights** — not the full 7B model.
-> To use the model, you must load the base Mistral model and apply this adapter.
----
-## 🧠 Model Details
 - **Base model:** `mistralai/Mistral-7B-Instruct-v0.3`
 - **Fine-tuning method:** QLoRA (PEFT)
 - **Training data:** Curated Reddit parenting discussions (r/Parenting, r/Mommit, r/Daddit)
 - **Goal:** Explore how instruction tuning on real-world parenting dialogue affects empathy and warmth in responses.
----
-## 🚀 How to Load the Model
 ```python
 ## Load the base model

 # Model Card for Model ID
 **PretendParentAI** is a fine-tuned variant of `mistralai/Mistral-7B-Instruct-v0.3`, adapted using **Quantized Low-Rank Adaptation (QLoRA)** on Reddit parenting discussions.
+It produces more *empathetic, warm, and relatable* parenting advice.
 ## Model Details
 ### Model Description
+**Goal:** Explore how instruction fine-tuning can enhance warmth, relatability, and storytelling in parenting advice LLMs, while assessing trade-offs with factual precision.
+**Action:** Fine-tuned Mistral-7B-Instruct on ~40K curated Reddit parenting Q&A pairs (Alpaca format), using Supervised Fine Tuning (SFT) with Parameter-Efficient Fine Tuning (PEFT) i.e. Quantized Low-Rank Adaptation (QLoRA). Built a full instruction-tuning pipeline including Reddit data curation, efficient training/inference using QLoRA, and LLM-as-a-Judge evaluation across empathy, relatability, and other metrics.
+**Result:** Produced highly human-like, narrative responses that excelled in empathy (30% to 70%) and relatability (2% to 98%), though often over-personalized or hallucinated personal anecdotes—yielding key insights into the tension between emotional alignment and factual grounding in instruction tuning when using human-generated data (e.g. from reddit).
 - **Developed by:** Prerna Chikersal
 - **Model type:** PEFT
 - Content moderation, bias-free generation, or factual question answering — the Reddit dataset may contain noisy or biased language.
 ## Bias, Risks, and Limitations
+PretendParentAI can **hallucinate personal details** — such as referring to imaginary “sons,” “daughters,” or “partners” — because it imitates how Reddit users often share personal anecdotes. These outputs should not be interpreted as factual or autobiographical.
 The model should **not** be used for real parenting, psychological, or medical guidance. Instead, it serves as a research tool for exploring empathy and tone in language models, and all outputs should be reviewed critically before use.
 ### Recommendations
+- Always pair this adapter with the base model mistralai/Mistral-7B-Instruct-v0.3.
 - Use bfloat16 precision and FlashAttention 2 on A100 or H100 GPUs for optimal speed.
 - Evaluate generations qualitatively for empathy, clarity, and factual accuracy before any downstream use.
 - For production or sensitive domains, fine-tune further using curated, high-quality data or Direct Preference Optimization (DPO) to balance warmth and helpfulness.
 ## How to Get Started with the Model
+This repository only contains **PEFT adapter weights** — not the full 7B model.
+To use the model, you must load the base Mistral model and apply this adapter.
 - **Base model:** `mistralai/Mistral-7B-Instruct-v0.3`
 - **Fine-tuning method:** QLoRA (PEFT)
 - **Training data:** Curated Reddit parenting discussions (r/Parenting, r/Mommit, r/Daddit)
 - **Goal:** Explore how instruction tuning on real-world parenting dialogue affects empathy and warmth in responses.
+### How to Load the Model
 ```python
 ## Load the base model