pankajmathur
/

RenCoder-Devstral-Small-2507

Text Generation

text-generation-inference

Model card Files Files and versions

pankajmathur commited on Dec 18, 2025

Commit

6cd7c2b

·

verified ·

1 Parent(s): df75442

Update README.md

Files changed (1) hide show

README.md +0 -14

README.md CHANGED Viewed

@@ -20,23 +20,9 @@ This model is a merged version of [mistralai/Devstral-Small-2507](https://huggin
 ## Model Details
 - **Base Model:** [mistralai/Devstral-Small-2507](https://huggingface.co/mistralai/Devstral-Small-2507)
-- **LoRA Adapter:** [pankajmathur/Devstral-Small-2507-sft-v1-adapter](https://huggingface.co/pankajmathur/Devstral-Small-2507-sft-v1-adapter)
-- **Training Dataset:** [pankajmathur/OpenThoughts-Agent-v1-SFT](https://huggingface.co/datasets/pankajmathur/OpenThoughts-Agent-v1-SFT)
 - **Parameters:** ~24B
 - **Precision:** bfloat16
-## Training Configuration
-The LoRA adapter was trained with the following configuration:
-- **LoRA Rank (r):** 32
-- **LoRA Alpha:** 16
-- **LoRA Dropout:** 0.05
-- **Target Modules:** q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
-- **Sequence Length:** 8192
-- **Learning Rate:** 0.0001
-- **Optimizer:** AdamW 8-bit
-- **Epochs:** 3
 ## Usage

 ## Model Details
 - **Base Model:** [mistralai/Devstral-Small-2507](https://huggingface.co/mistralai/Devstral-Small-2507)
 - **Parameters:** ~24B
 - **Precision:** bfloat16
 ## Usage