h34v7
/

DXP-Zero-V1.0-24b-Small-Instruct

Model card Files Files and versions

h34v7 commited on May 3, 2025

Commit

09718c4

·

verified ·

1 Parent(s): 01b90ef

Update README.md

Files changed (1) hide show

README.md +13 -4

README.md CHANGED Viewed

@@ -7,13 +7,22 @@ library_name: transformers
 tags:
 - mergekit
 - merge
 ---
-# .
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details
 ### Merge Method
 This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [ZeroAgency/Mistral-Small-3.1-24B-Instruct-2503-hf](https://huggingface.co/ZeroAgency/Mistral-Small-3.1-24B-Instruct-2503-hf) as a base.
@@ -47,4 +56,4 @@ parameters:
 dtype: bfloat16
 tokenizer:
  source: ZeroAgency/Mistral-Small-3.1-24B-Instruct-2503-hf
-```

 tags:
 - mergekit
 - merge
+license: apache-2.0
 ---
+# DXP-Zero-V1.0-24b-Small-Instruct
+So i was browsing for Mistral finetune and found this base [model](https://huggingface.co/ZeroAgency/Mistral-Small-3.1-24B-Instruct-2503-hf) by [ZeroAgency](https://huggingface.co/ZeroAgency), and oh boy... It was perfect! So here are few notable improvements i observed.
+Pros:
+- Increased output for storytelling or roleplay.
+- Dynamic output (it can adjust how much output, i mean like when you go with shorter prompt it will do smaller outputs and so does with longer prompt more output too).
+- Less repetitive (though it depends on your own prompt and settings).
+- I have tested with 49444/65536 tokens no degradation although i notice it's actually learning the context better and it's impacting the output a lot. (what i don't like is, it's learning the previous context(of turns) too quickly and set it as new standards.).
 ## Merge Details
+This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ### Merge Method
 This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [ZeroAgency/Mistral-Small-3.1-24B-Instruct-2503-hf](https://huggingface.co/ZeroAgency/Mistral-Small-3.1-24B-Instruct-2503-hf) as a base.
 dtype: bfloat16
 tokenizer:
  source: ZeroAgency/Mistral-Small-3.1-24B-Instruct-2503-hf
+```