sandmanbuzz's picture
Add model card
4c21feb verified
metadata
base_model:
  - Undi95/MistralThinker-v1.1
license: other
license_name: public-domain
license_link: https://www.loc.gov

Undi95/MistralThinker-v1.1 but with DavidAU/Mistral-Small-3-Reasoner-s1-24B-LORA-256-RANK slapped over it.

It seems pretty nice. Here's whatever settings I happened to have sitting in ST when I tried it:

t=1.25
topk=45
rep.pen=1.03
rep.pen.range=384
rep.pen.slope=0.7
smoothing=1.7

Personally I liked the output more than rawdog Thinker, which felt kinda model-y in terms of phrasing.