Request: DOI

#18

by alesvigne - opened Mar 17, 2025

Discussion

alesvigne

Mar 17, 2025

I want to try this model

odds-get-evened

Mar 7

i wouldn't. i trained this a lot, and it seemed very static, and resistant to updating memory. my training set may not have been appropriate, but I've moved onto Qwen-2.5, and it trains well in contrast to distilbert2, understands context better, and responses are meaty and relevant.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment