@ereniko on Hugging Face: "I don't know why, but lately there's been a growing problem on HuggingFace:…"

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

posted an update May 1

Post

334

I don't know why, but lately there's been a growing problem on HuggingFace: the platform is filled to the brim with datasets of reasoning traces from large AI models. I feel like someone should address this, but the dataset tab is just full of reasoning and output traces from models like Claude Opus and the model tab is full of fine-tunes trained on these.
What scares me most are the legal consequences of this, and the possibility that all models will start converging on the same tone because everyone is just fine-tuning on everyone else.

juiceb0xc0de

May 1

Yo this is the redundancy no one is talking about. I'm glad someone is saying something. It seems like everything is so benchmark and parameter driven that no one is looking at the way models are all becoming more and more alike rather than unique entities with one another.

In this post

ereniko Eren Ekşi
juiceb0xc0de R