Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
ereniko 
posted an update 10 days ago
Post
269
I don't know why, but lately there's been a growing problem on HuggingFace: the platform is filled to the brim with datasets of reasoning traces from large AI models. I feel like someone should address this, but the dataset tab is just full of reasoning and output traces from models like Claude Opus and the model tab is full of fine-tunes trained on these.
What scares me most are the legal consequences of this, and the possibility that all models will start converging on the same tone because everyone is just fine-tuning on everyone else.

Yo this is the redundancy no one is talking about. I'm glad someone is saying something. It seems like everything is so benchmark and parameter driven that no one is looking at the way models are all becoming more and more alike rather than unique entities with one another.

In this post