AI & ML interests

administrate by ai/ml.

AtAndDevΒ 
posted an update 10 months ago
view post
Post
676
Qwen 3 Coder is a personal attack to k2, and I love it.
It achieves near SOTA on LCB while not having reasoning.
Finally people are understanding that reasoning isnt necessary for high benches...

Qwen ftw!

DECENTRALIZE DECENTRALIZE DECENTRALIZE
AtAndDevΒ 
posted an update 11 months ago
view post
Post
3152
deepseek-ai/DeepSeek-R1-0528

This is the end
  • 1 reply
Β·
AtAndDevΒ 
posted an update about 1 year ago
view post
Post
3153
Llama 4 is out...
  • 3 replies
Β·
AtAndDevΒ 
posted an update about 1 year ago
view post
Post
4389
There seems to multiple paid apps shared here that are based on models on hf, but some ppl sell their wrappers as "products" and promote them here. For a long time, hf was the best and only platform to do oss model stuff but with the recent AI website builders anyone can create a product (really crappy ones btw) and try to sell it with no contribution to oss stuff. Please dont do this, or try finetuning the models you use...
Sorry for filling yall feed with this bs but yk...
  • 6 replies
Β·
AtAndDevΒ 
posted an update about 1 year ago
view post
Post
1679
Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.
AtAndDevΒ 
posted an update about 1 year ago
view post
Post
2505
@nroggendorff is that you sama?
  • 2 replies
Β·
AtAndDevΒ 
posted an update over 1 year ago
view post
Post
1954
everywhere i go i see his face
21worldΒ 
updated a Space over 1 year ago
AtAndDevΒ 
posted an update over 1 year ago
view post
Post
589
Deepseek gang on fire fr fr
AtAndDevΒ 
posted an update over 1 year ago
view post
Post
1664
R1 is out! And with a lot of other R1 releated models...
AtAndDevΒ 
posted an update over 1 year ago
view post
Post
502
@s3nh Hey man check your discord! Got some news.
  • 4 replies
Β·
21worldΒ 
updated a Space over 2 years ago