We should really have a release date range slider on the /models page. Tired of "trending/most downloaded" being the best way to sort and still seeing models from 2023 on the first page just because they're embedded in enterprise pipelines and get downloaded repeatedly. "Recently Created/Recently Updated" don't solve the discovery problem considering the amount of noise to sift through.

Slight caveat: Trending actually does have some recency bias, but it's not strong/precise enough.

3 replies

published a model 4 months ago

DrHouseFan-315/EFIG-1

Text-to-Image • Updated Mar 11

updated a model 4 months ago

DrHouseFan-315/EFIG-1

Text-to-Image • Updated Mar 11

New activity in kuprel/min-dalle 4 months ago

Tokenizer

#4 opened 4 months ago by

DrHouseFan-315

reacted to marksverdhei's post with 😔 5 months ago

Post

4621

Poll: Will 2026 be the year of subquadratic attention?

The transformer architecture is cursed by its computational complexity.
It is why you run out of tokens and have to compact. But some would argue that this is a feature not a bug and that this is also why these models are so good. We've been doing a lot of research on trying to make equally good models that are computationally cheaper, But so far, none of the approaches have stood the test of time. Or so it seems.

Please vote, don't be shy. Remember that the Dunning-Kruger effect is very real, so the person who knows less about transformers than you is going to vote. We want everyone's opinion, no matter confidence.

👍 if you think at least one frontier model* will have no O(n^2) attention by the end of 2026
🔥 If you disagree

* Frontier models - models that match / outperform the flagship claude, gemini or chatgpt at the time on multiple popular benchmarks

4 replies

reacted to Alexander1337's post with 🔥🚀🧠👀 5 months ago

Post

1466

host website

Firstname Lastname

AI & ML interests

Recent Activity

Organizations

DrHouseFan-315's activity

Base model link broken

Report

Tokenizer