1 57

Pawel

Pwlot

Pwlot
Pwlot

AI & ML interests

AGI

Recent Activity

liked a model about 2 months ago

Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

liked a Space 3 months ago

microsoft/TRELLIS.2

liked a Space 9 months ago

nanotron/ultrascale-playbook

View all activity

Organizations

liked a model about 2 months ago

Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

Text-to-Speech • 2B • Updated Jan 29 • 1.14M • 1.28k

liked a Space 3 months ago

TRELLIS.2

🏢

1.24k

High-fidelity 3D Generation from images

liked a Space 9 months ago

The Ultra-Scale Playbook

🌌

3.74k

The ultimate guide to training LLM on large GPU Clusters

liked a model about 1 year ago

lerobot/pi0_old

Robotics • 4B • Updated Sep 19, 2025 • 3.39k • 307

liked a dataset about 1 year ago

HuggingFaceTB/finemath

Viewer • Updated Feb 6, 2025 • 48.3M • 11.4k • 354

liked a model almost 2 years ago

jasperai/flash-sdxl

Text-to-Image • Updated Jul 3, 2024 • 79 • • 35

liked a dataset almost 2 years ago

HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 223k • 987

liked a model almost 2 years ago

stabilityai/stable-zero123

Text-to-3D • Updated Jul 10, 2024 • 756

reacted to Sentdex's post with 👍 almost 2 years ago

Post

10340

Okay, first pass over KAN: Kolmogorov–Arnold Networks, it looks very interesting!

Interpretability of KAN model:
May be considered mostly as a safety issue these days, but it can also be used as a form of interaction between the user and a model, as this paper argues and I think they make a valid point here. With MLP, we only interact with the outputs, but KAN is an entirely different paradigm and I find it compelling.

Scalability:
KAN shows better parameter efficiency than MLP. This likely translates also to needing less data. We're already at the point with the frontier LLMs where all the data available from the internet is used + more is made synthetically...so we kind of need something better.

Continual learning:
KAN can handle new input information w/o catastrophic forgetting, which helps to keep a model up to date without relying on some database or retraining.

Sequential data:
This is probably what most people are curious about right now, and KANs are not shown to work with sequential data yet and it's unclear what the best approach might be to make it work well both in training and regarding the interpretability aspect. That said, there's a rich long history of achieving sequential data in variety of ways, so I don't think getting the ball rolling here would be too challenging.

Mostly, I just love a new paradigm and I want to see more!

KAN: Kolmogorov-Arnold Networks (2404.19756)

5 replies

liked a Space almost 2 years ago

StoryDiffusion

👁

610

Generate consistent story images from prompts and reference photos

liked a dataset almost 2 years ago

HuggingFaceFW/fineweb

Viewer • Updated Jul 11, 2025 • 52.5B • 169k • 2.7k

liked 4 Spaces almost 2 years ago

InstantMesh

📚

1.57k

Create a 3D model from an image in 10 seconds!

Repo duplicator

😻

327

Duplicate Hugging Face repositories

DragGan - Drag Your GAN

👆

1.03k

Manipulate images by dragging points

Open VLM Leaderboard

🌎

VLMEvalKit Evaluation Results Collection

reacted to clem's post with 👍 about 2 years ago

Post

Is synthetic data the future of AI? 🔥🔥🔥

@HugoLaurencon @Leyo & @VictorSanh are introducing HuggingFaceM4/WebSight , a multimodal dataset featuring 823,000 pairs of synthetically generated HTML/CSS codes along with screenshots of the corresponding rendered websites to train GPT4-V-like models 🌐💻

While crafting their upcoming foundation vision language model, they faced the challenge of converting website screenshots into usable HTML/CSS codes. Most VLMs suck at this and there was no public dataset available for this specific task, so they decided to create their own.

They prompted existing LLMs to generate 823k HTML/CSS codes of very simple websites. Through supervised fine-tuning of a vision language model on WebSight, they were able to generate the code to reproduce a website component, given a screenshot.

You can explore the dataset here: HuggingFaceM4/WebSight

What do you think?