PhysiQuanty (PhysiQuanty)

reacted to andyolivers's post with 🔥 14 days ago

Post

199

→ Try it live: build-small-hackathon/lolaby
→ Watch the demo: https://youtu.be/eY_JnijT62E
→ Read the build journal: https://huggingface.co/blog/build-small-hackathon/lolaby-blog

Every parent, teacher, or babysitter knows the moment. The lights go dim. Blankets come out. Your child asks for a song. Then another. Then suddenly you’re improvising lyrics about dinosaurs, fire trucks, and princesses while trying to convince a little one that it’s actually bedtime.

That’s exactly the problem my partner’s sister faces as a kindergarten teacher. Every day she runs nap time for fifteen 4-year-olds, and ever since they learned about music and instruments in class, it starts the same way: "sing a song for me." She'd love to give each child their own song, built from whatever they love that week, but she doesn't have the time, the musical training, or a tool that could do it. So @volivers and I built one.

Introducing Lolaby — our submission to the Hugging Face Build Small Hackathon 2026, hosted by Gradio and backed by OpenBMB, OpenAI, NVIDIA, Modal, Cohere, JetBrains, and Black Forest Labs.

A child draws something they love (on screen or on paper), a name is entered, and a tiny AI watches the drawing, writes a personalised lullaby, and sings it back.

Everything runs locally. No cloud LLMs. No per-song API cost. No child's drawing or name ever leaves the device.

The full pipeline:
🖼️ MiniCPM-V 4.6 (1.3B) reads the drawing.
✍️ A fine-tuned Llama 3.2 3B writes the lyrics — trained on 1,500 lullabies with strict anti-boilerplate gates.
🎵 Kokoro 82M sings the result over custom DSP instruments.

Drop a like, upvote or comment. Feedback is welcome! 🙏

reacted to SeaWolf-AI's post with 😎 16 days ago

Post

4892

Darwin V9 — GPQA Diamond 90.9%, #1 on the leaderboard, with pure greedy decoding
Darwin-398B-JGOS reaches 90.9% (180/198) on GPQA Diamond, the PhD-level scientific reasoning benchmark, ranking #1 on the Hugging Face GPQA Diamond leaderboard. No self-consistency, no test-time compute scaling — this was achieved with a single greedy decode (temperature 0, single sample, max 16,384 tokens). The full eval config is published in the model card, so anyone can reproduce it. Raw reasoning, no score inflation.
The result comes from Darwin V9, a patented evolutionary model-development platform. Its core idea: it never trains a model from scratch.
Why Darwin V9 beats training from scratch

Cost & speed: no trillion-token pretraining run, no months of compute — a purpose-built, high-performance model is produced in a fraction of the time.
Reuse of proven intelligence: instead of re-learning every capability from a blank slate, it selects and combines only the strengths of already-trained, already-validated models, so results are stable and predictable.
Surgical transplantation: it identifies which neural region of which model holds which capability — at the FFN (Feed Forward Network) layer level — and grafts in only the segments that contribute to the target skill.

How it works: a large model (Qwen 3.5 397B) serves as the mother model (the substrate); several father models specialized in reasoning, coding, and language are analyzed layer-by-layer across their FFN regions; the segments that contribute to the target performance are extracted and transplanted into the mother model to produce a new child model. The result is a ~400B MoE that activates only ~17B parameters per token at inference — large-model capacity with efficient inference.
If training from scratch means rebuilding everything from a blank page, Darwin V9 means precisely recombining intelligence that has already been proven. GPQA Diamond #1 is the proof.
Model: FINAL-Bench/Darwin-398B-JGOS

reacted to danielhanchen's post with 👍 23 days ago

Post

4219

Google releases Gemma 4 QAT. ✨
You can now run Gemma 4 at 3x less memory with near original performance.

QAT makes it possible to run Gemma 4 26B-A4B on 16GB RAM.

GGUFs: https://huggingface.co/collections/unsloth/gemma-4-qat
QAT Guide: https://unsloth.ai/docs/models/gemma-4/qat

1 reply

·

reacted to SeaWolf-AI's post with 🧠 about 1 month ago

Post

4261

Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀

We're excited to release Darwin-60B-DUO, the Darwin family's first DUO model. Take two domain-verified specialists, hide them behind a single OpenAI-compatible endpoint, and let a router decide which one (or both) answers. You see one model, one API — but get the best of both.

The number that matters: on the full 198-question GPQA Diamond, Darwin-60B-DUO hits 88.38%. The constituents alone land at 69.70% (Darwin-28B-REASON) and 77.27% (AWAXIS-Think-31B); a naive cascade only reaches 83.84%. The DUO clears them all. Two small specialists, intelligently routed, beat one big generalist on cost and quality. Both are independently verified — Darwin-28B-REASON is #3 on the HF GPQA Diamond leaderboard, AWAXIS-Think-31B is #1 on Korea's national K-AI Leaderboard (MSIT).

The brains is a Hybrid-A router picking one of five strategies on the fly. Korean → AWAXIS, English/STEM → Darwin (single-backend, ~70% of traffic at 1× cost). When a Korean answer needs rigorous English reasoning, split_refine fires — Darwin drafts, AWAXIS polishes; MCQ/short-answer runs both with self-consistency + cross-verify. Net effective cost: only ~1.3× a single 30B model.

The part the community will care about: the gateway is model-agnostic and Apache-2.0. Point it at any two OpenAI-compatible backends and you've got a DUO in minutes — teach router.py when to use which, and parallel calls, response merging, and routing transparency via _duo_route are handled for you. Fork it and tell us what you built.

Painless deploy: docker compose up for both vLLM backends + gateway; FP8 ~30GB colocates on a single B200/H100. One git clone (~120GB). Text-only for now, streaming in v1.1.
Two SOTAs, one endpoint. Come build your own on the Community tab.

👇
🔗 FINAL-Bench/Darwin-60B-DUO

replied to their post about 1 month ago

Thanks !

replied to their post about 1 month ago

Thanks anyway 🙏

replied to their post about 1 month ago

@John6666 , @prithivMLmods , @thomwolf , @lewtun

Would it be possible to batch an API request, please: api/users/$user/overview?
I know that repository requests can be batched, but I can’t find anything similar for user requests.

Thank you in advance for your help!

reacted to their post with 🚀🔥 about 1 month ago

Post

4759

🌐 We crawled the entirety of Hugging Face to help the community! Huge thanks to the Hugging Face API 🌐
🤖 2.91M model repos (file names included), 📚 1.02M dataset repos, 🚀 1.31M Space repos
🤗 617,501 committers (datasets and models), we’ll share Hugging Face statistics with you in the coming days..

We also identified 61,398 users with “AI/ML Interests”, and NOW we can find each other through our “AI/ML Interests”🤗
HF-Collab-Center/Searching-For-HuggingFace-Users
HF-Collab-Center/All-Model-Repos
HF-Collab-Center/All-Dataset-Repos
HF-Collab-Center/All-Space-Repos

HF-Collab-Center/HF-Users
HF-Collab-Center/HF-Users-with-last-seen
HF-Collab-Center/HF-Users-With-AI-ML-Interests-Only

Made By @QuantaSparkLabs and @PhysiQuanty
C'est français, bon.. en anglais.. mais c'est français ;)

5 replies

·

posted an update about 1 month ago

Post

4759

🌐 We crawled the entirety of Hugging Face to help the community! Huge thanks to the Hugging Face API 🌐
🤖 2.91M model repos (file names included), 📚 1.02M dataset repos, 🚀 1.31M Space repos
🤗 617,501 committers (datasets and models), we’ll share Hugging Face statistics with you in the coming days..

We also identified 61,398 users with “AI/ML Interests”, and NOW we can find each other through our “AI/ML Interests”🤗
HF-Collab-Center/Searching-For-HuggingFace-Users
HF-Collab-Center/All-Model-Repos
HF-Collab-Center/All-Dataset-Repos
HF-Collab-Center/All-Space-Repos

HF-Collab-Center/HF-Users
HF-Collab-Center/HF-Users-with-last-seen
HF-Collab-Center/HF-Users-With-AI-ML-Interests-Only

Made By @QuantaSparkLabs and @PhysiQuanty
C'est français, bon.. en anglais.. mais c'est français ;)

5 replies

·

reacted to their post with 🚀🔥 about 1 month ago

Post

4597

🧬 You can now find out whether your cognitive soulmate has already existed among 50k anonymized profiles ✨

SpiceeChat/Check-If-Your-Soulmate-Has-Already-Existed
SpiceeChat/OkCupid-59k-Anonymized-Profiles
https://dating-fatigue.com/

You seek them: 79.7% | They may seek you: 84.1% (coming soon)

🔥 Powered by open source and too much coffee 🔥

2 replies

·

posted an update about 1 month ago

Post

4597

🧬 You can now find out whether your cognitive soulmate has already existed among 50k anonymized profiles ✨

SpiceeChat/Check-If-Your-Soulmate-Has-Already-Existed
SpiceeChat/OkCupid-59k-Anonymized-Profiles
https://dating-fatigue.com/

You seek them: 79.7% | They may seek you: 84.1% (coming soon)

🔥 Powered by open source and too much coffee 🔥

2 replies

·

reacted to espejelomar's post with 🚀❤️ about 1 month ago

Post

4735

Sharing WorldForge with @abdelstark

It's an open-source Python project for evaluating and replaying robotics and world-model workflows.

The useful part is not only calling a model. WorldForge records the run, validates action shapes, translates outputs into actions, and keeps replay artifacts you can inspect later.

The current demo uses LeRobot + LeWorldModel on PushT through the official loader:

stable_worldmodel.policy.AutoCostModel("pusht/lewm")

The harness also has replay-only paths for Cosmos-Policy and GR00T-style outputs, so you can inspect the provider contract from saved artifacts without keeping a GPU server online.

Try it:

pip install worldforge-ai
uv run --extra harness worldforge-harness --flow robotics-compare

Repo: https://github.com/AbdelStark/worldforge
Docs: https://abdelstark.github.io/worldforge/

Pre-1.0, MIT, and actively looking for contributors. Good areas:
- robotics provider adapters
- replay artifacts
- eval flows
- docs & first-run demos

Good first issues: https://github.com/AbdelStark/worldforge/contribute

If you're building robot policy evals or model adapters, would love a PR — or an issue describing what's missing.

reacted to fffiloni's post with 🚀🔥 about 1 month ago

Post

3843

I built HF Radio on Hugging Face Spaces 📻
fffiloni/HF-Radio

A live community radio for AI-generated songs, powered by tracks created with ACE-Step.

You can tune in, discover community-made songs in many languages, vote on what sounds good, and mark your real favorites as Bangers.

The more people listen, vote, and create, the better the station gets.

Under the hood, it connects a few Hugging Face pieces together:

Spaces for the live app, HF buckets for community tracks, OAuth for signed-in listeners, server-side streaming with ffmpeg, hourly playlist refreshes, moderation, jingles, and community feedback loops.

It’s not just a playlist.

It’s a shared taste experiment:
new songs get a shot every hour, and the community helps decide what deserves another spin.

Come listen.
Find weird gems.
Support the Bangers.
Shape the radio.

—> fffiloni/HF-Radio

replied to their post about 2 months ago

Hahaha, MatchGroup wants to do machine learning, it's going to be 💀

replied to their post about 2 months ago

Thank you !

replied to their post about 2 months ago

To explain it better ;)

A_Search = B_Profil
&&
B_Search = A_Profil

&& = inclusive and to fulfill both conditions simultaneously

What causes ghosting on dating apps? It's that the person on the other end is a good match for you, but you're not a good match for them :(

PhysiQuanty PRO

AI & ML interests

Recent Activity

Organizations

PhysiQuanty PRO

AI & ML interests

Recent Activity

Organizations

PhysiQuanty's activity