SFEvent (Open-Source AI Meetup)

posted an update about 1 month ago

Post

210

Are Large Language Models actually becoming more intelligent, or just better at seeming intelligent?

There is a noticeable shift happening in the LLM space.

Models today can:

Generate cleaner and more structured code.
Explain complex topics in simpler ways.
Maintain longer and more coherent conversations.

Yet at the same time, they still:

Produce confident hallucinations.
Fail in multi-step reasoning tasks.
Break under slightly unfamiliar or challenging inputs.

This raises a critical question.

Are we advancing intelligence, or optimizing presentation?

Most improvements so far seem driven by:

Larger datasets.
Increased scale.
Alignment techniques like RLHF.

But these do not necessarily lead to genuine reasoning ability.

What still appears fundamentally missing:

Persistent memory across interactions.
True reasoning rather than pattern completion.
Grounded understanding connected to real-world context.

Reliable self-correction and verification mechanisms.

If current scaling trends start to plateau, the next breakthrough will not come from doing more of the same.

So the real question for the community is:

If you were designing the next generation of AI systems, where would you focus?

A. Larger models and compute
B. Higher-quality and structured data
C. Agent-based systems with tool use and memory
D. New architectures beyond transformers

This is not just a technical discussion. It defines where AI is actually heading over the next few years.

I am interested to hear how others are thinking about this.

albertvillanova

posted an update 2 months ago

Post

2567

🚀 TRL v0.29.0 introduces trl-training: an agent-native training skill.

This makes the TRL CLI a structured, agent-readable capability, allowing AI agents to reliably execute training workflows such as:
- Supervised Fine-Tuning (SFT)
- Direct Preference Optimization (DPO)
- Group Relative Policy Optimization (GRPO)

We’re excited to see what the community builds on top of this.

If you’re working on AI agents, alignment research, or scalable RL training infrastructure: give TRL v0.29.0 a try! 🤗

The future of ML tooling is agent-native.
🔗 https://github.com/huggingface/trl/releases/tag/v0.29.0

albertvillanova

posted an update 3 months ago

Post

1903

5 years already working in democratizing AI 🤗
Grateful to be part of such an awesome team making it happen every day.

pcuenq

posted an update 4 months ago

Post

4742

👉 What happened in AI in 2025? 👈

We prepared the 2025 version of the HF AI Timeline Grid, highlighting open vs API-based model releases, and allowing you to browse and filter by access, modality, and release type!

Play with it here:
2025-ai-timeline/2025-ai-timeline

Here's my personal quarterly TL;DR:

1️⃣ Q1 — Learning to Reason
Deepseek not only releases a top-notch reasoning model, but shows how to train them and compete with closed frontier models. OpenAI debuts Deep Research.

Significant milestones: DeepSeek R1 & R1-Zero, Qwen 2.5 VL, OpenAI Deep Research, Gemini 2.5 Pro (experimental)

2️⃣ Q2 — Multimodality and Coding
More LLMs embrace multimodality by default, and there's a surge in coding agents. Strong vision, audio, and generative models emerge.

Significant milestones: Llama 4, Qwen 3, Imagen 4, OpenAI Codex, Google Jules, Claude 4

3️⃣ Q3 — "Gold" rush, OpenAI opens up, the community goes bananas
Flagship models get gold in Math olympiads and hard benchmarks. OpenAI releases strong open source models and Google releases the much anticipated nano-banana for image generation and editing. Agentic workflows become commonplace.

Significant milestones: Gemini and OpenAI IMO Gold, gpt-oss, Gemini 2.5 Flash Image, Grok 4, Claude Sonnet 4.5

4️⃣ Q4 — Mistral returns, leaderboard hill-climbing
Mistral is back with updated model families. All labs release impressive models to wrap up the year!

Significant milestones: Claude Opus 4.5, DeepSeek Math V2, FLUX 2, GPT 5.1, Kimi K2 Thinking, Nano Banana Pro, GLM 4.7, Gemini 3, Mistral 3, MiniMax M2.1 🤯

Credits
🙏 NHLOCAL for the source data https://github.com/NHLOCAL/AiTimeline

🫡 @reach-vb for the original idea, design and recipe

🙌 @ariG23498 and yours truly for compiling and verifying the 2025 edition

🥳 Here's to 2026, wishing it becomes the best year ever for open releases and on-device-first use-cases! 🥂

3 replies

·

DavidVivancos

posted an update 5 months ago

Post

399

Need a new challenging Dataset? Now that #NeurIPS2025 is almost over.

DavidVivancos/NeuraxonLife2-1M

1 Million #Neuraxon Artificial Lives, from almost 10000 Research Game runs, with more than 21 Million Neurons and almost 4 years of Simulated Life.

Read the preprint here https://www.researchgate.net/publication/397331336_Neuraxon

And here you have all the code: https://github.com/DavidVivancos/Neuraxon

DavidVivancos

posted an update 6 months ago

Post

443

Hi all!,

Neuraxon Game of Life is also live in demo at HuggingFace
DavidVivancos/NeuraxonLife

Preprint Paper: https://www.researchgate.net/publication/397331336_Neuraxon

Source Code of the Research verision: https://github.com/DavidVivancos/Neuraxon

HuggingFace Models are in the oven!

Hope you like it!
@DavidVivancos

DavidVivancos

posted an update 6 months ago

Post

997

Hi all!,

Neuraxon ( a novel Neural Growth & Computation Blueprint) is live in demo at HuggingFace DavidVivancos/Neuraxon

Paper: https://www.researchgate.net/publication/397331336_Neuraxon (on its way to arxiv too)

Code: https://github.com/DavidVivancos/Neuraxon

HuggingFace Model in the oven!

Hope you like it!
@DavidVivancos

2 replies

·

multimodalart

posted an update 7 months ago

Post

26683

Want to iterate on a Hugging Face Space with an LLM?

Now you can easily convert any HF entire repo (Model, Dataset or Space) to a text file and feed it to a language model!

multimodalart/repo2txt

1 reply

·

ehristoforu

posted an update 8 months ago

Post

2578

🚀Hello from the Project Fluently team!

✨ We are happy to share with you our new universal LLM models based on Qwen3 1.7B and 4B — powerful, multilingual and ready to solve a wide range of problems!

🛠️ We have conducted additional training and carefully merged them to achieve even better results and maximize the potential of the models.

🆓 And most importantly — the models are completely open and free under the Apache-2.0 license!

🔗 Links to repositories:
- FluentlyQwen3-4B: fluently/FluentlyQwen3-4B
- FluentlyQwen3-1.7B: fluently/FluentlyQwen3-1.7B

😍 We will be very glad to hear your feedback and impressions! Your opinion is very important to us!

albertvillanova

posted an update 9 months ago

Post

4617

Latest smolagents release supports GPT-5: build agents that think, plan, and act.
⚡ Upgrade now and put GPT-5 to work!

1 reply

·

BrigitteTousi

posted an update 9 months ago

Post

1213

On Wednesday, August 13 at 11am EDT, join @clem for a no bullshit AMA on Discord. Prep all your HF questions and meet us there! 🤗☄️⚡️

https://discord.com/invite/6r5TEXyk?event=1404451892179763311

albertvillanova

posted an update 9 months ago

Post

712

🚀 smolagents v1.21.0 is here!
Now with improved safety in the local Python executor: dunder calls are blocked!
⚠️ Still, not fully isolated: for untrusted code, use a remote executor instead: Docker, E2B, Wasm.
✨ Many bug fixes: more reliable code.
👉 https://github.com/huggingface/smolagents/releases/tag/v1.21.0

1024m

authored a paper 9 months ago

Query Attribute Modeling: Improving search relevance with Semantic Search and Meta Data Filtering

Paper • 2508.04683 • Published Aug 6, 2025

BrigitteTousi

posted an update 9 months ago

Post

688

New interactive viz from AI World showing OpenAI's new open model gpt-oss-120b breaking into the top 50 most liked models of all time on the Hub in under a day! ☄️☄️☄️

1024m

authored a paper 9 months ago

DSBC : Data Science task Benchmarking with Context engineering

Paper • 2507.23336 • Published Jul 31, 2025 • 2

merterbak

posted an update 9 months ago

Post

5209

OpenAI is now open again! Check out OpenAI’s brand new gpt‑oss‑20b model hosted on ZeroGPU 🤗

merterbak/gpt-oss-20b-demo

BrigitteTousi

posted an update 10 months ago

Post

707

This is what Hugging Face is all about. We want everyone, hobbyists, researchers and industry alike, to be able to contribute to AI because everyone is affected by it. Kudos to HF's @irenesolaiman for spreading the word!🔥🤗

albertvillanova

posted an update 10 months ago

Post

831

🚀 New in smolagents v1.20.0: Remote Python Execution via WebAssembly (Wasm)

We've just merged a major new capability into the smolagents framework: the CodeAgent can now execute Python code remotely in a secure, sandboxed WebAssembly environment!

🔧 Powered by Pyodide and Deno, this new WasmExecutor lets your agent-generated Python code run safely: without relying on Docker or local execution.

Why this matters:
✅ Isolated execution = no host access
✅ No need for Python on the user's machine
✅ Safer evaluation of arbitrary code
✅ Compatible with serverless / edge agent workloads
✅ Ideal for constrained or untrusted environments

This is just the beginning: a focused initial implementation with known limitations. A solid MVP designed for secure, sandboxed use cases. 💡

💡 We're inviting the open-source community to help evolve this executor:
• Tackle more advanced Python features
• Expand compatibility
• Add test coverage
• Shape the next-gen secure agent runtime

🔗 Check out the PR: https://github.com/huggingface/smolagents/pull/1261

Let's reimagine what agent-driven Python execution can look like: remote-first, wasm-secure, and community-built.

This feature is live in smolagents v1.20.0!
Try it out.
Break things. Extend it. Give us feedback.
Let's build safer, smarter agents; together 🧠⚙️

👉 https://github.com/huggingface/smolagents/releases/tag/v1.20.0

#smolagents #WebAssembly #Python #AIagents #Pyodide #Deno #OpenSource #HuggingFace #AgenticAI

albertvillanova

posted an update 11 months ago

Post

1852

🚀 SmolAgents v1.19.0 is live!
This release brings major improvements to agent flexibility, UI usability, streaming architecture, and developer experience: making it easier than ever to build smart, interactive AI agents. Here's what's new:

🔧 Agent Upgrades
- Support for managed agents in ToolCallingAgent
- Context manager support for cleaner agent lifecycle handling
- Output formatting now uses XML tags for consistency

🖥️ UI Enhancements
- GradioUI now supports reset_agent_memory: perfect for fresh starts in dev & demos.

🔄 Streaming Refactor
- Streaming event aggregation moved off the Model class
- ➡️ Better architecture & maintainability

📦 Output Tracking
- CodeAgent outputs are now stored in ActionStep
- ✅ More visibility and structure to agent decisions

🐛 Bug Fixes
- Smarter planning logic
- Cleaner Docker logs
- Better prompt formatting for additional_args
- Safer internal functions and final answer matching

📚 Docs Improvements
- Added quickstart examples with tool usage
- One-click Colab launch buttons
- Expanded reference docs (AgentMemory, GradioUI docstrings)
- Fixed broken links and migrated to .md format

🔗 Full release notes:
https://github.com/huggingface/smolagents/releases/tag/v1.19.0

💬 Try it out, explore the new features, and let us know what you build!

#smolagents #opensource #AIagents #LLM #HuggingFace

multimodalart

posted an update 11 months ago

Post

18367

Self-Forcing - a real-time video distilled model from Wan 2.1 by @adobe is out, and they open sourced it 🐐

I've built a live real time demo on Spaces 📹💨

multimodalart/self-forcing

6 replies

·

AI & ML interests

Team members 603

SFEvent's activity