AI & ML interests

Open science and open source

oncodyΒ 
posted an update about 1 month ago
view post
Post
210
Are Large Language Models actually becoming more intelligent, or just better at seeming intelligent?

There is a noticeable shift happening in the LLM space.

Models today can:

Generate cleaner and more structured code.
Explain complex topics in simpler ways.
Maintain longer and more coherent conversations.

Yet at the same time, they still:

Produce confident hallucinations.
Fail in multi-step reasoning tasks.
Break under slightly unfamiliar or challenging inputs.

This raises a critical question.

Are we advancing intelligence, or optimizing presentation?

Most improvements so far seem driven by:

Larger datasets.
Increased scale.
Alignment techniques like RLHF.

But these do not necessarily lead to genuine reasoning ability.

What still appears fundamentally missing:

Persistent memory across interactions.
True reasoning rather than pattern completion.
Grounded understanding connected to real-world context.

Reliable self-correction and verification mechanisms.

If current scaling trends start to plateau, the next breakthrough will not come from doing more of the same.

So the real question for the community is:

If you were designing the next generation of AI systems, where would you focus?

A. Larger models and compute
B. Higher-quality and structured data
C. Agent-based systems with tool use and memory
D. New architectures beyond transformers

This is not just a technical discussion. It defines where AI is actually heading over the next few years.

I am interested to hear how others are thinking about this.
albertvillanovaΒ 
posted an update 2 months ago
view post
Post
2567
πŸš€ TRL v0.29.0 introduces trl-training: an agent-native training skill.

This makes the TRL CLI a structured, agent-readable capability, allowing AI agents to reliably execute training workflows such as:
- Supervised Fine-Tuning (SFT)
- Direct Preference Optimization (DPO)
- Group Relative Policy Optimization (GRPO)

We’re excited to see what the community builds on top of this.

If you’re working on AI agents, alignment research, or scalable RL training infrastructure: give TRL v0.29.0 a try! πŸ€—

The future of ML tooling is agent-native.
πŸ”— https://github.com/huggingface/trl/releases/tag/v0.29.0
albertvillanovaΒ 
posted an update 3 months ago
view post
Post
1903
5 years already working in democratizing AI πŸ€—
Grateful to be part of such an awesome team making it happen every day.
pcuenqΒ 
posted an update 4 months ago
view post
Post
4742
πŸ‘‰ What happened in AI in 2025? πŸ‘ˆ

We prepared the 2025 version of the HF AI Timeline Grid, highlighting open vs API-based model releases, and allowing you to browse and filter by access, modality, and release type!

Play with it here:
2025-ai-timeline/2025-ai-timeline

Here's my personal quarterly TL;DR:

1️⃣ Q1 β€” Learning to Reason
Deepseek not only releases a top-notch reasoning model, but shows how to train them and compete with closed frontier models. OpenAI debuts Deep Research.

Significant milestones: DeepSeek R1 & R1-Zero, Qwen 2.5 VL, OpenAI Deep Research, Gemini 2.5 Pro (experimental)

2️⃣ Q2 β€” Multimodality and Coding
More LLMs embrace multimodality by default, and there's a surge in coding agents. Strong vision, audio, and generative models emerge.

Significant milestones: Llama 4, Qwen 3, Imagen 4, OpenAI Codex, Google Jules, Claude 4

3️⃣ Q3 β€” "Gold" rush, OpenAI opens up, the community goes bananas
Flagship models get gold in Math olympiads and hard benchmarks. OpenAI releases strong open source models and Google releases the much anticipated nano-banana for image generation and editing. Agentic workflows become commonplace.

Significant milestones: Gemini and OpenAI IMO Gold, gpt-oss, Gemini 2.5 Flash Image, Grok 4, Claude Sonnet 4.5

4️⃣ Q4 β€” Mistral returns, leaderboard hill-climbing
Mistral is back with updated model families. All labs release impressive models to wrap up the year!

Significant milestones: Claude Opus 4.5, DeepSeek Math V2, FLUX 2, GPT 5.1, Kimi K2 Thinking, Nano Banana Pro, GLM 4.7, Gemini 3, Mistral 3, MiniMax M2.1 🀯

Credits
πŸ™ NHLOCAL for the source data https://github.com/NHLOCAL/AiTimeline

🫑 @reach-vb for the original idea, design and recipe

πŸ™Œ @ariG23498 and yours truly for compiling and verifying the 2025 edition

πŸ₯³ Here's to 2026, wishing it becomes the best year ever for open releases and on-device-first use-cases! πŸ₯‚
  • 3 replies
Β·
DavidVivancosΒ 
posted an update 5 months ago
DavidVivancosΒ 
posted an update 6 months ago
DavidVivancosΒ 
posted an update 6 months ago
multimodalartΒ 
posted an update 7 months ago
view post
Post
26683
Want to iterate on a Hugging Face Space with an LLM?

Now you can easily convert any HF entire repo (Model, Dataset or Space) to a text file and feed it to a language model!

multimodalart/repo2txt
  • 1 reply
Β·
ehristoforuΒ 
posted an update 8 months ago
view post
Post
2578
πŸš€Hello from the Project Fluently team!

✨ We are happy to share with you our new universal LLM models based on Qwen3 1.7B and 4B β€” powerful, multilingual and ready to solve a wide range of problems!

πŸ› οΈ We have conducted additional training and carefully merged them to achieve even better results and maximize the potential of the models.

πŸ†“ And most importantly β€” the models are completely open and free under the Apache-2.0 license!

πŸ”— Links to repositories:
- FluentlyQwen3-4B: fluently/FluentlyQwen3-4B
- FluentlyQwen3-1.7B: fluently/FluentlyQwen3-1.7B

😍 We will be very glad to hear your feedback and impressions! Your opinion is very important to us!
albertvillanovaΒ 
posted an update 9 months ago
view post
Post
4617
Latest smolagents release supports GPT-5: build agents that think, plan, and act.
⚑ Upgrade now and put GPT-5 to work!
  • 1 reply
Β·
BrigitteTousiΒ 
posted an update 9 months ago
albertvillanovaΒ 
posted an update 9 months ago
view post
Post
712
πŸš€ smolagents v1.21.0 is here!
Now with improved safety in the local Python executor: dunder calls are blocked!
⚠️ Still, not fully isolated: for untrusted code, use a remote executor instead: Docker, E2B, Wasm.
✨ Many bug fixes: more reliable code.
πŸ‘‰ https://github.com/huggingface/smolagents/releases/tag/v1.21.0
BrigitteTousiΒ 
posted an update 9 months ago
view post
Post
688
New interactive viz from AI World showing OpenAI's new open model gpt-oss-120b breaking into the top 50 most liked models of all time on the Hub in under a day! β˜„οΈβ˜„οΈβ˜„οΈ
merterbakΒ 
posted an update 9 months ago
view post
Post
5209
OpenAI is now open again! Check out OpenAI’s brand new gpt‑oss‑20b model hosted on ZeroGPU πŸ€—

merterbak/gpt-oss-20b-demo
BrigitteTousiΒ 
posted an update 10 months ago
view post
Post
707
This is what Hugging Face is all about. We want everyone, hobbyists, researchers and industry alike, to be able to contribute to AI because everyone is affected by it. Kudos to HF's @irenesolaiman for spreading the word!πŸ”₯πŸ€—
albertvillanovaΒ 
posted an update 10 months ago
view post
Post
831
πŸš€ New in smolagents v1.20.0: Remote Python Execution via WebAssembly (Wasm)

We've just merged a major new capability into the smolagents framework: the CodeAgent can now execute Python code remotely in a secure, sandboxed WebAssembly environment!

πŸ”§ Powered by Pyodide and Deno, this new WasmExecutor lets your agent-generated Python code run safely: without relying on Docker or local execution.

Why this matters:
βœ… Isolated execution = no host access
βœ… No need for Python on the user's machine
βœ… Safer evaluation of arbitrary code
βœ… Compatible with serverless / edge agent workloads
βœ… Ideal for constrained or untrusted environments

This is just the beginning: a focused initial implementation with known limitations. A solid MVP designed for secure, sandboxed use cases. πŸ’‘

πŸ’‘ We're inviting the open-source community to help evolve this executor:
β€’ Tackle more advanced Python features
β€’ Expand compatibility
β€’ Add test coverage
β€’ Shape the next-gen secure agent runtime

πŸ”— Check out the PR: https://github.com/huggingface/smolagents/pull/1261

Let's reimagine what agent-driven Python execution can look like: remote-first, wasm-secure, and community-built.

This feature is live in smolagents v1.20.0!
Try it out.
Break things. Extend it. Give us feedback.
Let's build safer, smarter agents; together πŸ§ βš™οΈ

πŸ‘‰ https://github.com/huggingface/smolagents/releases/tag/v1.20.0

#smolagents #WebAssembly #Python #AIagents #Pyodide #Deno #OpenSource #HuggingFace #AgenticAI
albertvillanovaΒ 
posted an update 11 months ago
view post
Post
1852
πŸš€ SmolAgents v1.19.0 is live!
This release brings major improvements to agent flexibility, UI usability, streaming architecture, and developer experience: making it easier than ever to build smart, interactive AI agents. Here's what's new:

πŸ”§ Agent Upgrades
- Support for managed agents in ToolCallingAgent
- Context manager support for cleaner agent lifecycle handling
- Output formatting now uses XML tags for consistency

πŸ–₯️ UI Enhancements
- GradioUI now supports reset_agent_memory: perfect for fresh starts in dev & demos.

πŸ”„ Streaming Refactor
- Streaming event aggregation moved off the Model class
- ➑️ Better architecture & maintainability

πŸ“¦ Output Tracking
- CodeAgent outputs are now stored in ActionStep
- βœ… More visibility and structure to agent decisions

πŸ› Bug Fixes
- Smarter planning logic
- Cleaner Docker logs
- Better prompt formatting for additional_args
- Safer internal functions and final answer matching

πŸ“š Docs Improvements
- Added quickstart examples with tool usage
- One-click Colab launch buttons
- Expanded reference docs (AgentMemory, GradioUI docstrings)
- Fixed broken links and migrated to .md format

πŸ”— Full release notes:
https://github.com/huggingface/smolagents/releases/tag/v1.19.0

πŸ’¬ Try it out, explore the new features, and let us know what you build!

#smolagents #opensource #AIagents #LLM #HuggingFace
multimodalartΒ 
posted an update 11 months ago
view post
Post
18367
Self-Forcing - a real-time video distilled model from Wan 2.1 by @adobe is out, and they open sourced it 🐐

I've built a live real time demo on Spaces πŸ“ΉπŸ’¨

multimodalart/self-forcing
  • 6 replies
Β·