AI & ML interests

Large corpora

albertvillanovaย 
posted an update 3 months ago
view post
Post
2803
๐Ÿš€ TRL v0.29.0 introduces trl-training: an agent-native training skill.

This makes the TRL CLI a structured, agent-readable capability, allowing AI agents to reliably execute training workflows such as:
- Supervised Fine-Tuning (SFT)
- Direct Preference Optimization (DPO)
- Group Relative Policy Optimization (GRPO)

Weโ€™re excited to see what the community builds on top of this.

If youโ€™re working on AI agents, alignment research, or scalable RL training infrastructure: give TRL v0.29.0 a try! ๐Ÿค—

The future of ML tooling is agent-native.
๐Ÿ”— https://github.com/huggingface/trl/releases/tag/v0.29.0
albertvillanovaย 
posted an update 4 months ago
view post
Post
1979
5 years already working in democratizing AI ๐Ÿค—
Grateful to be part of such an awesome team making it happen every day.
albertvillanovaย 
posted an update 10 months ago
view post
Post
4627
Latest smolagents release supports GPT-5: build agents that think, plan, and act.
โšก Upgrade now and put GPT-5 to work!
  • 1 reply
ยท
albertvillanovaย 
posted an update 10 months ago
view post
Post
720
๐Ÿš€ smolagents v1.21.0 is here!
Now with improved safety in the local Python executor: dunder calls are blocked!
โš ๏ธ Still, not fully isolated: for untrusted code, use a remote executor instead: Docker, E2B, Wasm.
โœจ Many bug fixes: more reliable code.
๐Ÿ‘‰ https://github.com/huggingface/smolagents/releases/tag/v1.21.0
albertvillanovaย 
posted an update 11 months ago
view post
Post
841
๐Ÿš€ New in smolagents v1.20.0: Remote Python Execution via WebAssembly (Wasm)

We've just merged a major new capability into the smolagents framework: the CodeAgent can now execute Python code remotely in a secure, sandboxed WebAssembly environment!

๐Ÿ”ง Powered by Pyodide and Deno, this new WasmExecutor lets your agent-generated Python code run safely: without relying on Docker or local execution.

Why this matters:
โœ… Isolated execution = no host access
โœ… No need for Python on the user's machine
โœ… Safer evaluation of arbitrary code
โœ… Compatible with serverless / edge agent workloads
โœ… Ideal for constrained or untrusted environments

This is just the beginning: a focused initial implementation with known limitations. A solid MVP designed for secure, sandboxed use cases. ๐Ÿ’ก

๐Ÿ’ก We're inviting the open-source community to help evolve this executor:
โ€ข Tackle more advanced Python features
โ€ข Expand compatibility
โ€ข Add test coverage
โ€ข Shape the next-gen secure agent runtime

๐Ÿ”— Check out the PR: https://github.com/huggingface/smolagents/pull/1261

Let's reimagine what agent-driven Python execution can look like: remote-first, wasm-secure, and community-built.

This feature is live in smolagents v1.20.0!
Try it out.
Break things. Extend it. Give us feedback.
Let's build safer, smarter agents; together ๐Ÿง โš™๏ธ

๐Ÿ‘‰ https://github.com/huggingface/smolagents/releases/tag/v1.20.0

#smolagents #WebAssembly #Python #AIagents #Pyodide #Deno #OpenSource #HuggingFace #AgenticAI
albertvillanovaย 
posted an update 11 months ago
view post
Post
1868
๐Ÿš€ SmolAgents v1.19.0 is live!
This release brings major improvements to agent flexibility, UI usability, streaming architecture, and developer experience: making it easier than ever to build smart, interactive AI agents. Here's what's new:

๐Ÿ”ง Agent Upgrades
- Support for managed agents in ToolCallingAgent
- Context manager support for cleaner agent lifecycle handling
- Output formatting now uses XML tags for consistency

๐Ÿ–ฅ๏ธ UI Enhancements
- GradioUI now supports reset_agent_memory: perfect for fresh starts in dev & demos.

๐Ÿ”„ Streaming Refactor
- Streaming event aggregation moved off the Model class
- โžก๏ธ Better architecture & maintainability

๐Ÿ“ฆ Output Tracking
- CodeAgent outputs are now stored in ActionStep
- โœ… More visibility and structure to agent decisions

๐Ÿ› Bug Fixes
- Smarter planning logic
- Cleaner Docker logs
- Better prompt formatting for additional_args
- Safer internal functions and final answer matching

๐Ÿ“š Docs Improvements
- Added quickstart examples with tool usage
- One-click Colab launch buttons
- Expanded reference docs (AgentMemory, GradioUI docstrings)
- Fixed broken links and migrated to .md format

๐Ÿ”— Full release notes:
https://github.com/huggingface/smolagents/releases/tag/v1.19.0

๐Ÿ’ฌ Try it out, explore the new features, and let us know what you build!

#smolagents #opensource #AIagents #LLM #HuggingFace