If you've just found my work for the first time, these models are a great start.
Tyler Williams PRO
unmodeled-tyler
AI & ML interests
AI research engineer & solo operator of VANTA Research/Quanta Intellect
Recent Activity
liked a model about 19 hours ago
deepseek-ai/DeepSeek-V4-Pro reacted to sergiopaniego's post with 🔥 about 24 hours ago
OpenEnv already ships 🚢 with a ready-to-deploy RLM environment on free HF Spaces
Drop "Attention Is All You Need", write code that spawns parallel LLM calls → ✅ correct answer, reward 1.0, in 4.2s
Run GRPO (TRL) → model learns to write that search strategy itself
test it yourself → https://huggingface.co/spaces/sergiopaniego/repl-env
check out OpenEnv → https://github.com/meta-pytorch/OpenEnv