AKSARA Demo
AKSARA CLM: model bahasa Indonesia memahami makna
Totally agree, safety constraints are really the core challenge here.
What makes it tricky is that “unsafe” isn’t just about specific commands, but about how they’re composed and the context they run in. Two syntactically valid commands can have very different risk profiles depending on scope, permissions, and recursion.
I think the interesting direction is combining:
Datasets like this are great for learning the mapping, but the real gap is teaching models when not to execute or when to ask for confirmation.
That’s probably where smaller, practical terminal agents will differentiate the most.
Nice dataset—this kind of NL ↔ Bash pairing is genuinely useful for grounding LLMs in real system actions.
The interesting part will be how well it handles:
compositional commands
edge cases and flags
safety constraints (destructive ops, permissions)
Quality and diversity probably matter more than size here, especially for terminal use.
Still, a solid direction for making smaller models more practically useful.