AI & ML interests
Length-aware reinforcement learning fine-tuning, reasoning models, efficient inference, post-training, controllable generation, LLM alignment.
laconic-llm 's datasets
None public yet
Length-aware reinforcement learning fine-tuning, reasoning models, efficient inference, post-training, controllable generation, LLM alignment.