Haruka Takahashi
harukatakahashi
AI & ML interests
None yet
Recent Activity
upvoted a paper 1 day ago
ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning upvoted a paper 1 day ago
Heterogeneous Agent Collaborative Reinforcement Learning upvoted a paper 9 months ago
Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety
Assurance Organizations
None yet