Post
99
π§ Just shipped ML Research Assistant on HuggingFace Spaces!
An AI research assistant powered by Qwen2.5-7B with 17 auto-routing tools. Ask in plain English β it figures out the right tool automatically.
π§ What it can do:
β’ Compare LLMs on real benchmarks (MMLU, GSM8K, HumanEval, HellaSwag)
β’ Search ArXiv papers and find GitHub implementations
β’ Explore model benchmarks with real F1 and Accuracy scores
β’ Live AI news feed β click any headline to ask the assistant about it
β’ Web search, code generation, HuggingFace model and dataset search, and more
π‘ Try these:
β compare gpt-4o vs claude 3.5 sonnet on coding
β show me the overall llm leaderboard
β find papers with code for transformer attention
β trending ml repos on github
βοΈ Built without LangChain β I wrote the router from scratch so the model only gets called once per query. That matters on ZeroGPU where GPU time is limited.
β οΈ Still a v1 β handles one query at a time, cold start delay on first load. Multi-query support and streaming are coming next.
π Abdourakib/ml-research-assistant
π https://github.com/Abdrakib/ml-research-assistant
Would love feedback from the community β what would you add?
An AI research assistant powered by Qwen2.5-7B with 17 auto-routing tools. Ask in plain English β it figures out the right tool automatically.
π§ What it can do:
β’ Compare LLMs on real benchmarks (MMLU, GSM8K, HumanEval, HellaSwag)
β’ Search ArXiv papers and find GitHub implementations
β’ Explore model benchmarks with real F1 and Accuracy scores
β’ Live AI news feed β click any headline to ask the assistant about it
β’ Web search, code generation, HuggingFace model and dataset search, and more
π‘ Try these:
β compare gpt-4o vs claude 3.5 sonnet on coding
β show me the overall llm leaderboard
β find papers with code for transformer attention
β trending ml repos on github
βοΈ Built without LangChain β I wrote the router from scratch so the model only gets called once per query. That matters on ZeroGPU where GPU time is limited.
β οΈ Still a v1 β handles one query at a time, cold start delay on first load. Multi-query support and streaming are coming next.
π Abdourakib/ml-research-assistant
π https://github.com/Abdrakib/ml-research-assistant
Would love feedback from the community β what would you add?