Rajesh Kumar
rajeshkhannapall
ยท
AI & ML interests
Ai
Recent Activity
reacted to DedeProGames's post with ๐ about 13 hours ago
Introducing GRM2, a powerful 3 billion parameter model designed for long-term reasoning and high performance in complex tasks.
Even with only 3 billion parameters, it outperforms qwen3-32b in several benchmarks and complex reasoning tasks.
With just 3 billion parameters, it can also generate extensive and complex code with over 1000 lines, utilize tools comparable to larger models, and is perfect for agentic tasks.
GRM2 is licensed under Apache 2.0, making it ideal as a base for FineTune in other tasks.
You can see more here: https://huggingface.co/OrionLLM/GRM2-3b upvoted a paper about 17 hours ago
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation upvoted a paper about 17 hours ago
VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding