Ashish Talreja's picture

1 1

Ashish Talreja

talrejaa8

·

ashtalofIP

AI & ML interests

None yet

Recent Activity

commented on an article about 1 month ago

KV Caching Explained: Optimizing Transformer Inference Efficiency

updated a collection 3 months ago

replied to onekq's post 5 months ago

Introducing 🎉 OneSQL-v0.1🥳, our first text-to-SQL model based on Qwen2.5-Coder. This model has achieved an EX score of 63.33 on the BIRD leaderboard (https://bird-bench.github.io/). The model family includes 7B and 32B, https://huggingface.co/collections/onekq-ai/onesql-v01-qwen-67d8e3eb1611c5532bb90c5f and can be also found on Ollama (https://ollama.com/onekq/OneSQL-v0.1-Qwen) My goal is to make OneSQL the most usable open-weights model for text-to-SQL. I'm currently working on best practices to help users use this model the right away and avoid pitfalls. After that, I plan to train the next version to push for a higher EX score. Enjoy this model and feel free to share comments/questions 🤗

View all activity

Organizations

None yet

commented on KV Caching Explained: Optimizing Transformer Inference Efficiency about 1 month ago

I really appreciate your effort to explaining this so well. Just one doubt I have, what exactly is being cached?

The QK^t dot product results and the Value vectors of the already generated tokens
or
The just the key vectors and the value vectors of already generated tokens?

Also, is this done for each transformer block in an LLM?

updated a collection 3 months ago

Rl/GRPO

4 items • Updated Nov 8, 2025

replied to onekq's post 5 months ago

Hey Yi, could you please share the training data details, like what were the datasets this model was finetuned on?

updated a collection 6 months ago

Rl/GRPO

4 items • Updated Nov 8, 2025

updated a collection 7 months ago

Rl/GRPO

4 items • Updated Nov 8, 2025

liked a model 7 months ago

Snowflake/Arctic-Text2SQL-R1-7B

8B • Updated May 29, 2025 • 2.35k • 58

upvoted an article 10 months ago

Article

Mastering Tensor Dimensions in Transformers

Jan 12, 2025

•

132