Open to Work

3 20 47

amitk17

sweatSmile

https://www.amitchoubey.dev/

amitkatyayana

AI & ML interests

Training LLMs • Prompt Engineering • Fine-tuning • Retrieval-Augmented Generation (RAG) • LoRA • NLP • Educational AI • Financial NLP • Chatbot Design • Model Deployment • Transformers • Open-Source Contributions

Recent Activity

updated a model about 2 months ago

sweatSmile/my-finetuned-gpt2-medium

upvoted a paper about 2 months ago

Attention Residuals

new activity about 2 months ago

huggingface/InferenceSupport:sweatSmile/my-finetuned-gpt2-medium

View all activity

Organizations

updated a model about 2 months ago

sweatSmile/my-finetuned-gpt2-medium

0.4B • Updated Mar 19 • 15

upvoted a paper about 2 months ago

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 184

New activity in huggingface/InferenceSupport about 2 months ago

sweatSmile/my-finetuned-gpt2-medium

#8717 opened about 2 months ago by

sweatSmile

published a model about 2 months ago

sweatSmile/my-finetuned-gpt2-medium

0.4B • Updated Mar 19 • 15

liked a dataset about 2 months ago

nickmuchi/financial-classification

Viewer • Updated Jan 27, 2023 • 5.06k • 748 • 17

liked 2 models about 2 months ago

Qwen/Qwen2.5-0.5B-Instruct

Text Generation • 0.5B • Updated Sep 25, 2024 • 6.05M • • 508

yakul259/fint5-financeqa-customised

60.5M • Updated Oct 31, 2025 • 5 • 1

upvoted a changelog about 2 months ago

Hugging Face Changelog

Introducing Buckets: S3-like storage on the Hub

Mar 10

• 186

posted an update about 2 months ago

Post

425

Just published a hands-on guide on building a Kubernetes cluster from scratch on AWS EC2 using kubeadm, no managed services, no shortcuts.

If you want to truly understand how the control plane and workers communicate, how pod networking works with Flannel, and how to lock down access with security groups ,then this is the kind of exercise that makes it click.

The guide covers a full 3-node setup (1 control plane + 2 workers) on Amazon Linux 2023, from instance provisioning all the way to deploying your first workload.

Read it here 👉 https://www.amitchoubey.dev/posts/kubernetes-cluster-aws-ec2-kubeadm/

liked a model about 2 months ago

sarvamai/sarvam-105b

Text Generation • 106B • Updated Mar 10 • 24.1k • 266

liked a dataset 2 months ago

stanfordnlp/imdb

Viewer • Updated Jan 4, 2024 • 100k • 172k • 370

updated a model 6 months ago

sweatSmile/Phi3-Mini-FinSight-FinancialQA

Text Generation • 4B • Updated Nov 2, 2025 • 3 • 1

liked a model 6 months ago

sweatSmile/Phi3-Mini-FinSight-FinancialQA

Text Generation • 4B • Updated Nov 2, 2025 • 3 • 1

published a model 6 months ago

sweatSmile/Phi3-Mini-FinSight-FinancialQA

Text Generation • 4B • Updated Nov 2, 2025 • 3 • 1

published a dataset 6 months ago

sweatSmile/FinNLP-QA-1.0

Viewer • Updated Sep 25, 2024 • 47.3k • 4

reacted to anakin87's post with 👍 6 months ago

Post

1099

Haystack can now see 👀

The latest release of the Haystack OSS LLM framework adds a long-requested feature: image support!

📓 Notebooks below

This isn't just about passing images to an LLM. We built several features to enable practical multimodal use cases.

What's new?
🧠 Support for multiple LLM providers: OpenAI, Amazon Bedrock, Google Gemini, Mistral, NVIDIA, OpenRouter, Ollama and more (support for Hugging Face API coming 🔜)
🎛️ Prompt template language to handle structured inputs, including images
📄 PDF and image converters
🔍 Image embedders using CLIP-like models
🧾 LLM-based extractor to pull text from images
🧩 Components to build multimodal RAG pipelines and Agents

I had the chance of leading this effort with @sjrhuschlee (great collab).

📓 Below you can find two notebooks to explore the new features:
󠁯•󠁏󠁏 Introduction to Multimodal Text Generation https://haystack.deepset.ai/cookbook/multimodal_intro
󠁯•󠁏󠁏 Creating Vision+Text RAG Pipelines https://haystack.deepset.ai/tutorials/46_multimodal_rag

(🖼️ image by @bilgeyucel )

reacted to their post with 🔥 6 months ago

Post

1168

some of my fav where i run jobs

https://huggingface.co/docs/huggingface_hub/main/en/guides/cli#hf-jobs

https://lightning.ai/

https://colab.research.google.com/

https://www.runpod.io/console/deploy

ps: I ❤️ hf

reacted to prithivMLmods's post with 🚀 6 months ago

Post

3905

Build something cool with Nano Banana aka Gemini 2.5 Flash Image AIO [All-in-One]. Draw and transform on canvas, edit images, and generate images—all in one place!🍌

✦︎ Constructed with the Gemini API (GCP). Try it here: prithivMLmods/Nano-Banana-AIO (Added the Space recently! - Sep 18 '25)

4 replies

reacted to Kseniase's post with 🚀 6 months ago

Post

6241

10 awesome advanced LoRA approaches

Low-Rank Adaptation (LoRA) is the go-to method for efficient model fine-tuning that adds small low-rank matrices instead of retraining full models. The field isn’t standing still – new LoRA variants push the limits of efficiency, generalization, and personalization. So we’re sharing 10 of the latest LoRA approaches you should know about:

1. Mixture-of-LoRA-experts → Mixture of Low-Rank Adapter Experts in Generalizable Audio Deepfake Detection (2509.13878)
Adds multiple low-rank adapters (LoRA) into a model’s layers, and a routing mechanism activates the most suitable ones for each input. This lets the model adapt better to new unseen conditions

2. Amortized Bayesian Meta-Learning for LoRA (ABMLL) → Amortized Bayesian Meta-Learning for Low-Rank Adaptation of Large Language Models (2508.14285)
Balances global and task-specific parameters within a Bayesian framework to improve uncertainty calibration and generalization to new tasks without high memory or compute costs

3. AutoLoRA → AutoLoRA: Automatic LoRA Retrieval and Fine-Grained Gated Fusion for Text-to-Image Generation (2508.02107)
Automatically retrieves and dynamically aggregates public LoRAs for stronger T2I generation

4. aLoRA (Activated LoRA) → Activated LoRA: Fine-tuned LLMs for Intrinsics (2504.12397)
Only applies LoRA after invocation, letting the model reuse the base model’s KV cache instead of recomputing the full turn’s KV cache. Efficient in multi-turn conversations

5. LiLoRA (LoRA in LoRA) → LoRA in LoRA: Towards Parameter-Efficient Architecture Expansion for Continual Visual Instruction Tuning (2508.06202)
Shares the LoRA matrix A across tasks and additionally low-rank-decomposes matrix B to cut parameters in continual vision-text MLLMs

6. Sensitivity-LoRA → Sensitivity-LoRA: Low-Load Sensitivity-Based Fine-Tuning for Large Language Models (2509.09119)
Dynamically assigns ranks to weight matrices based on their sensitivity, measured using second-order derivatives

Read further below ↓
Also, subscribe to the Turing Post: https://www.turingpost.com/subscribe

3 replies

upvoted a paper 6 months ago

LimRank: Less is More for Reasoning-Intensive Information Reranking

Paper • 2510.23544 • Published Oct 27, 2025 • 9

amitk17

AI & ML interests

Recent Activity

Organizations

sweatSmile's activity

sweatSmile/my-finetuned-gpt2-medium

Introducing Buckets: S3-like storage on the Hub