Santosh Kompella's picture

Building on HF

2 3

Santosh Kompella PRO

Sathya77

·

AI & ML interests

LLMs Natural Language Processing (NLP) Transformers Deep Learning Machine Learning

Recent Activity

upvoted an article 1 day ago

Deploying Open Source Vision Language Models (VLM) on Jetson

liked a Space 2 days ago

ysharma/fast-image-studio

liked a Space 2 days ago

TencentARC/Pixal3D

View all activity

Organizations

None yet

upvoted an article 1 day ago

Article

Deploying Open Source Vision Language Models (VLM) on Jetson

nvidia

•

Feb 24

• 37

liked 2 Spaces 2 days ago

Fast Image Studio

fast image editing using FireRed Image Edit and gr.Server

Pixal3D

High-fidelity pixel-aligned image-to-3D generation.

posted an update 7 days ago

Post

138

Trained a Swin-T from scratch on NWPU-RESISC45 — no pretrained weights, no fine-tuning.

Every component hand-coded in PyTorch: window partitioning, shifted window attention with relative positional bias, patch merging across 4 stages, ~28M parameters.

Architecture:

embed_dim=96, window_size=7, depths=[2, 2, 6, 2]
heads=[3, 6, 12, 24] across stages
Patch embed via Conv2d (4×4, stride 4) → 56×56 feature map
PatchMerging downsamples by concatenating 2×2 neighbors + linear projection
Global average pooling → linear classifier

Training:

AdamW (lr=3e-4, weight_decay=0.05)
Cosine annealing with 3-epoch linear warmup over 20 epochs
Mixed precision (autocast + GradScaler)
Gradient clipping (max_norm=1.0)
Label smoothing (0.1)
ImageNet normalization, batch size 32
80/20 train/test split, seed=42

Result: 82% test accuracy on 45 land-use categories, 31,500 images.
🔗 Sathya77/swin-transformer-satellite

What accuracy do you think is achievable on NWPU-RESISC45 with Swin-T trained from scratch, without any pretraining?

liked a Space 7 days ago

Talkie 1930

Chat with a 1930s‑style language model

updated a Space 7 days ago

Swin Transformer Satellite

Classify satellite images into land-use categories

updated a model 11 days ago

Sathya77/swin-transformer-satellite

Image Classification • Updated 11 days ago

published a Space 16 days ago

Swin Transformer Satellite

Classify satellite images into land-use categories

published a model 16 days ago

Sathya77/swin-transformer-satellite

Image Classification • Updated 11 days ago

updated a model 3 months ago

Sathya77/Telecom_Plan_RAG_based

Question Answering • 0.8B • Updated Feb 21 • 5

updated a model 8 months ago

Sathya77/ViT_MvTec

Image Classification • Updated Sep 30, 2025

published a model 8 months ago

Sathya77/ViT_MvTec

Image Classification • Updated Sep 30, 2025

upvoted a changelog 8 months ago

Hugging Face Changelog

Emoji Autocomplete in Discussions and Posts

Sep 11, 2025

• 68

updated a dataset 9 months ago

Sathya77/telecom_plans

Viewer • Updated Aug 30, 2025 • 125 • 5

published a dataset 9 months ago

Sathya77/telecom_plans

Viewer • Updated Aug 30, 2025 • 125 • 5

published a model 9 months ago

Sathya77/Telecom_Plan_RAG_based

Question Answering • 0.8B • Updated Feb 21 • 5

updated a model 9 months ago

Sathya77/spam-ham-classifier

Text Classification • 0.1B • Updated Aug 18, 2025 • 5

published a model 9 months ago

Sathya77/spam-ham-classifier

Text Classification • 0.1B • Updated Aug 18, 2025 • 5