Harsh Tester

Harsh15aug

AI & ML interests

None yet

Recent Activity

upvoted an article 9 days ago

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

upvoted a paper 15 days ago

LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR

liked a Space 28 days ago

multimodalart/qwen-image-multiple-angles-3d-camera

View all activity

Organizations

None yet

upvoted an article 9 days ago

Article

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

18 days ago

•

upvoted a paper 15 days ago

LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR

Paper • 2601.14251 • Published 17 days ago • 23

liked a Space 28 days ago

Qwen Image Multiple Angles 3D Camera

🎥

1.36k

Adjust camera angles in images using 3D controls or sliders

liked a model 6 months ago

docling-project/SmolDocling-256M-preview

Image-Text-to-Text • 0.3B • Updated Sep 17, 2025 • 36.1k • 1.61k

upvoted an article 7 months ago

Article

Efficient MultiModal Data Pipeline

Jul 8, 2025

•

upvoted a collection 8 months ago

V-JEPA 2

Collection

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 191

upvoted an article 8 months ago

Article

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

Jun 3, 2025

•

liked 2 models 8 months ago

nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1

Image-Text-to-Text • Updated Dec 4, 2025 • 951k • 175

facebook/KernelLLM

Text Generation • 8B • Updated 23 days ago • 1.83k • • 189

upvoted an article 8 months ago

Article

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

May 23, 2025

•

171

upvoted a paper 9 months ago

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23, 2025 • 81

upvoted 6 articles 9 months ago

Article

TinyAgents: A Minimal Experiment with Code Agents and MCP Tools

May 16, 2025

•

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21, 2025

•

251

Article

The Transformers Library: standardizing model definitions

May 15, 2025

•

121

Article

Vision Language Models (Better, faster, stronger)

May 12, 2025

•

593

Article

How to Build an MCP Server with Gradio

Apr 30, 2025

•

202

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25, 2025

•

306

upvoted an article 10 months ago

Article

An Introduction to AI Model Optimization Techniques

Apr 18, 2025

•

liked a Space 10 months ago

DeepSite v4

🐳

16.4k

Generate any application by Vibe Coding it

liked a model 10 months ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • 685B • Updated Mar 27, 2025 • 262k • • 3.09k

Harsh Tester

AI & ML interests

Recent Activity

Organizations

Harsh15aug's activity

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

Qwen Image Multiple Angles 3D Camera

Efficient MultiModal Data Pipeline

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

TinyAgents: A Minimal Experiment with Code Agents and MCP Tools

nanoVLM: The simplest repository to train your VLM in pure PyTorch

The Transformers Library: standardizing model definitions

Vision Language Models (Better, faster, stronger)

How to Build an MCP Server with Gradio

Tiny Agents: an MCP-powered agent in 50 lines of code

An Introduction to AI Model Optimization Techniques

DeepSite v4