Joe Hirst's picture

Joe Hirst

joehirstdev

·

AI & ML interests

Multimodal AI

Organizations

upvoted 2 articles 5 months ago

Article

Open Responses: What you need to know

+2

evalstate, burtenshaw, merve, pcuenq

•

Jan 15

• 112

Article

Building Deep Research: How we Achieved State of the Art

Tavily

•

Nov 24, 2025

• 36

upvoted an article 10 months ago

Article

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

kuotient

•

Aug 9, 2025

• 59

upvoted a collection over 1 year ago

Gemma 3 Release

28 items • Updated Mar 12 • 643

upvoted 2 articles over 1 year ago

Article

Remote VAEs for decoding with Inference Endpoints 🤗

hlky, sayakpaul

•

Feb 24, 2025

• 41

Article

Welcome to Inference Providers on the Hub 🔥

+5

burkaygur, zeke, aton2006, hassanelmghari, sbrandeis, kramp, julien-c

•

Jan 28, 2025

• 494

upvoted a collection over 1 year ago

DeepSeek-R1

10 items • Updated Nov 27, 2025 • 854

upvoted a paper almost 2 years ago

End-to-end speaker segmentation for overlap-aware resegmentation

Paper • 2104.04045 • Published Apr 8, 2021 • 2

upvoted a collection almost 2 years ago

Training Datasets

A collection of pseudo-labelled datasets used to train the Distil-Whisper model. • 9 items • Updated Mar 21, 2024 • 14

upvoted 4 articles almost 2 years ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

+6

philschmid, osanseviero, alvarobartt, lvwerra, dvilasuero, reach-vb, marcsun13, pcuenq

•

Jul 23, 2024

• 241

Article

Introducing TextImage Augmentation for Document Images

+1

danaaubakirova, Molbap, Ternaus

•

Aug 6, 2024

• 33

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

+3

ybelkada, timdettmers, artidoro, sgugger, smangrul

•

May 24, 2023

• 180

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

ybelkada, timdettmers

•

Aug 17, 2022

• 136

upvoted 2 collections almost 2 years ago

Llama 3.1 GPTQ, AWQ, and BNB Quants

Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Sep 26, 2024 • 57

Gemma 2 2B Release

The 2.6B parameter version of Gemma 2. • 6 items • Updated Mar 12 • 85

upvoted an article almost 2 years ago

Article

TGI Multi-LoRA: Deploy Once, Serve 30 Models

+1

derek-thomas, dmaniloff, drbh

•

Jul 18, 2024

• 63

upvoted a paper about 2 years ago

SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions

Paper • 2403.16627 • Published Mar 25, 2024 • 22

upvoted a collection about 2 years ago

DBRX

DBRX is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. • 3 items • Updated Mar 27, 2024 • 96

upvoted 2 papers over 2 years ago

Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 78

Efficient Few-Shot Learning Without Prompts

Paper • 2209.11055 • Published Sep 22, 2022 • 7