MicrowaveJack (Trevor Miller)

liked 3 Spaces 7 months ago

The Ultra-Scale Playbook

🌌

3.88k

The ultimate guide to training LLM on large GPU Clusters

FineWeb: decanting the web for the finest text data at scale

🍷

1.36k

Explore and download the FineWeb web‑scale text dataset

The Smol Training Playbook

📚

3.2k

The secrets to building world-class LLMs

liked a model 8 months ago

microsoft/UserLM-8b

Text Generation • 8B • Updated 1 day ago • 621 • • 371

liked a Space over 1 year ago

Qwen2.5 Coder Artifacts

🐢

1.73k

Generate and preview app code from a text description

liked a model over 1 year ago

BAAI/bge-small-en-v1.5

Feature Extraction • 33.4M • Updated Feb 22, 2024 • 52.7M • • 486

liked a dataset over 1 year ago

gretelai/gretel-math-gsm8k-v1

Viewer • Updated Oct 16, 2024 • 24.9k • 482 • 38

liked a dataset almost 2 years ago

TIGER-Lab/SKGInstruct

Preview • Updated Apr 9, 2024 • 62 • 28

liked 2 models almost 2 years ago

google/gemma-scope

Updated Aug 29, 2024 • 202

meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 9.89M • • 6.05k

liked 2 models about 2 years ago

VAGOsolutions/Kraken-LoRA

Updated May 28, 2024 • 5 • 38

failspy/Llama-3-8B-Instruct-MopeyMule

Text Generation • 8B • Updated May 30, 2024 • 16 • • 86

liked a dataset about 2 years ago

TIGER-Lab/MMLU-Pro

Benchmark • Updated May 2 • 12.1k • 164k • 482

liked 4 models over 2 years ago

liked 2 models almost 3 years ago

NumbersStation/nsql-llama-2-7B

Text Generation • Updated Mar 10 • 1.32k • • 82

stabilityai/stablecode-instruct-alpha-3b

Text Generation • 3B • Updated Aug 8, 2023 • 9 • 303

liked a model about 3 years ago

MrHup/coloring-book

Updated May 21, 2023 • 40

Trevor Miller

AI & ML interests

Organizations

The Ultra-Scale Playbook

FineWeb: decanting the web for the finest text data at scale

The Smol Training Playbook

microsoft/UserLM-8b

Qwen2.5 Coder Artifacts

BAAI/bge-small-en-v1.5

gretelai/gretel-math-gsm8k-v1

TIGER-Lab/SKGInstruct

google/gemma-scope

meta-llama/Llama-3.1-8B-Instruct

VAGOsolutions/Kraken-LoRA

failspy/Llama-3-8B-Instruct-MopeyMule

TIGER-Lab/MMLU-Pro

TheBloke/CodeLlama-70B-hf-GGUF

mistralai/Mixtral-8x7B-Instruct-v0.1

microsoft/phi-2

thesephist/contra-bottleneck-t5-large-wikipedia

NumbersStation/nsql-llama-2-7B

stabilityai/stablecode-instruct-alpha-3b

MrHup/coloring-book

Trevor Miller

AI & ML interests

Organizations

MicrowaveJack's activity

The Ultra-Scale Playbook

FineWeb: decanting the web for the finest text data at scale

The Smol Training Playbook

Qwen2.5 Coder Artifacts