tm23's picture

tm23

tm23hgf

·

AI & ML interests

None yet

Recent Activity

updated a model 2 days ago

tm23hgf/wordle-qwen3-4b-sft

published a model 2 days ago

tm23hgf/wordle-qwen3-4b-sft

updated a Space 22 days ago

tm23hgf/rust_algo_reasoning

View all activity

Organizations

None yet

updated a model 2 days ago

tm23hgf/wordle-qwen3-4b-sft

Updated 2 days ago

published a model 2 days ago

tm23hgf/wordle-qwen3-4b-sft

Updated 2 days ago

updated a Space 22 days ago

Algo Reasoning Environment

Score Rust algorithm solutions on correctness, reasoning, complexity

liked a Space about 1 month ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

Visualize synthetic‑data experiments as an interactive bookshelf

updated a model about 2 months ago

tm23hgf/anime-sdxl-lora

Updated May 8 • 2

published a model about 2 months ago

tm23hgf/anime-sdxl-lora

Updated May 8 • 2

commented on Strand-Rust-Coder-v1: Rust Coding Model Fine-Tuned on Peer-Ranked Synthetic Data about 2 months ago

awesome work, i am going to start some research on reasoning SLM on rust wanted to know is the dataset publicly released?

liked 2 Spaces about 2 months ago

The ultimate guide to RL environments: building and scaling them in the LLM era

Building and scaling RL environments for LLM training

GPU Budget Negotiation Arena

Simulate GPU budget negotiations and view results

updated a Space 2 months ago

Social Network Env

Simulate a social network to detect coordinated inauthentic behavior

updated a dataset 2 months ago

tm23hgf/socialnet-sft

Viewer • Updated Apr 25 • 14.6k • 30

published a dataset 2 months ago

tm23hgf/socialnet-sft

Viewer • Updated Apr 25 • 14.6k • 30

published a Space 2 months ago

Social Network Env

Simulate a social network to detect coordinated inauthentic behavior

published a Space 3 months ago

Algo Reasoning Environment

Score Rust algorithm solutions on correctness, reasoning, complexity

New activity in BibbyResearch/3blue1brown-manim 7 months ago

Not a good dataset

#2 opened 7 months ago by

commented on Mixture of Experts Explained 7 months ago

Chinchilla paper actually shows that for a fixed compute budget, it is better to train a smaller model on more data rather than training a larger model for fewer steps.

upvoted an article 7 months ago

Article

Mixture of Experts Explained

+4

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.15k