Together

company

Verified

https://together.ai

togethercompute

togethercomputer

Inference Provider

3,210,856 monthly requests

AI & ML interests

Foundation Models, Decentralized Computing, Open Source AI.

Recent Activity

JamesSand authored a paper 2 days ago

No Hidden Prompts Needed! You Can Game AI Peer Review with Presentation-Only Revisions

JamesSand submitted a paper 9 days ago

No Hidden Prompts Needed! You Can Game AI Peer Review with Presentation-Only Revisions

KaiserWhoLearns authored a paper about 2 months ago

What Is Seen Cannot Be Unseen: The Disruptive Effect of Knowledge Conflict on Large Language Models

View all activity

Papers

Taylor-Calibrate: Principled Initialization for Hybrid Linear Attention Distillation

OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization

View all Papers

Articles

Fine-tune Any LLM from the Hugging Face Hub with Together AI

submitted a paper to Daily Papers 5 days ago

Taylor-Calibrate: Principled Initialization for Hybrid Linear Attention Distillation

Paper • 2606.16429 • Published 10 days ago • 5

submitted a paper to Daily Papers about 1 month ago

OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization

Paper • 2605.17757 • Published May 18 • 65

KaiserWhoLearns

authored a paper about 2 months ago

What Is Seen Cannot Be Unseen: The Disruptive Effect of Knowledge Conflict on Large Language Models

Paper • 2506.06485 • Published Jun 6, 2025 • 5

KaiserWhoLearns

authored a paper 2 months ago

What do Language Models Learn and When? The Implicit Curriculum Hypothesis

Paper • 2604.08510 • Published Apr 9 • 4

KaiserWhoLearns

submitted a paper to Daily Papers 2 months ago

What do Language Models Learn and When? The Implicit Curriculum Hypothesis

Paper • 2604.08510 • Published Apr 9 • 4

KaiserWhoLearns

authored a paper 4 months ago

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

Paper • 2603.09095 • Published Mar 10 • 29

KaiserWhoLearns

submitted a paper to Daily Papers 4 months ago

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

Paper • 2603.09095 • Published Mar 10 • 29

KaiserWhoLearns

authored a paper 5 months ago

FIRE-Bench: Evaluating Agents on the Rediscovery of Scientific Insights

Paper • 2602.02905 • Published Feb 2 • 5

authored a paper 10 months ago

Cartridges: Lightweight and general-purpose long context representations via self-study

Paper • 2506.06266 • Published Jun 6, 2025 • 8

authored a paper about 1 year ago

MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering

Paper • 2505.07782 • Published May 12, 2025 • 19

authored 2 papers over 1 year ago

Language Models Prefer What They Know: Relative Confidence Estimation via Confidence Preferences

Paper • 2502.01126 • Published Feb 3, 2025 • 4

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31, 2025 • 126

authored a paper over 1 year ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 59

authored a paper over 1 year ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 59

authored 5 papers over 1 year ago

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

Paper • 2306.11698 • Published Jun 20, 2023 • 13

Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

Paper • 2402.07440 • Published Feb 12, 2024 • 1

Simple linear attention language models balance the recall-throughput tradeoff

Paper • 2402.18668 • Published Feb 28, 2024 • 20

Just read twice: closing the recall gap for recurrent language models

Paper • 2407.05483 • Published Jul 7, 2024

LoLCATs: On Low-Rank Linearizing of Large Language Models

Paper • 2410.10254 • Published Oct 14, 2024 • 1

authored a paper over 1 year ago

Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21, 2024 • 29