Model Card for FLOPS-Squared/Llama-Baseline-V3-Instruct-B

An extended trained baseline model without using KeystoneFuse data efficient pretraining.

Research supported with Cloud TPUs from Google's TPU Research Cloud (TRC)

Downloads last month
1
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support