How to use from
vLLM
Install from pip and serve model
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "TomGrc/FusionNet_linear"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "TomGrc/FusionNet_linear",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'
Use Docker
docker model run hf.co/TomGrc/FusionNet_linear
Quick Links

FusionNet_linear

Fine-tuned model on English language using linear Fusion method.

Model description

This is an experiment with the linear Fusion method of FusionNet. This model has 10.7B parameters, and this model is fine-tuned. Enjoy!

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 74.43
AI2 Reasoning Challenge (25-Shot) 71.25
HellaSwag (10-Shot) 88.44
MMLU (5-Shot) 66.35
TruthfulQA (0-shot) 71.94
Winogrande (5-shot) 83.27
GSM8k (5-shot) 65.35
Downloads last month
186
Safetensors
Model size
11B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for TomGrc/FusionNet_linear

Quantizations
3 models

Evaluation results