How to use from
vLLM
Install from pip and serve model
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "TomGrc/FusionNet"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "TomGrc/FusionNet",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'
Use Docker
docker model run hf.co/TomGrc/FusionNet
Quick Links

FusionNet

Fine-tuned model on English language using Fusion method.

Model description

The FusionNet is a model to experiment with the "Fusion" method, which could significantly increase the performance of the original model. The FusionNet has 10.7B parameters, and this model is fine-tuned. Enjoy!

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 74.38
AI2 Reasoning Challenge (25-Shot) 71.25
HellaSwag (10-Shot) 88.42
MMLU (5-Shot) 66.36
TruthfulQA (0-shot) 71.95
Winogrande (5-shot) 83.27
GSM8k (5-shot) 65.05
Downloads last month
210
Safetensors
Model size
11B params
Tensor type
F16
·
Inference Providers NEW
Input a message to start chatting with TomGrc/FusionNet.

Model tree for TomGrc/FusionNet

Quantizations
3 models

Evaluation results