How to use from
SGLang
Install from pip and serve model
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "TomGrc/FusionNet_linear" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "TomGrc/FusionNet_linear",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'
Use Docker images
docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "TomGrc/FusionNet_linear" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "TomGrc/FusionNet_linear",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'
Quick Links

FusionNet_linear

Fine-tuned model on English language using linear Fusion method.

Model description

This is an experiment with the linear Fusion method of FusionNet. This model has 10.7B parameters, and this model is fine-tuned. Enjoy!

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 74.43
AI2 Reasoning Challenge (25-Shot) 71.25
HellaSwag (10-Shot) 88.44
MMLU (5-Shot) 66.35
TruthfulQA (0-shot) 71.94
Winogrande (5-shot) 83.27
GSM8k (5-shot) 65.35
Downloads last month
224
Safetensors
Model size
11B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for TomGrc/FusionNet_linear

Quantizations
3 models

Evaluation results