Image-Text-to-Text
Transformers
Safetensors
English
qwen2_5_vl
Merge
mergekit
slerp
agent
gui-automation
vision
multimodal
far-7b
ui-tars
text-generation-inference
Instructions to use yasserrmd/Fara-TARS-7B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use yasserrmd/Fara-TARS-7B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="yasserrmd/Fara-TARS-7B")# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("yasserrmd/Fara-TARS-7B") model = AutoModelForImageTextToText.from_pretrained("yasserrmd/Fara-TARS-7B") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use yasserrmd/Fara-TARS-7B with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "yasserrmd/Fara-TARS-7B" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "yasserrmd/Fara-TARS-7B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/yasserrmd/Fara-TARS-7B
- SGLang
How to use yasserrmd/Fara-TARS-7B with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "yasserrmd/Fara-TARS-7B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "yasserrmd/Fara-TARS-7B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "yasserrmd/Fara-TARS-7B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "yasserrmd/Fara-TARS-7B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use yasserrmd/Fara-TARS-7B with Docker Model Runner:
docker model run hf.co/yasserrmd/Fara-TARS-7B
Update README.md
Browse files
README.md
CHANGED
|
@@ -5,7 +5,7 @@ license: apache-2.0
|
|
| 5 |
tags:
|
| 6 |
- merge
|
| 7 |
- mergekit
|
| 8 |
-
-
|
| 9 |
- agent
|
| 10 |
- gui-automation
|
| 11 |
- vision
|
|
@@ -181,19 +181,20 @@ This model was merged using **Mergekit**.
|
|
| 181 |
models:
|
| 182 |
- model: microsoft/Fara-7B
|
| 183 |
- model: ByteDance-Seed/UI-TARS-1.5-7B
|
| 184 |
-
|
| 185 |
-
density: 0.53
|
| 186 |
-
weight: 0.5
|
| 187 |
-
merge_method: dare_ties
|
| 188 |
base_model: microsoft/Fara-7B
|
| 189 |
-
parameters:
|
| 190 |
-
normalize: true
|
| 191 |
-
int8_mask: true
|
| 192 |
dtype: bfloat16
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 193 |
```
|
| 194 |
-
*(Note: While `
|
| 195 |
|
| 196 |
-
##
|
| 197 |
|
| 198 |
1. **Strict Prompting:** The model expects the specific System Prompt defined in the usage class. Without it, it may hallucinate tool names.
|
| 199 |
2. **Repetition:** In extremely long lists (100+ items), the model may repeat. The recommended `repetition_penalty=1.15` fixes this for 99% of cases.
|
|
|
|
| 5 |
tags:
|
| 6 |
- merge
|
| 7 |
- mergekit
|
| 8 |
+
- slerp
|
| 9 |
- agent
|
| 10 |
- gui-automation
|
| 11 |
- vision
|
|
|
|
| 181 |
models:
|
| 182 |
- model: microsoft/Fara-7B
|
| 183 |
- model: ByteDance-Seed/UI-TARS-1.5-7B
|
| 184 |
+
merge_method: slerp
|
|
|
|
|
|
|
|
|
|
| 185 |
base_model: microsoft/Fara-7B
|
|
|
|
|
|
|
|
|
|
| 186 |
dtype: bfloat16
|
| 187 |
+
parameters:
|
| 188 |
+
t:
|
| 189 |
+
# 5-point gradient:
|
| 190 |
+
# 0.1 (Start): Mostly Fara -> Ensures input understanding and English grammar.
|
| 191 |
+
# 0.3 -> 0.5 (Middle): Blends TARS capability for reasoning and logic.
|
| 192 |
+
# 0.1 (End): Mostly Fara -> Ensures the output stops correctly and doesn't loop.
|
| 193 |
+
- value: [0.1, 0.3, 0.5, 0.3, 0.1]
|
| 194 |
```
|
| 195 |
+
*(Note: While `slerp` was used, specific inference parameters (temp=0.4, rep_penalty=1.15) are required to stabilize the output, as documented in the Usage section).*
|
| 196 |
|
| 197 |
+
## Limitations
|
| 198 |
|
| 199 |
1. **Strict Prompting:** The model expects the specific System Prompt defined in the usage class. Without it, it may hallucinate tool names.
|
| 200 |
2. **Repetition:** In extremely long lists (100+ items), the model may repeat. The recommended `repetition_penalty=1.15` fixes this for 99% of cases.
|