Instructions to use TobyYang7/MFFM-8B-finma-v8 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use TobyYang7/MFFM-8B-finma-v8 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="TobyYang7/MFFM-8B-finma-v8")# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("TobyYang7/MFFM-8B-finma-v8", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use TobyYang7/MFFM-8B-finma-v8 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "TobyYang7/MFFM-8B-finma-v8" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "TobyYang7/MFFM-8B-finma-v8", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/TobyYang7/MFFM-8B-finma-v8
- SGLang
How to use TobyYang7/MFFM-8B-finma-v8 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "TobyYang7/MFFM-8B-finma-v8" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "TobyYang7/MFFM-8B-finma-v8", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "TobyYang7/MFFM-8B-finma-v8" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "TobyYang7/MFFM-8B-finma-v8", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use TobyYang7/MFFM-8B-finma-v8 with Docker Model Runner:
docker model run hf.co/TobyYang7/MFFM-8B-finma-v8
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Installation
Install Package
conda create -n llava python=3.10 -y conda activate llava pip install --upgrade pip # enable PEP 660 support pip install -e .Install additional packages for training cases
pip install -e ".[train]" pip install flash-attn --no-build-isolation
Interface
from llava_llama3.serve.cli import chat_llava
from llava_llama3.model.builder import load_pretrained_model
import argparse
import os
import glob
import pandas as pd
from tqdm import tqdm
import json
root_path = os.path.dirname(os.path.abspath(__file__))
print(f'\033[92m{root_path}\033[0m')
parser = argparse.ArgumentParser()
parser.add_argument("--model-path", type=str, default="TobyYang7/MFFM-8B-finma-v8")
parser.add_argument("--device", type=str, default="cuda")
parser.add_argument("--conv-mode", type=str, default="llama_3")
parser.add_argument("--temperature", type=float, default=0)
parser.add_argument("--max-new-tokens", type=int, default=512)
parser.add_argument("--load-8bit", action="store_true")
parser.add_argument("--load-4bit", action="store_true")
args = parser.parse_args()
# load model
tokenizer, llava_model, image_processor, context_len = load_pretrained_model(args.model_path, None, 'llava_llama3', args.load_8bit, args.load_4bit, device=args.device)
print('\033[92mRunning chat\033[0m')
output = chat_llava(args=args,
image_file=root_path+'/data/llava_logo.png',
text='What is this?',
tokenizer=tokenizer,
model=llava_model,
image_processor=image_processor, # todo: input model name or path
context_len=context_len)
print('\033[94m', output, '\033[0m')
If you encounter the error No module named 'llava_llama3', set the PYTHONPATH as follows:
export PYTHONPATH=$PYTHONPATH:{$your_dir}/llava_llama3
- Downloads last month
- -