Text Generation
Transformers
PyTorch
Uzbek
English
mistral
text-generation-inference
summarization
translation
question-answering
conversational
Instructions to use behbudiy/Mistral-7B-Instruct-Uz with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use behbudiy/Mistral-7B-Instruct-Uz with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="behbudiy/Mistral-7B-Instruct-Uz") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("behbudiy/Mistral-7B-Instruct-Uz") model = AutoModelForCausalLM.from_pretrained("behbudiy/Mistral-7B-Instruct-Uz") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use behbudiy/Mistral-7B-Instruct-Uz with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "behbudiy/Mistral-7B-Instruct-Uz" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "behbudiy/Mistral-7B-Instruct-Uz", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/behbudiy/Mistral-7B-Instruct-Uz
- SGLang
How to use behbudiy/Mistral-7B-Instruct-Uz with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "behbudiy/Mistral-7B-Instruct-Uz" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "behbudiy/Mistral-7B-Instruct-Uz", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "behbudiy/Mistral-7B-Instruct-Uz" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "behbudiy/Mistral-7B-Instruct-Uz", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use behbudiy/Mistral-7B-Instruct-Uz with Docker Model Runner:
docker model run hf.co/behbudiy/Mistral-7B-Instruct-Uz
Commit History
Update README.md 8fcc501 verified
Update README.md e182078 verified
Update tokenizer_config.json 64971b2 verified
Update README.md 87a7e5b verified
Update README.md 3d5dd86 verified
Update README.md cfa72b8 verified
Update README.md 8318dde verified
Update README.md ebbb4aa verified
Update README.md 0746748 verified
Update README.md 1ba50b1 verified
Update README.md 7d63fa1 verified
Update README.md 2887241 verified
Update README.md 9c801a2 verified
Update README.md 3285301 verified
Update README.md 6b68494 verified
Update README.md ff46d87 verified
revert readme 4cffec8
me commited on