Instructions to use zhengchenphd/Mistral-Plus-7B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use zhengchenphd/Mistral-Plus-7B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="zhengchenphd/Mistral-Plus-7B")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("zhengchenphd/Mistral-Plus-7B") model = AutoModelForCausalLM.from_pretrained("zhengchenphd/Mistral-Plus-7B") - Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use zhengchenphd/Mistral-Plus-7B with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "zhengchenphd/Mistral-Plus-7B" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "zhengchenphd/Mistral-Plus-7B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/zhengchenphd/Mistral-Plus-7B
- SGLang
How to use zhengchenphd/Mistral-Plus-7B with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "zhengchenphd/Mistral-Plus-7B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "zhengchenphd/Mistral-Plus-7B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "zhengchenphd/Mistral-Plus-7B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "zhengchenphd/Mistral-Plus-7B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use zhengchenphd/Mistral-Plus-7B with Docker Model Runner:
docker model run hf.co/zhengchenphd/Mistral-Plus-7B
Mistral-Plus Model Card
Model Details
Mistral-Plus is a chat assistant trained by Reinforcement Learning from Human Feedback (RLHF) using the Mistral-7B base model as the backbone.
- Mistral-Plus adopts an innovative approach by completely bypassing Supervised Fine-Tuning (SFT) and directly implementing Harmless Reinforcement Learning from Human Feedback (RLHF).
- Mistral-Plus uses the mistralai/Mistral-7B-v0.1 model as its backbone.
- License: Mistral-Plus is licensed under the same license as the mistralai/Mistral-7B-v0.1 model.
Model Sources
Paper (Mistral-Plus): https://arxiv.org/abs/2403.02513
Uses
Mistral-Plus is primarily utilized for research in the areas of large language models and chatbots. It is intended chiefly for use by researchers and hobbyists specializing in natural language processing, machine learning, and artificial intelligence.
Mistral-Plus not only preserves the Mistral base model's general capabilities, but also significantly enhances its conversational abilities and notably reduces the generation of toxic outputs as human preference.
Goal: Empower researchers worldwide!
To the best of knowledge, this is the first academic endeavor to bypass supervised fine-tuning and directly apply reinforcement learning from human feedback. More importantly, Mistral-Plus is publicly available through HuggingFace for promoting collaborative research and innovation. This initiative to open-source Mistral-Plus seeks to empower researchers worldwide, enabling them to delve deeper into and build upon Mistral-Plus work, with a particular focus on conversational tasks, such as customer service, intelligent assistant, etc.
Model Performance on 11 general Language Tasks
Mistral-Plus on General language Understanding and Reasoning
Enhancing Conversational Safety in the Mistral-Plus
Bad word generation probablity on Mistral-Instruct and Mistral-Plus. The x-axis represents different intermittent layers, y-axis shows token probability.
Case Study
Multiple Round Dialogue
Case Study from Different Prompts
- Downloads last month
- 659










docker model run hf.co/zhengchenphd/Mistral-Plus-7B