lucky2me
/

Dorami-Chat

Model card Files Files and versions

Dorami-Chat / README.md

dorami-ai

dpo

cb10287 11 months ago

|

history blame contribute delete

1.33 kB

	---
	license: apache-2.0
	datasets:
	- liyucheng/zhihu_rlhf_3k
	language:
	- zh
	base_model:
	- lucky2me/Dorami-Instruct
	---

	# Dorami-Chat

	Dorami-Chat is a Direct Preference Optimization(DPO) model based on the Supervised Fine-tuning(SFT) model lucky2me/Dorami-Instruct

	## Model description

	### Training data

	- [liyucheng/zhihu_rlhf_3k](https://huggingface.co/datasets/liyucheng/zhihu_rlhf_3k)

	### Training code

	- [dorami](https://github.com/6zeus/dorami.git)

	## How to use

	### 1. Download model from Hugging Face Hub to local

	```
	git lfs install
	git clone https://huggingface.co/lucky2me/Dorami-Chat
	```

	### 2. Use the model downloaded above
	```python
	from transformers import AutoTokenizer, AutoModelForCausalLM, GenerationConfig
	model_path = "The path of the model downloaded above"
	tokenizer = AutoTokenizer.from_pretrained(model_path)
	model = AutoModelForCausalLM.from_pretrained(model_path)
	prompt="fill in any prompt you like."
	inputs = tokenizer(prompt, return_tensors="pt")
	generation_config = GenerationConfig(max_new_tokens=64, do_sample=True, top_k=2, eos_token_id=model.config.eos_token_id)
	outputs = model.generate(**inputs, generation_config=generation_config)
	decoded_text = tokenizer.batch_decode(outputs, skip_special_tokens=True)
	print(decoded_text)
	```