DippyResearch
/

super-cot-v1-step-20

Model card Files Files and versions

super-cot-v1-step-20 / README.md

DippyResearch's picture

Upload folder using huggingface_hub

6b29e3d verified 3 months ago

|

history blame contribute delete

943 Bytes

	# DippyResearch/super-cot-v1-step-20

	This is a fine-tuned model for roleplay tasks, created by merging a LoRA adapter with the base model.

	## Base Model
	- Model: Cydonia-24B-v4.1
	- Path: /workspace/models/Cydonia-24B-v4.1

	## Training Details
	- Framework: VERL (GRPO)
	- Training Type: Roleplay fine-tuning
	- Checkpoint: actor
	- LoRA Adapter: DippyResearch/super-cot-v1-step-20-lora

	## Usage

	### With Transformers
	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer

	model = AutoModelForCausalLM.from_pretrained("DippyResearch/super-cot-v1-step-20")
	tokenizer = AutoTokenizer.from_pretrained("DippyResearch/super-cot-v1-step-20")
	```

	### With vLLM
	```python
	from vllm import LLM

	llm = LLM(model="DippyResearch/super-cot-v1-step-20")
	```

	## Alternative: Use LoRA Adapter
	For more flexibility, you can use the LoRA adapter directly:
	- LoRA Adapter: DippyResearch/super-cot-v1-step-20-lora
	- Base Model: Cydonia-24B-v4.1