8B AWQ
Collection
162 items • Updated • 2
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("solidrust/Chimera-8B-AWQ")
model = AutoModelForCausalLM.from_pretrained("solidrust/Chimera-8B-AWQ")
messages = [
{"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
messages,
add_generation_prompt=True,
tokenize=True,
return_dict=True,
return_tensors="pt",
).to(model.device)
outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))Dare-ties merge method.
List of all models and merging path is coming soon.
Merging the "thick"est model weights from mistral models using amazing training methods like direct preference optimization (dpo) and reinforced learning.
I have spent countless hours studying the latest research papers, attending conferences, and networking with experts in the field. I experimented with different algorithms, tactics, fine-tuned hyperparameters, optimizers, and optimized code until i achieved the best possible results.
Thank you openchat 3.5 for showing me the way.
Here is my contribution.
Replace {system} with your system prompt, and {prompt} with your prompt instruction.
### System:
{system}
### User:
{prompt}
### Assistant:
Base model
mlabonne/ChimeraLlama-3-8B
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="solidrust/Chimera-8B-AWQ") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)