prithivMLmods
/

Calme-Ties-78B

Text Generation

text-generation-inference

Model card Files Files and versions

prithivMLmods commited on Jan 29, 2025

Commit

efedbe2

·

verified ·

1 Parent(s): e573446

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -11,6 +11,12 @@ tags:
 Calme-Ties-78B is a 78-billion-parameter model merged using the TIES methodology, based on the Qwen2 architecture. It integrates two sub-base models: *calme-3.2-instruct-78B* by MaziyarPanahi and *CalmeRys-78B-Orpo-v0.1* by dfurman, which serves as the base model. The merging process assigns equal weight and density to both models, with additional parameters enabling normalization and int8 masking. The model operates using the *bfloat16* data type.
 # **Merged Models**
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

 Calme-Ties-78B is a 78-billion-parameter model merged using the TIES methodology, based on the Qwen2 architecture. It integrates two sub-base models: *calme-3.2-instruct-78B* by MaziyarPanahi and *CalmeRys-78B-Orpo-v0.1* by dfurman, which serves as the base model. The merging process assigns equal weight and density to both models, with additional parameters enabling normalization and int8 masking. The model operates using the *bfloat16* data type.
+| Model    | Model Name                     | Model Link |
+|----------|--------------------------------|------------|
+| Base Model | CalmeRys-78B-Orpo-v0.1       | [CalmeRys-78B-Orpo-v0.1](https://huggingface.co/dfurman/CalmeRys-78B-Orpo-v0.1) |
+| Model 1  | calme-3.2-instruct-78B        | [calme-3.2-instruct-78B](https://huggingface.co/MaziyarPanahi/calme-3.2-instruct-78b) |
+| Model 2  | CalmeRys-78B-Orpo-v0.1        | [CalmeRys-78B-Orpo-v0.1](https://huggingface.co/dfurman/CalmeRys-78B-Orpo-v0.1) |
 # **Merged Models**
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).