RLHF-And-Friends
/

TLDR-Mistral-7B-SmallSFT-PPO

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

TLDR-Mistral-7B-SmallSFT-PPO

14.5 GB

Ctrl+K

Ctrl+K

2 contributors

History: 3 commits

evgurov's picture

Update config.json

f55f0d0 verified about 1 year ago

.gitattributes

1.52 kB
initial commit about 1 year ago
README.md

1.83 kB
Upload folder using huggingface_hub about 1 year ago
config.json

615 Bytes
Update config.json about 1 year ago
generation_config.json

111 Bytes
Upload folder using huggingface_hub about 1 year ago
model-00001-of-00003.safetensors

4.94 GB
xet

Upload folder using huggingface_hub about 1 year ago
model-00002-of-00003.safetensors

5 GB
xet

Upload folder using huggingface_hub about 1 year ago
model-00003-of-00003.safetensors

4.54 GB
xet

Upload folder using huggingface_hub about 1 year ago
model.safetensors.index.json

23.9 kB
Upload folder using huggingface_hub about 1 year ago
original_repo_id.json

62 Bytes
Upload folder using huggingface_hub about 1 year ago
special_tokens_map.json

414 Bytes
Upload folder using huggingface_hub about 1 year ago
tokenizer.json

3.51 MB
Upload folder using huggingface_hub about 1 year ago
tokenizer.model

493 kB
xet

Upload folder using huggingface_hub about 1 year ago
tokenizer_config.json

990 Bytes
Upload folder using huggingface_hub about 1 year ago