yuansui
/

TinyLLama-v0-PPO-tuned

Reinforcement Learning

Model card Files Files and versions

TinyLLama-v0-PPO-tuned

3.29 MB

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

yuansui's picture

Push model using huggingface_hub.

62bbcda verified over 1 year ago

.gitattributes

1.52 kB
initial commit over 1 year ago
README.md

1.26 kB
Push model using huggingface_hub. over 1 year ago
adapter_config.json

720 Bytes
Push model using huggingface_hub. over 1 year ago
adapter_model.safetensors

391 kB
xet

Push model using huggingface_hub. over 1 year ago
config.json

1.26 kB
Push model using huggingface_hub. over 1 year ago
pytorch_model.bin
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.BFloat16Storage",
- "collections.OrderedDict"
What is a pickle import?
378 kB
xet

Push model using huggingface_hub. over 1 year ago
special_tokens_map.json

434 Bytes
Push model using huggingface_hub. over 1 year ago
tokenizer.json

1.98 MB
Push model using huggingface_hub. over 1 year ago
tokenizer.model

534 kB
xet

Push model using huggingface_hub. over 1 year ago
tokenizer_config.json

890 Bytes
Push model using huggingface_hub. over 1 year ago