Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
nfsrulesFR
/
mega-grpo
like
0
Text Generation
TensorBoard
Safetensors
English
molecular-optimization
chemistry
llama-3
grpo
rlhf
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
main
mega-grpo
353 MB
Ctrl+K
Ctrl+K
1 contributor
History:
10 commits
nfsrulesFR
Upload README.md with huggingface_hub
860fac4
verified
6 months ago
runs
Upload folder using huggingface_hub
6 months ago
.gitattributes
Safe
1.57 kB
Upload folder using huggingface_hub
6 months ago
README.md
3.41 kB
Upload README.md with huggingface_hub
6 months ago
adapter_config.json
Safe
916 Bytes
Upload folder using huggingface_hub
6 months ago
adapter_model.safetensors
336 MB
xet
Upload folder using huggingface_hub
6 months ago
special_tokens_map.json
Safe
459 Bytes
Upload folder using huggingface_hub
6 months ago
tokenizer.json
Safe
17.2 MB
xet
Upload folder using huggingface_hub
6 months ago
tokenizer_config.json
Safe
51.1 kB
Upload folder using huggingface_hub
6 months ago
training_args.bin
5.88 kB
xet
Upload folder using huggingface_hub
6 months ago