Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
jang1563
/
BioRLHF
like
0
Model card
Files
Files and versions
xet
Community
main
BioRLHF
/
src
/
biorlhf
/
training
24 kB
Ctrl+K
Ctrl+K
2 contributors
History:
4 commits
jang1563
Phase 4: V1-aware calibration verifier, eval tools, cleanup
2145d80
about 2 months ago
__init__.py
Safe
1.05 kB
Add BioGRPO training pipeline with composable biological verifiers
about 2 months ago
dpo.py
Safe
6.6 kB
Initial commit: BioRLHF v0.1.0
4 months ago
grpo.py
Safe
10.7 kB
Phase 4: V1-aware calibration verifier, eval tools, cleanup
about 2 months ago
sft.py
Safe
5.7 kB
Initial commit: BioRLHF v0.1.0
4 months ago