Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

shahidul034
/
readctrl

Safetensors
Model card Files Files and versions
xet
Community
readctrl / code /RL_model /unsloth_rl
72.9 kB
  • 1 contributor
History: 1 commit
shahidul034's picture
shahidul034
Add files using upload-large-folder tool
c7a6fe6 verified 27 days ago
  • RL_code.py
    5.57 kB
    Add files using upload-large-folder tool 27 days ago
  • RL_training.ipynb
    16.8 kB
    Add files using upload-large-folder tool 27 days ago
  • claim_verifier.py
    7.47 kB
    Add files using upload-large-folder tool 27 days ago
  • finetune.py
    2.74 kB
    Add files using upload-large-folder tool 27 days ago
  • health_classifier.py
    1.78 kB
    Add files using upload-large-folder tool 27 days ago
  • highlighter.py
    4.01 kB
    Add files using upload-large-folder tool 27 days ago
  • inference.py
    6.39 kB
    Add files using upload-large-folder tool 27 days ago
  • prompt
    2.53 kB
    Add files using upload-large-folder tool 27 days ago
  • reward_mock.py
    4.47 kB
    Add files using upload-large-folder tool 27 days ago
  • test_reward_mock_unittest.py
    4.66 kB
    Add files using upload-large-folder tool 27 days ago
  • testing.py
    10 kB
    Add files using upload-large-folder tool 27 days ago
  • testing_v2.py
    6.54 kB
    Add files using upload-large-folder tool 27 days ago