Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

divyajot5005
/
Instruction-Tuning-and-DPO-Models

Safetensors
Model card Files Files and versions
xet
Community
Instruction-Tuning-and-DPO-Models
96.2 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
divyajot5005's picture
divyajot5005
Upload folder using huggingface_hub
65c7950 verified about 1 month ago
  • llama3p1_DPO_merged
    Upload folder using huggingface_hub about 1 month ago
  • llama3p1_SFT_merged
    Upload folder using huggingface_hub about 1 month ago
  • ministral3_DPO_merged
    Upload folder using huggingface_hub about 1 month ago
  • ministral3_SFT_merged
    Upload folder using huggingface_hub about 1 month ago
  • qwen2p5_DPO_merged
    Upload folder using huggingface_hub about 1 month ago
  • qwen2p5_SFT_merged
    Upload folder using huggingface_hub about 1 month ago
  • .gitattributes
    176 Bytes
    Upload folder using huggingface_hub about 1 month ago
  • Qwen2.5_SFT.py
    10.1 kB
    Upload folder using huggingface_hub about 1 month ago
  • README.md
    24 Bytes
    initial commit about 1 month ago
  • dpo.py
    22.9 kB
    Upload folder using huggingface_hub about 1 month ago
  • llama3_SFT.py
    10 kB
    Upload folder using huggingface_hub about 1 month ago
  • merge.py
    6.04 kB
    Upload folder using huggingface_hub about 1 month ago
  • qwen_dpo.py
    22.8 kB
    Upload folder using huggingface_hub about 1 month ago
  • requirements.txt
    251 Bytes
    Upload folder using huggingface_hub about 1 month ago