PaperAudit SFT/RL Model Collection

This repo aggregates all SFT/RL fine-tuned models for the PaperAudit project.

Model List

Model Name Hugging Face Link Description
Qwen3-8B-sft-rl mayiwen/PaperAudit_Qwen3_8B_sft_rl Qwen3 8B model fine-tuned with SFT + RL for PaperAudit
Qwen3-14B-sft-rl mayiwen/PaperAudit_Qwen3_14B_sft_rl Qwen3 14B model fine-tuned with SFT + RL for PaperAudit
Llama3.2-3B-sft-rl mayiwen/PaperAudit_Llama3.2_3B_sft_rl Llama3.2 3B model fine-tuned with SFT + RL for PaperAudit

Usage

Refer to each model's repo for detailed usage instructions (training code, inference examples, etc.).

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support