arxiv:2411.16341
Rania hossam elbadry PRO
Raniahossam33
AI & ML interests
None yet
Recent Activity
updated a Space 12 days ago
Raniahossam33/k2v3-error-scorecard published a Space 12 days ago
Raniahossam33/k2v3-error-scorecard upvoted a paper about 1 month ago
CEPO: RLVR Self-Distillation using Contrastive Evidence Policy Optimization