Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
1
Mohamed Ihjas
mlmihjaz
Follow
0 followers
ยท
10 following
AI & ML interests
AI ML
Recent Activity
reacted
to
salma-remyx
's
post
with ๐
9 days ago
Just trained a 2B coding model to rank candidate AI/ML research ideas against the implicit preferences in a code repository's merge history. The training data comes from a Gaussian Process fit on the accumulated dispositions in VQASynth, where each PR against a deployed project yields a pairwise comparison between the feature branch preferred and the baseline at main. The GP scores candidate papers to synthesize preference pairs, and DPO with LoRA bakes the ranking pipeline into the model's weights. After 1 epoch the model reaches 87.4% reward accuracy on the held-out eval split against 92.3% on training, consistent with learning the task without overfitting. Now, I'm scaling the pipeline to thousands of repos for a generalization test. Dataset: https://huggingface.co/datasets/remyxai/mhpd-dpo-v0 Model: https://huggingface.co/remyxai/mhpd-dpo-qwen3.5-2b-vqasynth Substack: https://remyxai.substack.com/p/the-ai-pm
liked
a Space
9 days ago
mlmihjaz/LTM
updated
a Space
9 days ago
mlmihjaz/LTM
View all activity
Organizations
None yet
mlmihjaz
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a Space
9 days ago
Running
1
ImgForge
๐ผ
1
Create and edit custom graphics with templates and shapes