Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
7
1
Bolian Li
lblaoke
Follow
AmberYifan's profile picture
1 follower
·
4 following
https://lblaoke.github.io/
lblaoke
lblaoke
bolian-li-554001297
AI & ML interests
None yet
Recent Activity
authored
a paper
about 21 hours ago
More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment
authored
a paper
about 21 hours ago
DRIFT: Learning from Abundant User Dissatisfaction in Real-World Preference Learning
authored
a paper
about 21 hours ago
Learning Self-Correction in Vision-Language Models via Rollout Augmentation
View all activity
Organizations
lblaoke
's models
44
Sort: Recently updated
lblaoke/mistral-v0.1-7b-ppo-self
7B
•
Updated
Feb 4, 2025
•
1
lblaoke/mistral-v0.1-7b-ppo-human
7B
•
Updated
Feb 4, 2025
•
1
lblaoke/llama2-7b-ppo-self-human
7B
•
Updated
Feb 3, 2025
•
2
lblaoke/llama2-7b-ppo-self
7B
•
Updated
Feb 3, 2025
•
2
lblaoke/llama2-7b-ppo-human
7B
•
Updated
Feb 3, 2025
•
1
lblaoke/mistral-v0.3-7b-rm-human
Text Classification
•
7B
•
Updated
Jan 14, 2025
•
1
lblaoke/mistral-v0.3-7b-rm-self-human
Text Classification
•
7B
•
Updated
Jan 14, 2025
•
3
lblaoke/mistral-v0.3-7b-rm-self
Text Classification
•
7B
•
Updated
Jan 14, 2025
•
1
lblaoke/mistral-v0.1-7b-rm-self-human
Text Classification
•
7B
•
Updated
Jan 14, 2025
lblaoke/mistral-v0.1-7b-rm-self
Text Classification
•
7B
•
Updated
Jan 14, 2025
•
1
lblaoke/llama2-7b-rm-self
Text Classification
•
7B
•
Updated
Jan 14, 2025
•
3
lblaoke/mistral-v0.1-7b-rm-human
Text Classification
•
7B
•
Updated
Jan 14, 2025
•
1
lblaoke/llama2-7b-rm-human
Text Classification
•
7B
•
Updated
Jan 14, 2025
•
1
lblaoke/llama2-7b-rm-self-human
Text Classification
•
7B
•
Updated
Jan 13, 2025
•
2
Previous
1
2
Next