Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
bethgelab
's Collections
Concept-Aware Batch Sampling
Delta Belief RL
Delta Belief RL
updated
Feb 13
Collection of the models for our paper "Intrinsic Credit Assignment for Long Horizon Interaction".
Upvote
1
iaa01/CIA-1.7B
2B
•
Updated
Feb 13
•
15
•
1
iaa01/CIA-4B
4B
•
Updated
Feb 13
•
37
•
3
Klingspor/Qwen3-1.7B-SFT
Text Generation
•
2B
•
Updated
Feb 13
•
1.16k
•
1
Klingspor/Qwen3-4B-SFT
Text Generation
•
4B
•
Updated
Feb 13
•
5.87k
Klingspor/StarPO-1.7B
Text Generation
•
2B
•
Updated
Feb 13
•
16
Klingspor/StarPO-4B
Text Generation
•
4B
•
Updated
Feb 13
•
18
•
2
Upvote
1
Share collection
View history
Collection guide
Browse collections