Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
14
2
31
DAEHEEKIM
andreaKIM
Follow
clem's profile picture
julien-c's profile picture
EsraN's profile picture
4 followers
ยท
4 following
daehuikim
AI & ML interests
LLM interactive chatbot
Organizations
None yet
andreaKIM
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
commented
a paper
8 months ago
Agentic Reinforced Policy Optimization
Paper
โข
2507.19849
โข
Published
Jul 26, 2025
โข
158
โข
9
New activity in
google/gemma-7b-it
about 2 years ago
Instruct Training Dataset languages
๐
1
5
#12 opened about 2 years ago by
Deniaud
New activity in
upstage/SOLAR-10.7B-Instruct-v1.0
about 2 years ago
This model ranked 1st place in open llm leader board, However this model has lower performance in supervised fine tuning.
2
#8 opened about 2 years ago by
andreaKIM
New activity in
berkeley-nest/Starling-LM-7B-alpha
over 2 years ago
What could be instruction fine tuning prompt for this model?
5
#22 opened over 2 years ago by
andreaKIM
New activity in
mistralai/Mistral-7B-v0.1
over 2 years ago
Why adaptor_model.bin becomes much larger than llama familes?
#34 opened over 2 years ago by
andreaKIM
New activity in
hyunseoki/ko-en-llama2-13b
over 2 years ago
Occured problem at long context
1
#3 opened over 2 years ago by
Se-Hun
Load more