Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
👋
Open to Work
Rijusmit Biswas
Phantomcloak19
1
1
Follow
Bulannel28's profile picture
1 follower
·
6 following
https://rijusmit.vercel.app/
Phantom_Cloak16
riju-talk
rijusmit-biswas
AI & ML interests
Data Science, Machine Learning, Deep Learning
Recent Activity
updated
a model
about 19 hours ago
Phantomcloak19/qwen3-4b-dpo
updated
a model
about 19 hours ago
Phantomcloak19/qwen2.5-3b-dpo
updated
a model
about 19 hours ago
Phantomcloak19/sequntial-sft-dpo-grpo
View all activity
Organizations
None yet
Phantomcloak19
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
4 models
about 19 hours ago
Phantomcloak19/qwen3-4b-dpo
Text Generation
•
4B
•
Updated
about 7 hours ago
•
12
Phantomcloak19/qwen2.5-3b-dpo
Text Generation
•
3B
•
Updated
about 7 hours ago
•
15
Phantomcloak19/sequntial-sft-dpo-grpo
Updated
about 7 hours ago
Phantomcloak19/gemma2-2b-dpo
Text Generation
•
3B
•
Updated
about 7 hours ago
•
24
published
a model
2 days ago
Phantomcloak19/qwen3-4b-dpo
Text Generation
•
4B
•
Updated
about 7 hours ago
•
12
updated
a model
2 days ago
Phantomcloak19/qwen2.5-3b-dpo-grpo
Text Generation
•
3B
•
Updated
2 days ago
•
15
published
a model
2 days ago
Phantomcloak19/qwen2.5-3b-dpo-grpo
Text Generation
•
3B
•
Updated
2 days ago
•
15
updated
a model
2 days ago
Phantomcloak19/TV-CGRPO-reward_soup_gemma-2-2b-it-QLoRA-TRL
Updated
2 days ago
published
a model
2 days ago
Phantomcloak19/TV-CGRPO-reward_soup_gemma-2-2b-it-QLoRA-TRL
Updated
2 days ago
updated
a model
2 days ago
Phantomcloak19/TV-CGRPO-reward_soup_Qwen2-5-3B-Instruct-QLoRA-TRL
Updated
2 days ago
published
3 models
2 days ago
Phantomcloak19/qwen2.5-3b-dpo
Text Generation
•
3B
•
Updated
about 7 hours ago
•
15
Phantomcloak19/TV-CGRPO-reward_soup_Qwen2-5-3B-Instruct-QLoRA-TRL
Updated
2 days ago
Phantomcloak19/sequntial-sft-dpo-grpo
Updated
about 7 hours ago
published
a model
3 days ago
Phantomcloak19/gemma2-2b-dpo
Text Generation
•
3B
•
Updated
about 7 hours ago
•
24
updated
a model
3 days ago
Phantomcloak19/gemma2-2b-dpo-grpo
Text Generation
•
3B
•
Updated
3 days ago
•
15
published
a model
3 days ago
Phantomcloak19/gemma2-2b-dpo-grpo
Text Generation
•
3B
•
Updated
3 days ago
•
15
updated
a model
6 days ago
Phantomcloak19/TV-CGRPO-gemma-2-2b-it_two_obj_scalar-QLoRA-TRL
3B
•
Updated
6 days ago
•
17
published
a model
6 days ago
Phantomcloak19/TV-CGRPO-gemma-2-2b-it_two_obj_scalar-QLoRA-TRL
3B
•
Updated
6 days ago
•
17
updated
a model
6 days ago
Phantomcloak19/TV-CGRPO-gemma-2-2b-it_uniform_scalar-QLoRA-TRL
3B
•
Updated
6 days ago
•
16
published
a model
6 days ago
Phantomcloak19/TV-CGRPO-gemma-2-2b-it_uniform_scalar-QLoRA-TRL
3B
•
Updated
6 days ago
•
16
Load more