Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
🏗️
Building on HF
1
3
Chris Lo
chrisluo5311
Follow
0 followers
·
4 following
https://chris-luo.me/
chrisluo5311
chris-lo-cs
AI & ML interests
RL, PPO, GRPO, SFT, NLP, RAG, vLLM, FAISS, Data Pipelines, Prompt Engineering, Fine-Tuning, Model Evaluation, Benchmarking
Recent Activity
updated
a model
8 days ago
chrisluo5311/dqn-SpaceInvadersNoFrameskip-v4-2
published
a model
8 days ago
chrisluo5311/dqn-SpaceInvadersNoFrameskip-v4-2
updated
a model
9 days ago
chrisluo5311/dqn-SpaceInvadersNoFrameskip-v4
View all activity
Organizations
None yet
chrisluo5311
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
8 days ago
chrisluo5311/dqn-SpaceInvadersNoFrameskip-v4-2
Reinforcement Learning
•
Updated
8 days ago
•
31
published
a model
8 days ago
chrisluo5311/dqn-SpaceInvadersNoFrameskip-v4-2
Reinforcement Learning
•
Updated
8 days ago
•
31
updated
a model
9 days ago
chrisluo5311/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
9 days ago
•
48
published
a model
9 days ago
chrisluo5311/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
9 days ago
•
48
updated
a model
11 days ago
chrisluo5311/taxi-v4-q-learning-gamma99-ep1000000
Reinforcement Learning
•
Updated
11 days ago
published
2 models
11 days ago
chrisluo5311/taxi-v4-q-learning-gamma99-ep1000000
Reinforcement Learning
•
Updated
11 days ago
chrisluo5311/taxi-v4-q-learning-ep1000000
Updated
11 days ago
updated
a model
11 days ago
chrisluo5311/taxi-v4-q-learning
Reinforcement Learning
•
Updated
11 days ago
published
a model
11 days ago
chrisluo5311/taxi-v4-q-learning
Reinforcement Learning
•
Updated
11 days ago
updated
a model
11 days ago
chrisluo5311/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
11 days ago
published
a model
11 days ago
chrisluo5311/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
11 days ago
updated
a model
19 days ago
chrisluo5311/ppo-Huggy
Reinforcement Learning
•
Updated
19 days ago
•
171
published
a model
19 days ago
chrisluo5311/ppo-Huggy
Reinforcement Learning
•
Updated
19 days ago
•
171
updated
a model
21 days ago
chrisluo5311/ppo-lunarlander-v3
Reinforcement Learning
•
Updated
21 days ago
•
41
published
a model
21 days ago
chrisluo5311/ppo-lunarlander-v3
Reinforcement Learning
•
Updated
21 days ago
•
41
updated
a collection
5 months ago
Reasoning Dataset
Collection
11 items
•
Updated
Feb 6
published
a model
5 months ago
chrisluo5311/Qwen2.5-1.5B-Instruct-SFT-MetaMath-Merged-ROI
Updated
Jan 29
updated
a model
5 months ago
chrisluo5311/Qwen2.5-7B-Instruct-SFT-GRPO-Merged-ROI
8B
•
Updated
Jan 29
•
2
Load more