Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
🏗️
Building on HF
1
3
Chris Lo
chrisluo5311
Follow
0 followers
·
4 following
https://chris-luo.me/
chrisluo5311
chris-lo-cs
AI & ML interests
RL, PPO, GRPO, SFT, NLP, RAG, vLLM, FAISS, Data Pipelines, Prompt Engineering, Fine-Tuning, Model Evaluation, Benchmarking
Recent Activity
updated
a model
7 days ago
chrisluo5311/dqn-SpaceInvadersNoFrameskip-v4-2
published
a model
7 days ago
chrisluo5311/dqn-SpaceInvadersNoFrameskip-v4-2
updated
a model
8 days ago
chrisluo5311/dqn-SpaceInvadersNoFrameskip-v4
View all activity
Organizations
None yet
chrisluo5311
's models
14
Sort: Recently updated
chrisluo5311/dqn-SpaceInvadersNoFrameskip-v4-2
Reinforcement Learning
•
Updated
7 days ago
•
31
chrisluo5311/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
8 days ago
•
48
chrisluo5311/taxi-v4-q-learning-gamma99-ep1000000
Reinforcement Learning
•
Updated
10 days ago
chrisluo5311/taxi-v4-q-learning-ep1000000
Updated
10 days ago
chrisluo5311/taxi-v4-q-learning
Reinforcement Learning
•
Updated
10 days ago
chrisluo5311/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
10 days ago
chrisluo5311/ppo-Huggy
Reinforcement Learning
•
Updated
18 days ago
•
170
chrisluo5311/ppo-lunarlander-v3
Reinforcement Learning
•
Updated
20 days ago
•
41
chrisluo5311/Qwen2.5-1.5B-Instruct-SFT-MetaMath-Merged-ROI
Updated
Jan 29
chrisluo5311/Qwen2.5-7B-Instruct-SFT-GRPO-Merged-ROI
8B
•
Updated
Jan 29
•
4
chrisluo5311/Qwen2.5-1.5B-Instruct-SFT-ServiceNow-ROI
Updated
Jan 29
•
1
chrisluo5311/Qwen2.5-3B-Instruct-SFT-MetaMath-Merged-ROI
3B
•
Updated
Jan 29
•
1
chrisluo5311/Qwen2.5-1.5B-Instruct-GRPO-Calx-ROI
Updated
Jan 27
•
2
chrisluo5311/Qwen2.5-7B-Instruct-SFT-MetaMath-Merged-ROI
8B
•
Updated
Jan 27
•
18