Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Kavya Malhotra
kavmal7
Follow
0 followers
·
1 following
https://kavmal7.github.io/Online-CV/
kavmal7
kavya-malhotra
AI & ML interests
None yet
Recent Activity
updated
a collection
about 2 months ago
COMP0087 DPO models and data
updated
a collection
about 2 months ago
COMP0087 DPO models and data
updated
a collection
about 2 months ago
COMP0087 DPO models and data
View all activity
Organizations
kavmal7
's models
19
Sort: Recently updated
kavmal7/lr_3e6_beta_0.1
Text Generation
•
Updated
Apr 17
kavmal7/lr_5e6_beta_0.2
Text Generation
•
Updated
Apr 17
kavmal7/lr_5e6_beta_0.1
Text Generation
•
Updated
Apr 17
kavmal7/lr_1e6_beta_0.1
Text Generation
•
Updated
Apr 17
•
1
kavmal7/mv-final-assignment
Updated
Jan 6
kavmal7/mv-final-assignment2
Updated
Jan 3
kavmal7/poca-SoccerTwos
Reinforcement Learning
•
Updated
Apr 18, 2025
kavmal7/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Mar 20, 2025
kavmal7/pyramidsrnd-v1
Reinforcement Learning
•
Updated
Mar 10, 2025
kavmal7/ppo-PyramidsRND-v1
Updated
Mar 10, 2025
kavmal7/ppo-SnowballTarget-v1
Reinforcement Learning
•
Updated
Mar 10, 2025
•
5
kavmal7/Reinforce-pixelcopter01
Reinforcement Learning
•
Updated
Feb 10, 2025
kavmal7/Reinforce-pixelcopter-v1
Reinforcement Learning
•
Updated
Feb 6, 2025
kavmal7/Reinforce-CartPole-V1
Reinforcement Learning
•
Updated
Feb 5, 2025
kavmal7/unit3SI-v4
Reinforcement Learning
•
Updated
Nov 22, 2024
kavmal7/q-Taxi-v3
Reinforcement Learning
•
Updated
Nov 20, 2024
kavmal7/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Nov 20, 2024
kavmal7/ppo-Huggy
Reinforcement Learning
•
Updated
Nov 16, 2024
kavmal7/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Nov 15, 2024