Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
10.4
TFLOPS
1
7
68
John Graham Reynolds
PRO
MarioBarbeque
Follow
awacke1's profile picture
1 follower
·
15 following
johngrahamreynolds
AI & ML interests
Quantum Computing, Mathematics+Code Generation, Deep Reasoning, Deep RL UT Austin Grad Student
Recent Activity
authored
a paper
4 days ago
Mitigating Catastrophic Forgetting in Mathematical Reasoning Finetuning through Mixed Training
upvoted
a
paper
5 days ago
Mitigating Catastrophic Forgetting in Mathematical Reasoning Finetuning through Mixed Training
updated
a model
8 days ago
MarioBarbeque/flan-t5-base-math-only-catastrophic
View all activity
Organizations
MarioBarbeque
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
6 models
23 days ago
MarioBarbeque/flan-t5-base-math-only-catastrophic
0.2B
•
Updated
8 days ago
•
34
•
1
MarioBarbeque/flan-t5-base-nli-only-catastrophic
0.2B
•
Updated
8 days ago
•
27
•
1
MarioBarbeque/flan-t5-base-mixed-1-1-catastrophic
0.2B
•
Updated
8 days ago
•
39
•
1
MarioBarbeque/flan-t5-base-mixed-3-1-catastrophic
0.2B
•
Updated
8 days ago
•
35
•
1
MarioBarbeque/flan-t5-base-mixed-7-1-catastrophic
0.2B
•
Updated
8 days ago
•
37
•
1
MarioBarbeque/flan-t5-base-mixed-15-1-catastrophic
0.2B
•
Updated
8 days ago
•
50
•
1
liked
a model
24 days ago
google/flan-t5-base
0.2B
•
Updated
Jul 17, 2023
•
725k
•
1.03k
liked
a model
27 days ago
google/flan-t5-small
77M
•
Updated
Oct 10, 2023
•
433k
•
452
liked
a Space
3 months ago
Sleeping
1
MistralAI
🚀
1
A retrieval-augmented chat model built with Mistral🇫🇷🦜
liked
6 models
5 months ago
MarioBarbeque/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Jul 23
•
13
•
1
MarioBarbeque/q-Taxi-v3-2.0
Reinforcement Learning
•
Updated
Jul 17
•
1
MarioBarbeque/q-Taxi-v3
Reinforcement Learning
•
Updated
Jul 17
•
1
MarioBarbeque/q-FrozenLake-v1-8x8-Slippery
Reinforcement Learning
•
Updated
Jul 17
•
1
MarioBarbeque/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jul 17
•
1
MarioBarbeque/ppo-HuggyFetch
Reinforcement Learning
•
Updated
Jul 14
•
9
•
1
liked
a Space
5 months ago
Running
Featured
407
Huggy
🐶
407
Play with a dog that learned to catch sticks
liked
a model
6 months ago
MarioBarbeque/ppo-LunarLander-v2-1.0
Reinforcement Learning
•
Updated
Jul 10
•
3
•
1
liked
a dataset
10 months ago
MarioBarbeque/StringTheorySolutions
Updated
Nov 5
•
21
•
1
liked
2 models
11 months ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation
•
33B
•
Updated
Feb 24
•
2.67M
•
•
1.48k
deepseek-ai/DeepSeek-R1
Text Generation
•
685B
•
Updated
Mar 27
•
609k
•
•
12.9k
Load more