Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
10.4
TFLOPS
1
7
68
John Graham Reynolds
PRO
MarioBarbeque
Follow
awacke1's profile picture
1 follower
·
15 following
johngrahamreynolds
AI & ML interests
Quantum Computing, Mathematics+Code Generation, Deep Reasoning, Deep RL UT Austin Grad Student
Recent Activity
authored
a paper
4 days ago
Mitigating Catastrophic Forgetting in Mathematical Reasoning Finetuning through Mixed Training
upvoted
a
paper
5 days ago
Mitigating Catastrophic Forgetting in Mathematical Reasoning Finetuning through Mixed Training
updated
a model
9 days ago
MarioBarbeque/flan-t5-base-math-only-catastrophic
View all activity
Organizations
MarioBarbeque
's models
20
Sort: Recently updated
MarioBarbeque/flan-t5-base-math-only-catastrophic
0.2B
•
Updated
9 days ago
•
34
•
1
MarioBarbeque/flan-t5-base-nli-only-catastrophic
0.2B
•
Updated
9 days ago
•
27
•
1
MarioBarbeque/flan-t5-base-mixed-1-1-catastrophic
0.2B
•
Updated
9 days ago
•
39
•
1
MarioBarbeque/flan-t5-base-mixed-3-1-catastrophic
0.2B
•
Updated
9 days ago
•
35
•
1
MarioBarbeque/flan-t5-base-mixed-7-1-catastrophic
0.2B
•
Updated
9 days ago
•
37
•
1
MarioBarbeque/flan-t5-base-mixed-15-1-catastrophic
0.2B
•
Updated
9 days ago
•
50
•
1
MarioBarbeque/CyberSolve-LinAlg-1.2
0.8B
•
Updated
24 days ago
•
21
•
1
MarioBarbeque/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Jul 23
•
13
•
1
MarioBarbeque/q-Taxi-v3-2.0
Reinforcement Learning
•
Updated
Jul 17
•
1
MarioBarbeque/q-Taxi-v3
Reinforcement Learning
•
Updated
Jul 17
•
1
MarioBarbeque/q-FrozenLake-v1-8x8-Slippery
Reinforcement Learning
•
Updated
Jul 17
•
1
MarioBarbeque/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jul 17
•
1
MarioBarbeque/ppo-HuggyFetch
Reinforcement Learning
•
Updated
Jul 14
•
9
•
1
MarioBarbeque/ppo-LunarLander-v2-1.0
Reinforcement Learning
•
Updated
Jul 10
•
3
•
1
MarioBarbeque/CyberSolve-LinAlg-1.1
0.8B
•
Updated
Jan 27
•
11
•
1
MarioBarbeque/CyberSolve-DeepMind-LinAlg-1D-downsample-v2
0.8B
•
Updated
Jan 27
•
8
•
1
MarioBarbeque/DistilBERT-DeNiro
Fill-Mask
•
67M
•
Updated
Dec 2, 2024
•
18
•
1
MarioBarbeque/gpt2-code-search-net-tokenizer
Updated
Nov 18, 2024
MarioBarbeque/RoBERTa-base-DReiFT
Text Classification
•
0.1B
•
Updated
Nov 7, 2024
•
35
•
1
MarioBarbeque/dummy
Updated
Oct 24, 2024