Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Charlie81
/
LoRE
like
0
TensorBoard
Safetensors
License:
mit
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
main
LoRE
/
scripts
Commit History
batch 3
85d731b
Charlie81
commited on
Jul 11, 2025
batch size 2
d55ddf7
Charlie81
commited on
Jul 11, 2025
batch 4
0d64997
Charlie81
commited on
Jul 11, 2025
comma
d2653ee
Charlie81
commited on
Jul 11, 2025
16batch
7d3ca95
Charlie81
commited on
Jul 11, 2025
train agaaa
45d6e50
Charlie81
commited on
Jul 11, 2025
train aga
580eff8
Charlie81
commited on
Jul 11, 2025
fix train
1f3825f
Charlie81
commited on
Jul 11, 2025
debugging missing grad
325d2d0
Charlie81
commited on
Jul 10, 2025
update training script
f9596a0
Charlie81
commited on
Jul 10, 2025
reorder small experts
7050cb6
Charlie81
commited on
Jul 7, 2025
unfreeze only gate and experts
356573e
Charlie81
commited on
Jul 7, 2025
1 batch size
6b0e19d
Charlie81
commited on
Jul 7, 2025
batch size back to 2
14b2125
Charlie81
commited on
Jul 6, 2025
batch size 3 lol
f078b84
Charlie81
commited on
Jul 6, 2025
batch size to 4
5f8bb1e
Charlie81
commited on
Jul 6, 2025
batch size 8
9fc70e4
Charlie81
commited on
Jul 6, 2025
modify batch and fix tensor issue
2a594f6
Charlie81
commited on
Jul 6, 2025
tokenize fn
5c05368
Charlie81
commited on
Jul 6, 2025
fix
52bdc02
Charlie81
commited on
Jul 6, 2025
add
d6ffab2
Charlie81
commited on
Jul 6, 2025
tokenize function
1e0b293
Charlie81
commited on
Jul 6, 2025
debugs
6d21fca
Charlie81
commited on
Jul 6, 2025
key value
7ab89f2
Charlie81
commited on
Jul 6, 2025
restore
7abbd62
Charlie81
commited on
Jul 6, 2025
claude attempt 2 dataset
3db4e2e
Charlie81
commited on
Jul 6, 2025
cache diagnostics
dd2e997
Charlie81
commited on
Jul 6, 2025
sanity
8e88ea1
Charlie81
commited on
Jul 6, 2025
claudeattempt dataset
5b01886
Charlie81
commited on
Jul 6, 2025
alternative dataset load
d7f70e5
Charlie81
commited on
Jul 6, 2025
dataset keep in memory
1182794
Charlie81
commited on
Jul 6, 2025
ignore mismatches
e039ec3
Charlie81
commited on
Jul 6, 2025
fix import
78b85e8
Charlie81
commited on
Jul 6, 2025
overhaul
c4785c5
Charlie81
commited on
Jul 6, 2025
reset modeling file
36acce3
Charlie81
commited on
Jul 6, 2025
attempt fix and more prints
a82f934
Charlie81
commited on
Jul 5, 2025
init expanded model after config change
438a56a
Charlie81
commited on
Jul 5, 2025
config shenanigans
72dfb47
Charlie81
commited on
Jul 4, 2025
debug to train script
e2fe765
Charlie81
commited on
Jul 4, 2025
changes without config
b41d150
Charlie81
commited on
Jul 4, 2025
update training script
582cd6b
Charlie81
commited on
Jul 4, 2025
restore import name
e3a54b7
Charlie81
commited on
Jul 4, 2025
handle base and product architecture differences
b63994d
Charlie81
commited on
Jul 4, 2025
import path
50a970c
Charlie81
commited on
Jul 4, 2025
import file name
34f196f
Charlie81
commited on
Jul 4, 2025
initial stuff
48f0e60
Charlie81
commited on
Jul 4, 2025
init no routing
9d66be3
Charlie81
commited on
Jul 3, 2025
Previous
1
2
Next