Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Charlie81
/
LoRE
like
0
TensorBoard
Safetensors
License:
mit
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
main
LoRE
/
myolmoe
Commit History
Checkpoint at step 20
05660af
Charlie81
commited on
Sep 2, 2025
Checkpoint at step 20
9ac5c9f
Charlie81
commited on
Sep 1, 2025
addd num_small_experts config access sparsemoeblock
46a3660
Charlie81
commited on
Sep 1, 2025
remove num_small_expert overwrite
bc05da9
Charlie81
commited on
Sep 1, 2025
Checkpoint at step 20
38d03d9
Charlie81
commited on
Aug 28, 2025
remove strategies
d05c72b
Charlie81
commited on
Aug 28, 2025
Checkpoint at step 20
be2de79
Charlie81
commited on
Aug 21, 2025
Checkpoint at step 160
afff366
Charlie81
commited on
Aug 20, 2025
Checkpoint at step 20
a646b77
Charlie81
commited on
Aug 12, 2025
try patch hook
b1da2be
Charlie81
commited on
Aug 2, 2025
corrected change to 64
5dc5166
Charlie81
commited on
Aug 2, 2025
add num_small_experts to config
571877d
Charlie81
commited on
Aug 2, 2025
add max small expert to config
37979d0
Charlie81
commited on
Aug 1, 2025
Revert "expert usage stats"
c37387b
Charlie81
commited on
Jul 28, 2025
expert usage stats
a875a53
Charlie81
commited on
Jul 21, 2025
Revert "18k checkpoint"
927465d
Charlie81
commited on
Jul 20, 2025
attempts to fix more
077e7bc
Charlie81
commited on
Jul 18, 2025
fix config
44006e7
Charlie81
commited on
Jul 18, 2025
dataclass config
353cce5
Charlie81
commited on
Jul 18, 2025
all OlMoeConfig to MyOlmoeConfig
392f3ed
Charlie81
commited on
Jul 18, 2025
olmoe to myolmoe in modelingcode
09b1ee2
Charlie81
commited on
Jul 18, 2025
add expert logging
224e1c5
Charlie81
commited on
Jul 18, 2025
Checkpoint at step 12000
cc70445
Charlie81
commited on
Jul 15, 2025
progress 20k
8fc755e
Charlie81
commited on
Jul 14, 2025
Checkpoint at step 2000
7120a3e
Charlie81
commited on
Jul 12, 2025
delete checkpoints all epochs to 3
d4a6b93
Charlie81
commited on
Jul 12, 2025
change to constant strategy
2b4d259
Charlie81
commited on
Jul 12, 2025
increment
623532d
Charlie81
commited on
Jul 11, 2025
dynamic mask for logits
b4e9375
Charlie81
commited on
Jul 11, 2025
load balacing fix
5061126
Charlie81
commited on
Jul 11, 2025
attempt new distribution experts
834ad70
Charlie81
commited on
Jul 11, 2025
changes
f18f893
Charlie81
commited on
Jul 10, 2025
name
50cd1ec
Charlie81
commited on
Jul 10, 2025
aaux to torch tensor
1ec67ec
Charlie81
commited on
Jul 10, 2025
huge fixes
c306fa9
Charlie81
commited on
Jul 10, 2025
fix small experts loss calculation for gradient
44c43d7
Charlie81
commited on
Jul 8, 2025
reorder small experts
7050cb6
Charlie81
commited on
Jul 7, 2025
modify batch and fix tensor issue
2a594f6
Charlie81
commited on
Jul 6, 2025
ratio to divisor set to 16
981f53b
Charlie81
commited on
Jul 6, 2025
fix import
78b85e8
Charlie81
commited on
Jul 6, 2025
overhaul
c4785c5
Charlie81
commited on
Jul 6, 2025
reset modeling file
36acce3
Charlie81
commited on
Jul 6, 2025
config shenanigans
72dfb47
Charlie81
commited on
Jul 4, 2025
changes without config
b41d150
Charlie81
commited on
Jul 4, 2025
import name
65e7011
Charlie81
commited on
Jul 4, 2025
initial stuff
48f0e60
Charlie81
commited on
Jul 4, 2025
remove old routing stuff
ac9f1eb
Charlie81
commited on
Jul 4, 2025
init no routing
9d66be3
Charlie81
commited on
Jul 3, 2025