LoRE / myolmoe

Commit History

Checkpoint at step 20
05660af

Charlie81 commited on

Checkpoint at step 20
9ac5c9f

Charlie81 commited on

addd num_small_experts config access sparsemoeblock
46a3660

Charlie81 commited on

remove num_small_expert overwrite
bc05da9

Charlie81 commited on

Checkpoint at step 20
38d03d9

Charlie81 commited on

remove strategies
d05c72b

Charlie81 commited on

Checkpoint at step 20
be2de79

Charlie81 commited on

Checkpoint at step 160
afff366

Charlie81 commited on

Checkpoint at step 20
a646b77

Charlie81 commited on

try patch hook
b1da2be

Charlie81 commited on

corrected change to 64
5dc5166

Charlie81 commited on

add num_small_experts to config
571877d

Charlie81 commited on

add max small expert to config
37979d0

Charlie81 commited on

Revert "expert usage stats"
c37387b

Charlie81 commited on

expert usage stats
a875a53

Charlie81 commited on

Revert "18k checkpoint"
927465d

Charlie81 commited on

attempts to fix more
077e7bc

Charlie81 commited on

fix config
44006e7

Charlie81 commited on

dataclass config
353cce5

Charlie81 commited on

all OlMoeConfig to MyOlmoeConfig
392f3ed

Charlie81 commited on

olmoe to myolmoe in modelingcode
09b1ee2

Charlie81 commited on

add expert logging
224e1c5

Charlie81 commited on

Checkpoint at step 12000
cc70445

Charlie81 commited on

progress 20k
8fc755e

Charlie81 commited on

Checkpoint at step 2000
7120a3e

Charlie81 commited on

delete checkpoints all epochs to 3
d4a6b93

Charlie81 commited on

change to constant strategy
2b4d259

Charlie81 commited on

increment
623532d

Charlie81 commited on

dynamic mask for logits
b4e9375

Charlie81 commited on

load balacing fix
5061126

Charlie81 commited on

attempt new distribution experts
834ad70

Charlie81 commited on

changes
f18f893

Charlie81 commited on

aaux to torch tensor
1ec67ec

Charlie81 commited on

huge fixes
c306fa9

Charlie81 commited on

fix small experts loss calculation for gradient
44c43d7

Charlie81 commited on

reorder small experts
7050cb6

Charlie81 commited on

modify batch and fix tensor issue
2a594f6

Charlie81 commited on

ratio to divisor set to 16
981f53b

Charlie81 commited on

fix import
78b85e8

Charlie81 commited on

overhaul
c4785c5

Charlie81 commited on

reset modeling file
36acce3

Charlie81 commited on

config shenanigans
72dfb47

Charlie81 commited on

changes without config
b41d150

Charlie81 commited on

import name
65e7011

Charlie81 commited on

initial stuff
48f0e60

Charlie81 commited on

remove old routing stuff
ac9f1eb

Charlie81 commited on

init no routing
9d66be3

Charlie81 commited on