Add files using upload-large-folder tool

Files changed (12) hide show

seed_1337/Qwen/Qwen2.5-7B-Instruct/adapters/agent_adapter/adapter_model.safetensors ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:f1573a26473da0b03f32a3946d7ac45ce29cbe59e1b3d4e2e572c5f2573d704e
+size 323014168

seed_1337/Qwen/Qwen2.5-7B-Instruct/adapters/critic_adapter/adapter_model.safetensors ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:5dc708f1883276873a2518019f91d9dc8c29baa9b76a5a455b0af5a48bd09c59
+size 323014168

seed_1337/agent_trainer/critic_optimizer_state.pt ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:f1574fdb90735a922b09c67d07f7abdbd51181f00dc7bed878cb80adb5f50c1d
+size 2631

seed_1337/agent_trainer/policy_optimizer_state.pt ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:f75fdd61a53f3b3fb359e475c20f60da316160b96006d043b8568cb63a6fe9ed
+size 646269121

seed_1337/agent_trainer/trainer_annealing_state.pkl ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:79cfce2a5040c0939846d147a00d13a3f05afa3b73ce05b85fd5b5b13bf4ddcf
+size 104

seed_1337/random_state.pkl ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:f21bd57021f204a5066ac92edfab3fc80a5e96eca703d15438be7fe029107a0c
+size 12250

src_code_for_reproducibility/chat_utils/__pycache__/apply_template.cpython-312.pyc ADDED Viewed

Binary file (3.64 kB). View file

src_code_for_reproducibility/chat_utils/__pycache__/chat_turn.cpython-312.pyc ADDED Viewed

Binary file (1.32 kB). View file

src_code_for_reproducibility/chat_utils/__pycache__/template_specific.cpython-312.pyc ADDED Viewed

Binary file (3.61 kB). View file

src_code_for_reproducibility/docs/source/src.training.train_main.rst ADDED Viewed

+src.training.train\_main module
+===============================
+.. automodule:: src.training.train_main
+   :members:
+   :undoc-members:
+   :show-inheritance:

src_code_for_reproducibility/docs/source/src.utils.export_ppo_training_set.rst ADDED Viewed

+src.utils.export\_ppo\_training\_set module
+===========================================
+.. automodule:: src.utils.export_ppo_training_set
+   :members:
+   :undoc-members:
+   :show-inheritance:

src_code_for_reproducibility/docs/source/src.utils.model_to_cpu.rst ADDED Viewed

+src.utils.model\_to\_cpu module
+===============================
+.. automodule:: src.utils.model_to_cpu
+   :members:
+   :undoc-members:
+   :show-inheritance: