Upload folder using huggingface_hub

Browse files

Files changed (7) hide show

.gitattributes +1 -0
outputs/Qwen3-0.6B/2025-12-23_17-27-50/1-teacher_activations.pth +3 -0
outputs/Qwen3-0.6B/2025-12-23_17-27-50/1-teacher_activations_uncomplete.pth +3 -0
outputs/Qwen3-0.6B/2025-12-23_17-27-50/2-aligned_activations.pth +3 -0
outputs/Qwen3-0.6B/2025-12-23_17-27-50/2-aligned_activations_uncomplete.pth +3 -0
outputs/Qwen3-0.6B/2025-12-23_17-27-50/neurohike.26011192.out +137 -0
outputs/Qwen3-0.6B/2025-12-23_17-27-50/neurohike.26011957.out +3 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+outputs/Qwen3-0.6B/2025-12-23_17-27-50/neurohike.26011957.out filter=lfs diff=lfs merge=lfs -text

outputs/Qwen3-0.6B/2025-12-23_17-27-50/1-teacher_activations.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1ab2dbafcae3402bc19be6123556978a574c99eee1cde944b51124942912114a
+size 32510289319

outputs/Qwen3-0.6B/2025-12-23_17-27-50/1-teacher_activations_uncomplete.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:54fd6c7bc597e7073b6d19d7a76c586936c0a5e939819c8037301e43681dff98
+size 32510304105

outputs/Qwen3-0.6B/2025-12-23_17-27-50/2-aligned_activations.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b80f9efb5b453d056ac164d35fb9576e37bf30807eb3b6518e9135b097b0e57a
+size 45511894547

outputs/Qwen3-0.6B/2025-12-23_17-27-50/2-aligned_activations_uncomplete.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:38604c8147f3cc1f45b0c5d3ceac8e579d073e88f6400657e54b90a93c24fd87
+size 45511914421

outputs/Qwen3-0.6B/2025-12-23_17-27-50/neurohike.26011192.out ADDED Viewed

	@@ -0,0 +1,137 @@

+The following modules were not unloaded:
+  (Use "module --force purge" to unload all):
+  1) 2023.01   2) StdEnv
+layers_to_collect = [6, 20, 27, 34]
+Total layers in teacher model: 34
+Traceback (most recent call last):
+  File "/home1/p313544/Documents/NeuroHike/main.py", line 109, in <module>
+    fire.Fire(main)
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/fire/core.py", line 135, in Fire
+    component_trace = _Fire(component, args, parsed_flag_args, context, name)
+                      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/fire/core.py", line 468, in _Fire
+    component, remaining_args = _CallAndUpdateTrace(
+                                ^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/fire/core.py", line 684, in _CallAndUpdateTrace
+    component = fn(*varargs, **kwargs)
+                ^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/main.py", line 61, in main
+    teacher_activations = hiker.collect_teacher_activations(
+                          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/neurohike/wrapper.py", line 525, in collect_teacher_activations
+    gen_tokens, activations = self._get_teaching_activations(
+                              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/neurohike/wrapper.py", line 356, in _get_teaching_activations
+    teacher_output = self.teacher_model(
+                     ^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1775, in _wrapped_call_impl
+    return self._call_impl(*args, **kwargs)
+           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1786, in _call_impl
+    return forward_call(*args, **kwargs)
+           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/transformers/models/gemma3/modeling_gemma3.py", line 1100, in forward
+    outputs = self.model(
+              ^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1775, in _wrapped_call_impl
+    return self._call_impl(*args, **kwargs)
+           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1786, in _call_impl
+    return forward_call(*args, **kwargs)
+           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/transformers/utils/generic.py", line 918, in wrapper
+    output = func(self, *args, **kwargs)
+             ^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/transformers/models/gemma3/modeling_gemma3.py", line 957, in forward
+    outputs = self.language_model(
+              ^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1775, in _wrapped_call_impl
+    return self._call_impl(*args, **kwargs)
+           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1786, in _call_impl
+    return forward_call(*args, **kwargs)
+           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/transformers/utils/generic.py", line 1072, in wrapper
+    outputs = func(self, *args, **kwargs)
+              ^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/transformers/models/gemma3/modeling_gemma3.py", line 570, in forward
+    layer_outputs = decoder_layer(
+                    ^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/transformers/modeling_layers.py", line 94, in __call__
+    return super().__call__(*args, **kwargs)
+           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1775, in _wrapped_call_impl
+    return self._call_impl(*args, **kwargs)
+           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1786, in _call_impl
+    return forward_call(*args, **kwargs)
+           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/transformers/utils/generic.py", line 1031, in wrapped_forward
+    output = orig_forward(*args, **kwargs)
+             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func
+    return func(*args, **kwargs)
+           ^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/transformers/models/gemma3/modeling_gemma3.py", line 382, in forward
+    hidden_states, self_attn_weights = self.self_attn(
+                                       ^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1775, in _wrapped_call_impl
+    return self._call_impl(*args, **kwargs)
+           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/torch/nn/modules/module.py", line 1786, in _call_impl
+    return forward_call(*args, **kwargs)
+           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func
+    return func(*args, **kwargs)
+           ^^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/transformers/models/gemma3/modeling_gemma3.py", line 327, in forward
+    attn_output, attn_weights = attention_interface(
+                                ^^^^^^^^^^^^^^^^^^^^
+  File "/home1/p313544/Documents/NeuroHike/.venv/lib/python3.12/site-packages/transformers/integrations/sdpa_attention.py", line 96, in sdpa_attention_forward
+    attn_output = torch.nn.functional.scaled_dot_product_attention(
+                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 8.12 GiB. GPU 0 has a total capacity of 39.49 GiB of which 7.54 GiB is free. Including non-PyTorch memory, this process has 31.95 GiB memory in use. Of the allocated memory 29.43 GiB is allocated by PyTorch, and 2.03 GiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True to avoid fragmentation.  See documentation for Memory Management  (https://pytorch.org/docs/stable/notes/cuda.html#environment-variables)
+###############################################################################
+Hábrók Cluster
+Job 26011192 for user p313544
+Finished at: Tue Dec 23 17:29:05 CET 2025
+Job details:
+============
+Job ID                         : 26011192
+Name                           : neurohike
+User                           : p313544
+Partition                      : gpumedium
+Nodes                          : a100gpu3
+Number of Nodes                : 1
+Cores                          : 8
+Number of Tasks                : 1
+State                          : FAILED
+Submit                         : 2025-12-23T13:01:05
+Start                          : 2025-12-23T17:27:27
+End                            : 2025-12-23T17:29:01
+Reserved walltime              : 10:50:00
+Used walltime                  : 00:01:34
+Used CPU time                  : 00:00:60 (Efficiency:  7.97%)
+% User (Computation)           : 71.44%
+% System (I/O)                 : 28.56%
+Total memory reserved          : 120G
+Maximum memory used            : 9.15G
+Requested GPUs                 : a100=1
+Allocated GPUs                 : a100=1
+Max GPU utilization            : 61%
+Max GPU memory used            : 22.71G
+Acknowledgements:
+=================
+Please see this page for information about acknowledging Hábrók in your publications:
+https://wiki.hpc.rug.nl/habrok/introduction/scientific_output
+################################################################################

outputs/Qwen3-0.6B/2025-12-23_17-27-50/neurohike.26011957.out ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b25ec5c73bb1995db8ea6a7178ebb2b3770f0195a451d9f13d5087b836a4710e
+size 20987586