mdeputy commited on
Commit
48945e4
·
verified ·
1 Parent(s): 86c986b

mdeputy/output

Browse files
Files changed (4) hide show
  1. README.md +66 -65
  2. config.json +31 -31
  3. model.safetensors +1 -1
  4. training_args.bin +2 -2
README.md CHANGED
@@ -1,65 +1,66 @@
1
- ---
2
- tags:
3
- - generated_from_trainer
4
- metrics:
5
- - f1
6
- - precision
7
- - recall
8
- - accuracy
9
- model-index:
10
- - name: output
11
- results: []
12
- ---
13
-
14
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
- should probably proofread and complete it, then remove this comment. -->
16
-
17
- # output
18
-
19
- This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
20
- It achieves the following results on the evaluation set:
21
- - Loss: 1.3839
22
- - F1: 0.0
23
- - Precision: 0.0
24
- - Recall: 0.0
25
- - Accuracy: 0.0
26
-
27
- ## Model description
28
-
29
- More information needed
30
-
31
- ## Intended uses & limitations
32
-
33
- More information needed
34
-
35
- ## Training and evaluation data
36
-
37
- More information needed
38
-
39
- ## Training procedure
40
-
41
- ### Training hyperparameters
42
-
43
- The following hyperparameters were used during training:
44
- - learning_rate: 5e-05
45
- - train_batch_size: 8
46
- - eval_batch_size: 8
47
- - seed: 42
48
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
- - lr_scheduler_type: cosine
50
- - lr_scheduler_warmup_steps: 100
51
- - num_epochs: 1
52
-
53
- ### Training results
54
-
55
- | Training Loss | Epoch | Step | Validation Loss | F1 | Precision | Recall | Accuracy |
56
- |:-------------:|:-----:|:----:|:---------------:|:---:|:---------:|:------:|:--------:|
57
- | No log | 1.0 | 12 | 1.3839 | 0.0 | 0.0 | 0.0 | 0.0 |
58
-
59
-
60
- ### Framework versions
61
-
62
- - Transformers 4.41.2
63
- - Pytorch 2.3.1+cu121
64
- - Datasets 2.20.0
65
- - Tokenizers 0.19.1
 
 
1
+ ---
2
+ tags:
3
+ - generated_from_trainer
4
+ metrics:
5
+ - f1
6
+ - precision
7
+ - recall
8
+ - accuracy
9
+ model-index:
10
+ - name: output
11
+ results: []
12
+ ---
13
+
14
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
+ should probably proofread and complete it, then remove this comment. -->
16
+
17
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/mdeputy-Gleghorn%20Lab/huggingface/runs/c2hwu750)
18
+ # output
19
+
20
+ This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
21
+ It achieves the following results on the evaluation set:
22
+ - Loss: 1.3546
23
+ - F1: 0.0
24
+ - Precision: 0.0
25
+ - Recall: 0.0
26
+ - Accuracy: 0.0
27
+
28
+ ## Model description
29
+
30
+ More information needed
31
+
32
+ ## Intended uses & limitations
33
+
34
+ More information needed
35
+
36
+ ## Training and evaluation data
37
+
38
+ More information needed
39
+
40
+ ## Training procedure
41
+
42
+ ### Training hyperparameters
43
+
44
+ The following hyperparameters were used during training:
45
+ - learning_rate: 5e-05
46
+ - train_batch_size: 8
47
+ - eval_batch_size: 8
48
+ - seed: 42
49
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
+ - lr_scheduler_type: cosine
51
+ - lr_scheduler_warmup_steps: 100
52
+ - num_epochs: 1
53
+
54
+ ### Training results
55
+
56
+ | Training Loss | Epoch | Step | Validation Loss | F1 | Precision | Recall | Accuracy |
57
+ |:-------------:|:-----:|:----:|:---------------:|:---:|:---------:|:------:|:--------:|
58
+ | No log | 1.0 | 12 | 1.3546 | 0.0 | 0.0 | 0.0 | 0.0 |
59
+
60
+
61
+ ### Framework versions
62
+
63
+ - Transformers 4.42.4
64
+ - Pytorch 2.4.0+cu121
65
+ - Datasets 2.21.0
66
+ - Tokenizers 0.19.1
config.json CHANGED
@@ -1,31 +1,31 @@
1
- {
2
- "architectures": [
3
- "AttAlexNetForClassification"
4
- ],
5
- "attention_type": "sdpa",
6
- "classifier_dim": 4096,
7
- "classifier_dropout": 0.1,
8
- "dual_obj": false,
9
- "hidden_act": "silu",
10
- "hidden_size": 64,
11
- "img_size": 1024,
12
- "in_channels": 3,
13
- "intermediate_size": 1024,
14
- "is_causal": false,
15
- "max_position_embeddings": 4096,
16
- "model_type": "att_alexnet",
17
- "moe": false,
18
- "n_filts": 4,
19
- "num_attention_heads": 8,
20
- "num_classes": 3,
21
- "num_experts": 8,
22
- "num_hidden_layers": 6,
23
- "num_layers": 2,
24
- "output_router_logits": true,
25
- "patch_size": 16,
26
- "problem_type": "single_label_classification",
27
- "router_aux_loss_coef": 0.01,
28
- "topk": 2,
29
- "torch_dtype": "float32",
30
- "transformers_version": "4.41.2"
31
- }
 
1
+ {
2
+ "architectures": [
3
+ "AttAlexNetForClassification"
4
+ ],
5
+ "attention_type": "sdpa",
6
+ "classifier_dim": 4096,
7
+ "classifier_dropout": 0.1,
8
+ "dual_obj": false,
9
+ "hidden_act": "silu",
10
+ "hidden_size": 64,
11
+ "img_size": 1024,
12
+ "in_channels": 3,
13
+ "intermediate_size": 1024,
14
+ "is_causal": false,
15
+ "max_position_embeddings": 4096,
16
+ "model_type": "att_alexnet",
17
+ "moe": false,
18
+ "n_filts": 4,
19
+ "num_attention_heads": 8,
20
+ "num_classes": 3,
21
+ "num_experts": 8,
22
+ "num_hidden_layers": 6,
23
+ "num_layers": 2,
24
+ "output_router_logits": true,
25
+ "patch_size": 16,
26
+ "problem_type": "single_label_classification",
27
+ "router_aux_loss_coef": 0.01,
28
+ "topk": 2,
29
+ "torch_dtype": "float32",
30
+ "transformers_version": "4.42.4"
31
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c9899c7d40fc35e849b89b4ec543dc48d6092715079b93c3fd81a25a4edef789
3
  size 127944324
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:48fb8b6ccb294ad28094830a21ffc4ac5a3c2248faab9ca6ce3dcc074c981b25
3
  size 127944324
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b5a66d978e8cb98dc136203309cce0bf89e5e123b35245583c7c06dfe7c251d0
3
- size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:56a72c7d512f06b3f59e0d1b6c1079ce04ea733fe2671846e3da70581b341df4
3
+ size 5048