pimm export (config.json)
Browse files- README.md +8 -34
- config.json +64 -0
README.md
CHANGED
|
@@ -1,27 +1,16 @@
|
|
| 1 |
---
|
| 2 |
-
license: mit
|
| 3 |
-
datasets:
|
| 4 |
-
- DeepLearnPhysics/PILArNet-M
|
| 5 |
library_name: pimm
|
| 6 |
tags:
|
| 7 |
-
- particle
|
| 8 |
-
-
|
| 9 |
-
-
|
| 10 |
-
- lartpc
|
| 11 |
-
- panda
|
| 12 |
-
- point-transformer-v3
|
| 13 |
---
|
| 14 |
|
| 15 |
# Panda — PointTransformerV3 encoder (pretrained)
|
| 16 |
|
| 17 |
-
|
| 18 |
-
point modeling on LArTPC (PILArNet) data. Intended as a backbone / warm-start for
|
| 19 |
-
downstream segmentation and detection tasks (it produces multi-scale point
|
| 20 |
-
features; it has no task head).
|
| 21 |
|
| 22 |
-
- **pimm type:** `PT-v3m2`
|
| 23 |
-
- **enc_depths:** (3, 3, 3, 9, 3) · **enc_channels:** (48, 96, 192, 384, 512)
|
| 24 |
-
- **encoder-only** (`enc_mode=True`), masked-modeling token enabled.
|
| 25 |
|
| 26 |
## Loading
|
| 27 |
|
|
@@ -30,24 +19,9 @@ import pimm
|
|
| 30 |
model = pimm.from_pretrained("hf://deeplearnphysics/panda-base")
|
| 31 |
```
|
| 32 |
|
| 33 |
-
|
| 34 |
-
|
| 35 |
-
checkpoint.
|
| 36 |
-
|
| 37 |
-
## Input
|
| 38 |
-
|
| 39 |
-
A pimm `Point` dict for LArTPC (PILArNet) point clouds: `coord` (xyz) + `energy`,
|
| 40 |
-
collected as `feat_keys=("coord", "energy")` (4 input channels). Coordinates are
|
| 41 |
-
normalized in the model forward.
|
| 42 |
-
|
| 43 |
-
## Notes
|
| 44 |
-
|
| 45 |
-
- Config sets `enable_flash=True`. On CPU or non-FlashAttention setups, override
|
| 46 |
-
it (e.g. pass `enable_flash=False` to `from_pretrained`, or for detectors set it
|
| 47 |
-
on the `backbone`).
|
| 48 |
|
| 49 |
## Provenance
|
| 50 |
|
| 51 |
-
Repackaged from the original [`
|
| 52 |
-
[`pimm`](https://github.com/youngsm/particle-imaging-models) export format. Inherits
|
| 53 |
-
the license of the source repository.
|
|
|
|
| 1 |
---
|
|
|
|
|
|
|
|
|
|
| 2 |
library_name: pimm
|
| 3 |
tags:
|
| 4 |
+
- particle-physics
|
| 5 |
+
- lartpc
|
| 6 |
+
- point-cloud
|
|
|
|
|
|
|
|
|
|
| 7 |
---
|
| 8 |
|
| 9 |
# Panda — PointTransformerV3 encoder (pretrained)
|
| 10 |
|
| 11 |
+
PointTransformerV3 (`PT-v3m2`) encoder pretrained with masked point modeling on LArTPC (PILArNet) data. Backbone / warm-start for downstream tasks (no task head).
|
|
|
|
|
|
|
|
|
|
| 12 |
|
| 13 |
+
- **pimm type:** `PT-v3m2` · encoder-only · enc_channels (48,96,192,384,512)
|
|
|
|
|
|
|
| 14 |
|
| 15 |
## Loading
|
| 16 |
|
|
|
|
| 19 |
model = pimm.from_pretrained("hf://deeplearnphysics/panda-base")
|
| 20 |
```
|
| 21 |
|
| 22 |
+
Architecture + hyper-parameters travel in `config.json`, so no config file is
|
| 23 |
+
needed. Weights are bitwise-identical to the original checkpoint.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 24 |
|
| 25 |
## Provenance
|
| 26 |
|
| 27 |
+
Repackaged from the original [`panda`](https://github.com/DeepLearnPhysics/panda) checkpoints into the [`pimm`](https://github.com/youngsm/particle-imaging-models) export format. Inherits the source repo's license.
|
|
|
|
|
|
config.json
ADDED
|
@@ -0,0 +1,64 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"model": {
|
| 3 |
+
"type": "PT-v3m2",
|
| 4 |
+
"in_channels": 4,
|
| 5 |
+
"order": [
|
| 6 |
+
"hilbert",
|
| 7 |
+
"hilbert-trans",
|
| 8 |
+
"z",
|
| 9 |
+
"z-trans"
|
| 10 |
+
],
|
| 11 |
+
"stride": [
|
| 12 |
+
2,
|
| 13 |
+
2,
|
| 14 |
+
2,
|
| 15 |
+
2
|
| 16 |
+
],
|
| 17 |
+
"enc_depths": [
|
| 18 |
+
3,
|
| 19 |
+
3,
|
| 20 |
+
3,
|
| 21 |
+
9,
|
| 22 |
+
3
|
| 23 |
+
],
|
| 24 |
+
"enc_channels": [
|
| 25 |
+
48,
|
| 26 |
+
96,
|
| 27 |
+
192,
|
| 28 |
+
384,
|
| 29 |
+
512
|
| 30 |
+
],
|
| 31 |
+
"enc_num_head": [
|
| 32 |
+
3,
|
| 33 |
+
6,
|
| 34 |
+
12,
|
| 35 |
+
24,
|
| 36 |
+
32
|
| 37 |
+
],
|
| 38 |
+
"enc_patch_size": [
|
| 39 |
+
256,
|
| 40 |
+
256,
|
| 41 |
+
256,
|
| 42 |
+
256,
|
| 43 |
+
256
|
| 44 |
+
],
|
| 45 |
+
"mlp_ratio": 4,
|
| 46 |
+
"qkv_bias": true,
|
| 47 |
+
"qk_scale": null,
|
| 48 |
+
"layer_scale": 1e-05,
|
| 49 |
+
"attn_drop": 0.0,
|
| 50 |
+
"proj_drop": 0.0,
|
| 51 |
+
"drop_path": 0.3,
|
| 52 |
+
"shuffle_orders": true,
|
| 53 |
+
"pre_norm": true,
|
| 54 |
+
"enable_rpe": false,
|
| 55 |
+
"enable_flash": true,
|
| 56 |
+
"upcast_attention": false,
|
| 57 |
+
"upcast_softmax": false,
|
| 58 |
+
"traceable": true,
|
| 59 |
+
"enc_mode": true,
|
| 60 |
+
"mask_token": true,
|
| 61 |
+
"cpe_first_layer_only": false,
|
| 62 |
+
"cpe_shared_weight": false
|
| 63 |
+
}
|
| 64 |
+
}
|