jech2 commited on
Commit
ce225f0
·
1 Parent(s): cd05ac4

update readme

Browse files
Files changed (1) hide show
  1. README.md +51 -3
README.md CHANGED
@@ -1,3 +1,51 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:e2ee0d54961d9df216b42e5bc729def934387e61abf8563eb62a04b552708e3e
3
- size 1435
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ pipeline_tag: symbolic-music-retrieval
3
+ language: en
4
+ library_name: pytorch
5
+ license: apache-2.0
6
+ tags:
7
+ - music
8
+ - midi
9
+ - mir
10
+ - deduplication
11
+ - caugbert
12
+ model-index:
13
+ - name: LMD Deduplication - CAugBERT
14
+ results:
15
+ - task:
16
+ type: representation-learning
17
+ name: symbolic music representation learning
18
+ dataset:
19
+ type: midi
20
+ name: Lakh MIDI Dataset
21
+ metrics:
22
+ - type: F1
23
+ value: 0.493
24
+ ---
25
+
26
+ # LMD Deduplication Supplements
27
+ This repository provides the pre-trained CAugBERT model checkpoint used in:
28
+ **"On the De-duplication of the Lakh MIDI Dataset" (ISMIR 2025)**
29
+ [[Paper]](https://ismir2025program.ismir.net/poster_188.html) | [[GitHub Code]](https://github.com/jech2/LMD_Deduplication)
30
+
31
+ ---
32
+
33
+ # Usage
34
+ You can either integrate this checkpoint into the main repository for inference, or load it directly:
35
+ ```bash
36
+ # Option 1: Run inference in the main repo
37
+ poetry run python inference.py # make sure yamls/inference.yaml paths are correct
38
+ ```
39
+ ```python
40
+ # Option 2: Load checkpoint manually
41
+ import torch
42
+ from contrastive_musicbert.model.BERT import BERT_Lightning
43
+
44
+ model = BERT_Lightning(...).to(device) # see .hydra/config.yaml for arguments
45
+ checkpoint = torch.load(checkpoint_path, map_location="cpu")
46
+ model.load_state_dict(checkpoint['state_dict'])
47
+ ```
48
+
49
+ # Note
50
+ If you have any questions regarding the checkpoint, please contact:
51
+ Eunjin Choi (jech@kaist.ac.kr)