Sefika commited on
Commit
f28a68d
·
1 Parent(s): 3a40df1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +97 -0
README.md CHANGED
@@ -1,3 +1,100 @@
1
  ---
2
  license: gpl-3.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: gpl-3.0
3
  ---
4
+ # GraphMatcher
5
+ The GraphMatcher aims to find the correspondes between two ontologies and outputs the possible alignments between them.
6
+
7
+ The GraphMatcher leverages Graph Attention Network[2] in its neural network structure.
8
+ The project leverages a new neighborhood aggregation algorithm, so it examines contribution of neighboring terms which have not been used in the previous matchers before.
9
+
10
+ The project has been submitted to The 17th International Workshop on Ontology Matching's OAEI 2022 (ISWC-2022) for conference track and obtained the highest F1-measure in uncertain reference alignments among other experts participating to this challenge. Its system paper has been published, and it was invited to the poster presentation session.
11
+
12
+ ## Set up
13
+ * 1.) install requirements
14
+ ``` pip install -r requirements.txt```
15
+
16
+ * 2.) set the parameters in the config.ini
17
+ ````
18
+ [General]
19
+ dataset = ------> name of a dataset e.g., conference.
20
+ K = ------> the parameter for K fold cross-validation
21
+ ontology_split = ------> True/False
22
+ max_false_examples = ------>
23
+
24
+ [Paths]
25
+ dataset_folder = ------> a path to the ontologies
26
+ alignment_folder = ------> a path to the reference alignments
27
+ save_model_path = ------> save the model to the path
28
+ load_model_path = ------> model path
29
+ output_folder = ------> The output folder for the alignments
30
+
31
+ [Parameters]
32
+ max_paths = ------>
33
+ max_pathlen = ------> ( number of neighboring concepts' types: Equivalent class, subclass of(general to specific or specific to general(2))...
34
+ [Hyperparameters]
35
+
36
+ lr = ------> learning rate
37
+ num_epochs = ------> number of epochs
38
+ weight_decay = ------> Weight decay
39
+ batch_size = ------> Batch Size (8/16/32)
40
+
41
+ ````
42
+
43
+ * 3.) train the model
44
+ ```python
45
+ python src/train_model.py
46
+
47
+ ```
48
+ * 4.) test the model
49
+ ```python
50
+ python src/test_model.py ${source.rdf} ${target.rdf}
51
+ ```
52
+ ### Sample Alignment:
53
+ ```xml
54
+ <map>
55
+ <Cell>
56
+ <entity1 rdf:resource='http://conference#has_the_last_name'/>
57
+ <entity2 rdf:resource='http://confof#hasSurname'/>
58
+ <relation>=</relation>
59
+ <measure rdf:datatype='http://www.w3.org/2001/XMLSchema#float'>0.972</measure>
60
+ </Cell>
61
+ </map>
62
+ ```
63
+
64
+ * 5.) evaluate the model with the MELT
65
+
66
+
67
+ Note: The codes in train_model.py and test_model.py are partially based on the VeeAlign[2] project with the permission of its main author. I would like to thank the main author.
68
+
69
+ ## References:
70
+ [1]
71
+ ````
72
+ @inproceedings{iyer-etal-2021-veealign,
73
+ title = "{V}ee{A}lign: Multifaceted Context Representation Using Dual Attention for Ontology Alignment",
74
+ author = "Iyer, Vivek and
75
+ Agarwal, Arvind and
76
+ Kumar, Harshit",
77
+ booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing",
78
+ month = nov,
79
+ year = "2021",
80
+ address = "Online and Punta Cana, Dominican Republic",
81
+ publisher = "Association for Computational Linguistics",
82
+ url = "https://aclanthology.org/2021.emnlp-main.842",
83
+ doi = "10.18653/v1/2021.emnlp-main.842",
84
+ pages = "10780--10792",
85
+ }
86
+ ````
87
+ [2]
88
+ ````
89
+ @misc{https://doi.org/10.48550/arxiv.1710.10903,
90
+ title = {Graph Attention Networks},
91
+ author = {Veličković, Petar and Cucurull, Guillem and Casanova, Arantxa and Romero, Adriana and Liò, Pietro and Bengio, Yoshua},
92
+ keywords = {Machine Learning (stat.ML), Artificial Intelligence (cs.AI), Machine Learning (cs.LG), Social and Information Networks (cs.SI), FOS: Computer and information sciences, FOS: Computer and information sciences},
93
+ url = {https://arxiv.org/abs/1710.10903},
94
+ publisher = {arXiv},
95
+ doi = {10.48550/ARXIV.1710.10903},
96
+ year = {2017},
97
+ copyright = {arXiv.org perpetual, non-exclusive license}
98
+ }
99
+
100
+ ````