cstr commited on
Commit
c01a3cf
Β·
verified Β·
1 Parent(s): ee99d6d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +66 -2
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- title: Conceptnet Db
3
  emoji: πŸ“š
4
  colorFrom: indigo
5
  colorTo: purple
@@ -8,6 +8,70 @@ sdk_version: 5.49.1
8
  app_file: app.py
9
  pinned: false
10
  license: cc-by-sa-4.0
 
 
 
 
 
11
  ---
12
 
13
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: ConceptNet DB (All Languages)
3
  emoji: πŸ“š
4
  colorFrom: indigo
5
  colorTo: purple
 
8
  app_file: app.py
9
  pinned: false
10
  license: cc-by-sa-4.0
11
+ tags:
12
+ - conceptnet
13
+ - knowledge-graph
14
+ - all-languages
15
+ - multilingual
16
  ---
17
 
18
+ # ConceptNet Database Explorer (All Languages)
19
+
20
+ This Gradio application provides access to the **complete, unfiltered ConceptNet 5.5 knowledge graph** with all 28.3 million nodes across **all languages**.
21
+
22
+ ## Why Use This App?
23
+
24
+ **Use this application if you:**
25
+ - Need access to **all languages** in ConceptNet (not just the 11 languages in the normalized version)
26
+ - Want to explore connections across any language pair
27
+ - Need comprehensive, unfiltered results from the complete knowledge graph
28
+ - Are doing multilingual research that requires less common languages
29
+
30
+ **For faster queries with 11 specific languages** (en, fr, it, de, es, ar, fa, grc, he, la, hbo), consider using the normalized version which has optimized performance.
31
+
32
+ ## Features
33
+
34
+ - **Semantic Profile**: Finds incoming and outgoing relations for a given node across all languages
35
+ - **Query Builder**: Allows for custom queries (start_node, relation, end_node) with flexible filtering
36
+ - **Raw SQL**: Enables direct SQL queries against the complete database for advanced users
37
+
38
+ ## Performance Note
39
+
40
+ This application queries the original 23.6 GB un-normalized database (`cstr/conceptnet-de-indexed`). Queries may take 30-60+ seconds due to:
41
+ - Text-based operations on 34 million edges
42
+ - Comprehensive search across all languages
43
+ - Un-normalized schema optimized for completeness rather than speed
44
+
45
+ The trade-off is complete language coverage versus query speed.
46
+
47
+ ## Data Quality
48
+
49
+ This database contains the complete, unfiltered ConceptNet data. You may see some metadata artifacts (such as POS tags like 'n' or 'v') alongside semantic relationships. This is expected behavior for the comprehensive dataset.
50
+
51
+ ## Source Database
52
+
53
+ This application runs on top of the complete, un-normalized database:
54
+ - **Dataset**: [cstr/conceptnet-de-indexed](https://huggingface.co/datasets/cstr/conceptnet-de-indexed)
55
+ - **Nodes**: 28.3 million nodes across all ConceptNet languages
56
+ - **Edges**: 34 million edges (unfiltered)
57
+
58
+ ## Licensing and Attribution
59
+
60
+ This work includes data from ConceptNet 5, which was compiled by the Commonsense Computing Initiative. ConceptNet 5 is freely available under the Creative Commons Attribution-ShareAlike license (CC BY SA 4.0) from http://conceptnet.io.
61
+
62
+ For a full list of licenses and attributions for included resources such as WordNet, Open Multilingual WordNet, and Wikimedia projects, please see the original dataset card.
63
+
64
+ ## Citation Information
65
+
66
+ If you use this data in your work, please cite the original ConceptNet 5.5 paper:
67
+
68
+ ```bibtex
69
+ @inproceedings{speer2017conceptnet,
70
+ author = {Robyn Speer and Joshua Chin and Catherine Havasi},
71
+ title = {ConceptNet 5.5: An Open Multilingual Graph of General Knowledge},
72
+ booktitle = {Proceedings of the AAAI Conference on Artificial Intelligence},
73
+ year = {2017},
74
+ pages = {4444--4451},
75
+ url = {http://aaai.org/ocs/index.php/AAAI/AAAI17/paper/view/14972}
76
+ }
77
+ ```