Peter Organisciak commited on
Commit
e5d14d0
·
1 Parent(s): ad0abee

Restore GloVe to dropdown; files uploading to HF

Browse files
Files changed (2) hide show
  1. app.py +3 -3
  2. scoring.py +2 -2
app.py CHANGED
@@ -167,7 +167,7 @@ embedding space.
167
 
168
  **Available models:**
169
  - **MOTES 100k** (default): Children's writing embeddings (ages 10–12) from the MOTES study
170
- - **GloVe 840B** (available for local use): General-purpose embeddings from Common Crawl (Pennington et al. 2014). Due to the 5.4 GB model size, GloVe is not hosted on this Space. For self-hosted deployments, download vectors from [Stanford NLP](https://nlp.stanford.edu/projects/glove/) and see [massivetexts/glove-840b-gensim](https://huggingface.co/massivetexts/glove-840b-gensim) for Gensim conversion instructions.
171
 
172
  **Options:**
173
  - **Stopword filtering**: Skip common functional words (the, and, etc.)
@@ -214,8 +214,8 @@ with gr.Blocks(title="OCS Semantic Scoring", theme=OCS_THEME) as demo:
214
  "For higher accuracy, try [Ocsai LLM scoring](https://openscoring.du.edu/ocsai)."
215
  )
216
 
217
- # Only show hosted models in the dropdown
218
- model_choices = [(MODELS[k]["description"], k) for k in MODELS if MODELS[k].get("hosted", True)]
219
 
220
  # Load model controls
221
  with gr.Row():
 
167
 
168
  **Available models:**
169
  - **MOTES 100k** (default): Children's writing embeddings (ages 10–12) from the MOTES study
170
+ - **GloVe 840B**: General-purpose embeddings trained on 840B Common Crawl tokens (Pennington et al. 2014). Large vocabulary, good for adult responses. English only.
171
 
172
  **Options:**
173
  - **Stopword filtering**: Skip common functional words (the, and, etc.)
 
214
  "For higher accuracy, try [Ocsai LLM scoring](https://openscoring.du.edu/ocsai)."
215
  )
216
 
217
+ # Model choices for dropdowns
218
+ model_choices = [(MODELS[k]["description"], k) for k in MODELS]
219
 
220
  # Load model controls
221
  with gr.Row():
scoring.py CHANGED
@@ -32,9 +32,9 @@ MODELS = {
32
  "repo": "massivetexts/glove-840b-gensim",
33
  "files": ["glove.840B-300d.wv", "glove.840B-300d.wv.vectors.npy"],
34
  "main_file": "glove.840B-300d.wv",
35
- "description": "GloVe 840B 300d self-hosted only (5.4 GB, see repo for setup)",
36
  "scaling": {"min": 0.6456, "max": 0.9610},
37
- "hosted": False, # model files not uploaded; too large for HF Space
38
  },
39
  }
40
 
 
32
  "repo": "massivetexts/glove-840b-gensim",
33
  "files": ["glove.840B-300d.wv", "glove.840B-300d.wv.vectors.npy"],
34
  "main_file": "glove.840B-300d.wv",
35
+ "description": "GloVe 840B 300d (Pennington et al. 2014) general English vocabulary",
36
  "scaling": {"min": 0.6456, "max": 0.9610},
37
+ "hosted": True,
38
  },
39
  }
40