Spaces:
Running
A newer version of the Gradio SDK is available:
6.9.0
Troubleshooting
pplx-embed fails to load
Error: ImportError: cannot import name 'Module' from 'sentence_transformers.models'
Cause: pplx-embed's custom st_quantize.py imports Module from sentence_transformers.models, which was removed in version 4.x.
Fix: Pin to the correct version:
pip install "sentence-transformers==3.3.1"
pplx-embed crashes with 404 on chat templates
Error: RemoteEntryNotFoundError: 404 ... additional_chat_templates does not exist
Cause: transformers 4.57+ added list_repo_templates() which looks for an additional_chat_templates folder in every model repo. pplx-embed predates this feature and doesn't have the folder.
Fix: Already handled automatically in openmark/embeddings/local.py via a monkey-patch applied before model loading. If you see this error outside of OpenMark, apply:
from transformers.utils import hub as _hub
import transformers.tokenization_utils_base as _tub
_orig = _hub.list_repo_templates
def _safe(*a, **kw):
try: return _orig(*a, **kw)
except Exception: return []
_hub.list_repo_templates = _safe
_tub.list_repo_templates = _safe
Neo4j connection error: "Unable to retrieve routing information"
Cause: Using neo4j:// URI (routing protocol) with a single local Neo4j instance.
Fix: Use bolt:// instead:
NEO4J_URI=bolt://127.0.0.1:7687
Neo4j error: "Database does not exist"
Cause: The database name in .env doesn't match what's in Neo4j Desktop.
Fix: Open http://localhost:7474, check what databases exist:
SHOW DATABASES
Update NEO4J_DATABASE in .env to match.
LinkedIn script returns 0 results or 404
Cause: LinkedIn's internal queryId changes when they deploy new JavaScript bundles.
Fix:
- Open LinkedIn in your browser β go to Saved Posts
- Open DevTools β Network tab β filter for
voyagerSearchDashClusters - Click one of the requests β copy the full URL
- Extract the new
queryIdvalue - Update
linkedin_fetch.pywith the newqueryId
YouTube OAuth "Access Blocked: App not verified"
Cause: Your Google Cloud app is in testing mode and your account isn't listed as a test user.
Fix:
- Google Cloud Console β OAuth consent screen
- Scroll to "Test users" β Add users β add your Google account email
- Re-run
youtube_fetch.py
ChromaDB ingest is slow
On CPU with local pplx-embed, embedding 8K items takes ~20 minutes. This is normal.
Options:
- Use Azure instead:
python scripts/ingest.py --provider azure(~5 min, ~β¬0.30) - The ingest is resumable β if interrupted, re-run and it skips already-ingested items
SIMILAR_TO step takes too long
Building SIMILAR_TO edges queries ChromaDB for every bookmark's top-5 neighbors, then writes to Neo4j. For 8K items on CPU this takes ~25-40 minutes.
Skip it:
python scripts/ingest.py --skip-similar
The app works without SIMILAR_TO edges. You only lose the find_similar_bookmarks agent tool and cross-topic graph traversal.
Windows UnicodeEncodeError in terminal
Error: UnicodeEncodeError: 'charmap' codec can't encode character
Cause: Windows terminal (cmd/PowerShell) defaults to cp1252 encoding which can't handle emoji or some Unicode characters in bookmark titles.
Fix: Run from Windows Terminal (supports UTF-8) or add to the top of the script:
import sys
sys.stdout.reconfigure(encoding='utf-8')
All OpenMark scripts already include this.
gradio not found on Python 3.13
gradio 6.6.0 is installed on Python 3.14 by default on this machine. If using Python 3.13:
C:\Python313\python -m pip install gradio