Spaces:
Runtime error
Runtime error
🚀 Push to GitHub - Complete Instructions
Step 1: Create a GitHub Repository
- Go to https://github.com/new
- Sign in to your GitHub account
- Fill in the form:
- Repository name:
togmal-prompt-analyzer - Description: "Real-time LLM capability boundary detection using vector similarity search"
- Public: Selected
- Initialize this repository with a README: Unchecked
- Repository name:
- Click "Create repository"
Step 2: Push Your Local Repository
After creating the repository, you'll see instructions. Use these commands in your terminal:
cd /Users/hetalksinmaths/togmal
git remote add origin https://github.com/YOUR_USERNAME/togmal-prompt-analyzer.git
git branch -M main
git push -u origin main
Replace YOUR_USERNAME with your actual GitHub username.
What You'll Have on GitHub
Once pushed, your repository will contain:
Core Implementation
benchmark_vector_db.py- Vector database for difficulty assessmentdemo_app.py- Gradio web interfacefetch_mmlu_top_models.py- Script to fetch real benchmark data
Documentation
COMPLETE_DEMO_ANALYSIS.md- Comprehensive analysis of the systemDEMO_README.md- Demo instructions and resultsGITHUB_INSTRUCTIONS.md- These instructionsREADME.md- Main project documentation
Test Files
test_vector_db.py- Test script with real data examplestest_examples.py- Additional test cases
Configuration
requirements.txt- Python dependencies.gitignore- Files excluded from version control
Key Features Demonstrated
Real Data vs Mock Data
- Before: All prompts showed ~45% success rate (mock data)
- After: System correctly differentiates difficulty levels:
- Hard prompts: 23.9% success rate (HIGH risk)
- Easy prompts: 100% success rate (MINIMAL risk)
11 Test Questions Analysis
The system correctly categorizes:
- Hard Questions (20-50% success):
- "Calculate the quantum correction to the partition function..."
- "Prove that there are infinitely many prime numbers"
- "Statement 1 | Every field is also a ring..."
- Easy Questions (80-100% success):
- "What is 2 + 2?"
- "What is the capital of France?"
- "Who wrote Romeo and Juliet?"
Recommendation Engine
Based on success rates:
- <30%: Multi-step reasoning with verification
- 30-70%: Use chain-of-thought prompting
- >70%: Standard LLM response adequate
Live Demo
Your demo is running at:
- Local: http://127.0.0.1:7861
- Public: https://db11ee71660c8a3319.gradio.live
Next Steps After Pushing
- Add badges to README (build status, license, etc.)
- Create GitHub Pages for project documentation
- Set up CI/CD for automated testing
- Add more benchmark datasets
- Create releases for different versions
Need Help?
If you encounter any issues:
- Check that you're using the correct repository URL
- Ensure you have internet connectivity
- Verify your GitHub credentials are set up
- Make sure you've replaced YOUR_USERNAME with your actual GitHub username
For additional support, refer to: