Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
samrat-rm
/
WhyDidItFail
like
1
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
ae1e803
WhyDidItFail
Ctrl+K
Ctrl+K
4 contributors
History:
23 commits
samrat-rm
feat: implementing judge LLM which contributes to 15% of scoring
ae1e803
19 days ago
server
feat: max step limit
19 days ago
.gitignore
Safe
279 Bytes
Initial commit
20 days ago
Dockerfile
Safe
2.65 kB
Initial commit
20 days ago
LICENSE
Safe
1.07 kB
Initial commit
19 days ago
README.md
7.82 kB
refactor: WhyDidItFailAction and WhyDidItFailObservation classes
20 days ago
__init__.py
437 Bytes
refactor: WhyDidItFailAction and WhyDidItFailObservation classes
20 days ago
client.py
2.63 kB
feat: init the client
20 days ago
inference.py
9.07 kB
feat: implementing judge LLM which contributes to 15% of scoring
19 days ago
llm_judge.py
3.1 kB
feat: implementing judge LLM which contributes to 15% of scoring
19 days ago
models.py
1.44 kB
feat: define WhyDidItFailAction and WhyDidItFailObservation models with typed fields and descriptions
20 days ago
openenv.yaml
96 Bytes
Initial commit
20 days ago
pyproject.toml
Safe
1.34 kB
Initial commit
20 days ago
uv.lock
Safe
576 kB
Initial commit
20 days ago