Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
evalstate
/
hf-papers
like
1
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
hf-papers
/
docs
/
tool_description_eval
/
clean_release_20260209
1.73 MB
1 contributor
History:
2 commits
evalstate
HF Staff
add social-ready evaluation dashboards
0f3bc9b
verified
about 1 month ago
SUMMARY.md
Safe
827 Bytes
add clean tool-description evaluation charts and summary
about 1 month ago
bar_avg_calls_by_model.png
Safe
66.6 kB
add clean tool-description evaluation charts and summary
about 1 month ago
bar_avg_exchange_chars_by_model.png
Safe
67.5 kB
add clean tool-description evaluation charts and summary
about 1 month ago
bar_avg_score_by_model.png
Safe
64 kB
add clean tool-description evaluation charts and summary
about 1 month ago
bar_first_call_ok_by_model.png
Safe
64 kB
add clean tool-description evaluation charts and summary
about 1 month ago
heat_avg_calls.png
Safe
70.5 kB
add clean tool-description evaluation charts and summary
about 1 month ago
heat_avg_exchange_chars.png
Safe
79.8 kB
add clean tool-description evaluation charts and summary
about 1 month ago
heat_avg_score.png
Safe
66.4 kB
add clean tool-description evaluation charts and summary
about 1 month ago
heat_first_call_ok.png
Safe
66 kB
add clean tool-description evaluation charts and summary
about 1 month ago
model_compare_answer_norm.png
Safe
68.2 kB
add clean tool-description evaluation charts and summary
about 1 month ago
model_compare_answer_pass.png
Safe
63.5 kB
add clean tool-description evaluation charts and summary
about 1 month ago
model_compare_avg_delegation_chars.png
Safe
69.2 kB
add clean tool-description evaluation charts and summary
about 1 month ago
model_compare_avg_exchange_chars.png
Safe
64.2 kB
add clean tool-description evaluation charts and summary
about 1 month ago
model_compare_avg_tool_calls.png
Safe
63.7 kB
add clean tool-description evaluation charts and summary
about 1 month ago
model_compare_pareto_answer_vs_exchange.png
110 kB
xet
add clean tool-description evaluation charts and summary
about 1 month ago
overall_variant_pareto_chart.png
Safe
96.4 kB
add clean tool-description evaluation charts and summary
about 1 month ago
overall_variant_summary_chart.png
Safe
51.1 kB
add clean tool-description evaluation charts and summary
about 1 month ago
scatter_calls_vs_first_ok.png
Safe
42.4 kB
add clean tool-description evaluation charts and summary
about 1 month ago
scatter_exchange_vs_first_ok.png
Safe
51.1 kB
add clean tool-description evaluation charts and summary
about 1 month ago
social_at_a_glance_dashboard.png
316 kB
xet
add social-ready evaluation dashboards
about 1 month ago
social_square_summary.png
153 kB
xet
add social-ready evaluation dashboards
about 1 month ago
tool_description_ab_summary.filtered.csv
Safe
1.9 kB
add clean tool-description evaluation charts and summary
about 1 month ago
tool_description_ab_summary.filtered.json
Safe
8.67 kB
add clean tool-description evaluation charts and summary
about 1 month ago
tool_description_answer_summary.filtered.csv
Safe
778 Bytes
add clean tool-description evaluation charts and summary
about 1 month ago
tool_description_answer_summary.filtered.json
Safe
3.2 kB
add clean tool-description evaluation charts and summary
about 1 month ago
tool_description_dashboard.csv
Safe
2.51 kB
add clean tool-description evaluation charts and summary
about 1 month ago
tool_description_dashboard.json
Safe
10 kB
add clean tool-description evaluation charts and summary
about 1 month ago
tool_description_dashboard.md
Safe
2.54 kB
add clean tool-description evaluation charts and summary
about 1 month ago
tool_description_interpretation.md
Safe
859 Bytes
add clean tool-description evaluation charts and summary
about 1 month ago
tool_description_model_comparison.md
Safe
1.11 kB
add clean tool-description evaluation charts and summary
about 1 month ago