Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Flow AI
company
Verified
https://www.flow-ai.com/
flowaicom
flowaicom/flow-judge
Activity Feed
Follow
38
AI & ML interests
LLM system evaluation, Automatic LM improvements
Team members
7
flowaicom
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Articles
bergr7f
updated
a Space
over 1 year ago
Runtime error
7
Flow Judge V0.1
🏢
7
Evaluate tasks using custom criteria and rubrics
sariola
updated
3 models
over 1 year ago
flowaicom/Flow-Judge-v0.1-W8A16
1B
•
Updated
Oct 14, 2024
•
1
•
1
flowaicom/Flow-Judge-v0.1-W4A16
0.7B
•
Updated
Oct 14, 2024
•
2
•
1
flowaicom/Flow-Judge-v0.1-FP8
4B
•
Updated
Oct 14, 2024
•
1
•
1
bergr7f
updated
2 models
over 1 year ago
flowaicom/Flow-Judge-v0.1-AWQ
Text Generation
•
4B
•
Updated
Oct 9, 2024
•
75
•
6
flowaicom/Flow-Judge-v0.1
Text Generation
•
4B
•
Updated
Oct 7, 2024
•
249
•
70
bergr7f
updated
a dataset
over 1 year ago
flowaicom/legalbench_contracts_qa_subset
Viewer
•
Updated
Oct 1, 2024
•
100
•
10
sariola
updated
2 models
over 1 year ago
flowaicom/Flow-Judge-v0.1-Llamafile
Updated
Sep 27, 2024
•
1
•
1
flowaicom/Flow-Judge-v0.1-GGUF
Text Generation
•
4B
•
Updated
Sep 18, 2024
•
20
•
10
bergr7f
updated
8 datasets
over 1 year ago
flowaicom/Flow-Judge-v0.1-3-likert-heldout
Viewer
•
Updated
Sep 18, 2024
•
300
•
7
flowaicom/Flow-Judge-v0.1-5-likert-heldout
Viewer
•
Updated
Sep 18, 2024
•
274
•
8
flowaicom/Flow-Judge-v0.1-binary-heldout
Viewer
•
Updated
Sep 18, 2024
•
316
•
5
flowaicom/RAGTruth_test
Viewer
•
Updated
Sep 14, 2024
•
2.7k
•
145
•
1
flowaicom/covid_qa
Viewer
•
Updated
Sep 14, 2024
•
1k
•
8
flowaicom/PubMedQA
Viewer
•
Updated
Sep 14, 2024
•
1k
•
10
•
1
flowaicom/HaluEval
Viewer
•
Updated
Sep 14, 2024
•
10k
•
182
•
1
flowaicom/Feedback-Bench
Viewer
•
Updated
Sep 14, 2024
•
1k
•
29
•
1
bergr7f
updated
a Space
over 1 year ago
Running
README
🧐
Load more