Spaces:

OpenMark-AI
/

README

Running

App Files Files Community

OpenMarkAI commited on 14 days ago

Commit

a9e38d4

verified ·

1 Parent(s): ed5d3db

Rename OpenMarkA.md to OpenMark.md

Browse files

Files changed (2) hide show

OpenMark.md +43 -0
OpenMarkA.md +0 -13

OpenMark.md ADDED Viewed

	@@ -0,0 +1,43 @@

+---
+title: OpenMark
+emoji: 🎯
+colorFrom: blue
+colorTo: purple
+sdk: static
+pinned: true
+short_description: "AI model benchmarking platform — compare 100+ models on your own tasks"
+tags:
+  - benchmarking
+  - llm
+  - ai
+  - model-evaluation
+---
+# OpenMark — AI Model Benchmarking Platform
+**Stop trusting leaderboards. Benchmark your own work.**
+[OpenMark](https://openmark.ai) lets you benchmark 100+ AI models on your own tasks with deterministic scoring, stability metrics, and real API cost tracking.
+## What Makes OpenMark Different
+- **Your tasks, not generic tests** — Write any evaluation task (code review, classification, creative writing, vision analysis) and test models against it
+- **Deterministic scoring** — Same prompt, same score, every time. No vibes-based evaluation
+- **Stability metrics** — See which models change their answer across runs (hint: many do)
+- **Real API costs** — Know exactly what each model costs per task, not just per million tokens
+- **100+ models** — OpenAI, Anthropic, Google, Meta, Mistral, xAI, and more. Side-by-side comparison
+## Why It Matters
+Generic benchmarks (MMLU, HumanEval, MATH) test models on tasks you'll never use. The only benchmark that matters is yours: does this model, with this prompt, for this task, give you the result you expect — reliably and affordably?
+## Try It
+👉 **[openmark.ai](https://openmark.ai)** — Free to start. No credit card required.
+## Links
+- 🌐 [Website](https://openmark.ai)
+- 📝 [Why Generic Benchmarks Are Useless](https://dev.to/openmarkai/i-benchmarked-10-ai-models-on-reading-human-emotions-3m0b)
+- 🐦 [Twitter/X](https://x.com/OpenMarkAI)
+- 💼 [LinkedIn](https://www.linkedin.com/company/openmark-ai)

OpenMarkA.md DELETED Viewed

@@ -1,13 +0,0 @@
----
-title: README
-emoji: 🚀
-colorFrom: purple
-colorTo: green
-sdk: static
-pinned: true
-thumbnail: >-
-  https://cdn-uploads.huggingface.co/production/uploads/6997b2c868950cfdb9f34310/yoX33UYjvhN52TZOM2OCW.png
-short_description: AI model benchmarking platform — compare 100+ models on your
----
-Edit this `README.md` markdown file to author your organization card.