Spaces:

JunsWan
/

HardcoreLogic

Running

App Files Files Community

JunsWan commited on Oct 13

Commit

1e754f8

verified ·

1 Parent(s): b0a10e7

Upload README.md

Browse files

Files changed (1) hide show

README.md +61 -0

README.md ADDED Viewed

	@@ -0,0 +1,61 @@

+---
+title: Zebra Logic Bench
+emoji: 🦓
+colorFrom: blue
+colorTo: yellow
+sdk: gradio
+sdk_version: 4.19.2
+app_file: app.py
+pinned: true
+fullWidth: true
+hf_oauth: true
+api: false
+tags:
+    - leaderboard
+datasets:
+    - allenai/ZebraLogicBench
+    - WildEval/ZebraLogic
+models:
+    - Qwen/Qwen2-72B-Instruct
+    - Qwen/Qwen1.5-72B-Chat
+    - Qwen/Qwen1.5-7B-Chat
+    - meta-llama/Meta-Llama-3-8B-Instruct
+    - meta-llama/Meta-Llama-3-70B-Instruct
+    - meta-llama/Llama-2-13b-chat-hf
+    - meta-llama/Llama-2-70b-chat-hf
+    - meta-llama/Llama-2-7b-chat-hf
+    - mistralai/Mistral-7B-Instruct-v0.1
+    - mistralai/Mistral-7B-Instruct-v0.2
+    - mistralai/Mixtral-8x7B-Instruct-v0.1
+    - microsoft/Phi-3-medium-128k-instruct
+    - microsoft/Phi-3-mini-128k-instruct
+    - NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO
+    - NousResearch/Hermes-2-Theta-Llama-3-8B
+    - 01-ai/Yi-1.5-34B-Chat
+    - 01-ai/Yi-1.5-9B-Chat
+    - 01-ai/Yi-1.5-6B-Chat
+    - google/gemma-7b-it
+    - google/gemma-2b-it
+    - allenai/tulu-2-dpo-70b
+    - HuggingFaceH4/zephyr-7b-beta
+    - Nexusflow/Starling-LM-7B-beta
+    - databricks/dbrx-instruct
+    - princeton-nlp/Llama-3-Instruct-8B-SimPO
+    - chujiezheng/Llama-3-Instruct-8B-SimPO-ExPO
+    - chujiezheng/Starling-LM-7B-beta-ExPO
+    - ZhangShenao/SELM-Zephyr-7B-iter-3
+    - deepseek-ai/DeepSeek-V2-Chat
+    - m-a-p/neo_7b_instruct_v0.1
+    - 01-ai/Yi-34B-chat
+    - lmsys/vicuna-13b-v1.5
+    - HuggingFaceH4/zephyr-7b-gemma-v0.1
+    - deepseek-ai/DeepSeek-Coder-V2
+    - THUDM/glm-4-9b-chat
+    - chujiezheng/neo_7b_instruct_v0.1-ExPO
+    - ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
+---
+Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
+Paper: arxiv.org/abs/2406.04770
+Paper: arxiv.org/abs/2502.01100