Spaces:
Running
Running
| Organization,Method,Model,DR,EPR(Micro/Macro),LPR(Micro/Macro),C-LPR,FPR | |
| NJU,Act,DeepSeek-V3,70.4,49.9 / 0,64.6 / 30.6,0,0 | |
| NJU,Act,GPT-4o,97.5,70.8 / 0,86.8 / 68.6,0,0 | |
| NJU,ReAct (zero-shot),DeepSeek-V3,43.3,40.8 / 0,41.9 / 19.6,0,0 | |
| NJU,ReAct (zero-shot),GPT-4o,95.4,48.2 / 0,71.3 / 33.0,0,0 | |
| NJU,ReAct (one-shot),DeepSeek-V3,77.5,68.3 / 6.00,74.1 / 52.3,5.77,5.33 | |
| NJU,ReAct (one-shot),GPT-4o,94.2,68.1 / 0,89.4 / 70.6,0,0 | |
| NJU,NeSy Planning,DeepSeek-V3,75.3,75.3 / 75.3,70.4 / 52.6,70.4,52.6 | |
| NJU,NeSy Planning,GPT-4o,75.0,73.6 / 64.0,73.5 / 63.3,61.7,60.6 | |
| NJU,NeSy Planning,Qwen3-8B,72.3,67.0 / 34.0,70.4 / 49.6,32.6,28.3 | |
| NJU,NeSy Planning,Llama3.1-8B,32.0,31.9 / 31.3,29.1 / 21.0,28.3,21.0 | |
| NJU,NeSy Planning,Mistral-7B,30.3,30.3 / 30.3,27.6 / 19.6,27.6,19.6 | |
| NJU,TTG (oracle),DeepSeek-V3,18.3,21.5 / 8.66,17.2 / 15.0,8.23,8.66 | |
| NJU,LLM-Modulo*,DeepSeek-V3,48.3,94.5 / 4.33,58.4 / 43.6,4.11,4.33 | |
| NJU,LLM-Modulo*,GPT-4o,91.6,88.2 / 7.66,95.5 / 84.6,7.66,7.00 | |
| NJU,LLM-Modulo*,Qwen3-8B,30.0,80.5 / 0.0,62.7 / 25.0,0.0,0.0 | |
| NJU,LLM-Modulo*,Llama3.1-8B,28.6,69.4 / 0.0,55.2 / 8.33,0.0,0.0 | |
| NJU,LLM-Modulo*,Mistral-7B,10.3,90.5 / 0.0,39.1 / 9.0,0.0,0.0 | |
| NJU,NeSy Planning*,DeepSeek-V3,82.6,81.7 / 75.0,82.2 / 75.3,75.0,74.0 | |
| NJU,NeSy Planning*,GPT-4o,66.6,66.7 / 66.0,64.6 / 63.6,64.6,62.6 | |
| NJU,NeSy Planning*,Qwen3-8B,69.3,69.3 / 59.3,70.2 / 59.6,59.3,57.9 | |
| NJU,NeSy Planning*,Mistral-7B,52.6,52.6 / 52.6,50.4 / 45.3,50.4,45.6 | |
| NJU,NeSy Planning*,Llama3.1-8B,33.3,33.2 / 32.6,32.1 / 32.0,31.4,32.3 |