Spaces:
Configuration error
Configuration error
Update README.md
Browse files
README.md
CHANGED
|
@@ -7,15 +7,6 @@ Our goal is to develop effective and efficient data preparation systems and algo
|
|
| 7 |
## Newly Released Papers and Code
|
| 8 |
|
| 9 |
π₯ 2024/08/14 Llama3-PBM-Nova-70B Model is released! [π€ Huggingface](https://huggingface.co/PKU-Baichuan-MLSystemLab/Llama3-PBM-Nova-70B)
|
| 10 |
-
| Model | Arena-Hard | MixEval-Hard | Alpaca-Eval 2.0 |
|
| 11 |
-
|------------------------|------------|--------------|-----------------|
|
| 12 |
-
| GPT-4Turbo (04/09) | 82.6% | 62.6 | 55.0% |
|
| 13 |
-
| GPT-4o (05/13) | 79.2% | 64.7 | 57.5% |
|
| 14 |
-
| Gemini 1.5 Pro | 72.0% | 58.3 | - |
|
| 15 |
-
| Llama3-PBM-Nova-70B | 74.5% | 58.1 | 61.23% |
|
| 16 |
-
| Llama-3.1-70B-Instruct | 55.7% | - | 38.1% |
|
| 17 |
-
| Llama-3-70B-Instruct | 46.6% | 55.9 | 34.4% |
|
| 18 |
-
|
| 19 |
π₯ 2024/08/07 PAS: Data-Efficient Plug-and-Play Prompt Augmentation System [π΄ Repo](https://github.com/PKU-Baichuan-MLSystemLab/PAS) [π² arXiv](https://arxiv.org/abs/2407.06027)
|
| 20 |
π₯ 2024/08/02 CFBench: A Comprehensive Constraints-Following Benchmark for LLMs [π΄ Repo](https://github.com/PKU-Baichuan-MLSystemLab/CFBench) [π² arXiv](https://arxiv.org/abs/2408.01122)
|
| 21 |
|
|
|
|
| 7 |
## Newly Released Papers and Code
|
| 8 |
|
| 9 |
π₯ 2024/08/14 Llama3-PBM-Nova-70B Model is released! [π€ Huggingface](https://huggingface.co/PKU-Baichuan-MLSystemLab/Llama3-PBM-Nova-70B)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 10 |
π₯ 2024/08/07 PAS: Data-Efficient Plug-and-Play Prompt Augmentation System [π΄ Repo](https://github.com/PKU-Baichuan-MLSystemLab/PAS) [π² arXiv](https://arxiv.org/abs/2407.06027)
|
| 11 |
π₯ 2024/08/02 CFBench: A Comprehensive Constraints-Following Benchmark for LLMs [π΄ Repo](https://github.com/PKU-Baichuan-MLSystemLab/CFBench) [π² arXiv](https://arxiv.org/abs/2408.01122)
|
| 12 |
|