Spaces:
Configuration error
Configuration error
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,27 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
title: README
|
| 3 |
emoji: π
|
|
|
|
| 1 |
+
|
| 2 |
+
# PKU-Baichuan-MLSystemLab
|
| 3 |
+
Welcome to the GitHub repository of the Peking University ML System Lab - Baichuan Inc. Joint Laboratory.
|
| 4 |
+
|
| 5 |
+
We are dedicated to advancing research in Data-Centric Machine Learning (DCML), Large Language Models (LLMs), and Machine Learning Systems (ML Systems).
|
| 6 |
+
|
| 7 |
+
Our goal is to develop effective and efficient data preparation systems and algorithms that support and enhance the performance of machine learning models.
|
| 8 |
+
|
| 9 |
+
## Newly Released Papers and Code
|
| 10 |
+
|
| 11 |
+
π₯ 2024/08/14 Llama3-PBM-Nova-70B Model is released! [π€ Huggingface](https://huggingface.co/PKU-Baichuan-MLSystemLab/Llama3-PBM-Nova-70B)
|
| 12 |
+
| Model | Arena-Hard | MixEval-Hard | Alpaca-Eval 2.0 |
|
| 13 |
+
|------------------------|------------|--------------|-----------------|
|
| 14 |
+
| GPT-4Turbo (04/09) | 82.6% | 62.6 | 55.0% |
|
| 15 |
+
| GPT-4o (05/13) | 79.2% | 64.7 | 57.5% |
|
| 16 |
+
| Gemini 1.5 Pro | 72.0% | 58.3 | - |
|
| 17 |
+
| Llama3-PBM-Nova-70B | 74.5% | 58.1 | 61.23% |
|
| 18 |
+
| Llama-3.1-70B-Instruct | 55.7% | - | 38.1% |
|
| 19 |
+
| Llama-3-70B-Instruct | 46.6% | 55.9 | 34.4% |
|
| 20 |
+
|
| 21 |
+
π₯ 2024/08/07 PAS: Data-Efficient Plug-and-Play Prompt Augmentation System [π΄ Repo](https://github.com/PKU-Baichuan-MLSystemLab/PAS) [π² arXiv](https://arxiv.org/abs/2407.06027)
|
| 22 |
+
π₯ 2024/08/02 CFBench: A Comprehensive Constraints-Following Benchmark for LLMs [π΄ Repo](https://github.com/PKU-Baichuan-MLSystemLab/CFBench) [π² arXiv](https://arxiv.org/abs/2408.01122)
|
| 23 |
+
|
| 24 |
+
|
| 25 |
---
|
| 26 |
title: README
|
| 27 |
emoji: π
|