Instructions to use cs-552-2026-the-transformers/math_model with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use cs-552-2026-the-transformers/math_model with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="cs-552-2026-the-transformers/math_model") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("cs-552-2026-the-transformers/math_model") model = AutoModelForCausalLM.from_pretrained("cs-552-2026-the-transformers/math_model") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use cs-552-2026-the-transformers/math_model with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "cs-552-2026-the-transformers/math_model" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "cs-552-2026-the-transformers/math_model", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/cs-552-2026-the-transformers/math_model
- SGLang
How to use cs-552-2026-the-transformers/math_model with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "cs-552-2026-the-transformers/math_model" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "cs-552-2026-the-transformers/math_model", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "cs-552-2026-the-transformers/math_model" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "cs-552-2026-the-transformers/math_model", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use cs-552-2026-the-transformers/math_model with Docker Model Runner:
docker model run hf.co/cs-552-2026-the-transformers/math_model
Automated MNLP evaluation report (2026-06-03)
#5
by zechen-nlp - opened
- EVAL_REPORT.md +41 -59
EVAL_REPORT.md
CHANGED
|
@@ -2,7 +2,7 @@
|
|
| 2 |
|
| 3 |
- **Model repo:** [`cs-552-2026-the-transformers/math_model`](https://huggingface.co/cs-552-2026-the-transformers/math_model)
|
| 4 |
- **Owner(s):** group **the-transformers**
|
| 5 |
-
- **Generated at:** 2026-06-
|
| 6 |
- **Pipeline:** [mnlp-project-ci](https://github.com/eric11eca/mnlp-project-ci)
|
| 7 |
|
| 8 |
_This PR is opened automatically by the course CI. It is **non-blocking** — you do not need to merge it. The next nightly run will refresh this file._
|
|
@@ -11,7 +11,7 @@ _This PR is opened automatically by the course CI. It is **non-blocking** — yo
|
|
| 11 |
|
| 12 |
| Benchmark | Accuracy | Status |
|
| 13 |
|---|---:|---|
|
| 14 |
-
| Math | 0.
|
| 15 |
| Knowledge | — | not run |
|
| 16 |
| Multilingual | — | not run |
|
| 17 |
| Safety | — | not run |
|
|
@@ -24,88 +24,70 @@ _Prompts are intentionally omitted to avoid revealing benchmark contents. For mu
|
|
| 24 |
|
| 25 |
**Correct** (1 shown)
|
| 26 |
|
| 27 |
-
- **reference**: `
|
| 28 |
-
- **overall** (
|
| 29 |
-
- **extracted** (✓): `
|
| 30 |
- **completion**:
|
| 31 |
|
| 32 |
```text
|
| 33 |
<think>
|
| 34 |
-
|
| 35 |
-
- Given that $XYZ$ is an isosceles triangle with $XY = XZ$, the circumcenter $P$ lies on the perpendicular bisector of $YZ$.
|
| 36 |
-
- $Q$ is the midpoint of $YZ$, and $R$ is the reflection of $Q$ across $XZ$.
|
| 37 |
-
- $S$ is chosen such that $XQYS$ forms a rectangle.
|
| 38 |
|
| 39 |
-
2.
|
| 40 |
-
- Since $XQYS$ is a rectangle, $\angle XQY = 90^\circ$.
|
| 41 |
-
- Because $Q$ is the midpoint of $YZ$, $QY = QZ$.
|
| 42 |
-
- Since $XQYS$ is a rectangle, $\angle XQS = 90^\circ$.
|
| 43 |
-
- Given $\angle YXZ = a$, we have $\angle QXZ = \frac{a}{2}$ because $Q$ is the midpoint of $YZ$.
|
| 44 |
|
| 45 |
-
|
| 46 |
-
- Since $R$ is the reflection of $Q$ across $XZ$, $\angle QXR = \angle QXZ = \frac{a}{2}$.
|
| 47 |
-
- Therefore, $\angle QXR = \frac{a}{2}$.
|
| 48 |
|
| 49 |
-
|
| 50 |
-
|
| 51 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 52 |
|
| 53 |
-
### Conclusion:
|
| 54 |
\[
|
| 55 |
</think>
|
| 56 |
|
| 57 |
-
\boxed{
|
| 58 |
\]
|
| 59 |
```
|
| 60 |
|
| 61 |
**Incorrect** (1 shown)
|
| 62 |
|
| 63 |
-
- **reference**: `
|
| 64 |
- **overall** (0/8 completions correct)
|
| 65 |
-
- **extracted** (✗): `
|
| 66 |
- **completion**:
|
| 67 |
|
| 68 |
```text
|
| 69 |
<think>
|
| 70 |
-
To solve
|
| 71 |
|
| 72 |
-
.
|
| 73 |
-
|
| 74 |
-
|
| 75 |
-
|
| 76 |
-
|
| 77 |
-
AC^2 = 11^2 + 19^2 = 121 + 361 = 482
|
| 78 |
-
\]
|
| 79 |
-
|
| 80 |
-
227. **Calculate the squared distance between the centers of the first and third balloons:**
|
| 81 |
-
Using the Pythagorean theorem again, we have:
|
| 82 |
-
\[
|
| 83 |
-
AC^2 = 11^2 + 19^2 = 121 + 361 = 482
|
| 84 |
-
\]
|
| 85 |
|
| 86 |
-
|
| 87 |
-
Using the Pythagorean theorem again, we have:
|
| 88 |
-
\[
|
| 89 |
-
AC^2 = 11^2 + 19^2 = 121 + 361 = 482
|
| 90 |
-
\]
|
| 91 |
|
| 92 |
-
|
| 93 |
-
Using the Pythagorean theorem again, we have:
|
| 94 |
-
\[
|
| 95 |
-
AC^2 = 11^2 + 19^2 = 121 + 361 = 482
|
| 96 |
-
\]
|
| 97 |
|
| 98 |
-
|
| 99 |
-
|
| 100 |
-
|
| 101 |
-
|
| 102 |
-
|
|
|
|
|
|
|
|
|
|
| 103 |
|
| 104 |
-
|
| 105 |
-
|
| 106 |
-
|
| 107 |
-
AC^2 = 11^2 + 19^2 = 121 + 361 = 482
|
| 108 |
-
\]
|
| 109 |
|
| 110 |
-
|
|
|
|
| 111 |
```
|
|
|
|
| 2 |
|
| 3 |
- **Model repo:** [`cs-552-2026-the-transformers/math_model`](https://huggingface.co/cs-552-2026-the-transformers/math_model)
|
| 4 |
- **Owner(s):** group **the-transformers**
|
| 5 |
+
- **Generated at:** 2026-06-03T08:04:32+00:00 (UTC)
|
| 6 |
- **Pipeline:** [mnlp-project-ci](https://github.com/eric11eca/mnlp-project-ci)
|
| 7 |
|
| 8 |
_This PR is opened automatically by the course CI. It is **non-blocking** — you do not need to merge it. The next nightly run will refresh this file._
|
|
|
|
| 11 |
|
| 12 |
| Benchmark | Accuracy | Status |
|
| 13 |
|---|---:|---|
|
| 14 |
+
| Math | 0.3700 | ok |
|
| 15 |
| Knowledge | — | not run |
|
| 16 |
| Multilingual | — | not run |
|
| 17 |
| Safety | — | not run |
|
|
|
|
| 24 |
|
| 25 |
**Correct** (1 shown)
|
| 26 |
|
| 27 |
+
- **reference**: `194`
|
| 28 |
+
- **overall** (2/8 completions correct)
|
| 29 |
+
- **extracted** (✓): `194`
|
| 30 |
- **completion**:
|
| 31 |
|
| 32 |
```text
|
| 33 |
<think>
|
| 34 |
+
To solve this problem, we need to understand the constraints and the implications of marking cells on a $194 \times 194$ board using dominoes.
|
|
|
|
|
|
|
|
|
|
| 35 |
|
| 36 |
+
First, note that a $194 \times 194$ board has an even number of cells, specifically $194^2 = 37636$ cells. Each $1 \times 2$ or $2 \times 1$ domino covers exactly 2 cells, so the number of dominoes that can fit on the board is $37636 / 2 = 18818$ d
|
|
|
|
|
|
|
|
|
|
|
|
|
| 37 |
|
| 38 |
+
... [truncated 261 chars] ...
|
|
|
|
|
|
|
| 39 |
|
| 40 |
+
er of marked cells is equal to the number of dominoes, which is 18818.
|
| 41 |
+
|
| 42 |
+
However, we need to ensure that the marked cells are arranged in such a way that they do not interfere with the tiling. If we mark a cell in a domino, we must ensure that no other domino can overlap with it in a way that would require marking another cell. This means that the marked cells must be placed in a way that they do not interfere with the tiling, i.e., no two marked cells can be in the same row or column.
|
| 43 |
+
|
| 44 |
+
To achieve this, we can place a marked cell in each row and each column, ensuring that no two marked cells are in the same row or column. This is similar to placing rooks on a chessboard such that no two rooks are in the same row or column. The number of such placements is the number of permutations of the columns for the rows, which is $194!$.
|
| 45 |
+
|
| 46 |
+
Therefore, the maximum number of marked cells such that no two are in the same row or column is 194.
|
| 47 |
+
|
| 48 |
+
Hence, the least positive integer $k$ for which one can choose $k$ cells to mark so that there is exactly one tiling of the board by $1 \times 2$ and $2 \times 1$ dominoes in which every domino contains at most one marked cell is:
|
| 49 |
|
|
|
|
| 50 |
\[
|
| 51 |
</think>
|
| 52 |
|
| 53 |
+
\boxed{194}
|
| 54 |
\]
|
| 55 |
```
|
| 56 |
|
| 57 |
**Incorrect** (1 shown)
|
| 58 |
|
| 59 |
+
- **reference**: `3069`
|
| 60 |
- **overall** (0/8 completions correct)
|
| 61 |
+
- **extracted** (✗): `99500`
|
| 62 |
- **completion**:
|
| 63 |
|
| 64 |
```text
|
| 65 |
<think>
|
| 66 |
+
To solve the problem, we need to understand what a digiroot is and how to find all such numbers less than 1000.
|
| 67 |
|
| 68 |
+
First, let's understand what it means for a number \(n\) to be a digiroot. A number \(n\) is a digiroot if the sum of its digits in base-2 (\(s_2(n)\)) is the square root of the sum of its digits in base-4 (\(s_4(n)\)). This means:
|
| 69 |
+
\[
|
| 70 |
+
s_2(n) = \sqrt{s_4(n)}
|
| 71 |
+
\]
|
| 72 |
+
Squaring both side
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 73 |
|
| 74 |
+
... [truncated 822 chars] ...
|
|
|
|
|
|
|
|
|
|
|
|
|
| 75 |
|
| 76 |
+
719, 721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 827, 829, 831, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 933, 935, 937, 939, 941, 943, 945, 947, 949, 951, 953, 955, 957, 959, 961, 963, 965, 967, 969, 971, 973, 975, 977, 979, 981, 983, 985, 987, 989, 991, 993, 995, 997, 999.
|
|
|
|
|
|
|
|
|
|
|
|
|
| 77 |
|
| 78 |
+
Now, we need to sum all these numbers. The sum of an arithmetic series is given by:
|
| 79 |
+
\[
|
| 80 |
+
\text{Sum} = \frac{n}{2} \times (\text{first term} + \text{last term})
|
| 81 |
+
\]
|
| 82 |
+
where \(n\) is the number of terms. Here, \(n = 199\) (since there are 199 numbers from 1 to 999), the first term is 1, and the last term is 999. Therefore, the sum is:
|
| 83 |
+
\[
|
| 84 |
+
\text{Sum} = \frac{199}{2} \times (1 + 999) = 199 \times 500 = 99500
|
| 85 |
+
\]
|
| 86 |
|
| 87 |
+
Thus, the sum of all digiroots that are less than 1000 is:
|
| 88 |
+
\[
|
| 89 |
+
</think>
|
|
|
|
|
|
|
| 90 |
|
| 91 |
+
\boxed{99500}
|
| 92 |
+
\]
|
| 93 |
```
|