Instructions to use cs-552-2026-the-transformers/math_model with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use cs-552-2026-the-transformers/math_model with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="cs-552-2026-the-transformers/math_model") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("cs-552-2026-the-transformers/math_model") model = AutoModelForCausalLM.from_pretrained("cs-552-2026-the-transformers/math_model") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use cs-552-2026-the-transformers/math_model with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "cs-552-2026-the-transformers/math_model" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "cs-552-2026-the-transformers/math_model", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/cs-552-2026-the-transformers/math_model
- SGLang
How to use cs-552-2026-the-transformers/math_model with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "cs-552-2026-the-transformers/math_model" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "cs-552-2026-the-transformers/math_model", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "cs-552-2026-the-transformers/math_model" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "cs-552-2026-the-transformers/math_model", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use cs-552-2026-the-transformers/math_model with Docker Model Runner:
docker model run hf.co/cs-552-2026-the-transformers/math_model
Automated MNLP evaluation report (2026-05-23)
#1
by zechen-nlp - opened
- EVAL_REPORT.md +87 -0
EVAL_REPORT.md
ADDED
|
@@ -0,0 +1,87 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Automated MNLP evaluation report
|
| 2 |
+
|
| 3 |
+
- **Model repo:** [`cs-552-2026-the-transformers/math_model`](https://huggingface.co/cs-552-2026-the-transformers/math_model)
|
| 4 |
+
- **Owner(s):** group **the-transformers**
|
| 5 |
+
- **Generated at:** 2026-05-23T14:37:35+00:00 (UTC)
|
| 6 |
+
- **Pipeline:** [mnlp-project-ci](https://github.com/eric11eca/mnlp-project-ci)
|
| 7 |
+
|
| 8 |
+
_This PR is opened automatically by the course CI. It is **non-blocking** — you do not need to merge it. The next nightly run will refresh this file._
|
| 9 |
+
|
| 10 |
+
## Summary
|
| 11 |
+
|
| 12 |
+
| Benchmark | Accuracy | Status |
|
| 13 |
+
|---|---:|---|
|
| 14 |
+
| Math | 0.3600 | ok |
|
| 15 |
+
| Knowledge | — | not run |
|
| 16 |
+
| Multilingual | — | not run |
|
| 17 |
+
| Safety | — | not run |
|
| 18 |
+
|
| 19 |
+
## Sample completions
|
| 20 |
+
|
| 21 |
+
_Prompts are intentionally omitted to avoid revealing benchmark contents. For multi-completion problems, only one completion is shown per sample._
|
| 22 |
+
|
| 23 |
+
### Math
|
| 24 |
+
|
| 25 |
+
**Correct** (1 shown)
|
| 26 |
+
|
| 27 |
+
- **reference**: `194`
|
| 28 |
+
- **overall** (3/8 completions correct)
|
| 29 |
+
- **extracted** (✓): `194`
|
| 30 |
+
- **completion**:
|
| 31 |
+
|
| 32 |
+
```text
|
| 33 |
+
<think>
|
| 34 |
+
To solve this problem, we need to understand the conditions under which a \(194 \times 194\) board can be tiled with \(1 \times 2\) and \(2 \times 1\) dominoes, and how marking cells affects this tiling.
|
| 35 |
+
|
| 36 |
+
First, we note that a \(194 \times 194\) board has an area of \(194^2 = 37636\) cells. Each domino covers exactly 2 cells, so the number of dominoes required to tile the board is \(37636
|
| 37 |
+
|
| 38 |
+
... [truncated 147 chars] ...
|
| 39 |
+
|
| 40 |
+
ontains at most one marked cell. This means that the marked cells can be placed in such a way that no two marked cells are in the same domino.
|
| 41 |
+
|
| 42 |
+
To find the minimum number of marked cells, we need to consider the maximum number of dominoes that can be placed such that no two marked cells are in the same domino. This is equivalent to placing the marked cells in such a way that they are not in the same row or column, as dominoes span two rows and two columns.
|
| 43 |
+
|
| 44 |
+
If we place a marked cell in each row and each column, we can ensure that no two marked cells are in the same row or column. This is similar to placing non-attacking rooks on a chessboard, where no two rooks attack each other.
|
| 45 |
+
|
| 46 |
+
For a \(194 \times 194\) board, the maximum number of non-attacking rooks we can place is 194 (one per row and one per column). Therefore, the minimum number of marked cells required to ensure that no two marked cells are in the same row or column is 194.
|
| 47 |
+
|
| 48 |
+
Thus, the least positive integer \(k\) for which one can choose \(k\) cells to mark so that there is exactly one tiling of the board by \(1 \times 2\) and \(2 \times 1\) dominoes in which every domino contains at most one marked cell is \(\boxed{194}\).
|
| 49 |
+
```
|
| 50 |
+
|
| 51 |
+
**Incorrect** (1 shown)
|
| 52 |
+
|
| 53 |
+
- **reference**: `$g(x)=c, g(x)=\lceil x \rceil, g(x)=\lfloor x \rfloor$`
|
| 54 |
+
- **overall** (0/8 completions correct)
|
| 55 |
+
- **extracted** (✗): `h(x) = c \text{ for all } x \in \mathbb{Q} \text{ and some } c \in \mathbb{Z}`
|
| 56 |
+
- **completion**:
|
| 57 |
+
|
| 58 |
+
```text
|
| 59 |
+
<think>
|
| 60 |
+
Given the problem, we need to find all mappings $h:\mathbb{Q}\to\mathbb{Z}$ such that for every $x\in\mathbb{Q}$, every $a\in\mathbb{Z}$, and every positive integer $b$, the following equation holds:
|
| 61 |
+
\[
|
| 62 |
+
h(x)=h\!\left(\frac{h(bx-a)+a}{b}\right).
|
| 63 |
+
\]
|
| 64 |
+
|
| 65 |
+
First, we can simplify the given equation by setting $x=0$. Substituting $x=0$ into the equation, we get:
|
| 66 |
+
\[
|
| 67 |
+
h(0) = h\!\left(\frac{h(0-a)+a}{b}
|
| 68 |
+
|
| 69 |
+
... [truncated 371 chars] ...
|
| 70 |
+
|
| 71 |
+
r all $a$ and $b$. Therefore, $h$ must be constant on the set of all rational numbers. Let's denote this constant value by $c$. So, we have:
|
| 72 |
+
\[
|
| 73 |
+
h(x) = c \quad \text{for all} \quad x \in \mathbb{Q}.
|
| 74 |
+
\]
|
| 75 |
+
Now, we need to check if this constant function satisfies the given equation. Substituting $h(x) = c$ into the original equation, we get:
|
| 76 |
+
\[
|
| 77 |
+
c = c \quad \text{which is true for all} \quad c \in \mathbb{Z}.
|
| 78 |
+
\]
|
| 79 |
+
Thus, the constant function $h(x) = c$ for all $x \in \mathbb{Q}$ is a solution. To confirm that this is the only solution, we need to ensure that no other functions can satisfy the given equation. Suppose there exists a non-constant function $h$ that satisfies the equation. Then, there must exist some $x$ for which $h(x) \neq c$. However, the condition $h(x) = h\!\left(\frac{h(bx-a)+a}{b}\right)$ must hold for all $x$, $a$, and $b$, and since $h$ is constant on all rational numbers, this condition can only be satisfied if $h$ is constant. Therefore, the only possible solution is the constant function $h(x) = c$ for all $x \in \mathbb{Q}$.
|
| 80 |
+
|
| 81 |
+
Hence, the solution to the problem is:
|
| 82 |
+
\[
|
| 83 |
+
</think>
|
| 84 |
+
|
| 85 |
+
\boxed{h(x) = c \text{ for all } x \in \mathbb{Q} \text{ and some } c \in \mathbb{Z}}.
|
| 86 |
+
\]
|
| 87 |
+
```
|