Instructions to use OpenMOSS-Team/SciJudge-30B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenMOSS-Team/SciJudge-30B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="OpenMOSS-Team/SciJudge-30B") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("OpenMOSS-Team/SciJudge-30B") model = AutoModelForCausalLM.from_pretrained("OpenMOSS-Team/SciJudge-30B") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use OpenMOSS-Team/SciJudge-30B with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "OpenMOSS-Team/SciJudge-30B" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "OpenMOSS-Team/SciJudge-30B", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/OpenMOSS-Team/SciJudge-30B
- SGLang
How to use OpenMOSS-Team/SciJudge-30B with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "OpenMOSS-Team/SciJudge-30B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "OpenMOSS-Team/SciJudge-30B", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "OpenMOSS-Team/SciJudge-30B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "OpenMOSS-Team/SciJudge-30B", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use OpenMOSS-Team/SciJudge-30B with Docker Model Runner:
docker model run hf.co/OpenMOSS-Team/SciJudge-30B
Update SciJudge-30B model card and link 2605 release
#2
by Mubuky - opened
README.md
CHANGED
|
@@ -1,32 +1,40 @@
|
|
| 1 |
---
|
| 2 |
language:
|
| 3 |
-
|
| 4 |
license: apache-2.0
|
| 5 |
base_model: Qwen/Qwen3-30B-A3B-Instruct-2507
|
|
|
|
|
|
|
| 6 |
tags:
|
| 7 |
-
|
| 8 |
-
|
| 9 |
-
- preference-learning
|
| 10 |
-
- GRPO
|
| 11 |
-
- moe
|
| 12 |
pipeline_tag: text-generation
|
| 13 |
library_name: transformers
|
| 14 |
---
|
| 15 |
|
| 16 |
-
# SciJudge-
|
| 17 |
|
| 18 |
-
|
| 19 |
|
| 20 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
|
| 22 |
## Usage
|
| 23 |
|
| 24 |
```python
|
|
|
|
| 25 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|
| 26 |
|
| 27 |
model_name = "OpenMOSS-Team/SciJudge-30B"
|
| 28 |
tokenizer = AutoTokenizer.from_pretrained(model_name)
|
| 29 |
-
model = AutoModelForCausalLM.from_pretrained(
|
|
|
|
|
|
|
|
|
|
|
|
|
| 30 |
|
| 31 |
messages = [
|
| 32 |
{"role": "system", "content": "You are a helpful assistant. You first think about the reasoning process in your mind and then provide the user with the answer."},
|
|
@@ -42,20 +50,22 @@ print(response)
|
|
| 42 |
|
| 43 |
## Training Details
|
| 44 |
|
| 45 |
-
- **Base model:** Qwen3-30B-A3B-Instruct-2507
|
| 46 |
-
- **Training method:** GRPO
|
| 47 |
-
- **
|
| 48 |
-
- **Learning rate:** 8e-7 (cosine schedule, 5% warmup)
|
| 49 |
-
- **Micro batch size:** 8, global batch size: 1024
|
| 50 |
-
- **Optimizer:** Adam (with CPU offload)
|
| 51 |
- **Precision:** bfloat16
|
| 52 |
-
- **KL coefficient
|
| 53 |
|
| 54 |
## Citation
|
| 55 |
|
| 56 |
```bibtex
|
| 57 |
-
@
|
| 58 |
-
|
| 59 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 60 |
}
|
| 61 |
```
|
|
|
|
| 1 |
---
|
| 2 |
language:
|
| 3 |
+
- en
|
| 4 |
license: apache-2.0
|
| 5 |
base_model: Qwen/Qwen3-30B-A3B-Instruct-2507
|
| 6 |
+
datasets:
|
| 7 |
+
- OpenMOSS-Team/SciJudgeBench
|
| 8 |
tags:
|
| 9 |
+
- scientific-taste
|
| 10 |
+
- GRPO
|
|
|
|
|
|
|
|
|
|
| 11 |
pipeline_tag: text-generation
|
| 12 |
library_name: transformers
|
| 13 |
---
|
| 14 |
|
| 15 |
+
# SciJudge-30B
|
| 16 |
|
| 17 |
+
> **Update:** A newer release is available at [SciJudge-30B-2605](https://huggingface.co/OpenMOSS-Team/SciJudge-30B-2605). We recommend using the newer release for current experiments and comparisons.
|
| 18 |
|
| 19 |
+
SciJudge-30B is a Qwen3-30B-A3B-Instruct-2507 MoE model fine-tuned for scientific paper evaluation. Given two papers' titles, abstracts, and publication dates, it predicts which paper has higher citation impact.
|
| 20 |
+
|
| 21 |
+
This model is part of [AI Can Learn Scientific Taste](https://arxiv.org/abs/2603.14473). The benchmark dataset is [SciJudgeBench](https://huggingface.co/datasets/OpenMOSS-Team/SciJudgeBench).
|
| 22 |
+
|
| 23 |
+
Resources: [Project page](https://tongjingqi.github.io/AI-Can-Learn-Scientific-Taste/) and [GitHub repository](https://github.com/tongjingqi/AI-Can-Learn-Scientific-Taste).
|
| 24 |
|
| 25 |
## Usage
|
| 26 |
|
| 27 |
```python
|
| 28 |
+
import torch
|
| 29 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|
| 30 |
|
| 31 |
model_name = "OpenMOSS-Team/SciJudge-30B"
|
| 32 |
tokenizer = AutoTokenizer.from_pretrained(model_name)
|
| 33 |
+
model = AutoModelForCausalLM.from_pretrained(
|
| 34 |
+
model_name,
|
| 35 |
+
torch_dtype=torch.bfloat16,
|
| 36 |
+
device_map="auto",
|
| 37 |
+
)
|
| 38 |
|
| 39 |
messages = [
|
| 40 |
{"role": "system", "content": "You are a helpful assistant. You first think about the reasoning process in your mind and then provide the user with the answer."},
|
|
|
|
| 50 |
|
| 51 |
## Training Details
|
| 52 |
|
| 53 |
+
- **Base model:** [Qwen/Qwen3-30B-A3B-Instruct-2507](https://huggingface.co/Qwen/Qwen3-30B-A3B-Instruct-2507)
|
| 54 |
+
- **Training method:** GRPO with DAPO loss
|
| 55 |
+
- **Reward:** external preference reward for citation-based pairwise judgment
|
|
|
|
|
|
|
|
|
|
| 56 |
- **Precision:** bfloat16
|
| 57 |
+
- **KL coefficient:** 0.03
|
| 58 |
|
| 59 |
## Citation
|
| 60 |
|
| 61 |
```bibtex
|
| 62 |
+
@misc{tong2026ailearnscientifictaste,
|
| 63 |
+
title={AI Can Learn Scientific Taste},
|
| 64 |
+
author={Jingqi Tong and Mingzhe Li and Hangcheng Li and Yongzhuo Yang and Yurong Mou and Weijie Ma and Zhiheng Xi and Hongji Chen and Xiaoran Liu and Qinyuan Cheng and Ming Zhang and Qiguang Chen and Weifeng Ge and Qipeng Guo and Tianlei Ying and Tianxiang Sun and Yining Zheng and Xinchi Chen and Jun Zhao and Ning Ding and Xuanjing Huang and Yugang Jiang and Xipeng Qiu},
|
| 65 |
+
year={2026},
|
| 66 |
+
eprint={2603.14473},
|
| 67 |
+
archivePrefix={arXiv},
|
| 68 |
+
primaryClass={cs.CL},
|
| 69 |
+
url={https://arxiv.org/abs/2603.14473},
|
| 70 |
}
|
| 71 |
```
|