Text Generation
Transformers
Safetensors
qwen2
llama-factory
full
Generated from Trainer
conversational
text-generation-inference
Instructions to use pepoo20/WordProblem with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use pepoo20/WordProblem with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="pepoo20/WordProblem") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("pepoo20/WordProblem") model = AutoModelForCausalLM.from_pretrained("pepoo20/WordProblem") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use pepoo20/WordProblem with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "pepoo20/WordProblem" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "pepoo20/WordProblem", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/pepoo20/WordProblem
- SGLang
How to use pepoo20/WordProblem with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "pepoo20/WordProblem" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "pepoo20/WordProblem", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "pepoo20/WordProblem" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "pepoo20/WordProblem", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use pepoo20/WordProblem with Docker Model Runner:
docker model run hf.co/pepoo20/WordProblem
Training in progress, step 6000
Browse files- model.safetensors +1 -1
- trainer_log.jsonl +12 -0
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 3673690696
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ce470db20a3b3fa0e4c26072d5eaa59b030c4dd69694ce9ae9f5da06c6e277be
|
| 3 |
size 3673690696
|
trainer_log.jsonl
CHANGED
|
@@ -10,3 +10,15 @@
|
|
| 10 |
{"current_steps": 2700, "total_steps": 9120, "loss": 0.1797, "eval_loss": null, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": 4.238538385782601e-05, "epoch": 0.2960445163235657, "percentage": 29.61, "elapsed_time": "1:40:55", "remaining_time": "3:59:59"}
|
| 11 |
{"current_steps": 3000, "total_steps": 9120, "loss": 0.176, "eval_loss": null, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": 4.032123642522486e-05, "epoch": 0.32893835147062855, "percentage": 32.89, "elapsed_time": "1:51:07", "remaining_time": "3:46:41"}
|
| 12 |
{"current_steps": 3000, "total_steps": 9120, "loss": null, "eval_loss": 0.1760552078485489, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": null, "epoch": 0.32893835147062855, "percentage": 32.89, "elapsed_time": "1:51:07", "remaining_time": "3:46:41"}
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 10 |
{"current_steps": 2700, "total_steps": 9120, "loss": 0.1797, "eval_loss": null, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": 4.238538385782601e-05, "epoch": 0.2960445163235657, "percentage": 29.61, "elapsed_time": "1:40:55", "remaining_time": "3:59:59"}
|
| 11 |
{"current_steps": 3000, "total_steps": 9120, "loss": 0.176, "eval_loss": null, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": 4.032123642522486e-05, "epoch": 0.32893835147062855, "percentage": 32.89, "elapsed_time": "1:51:07", "remaining_time": "3:46:41"}
|
| 12 |
{"current_steps": 3000, "total_steps": 9120, "loss": null, "eval_loss": 0.1760552078485489, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": null, "epoch": 0.32893835147062855, "percentage": 32.89, "elapsed_time": "1:51:07", "remaining_time": "3:46:41"}
|
| 13 |
+
{"current_steps": 3300, "total_steps": 9120, "loss": 0.1791, "eval_loss": null, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": 3.8074115216771435e-05, "epoch": 0.3618321866176914, "percentage": 36.18, "elapsed_time": "2:04:22", "remaining_time": "3:39:21"}
|
| 14 |
+
{"current_steps": 3600, "total_steps": 9120, "loss": 0.1808, "eval_loss": null, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": 3.567085646427478e-05, "epoch": 0.39472602176475424, "percentage": 39.47, "elapsed_time": "2:14:33", "remaining_time": "3:26:19"}
|
| 15 |
+
{"current_steps": 3900, "total_steps": 9120, "loss": 0.1805, "eval_loss": null, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": 3.3140161071244915e-05, "epoch": 0.4276198569118171, "percentage": 42.76, "elapsed_time": "2:26:25", "remaining_time": "3:15:58"}
|
| 16 |
+
{"current_steps": 4200, "total_steps": 9120, "loss": 0.1738, "eval_loss": null, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": 3.05122518525215e-05, "epoch": 0.46051369205888, "percentage": 46.05, "elapsed_time": "2:36:32", "remaining_time": "3:03:22"}
|
| 17 |
+
{"current_steps": 4500, "total_steps": 9120, "loss": 0.1736, "eval_loss": null, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": 2.781851259848554e-05, "epoch": 0.49340752720594283, "percentage": 49.34, "elapsed_time": "2:48:26", "remaining_time": "2:52:55"}
|
| 18 |
+
{"current_steps": 4500, "total_steps": 9120, "loss": null, "eval_loss": 0.17090687155723572, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": null, "epoch": 0.49340752720594283, "percentage": 49.34, "elapsed_time": "2:48:26", "remaining_time": "2:52:55"}
|
| 19 |
+
{"current_steps": 4800, "total_steps": 9120, "loss": 0.1709, "eval_loss": null, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": 2.509111327432736e-05, "epoch": 0.5263013623530056, "percentage": 52.63, "elapsed_time": "2:59:53", "remaining_time": "2:41:53"}
|
| 20 |
+
{"current_steps": 5100, "total_steps": 9120, "loss": 0.1775, "eval_loss": null, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": 2.236262583042668e-05, "epoch": 0.5591951975000685, "percentage": 55.92, "elapsed_time": "3:11:41", "remaining_time": "2:31:05"}
|
| 21 |
+
{"current_steps": 5400, "total_steps": 9120, "loss": 0.1759, "eval_loss": null, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": 1.966563521202681e-05, "epoch": 0.5920890326471314, "percentage": 59.21, "elapsed_time": "3:21:50", "remaining_time": "2:19:02"}
|
| 22 |
+
{"current_steps": 5700, "total_steps": 9120, "loss": 0.1754, "eval_loss": null, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": 1.7032350213717874e-05, "epoch": 0.6249828677941942, "percentage": 62.5, "elapsed_time": "3:33:38", "remaining_time": "2:08:11"}
|
| 23 |
+
{"current_steps": 6000, "total_steps": 9120, "loss": 0.1688, "eval_loss": null, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": 1.4494218826096939e-05, "epoch": 0.6578767029412571, "percentage": 65.79, "elapsed_time": "3:43:45", "remaining_time": "1:56:21"}
|
| 24 |
+
{"current_steps": 6000, "total_steps": 9120, "loss": null, "eval_loss": 0.16823573410511017, "predict_loss": null, "reward": null, "accuracy": null, "learning_rate": null, "epoch": 0.6578767029412571, "percentage": 65.79, "elapsed_time": "3:43:45", "remaining_time": "1:56:21"}
|