Instructions to use d2j666/competitorDescriptions-ds-mini with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use d2j666/competitorDescriptions-ds-mini with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="d2j666/competitorDescriptions-ds-mini")

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("d2j666/competitorDescriptions-ds-mini")
model = AutoModelForCausalLM.from_pretrained("d2j666/competitorDescriptions-ds-mini")

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use d2j666/competitorDescriptions-ds-mini with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "d2j666/competitorDescriptions-ds-mini"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "d2j666/competitorDescriptions-ds-mini",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/d2j666/competitorDescriptions-ds-mini

SGLang

How to use d2j666/competitorDescriptions-ds-mini with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "d2j666/competitorDescriptions-ds-mini" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "d2j666/competitorDescriptions-ds-mini",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "d2j666/competitorDescriptions-ds-mini" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "d2j666/competitorDescriptions-ds-mini",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use d2j666/competitorDescriptions-ds-mini with Docker Model Runner:
```
docker model run hf.co/d2j666/competitorDescriptions-ds-mini
```

d2j666 commited on Jun 7, 2023

Commit

62d0b4e

1 Parent(s): c9c7ed9

Upload model

Browse files

Files changed (3) hide show

README.md +9 -7
config.json +1 -1
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -14,9 +14,9 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 9.1595
-- Validation Loss: 8.9603
-- Epoch: 2
 ## Model description
@@ -35,16 +35,18 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'class_name': 'WarmUp', 'config': {'initial_learning_rate': 5e-05, 'decay_schedule_fn': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 5e-05, 'decay_steps': -985, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, '__passive_serialization__': True}, 'warmup_steps': 1000, 'power': 1.0, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
 - training_precision: float32
 ### Training results
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
-| 9.6041     | 9.5385          | 0     |
-| 9.4427     | 9.2623          | 1     |
-| 9.1595     | 8.9603          | 2     |
 ### Framework versions

 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 8.7799
+- Validation Loss: 8.6826
+- Epoch: 4
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'class_name': 'WarmUp', 'config': {'initial_learning_rate': 5e-05, 'decay_schedule_fn': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 5e-05, 'decay_steps': -987, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, '__passive_serialization__': True}, 'warmup_steps': 1000, 'power': 1.0, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
 - training_precision: float32
 ### Training results
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
+| 9.5975     | 9.5488          | 0     |
+| 9.4736     | 9.3392          | 1     |
+| 9.2465     | 9.0821          | 2     |
+| 8.9970     | 8.8573          | 3     |
+| 8.7799     | 8.6826          | 4     |
 ### Framework versions

config.json CHANGED Viewed

@@ -11,7 +11,7 @@
   "initializer_range": 0.02,
   "layer_norm_epsilon": 1e-05,
   "model_type": "gpt2",
-  "n_ctx": 60,
   "n_embd": 768,
   "n_head": 12,
   "n_inner": null,

   "initializer_range": 0.02,
   "layer_norm_epsilon": 1e-05,
   "model_type": "gpt2",
+  "n_ctx": 128,
   "n_embd": 768,
   "n_head": 12,
   "n_inner": null,

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:331aaf957039547b2cda484476edfa81ea1eb9faa9bfca61a3116a4c7088e689
 size 385107024

 version https://git-lfs.github.com/spec/v1
+oid sha256:6efd01ce8c035e17d8ad90a04dad10a42274c144352a4fb477eb09d49264f739
 size 385107024