inference4j
/

gpt2

Text Generation

Model card Files Files and versions

gpt2 / README.md

vccarvalho11's picture

Upload gpt2 ONNX model

48fa959 verified about 1 month ago

|

history blame contribute delete

1.23 kB

	---
	library_name: onnx
	tags:
	- text-generation
	- gpt2
	- onnx
	- inference4j
	license: mit
	pipeline_tag: text-generation
	---

	# GPT-2 — ONNX

	ONNX export of [GPT-2](https://huggingface.co/openai-community/gpt2) (124M parameters) with KV cache support for efficient autoregressive generation.

	Converted for use with [inference4j](https://github.com/inference4j/inference4j), an inference-only AI library for Java.

	## Original Source

	- Repository: [OpenAI](https://huggingface.co/openai-community/gpt2)
	- License: MIT

	## Usage with inference4j

	```java
	try (Gpt2TextGenerator gen = Gpt2TextGenerator.builder().build()) {
	GenerationResult result = gen.generate("Once upon a time");
	System.out.println(result.text());
	}
	```

	## Model Details

	\| Property \| Value \|
	\|----------\|-------\|
	\| Architecture \| GPT-2 (124M parameters, 12 layers, 768 hidden, 12 heads) \|
	\| Task \| Text generation \|
	\| Context length \| 1024 tokens \|
	\| Vocabulary \| 50257 tokens (BPE) \|
	\| Original framework \| PyTorch (transformers) \|
	\| Export method \| Hugging Face Optimum (with KV cache) \|

	## License

	This model is licensed under the [MIT License](https://opensource.org/licenses/MIT). Original model by [OpenAI](https://openai.com/).