How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="cstr/splade-pp-en-v1-GGUF",
	filename="",
)
output = llm(
	"Once upon a time,",
	max_tokens=512,
	echo=True
)
print(output)

Splade PP en v1 GGUF

GGUF format of prithivida/Splade_PP_en_v1 for use with CrispEmbed.

SPLADE sparse embedding model for efficient keyword-based retrieval. Produces sparse term-weight vectors over the vocabulary.

Files

File Quantization Size
splade-pp-en-v1.gguf F32 418 MB
splade-pp-en-v1-q8_0.gguf Q8_0 111 MB

Quick Start

from crispembed import CrispEmbed

model = CrispEmbed("splade-pp-en-v1.gguf")
sparse = model.encode_sparse("machine learning")
# {token_id: weight} โ€” top terms: machine(2.09), learning(1.63), ...

Credits

Downloads last month
249
GGUF
Model size
0.1B params
Architecture
bert
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for cstr/splade-pp-en-v1-GGUF

Quantized
(1)
this model