Instructions to use Kleva-ai/ItaLegalEmb with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use Kleva-ai/ItaLegalEmb with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("Kleva-ai/ItaLegalEmb") sentences = [ "Questa è una persona felice", "Questo è un cane felice", "Questa è una persona molto felice", "Oggi è una giornata di sole" ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Transformers
How to use Kleva-ai/ItaLegalEmb with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("Kleva-ai/ItaLegalEmb") model = AutoModel.from_pretrained("Kleva-ai/ItaLegalEmb") - llama-cpp-python
How to use Kleva-ai/ItaLegalEmb with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="Kleva-ai/ItaLegalEmb", filename="ItaLegalEmb.gguf", )
output = llm( "Once upon a time,", max_tokens=512, echo=True ) print(output)
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use Kleva-ai/ItaLegalEmb with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf Kleva-ai/ItaLegalEmb # Run inference directly in the terminal: llama-cli -hf Kleva-ai/ItaLegalEmb
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf Kleva-ai/ItaLegalEmb # Run inference directly in the terminal: llama-cli -hf Kleva-ai/ItaLegalEmb
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf Kleva-ai/ItaLegalEmb # Run inference directly in the terminal: ./llama-cli -hf Kleva-ai/ItaLegalEmb
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf Kleva-ai/ItaLegalEmb # Run inference directly in the terminal: ./build/bin/llama-cli -hf Kleva-ai/ItaLegalEmb
Use Docker
docker model run hf.co/Kleva-ai/ItaLegalEmb
- LM Studio
- Jan
- Ollama
How to use Kleva-ai/ItaLegalEmb with Ollama:
ollama run hf.co/Kleva-ai/ItaLegalEmb
- Unsloth Studio new
How to use Kleva-ai/ItaLegalEmb with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Kleva-ai/ItaLegalEmb to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Kleva-ai/ItaLegalEmb to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for Kleva-ai/ItaLegalEmb to start chatting
- Docker Model Runner
How to use Kleva-ai/ItaLegalEmb with Docker Model Runner:
docker model run hf.co/Kleva-ai/ItaLegalEmb
- Lemonade
How to use Kleva-ai/ItaLegalEmb with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull Kleva-ai/ItaLegalEmb
Run and chat with the model
lemonade run user.ItaLegalEmb-{{QUANT_TAG}}List all available models
lemonade list
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("Kleva-ai/ItaLegalEmb")
sentences = [
"Questa è una persona felice",
"Questo è un cane felice",
"Questa è una persona molto felice",
"Oggi è una giornata di sole"
]
embeddings = model.encode(sentences)
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [4, 4]ItaLegalEmb
This is a sentence-transformers model: It maps sentences & paragraphs to a 768 dimensional dense vector space and can be used for tasks like clustering or semantic search.
This model is a custom made version based on a Synthetic dataset created on top of Italian legal documents.
It shows promising results:
OpenAI (embedding-ada-002) : 0.793436
Italian BERT (nickprock/sentence-bert-base-italian-xxl-uncased) : 0.563707
ItaLegalEmb : 0.857143
Please note : any access request made using an organizational email address automatically grants us permission to list your organization as a user of our products and services on our website. If you do not agree with this policy, we ask that you refrain from requesting access to our materials.
Usage (Sentence-Transformers)
Using this model becomes easy when you have sentence-transformers installed:
pip install -U sentence-transformers
Then you can use the model like this:
from sentence_transformers import SentenceTransformer
sentences = ["This is an example sentence", "Each sentence is converted"]
model = SentenceTransformer('{MODEL_NAME}')
embeddings = model.encode(sentences)
print(embeddings)
Usage (HuggingFace Transformers)
Without sentence-transformers, you can use the model like this: First, you pass your input through the transformer model, then you have to apply the right pooling-operation on-top of the contextualized word embeddings.
from transformers import AutoTokenizer, AutoModel
import torch
#Mean Pooling - Take attention mask into account for correct averaging
def mean_pooling(model_output, attention_mask):
token_embeddings = model_output[0] #First element of model_output contains all token embeddings
input_mask_expanded = attention_mask.unsqueeze(-1).expand(token_embeddings.size()).float()
return torch.sum(token_embeddings * input_mask_expanded, 1) / torch.clamp(input_mask_expanded.sum(1), min=1e-9)
# Sentences we want sentence embeddings for
sentences = ['This is an example sentence', 'Each sentence is converted']
# Load model from HuggingFace Hub
tokenizer = AutoTokenizer.from_pretrained('{MODEL_NAME}')
model = AutoModel.from_pretrained('{MODEL_NAME}')
# Tokenize sentences
encoded_input = tokenizer(sentences, padding=True, truncation=True, return_tensors='pt')
# Compute token embeddings
with torch.no_grad():
model_output = model(**encoded_input)
# Perform pooling. In this case, mean pooling.
sentence_embeddings = mean_pooling(model_output, encoded_input['attention_mask'])
print("Sentence embeddings:")
print(sentence_embeddings)
Evaluation Results
We compare ItaLegalEmb against an Italian BERT model, as well as the OpenAI embedding model. We evaluated with InformationRetrievalEvaluator as well as a simple hit rate metric
Training
The model was trained with the parameters:
DataLoader:
torch.utils.data.dataloader.DataLoader of length 190 with parameters:
{'batch_size': 10, 'sampler': 'torch.utils.data.sampler.SequentialSampler', 'batch_sampler': 'torch.utils.data.sampler.BatchSampler'}
Loss:
sentence_transformers.losses.MultipleNegativesRankingLoss.MultipleNegativesRankingLoss with parameters:
{'scale': 20.0, 'similarity_fct': 'cos_sim'}
Parameters of the fit()-Method:
{
"epochs": 5,
"evaluation_steps": 50,
"evaluator": "sentence_transformers.evaluation.InformationRetrievalEvaluator.InformationRetrievalEvaluator",
"max_grad_norm": 1,
"optimizer_class": "<class 'torch.optim.adamw.AdamW'>",
"optimizer_params": {
"lr": 2e-05
},
"scheduler": "WarmupLinear",
"steps_per_epoch": null,
"warmup_steps": 95,
"weight_decay": 0.01
}
Full Model Architecture
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False})
)
Citing & Authors
@misc{ItaLegalEmb,
title = {Obiactum/ItaLegalEmb: An embedding model fine-tuned on Italian legal documents.},
author = {Obiactum},
year = {2024},
publisher = {Obiactum},
journal = {HuggingFace repository},
howpublished = {\url{https://huggingface.co/Obiactum/ItaLegalEmb}},
}
- Downloads last month
- -
# Gated model: Login with a HF token with gated access permission hf auth login