AnnaStudy
/

LEXA-8B

Model card Files Files and versions

LEXA-8B / README.md

AnnaStudy's picture

Upload model

ace1ce9 verified 2 months ago

|

history blame contribute delete

1.4 kB

	---
	license: apache-2.0
	language:
	- en
	base_model:
	- Qwen/Qwen3-Embedding-8B
	---
	## LEXA-8B
	👉 LEXA-8B: LEXA: Legal Case Retrieval via Graph Contrastive Learning with Contextualised LLM Embeddings. More information is available in [arXiv](https://arxiv.org/abs/2405.11791) & [GitHub](https://github.com/yanran-tang/CaseGNN).

	## Example Usage

	```python
	from transformers import AutoModel, AutoTokenizer
	model = AutoModel.from_pretrained("AnnaStudy/LEXA-8B", torch_dtype="auto", device_map="auto")
	tokenizer = AutoTokenizer.from_pretrained("AnnaStudy/LEXA-8B")
	case_txt = "The following contains key components of a legal case. Legal facts..."
	tokenized = tokenizer(case_txt, return_tensors='pt', padding=True, truncation=True, max_length=2048)
	outputs = model(**tokenized)
	case_embedding = outputs.last_hidden_state[:, -1]
	```
	## Base Model

	ReaKase-8B is finetuned from Qwen3-Embedding-8B, which provides the underlying semantic representation capability.

	Reference: [Qwen/Qwen3-Embedding-8B](https://huggingface.co/Qwen/Qwen3-Embedding-8B)

	## Cite
	If you find this repo useful, please cite
	```
	@article
	{LEXA-8B,
	author = {Yanran Tang, Ruihong Qiu, Xue Li, Zi Huang},
	title = {LEXA: Legal Case Retrieval via Graph Contrastive Learning with Contextualised LLM Embeddings},
	journal = {CoRR},
	volume = {abs/2405.11791},
	year = {2025}
	}
	```