shaqaqio
/

my_model

Model card Files Files and versions

my_model / README.md

shaqaqio's picture

Update README.md

52fe3e5 verified 3 months ago

|

history blame contribute delete

1.68 kB

	# Arabic End-of-Utterance (EOU) Classifier

	## Overview
	This repository contains a custom PyTorch model for End-of-Utterance (EOU) detection in Arabic conversational text.
	The model predicts whether a given text segment represents the end of a speaker’s turn.

	This is a custom architecture (not a Hugging Face `AutoModel`) and is intended for research and development use.

	---

	## Task
	Given an input text segment, the model outputs a binary prediction:

	- `0` → The speaker is expected to continue speaking
	- `1` → The speaker has finished their turn

	---

	## Model Details
	- Framework: PyTorch
	- Architecture: Custom `EOUClassifier`
	- Task: Binary classification (EOU detection)
	- Language: Arabic

	---

	## Tokenizer
	This model uses the tokenizer from:

	`Omartificial-Intelligence-Space/SA-BERT-V1`

	The tokenizer is not included in this repository and must be loaded separately.

	---

	## Files
	- `model.py` — Model architecture (`EOUClassifier`)
	- `model.pt` — Trained model weights
	- `config.json` — Model configuration
	- `README.md` — This file

	---

	## Loading the Model
	```python
	import torch
	from transformers import AutoTokenizer
	from model import EOUClassifier

	tokenizer = AutoTokenizer.from_pretrained(
	"Omartificial-Intelligence-Space/SA-BERT-V1"
	)

	model = EOUClassifier()
	model.load_state_dict(
	torch.load("model.pt", map_location="cpu")
	)
	model.eval()

	examples = ["مقصدي من الموضوع انه", "اتمنى تقدر تساعدني"]


	batch = tokenizer(examples, padding=True, truncation=True, return_tensors="pt")
	batch.to(device)

	out = model(batch["input_ids"], batch["attention_mask"])
	```

	## license

	MIT