niruthiha
/

roberta-large-mit-restaurant-ner

Token Classification

named-entity-recognition

Model card Files Files and versions

roberta-large-mit-restaurant-ner / README.md

niruthiha's picture

Upload folder using huggingface_hub

df8519c verified 9 months ago

|

history blame contribute delete

2.18 kB

	---
	language: en
	tags:
	- token-classification
	- ner
	- named-entity-recognition
	- roberta
	- restaurant
	- mit-restaurant
	datasets:
	- mit_restaurant
	metrics:
	- f1
	- precision
	- recall
	- accuracy
	widget:
	- text: "I want a reservation at an italian restaurant with outdoor seating"
	example_title: "Restaurant query"
	- text: "Find me a cheap chinese place near downtown"
	example_title: "Restaurant search"
	---

	# RoBERTa Large for MIT Restaurant NER

	This model is a fine-tuned version of RoBERTa Large on the MIT Restaurant dataset for Named Entity Recognition (NER).

	## Model Description

	- Model type: Token Classification (NER)
	- Base model: roberta-large
	- Dataset: MIT Restaurant NER dataset
	- Languages: English
	- Task: Named Entity Recognition for restaurant domain

	## Entity Types

	The model can identify the following entity types:
	['O', 'B-Amenity', 'I-Amenity', 'B-Cuisine', 'I-Cuisine', 'B-Dish', 'I-Dish', 'B-Hours', 'I-Hours', 'B-Location', 'I-Location', 'B-Price', 'I-Price', 'B-Rating', 'I-Rating', 'B-Restaurant_Name', 'I-Restaurant_Name']

	## Usage

	```python
	from transformers import AutoTokenizer, AutoModelForTokenClassification, pipeline

	tokenizer = AutoTokenizer.from_pretrained("niruthiha/roberta-large-mit-restaurant-ner")
	model = AutoModelForTokenClassification.from_pretrained("niruthiha/roberta-large-mit-restaurant-ner")

	# Using pipeline
	nlp = pipeline("ner", model=model, tokenizer=tokenizer, aggregation_strategy="simple")
	result = nlp("I want a reservation at an italian restaurant with outdoor seating")
	print(result)

	# Manual usage
	inputs = tokenizer("I want a reservation at an italian restaurant", return_tensors="pt")
	outputs = model(**inputs)
	```

	## Training Details

	- Fine-tuned on MIT Restaurant NER dataset
	- Training epochs: 5
	- Learning rate: 1e-5
	- Batch size: 16
	- Gradient accumulation steps: 2

	## Dataset

	The MIT Restaurant dataset contains restaurant-related queries with entity annotations.
	Dataset source: https://groups.csail.mit.edu/sls/downloads/restaurant/

	## Performance

	The model achieves good performance on restaurant domain NER tasks. Specific metrics will be updated after evaluation.