recipe-ner-model / README.md

Upload recipe NER model

151e2ee verified 5 months ago

4.6 kB

	---
	language: en
	tags:
	- ner
	- named-entity-recognition
	- recipes
	- food
	- cooking
	- token-classification
	license: mit
	datasets:
	- TASTEset
	---

	# Recipe NER Model

	This is a Named Entity Recognition (NER) model trained to extract entities from recipe text. The model can identify food ingredients, quantities, units, cooking processes, and physical qualities from recipe instructions.

	## Model Details

	### Model Description
	- Model type: Token Classification (NER)
	- Base model: ModernBERT-base
	- Training data: TASTEset (recipe NER dataset)
	- Number of labels: Unknown

	### Entity Types
	The model recognizes the following entity types:

	- FOOD: Food items and ingredients (e.g., "chicken breast", "olive oil")
	- QUANTITY: Numerical quantities (e.g., "2", "1/2", "three")
	- UNIT: Measurement units (e.g., "tablespoons", "cups", "pounds")
	- PROCESS: Cooking processes and methods (e.g., "heat", "stir", "bake")
	- PHYSICAL_QUALITY: Physical qualities and descriptors (e.g., "hot", "chopped", "fresh")

	### Label Format
	The model uses BIO (Beginning-Inside-Outside) tagging scheme:
	- `B-ENTITY`: Beginning of an entity
	- `I-ENTITY`: Inside/continuation of an entity
	- `O`: Outside any entity (not an entity)

	## Usage

	### Basic Usage with Transformers

	```python
	from transformers import AutoTokenizer, AutoModelForTokenClassification
	import torch

	# Load model and tokenizer
	model_name = "rgonzale/recipe-ner-model"
	tokenizer = AutoTokenizer.from_pretrained(model_name)
	model = AutoModelForTokenClassification.from_pretrained(model_name)

	# Example text
	text = "Heat 2 tablespoons olive oil in a large pan"

	# Tokenize
	tokens = tokenizer(text, return_tensors="pt", truncation=True)

	# Make prediction
	with torch.no_grad():
	outputs = model(**tokens)

	# Get predictions
	predictions = torch.argmax(outputs.logits, dim=-1)
	predicted_labels = [model.config.id2label[pred.item()] for pred in predictions[0]]

	print("Tokens:", tokenizer.convert_ids_to_tokens(tokens["input_ids"][0]))
	print("Labels:", predicted_labels)
	```

	### Usage with Pipeline

	```python
	from transformers import pipeline

	# Load NER pipeline
	ner_pipeline = pipeline("ner", model="rgonzale/recipe-ner-model", aggregation_strategy="simple")

	# Example text
	text = "Mix 1 cup flour with 2 eggs and bake at 350 degrees for 25 minutes"

	# Get entities
	entities = ner_pipeline(text)
	for entity in entities:
	print(f"{entity['word']}: {entity['entity_group']} (confidence: {entity['score']:.2f})")
	```

	### Command Line Usage

	```bash
	# Quick inference
	uv run python scripts/infer_ner.py --text "Heat 2 tablespoons olive oil in a large pan"

	# Interactive mode
	uv run python scripts/infer_ner.py --interactive
	```

	## Training Details

	### Training Data
	- Dataset: TASTEset - A specialized dataset for recipe NER
	- Task: Named Entity Recognition for recipes
	- Preprocessing: Simple tokenization with whitespace/punctuation splitting

	### Training Parameters
	- Learning rate: 3e-5
	- Batch size: 8 (train), 16 (eval)
	- Max sequence length: 512 tokens
	- Optimizer: AdamW with weight decay
	- Scheduler: Cosine annealing

	## Evaluation

	### Performance Metrics
	The model was evaluated on the TASTEset validation set with the following results:

	- Sequence-level F1: 0.90
	- Token-level F1: 0.90
	- Entity-level metrics:
	- FOOD: F1 = 0.85
	- QUANTITY: F1 = 0.97
	- UNIT: F1 = 0.97
	- PROCESS: F1 = 0.81
	- PHYSICAL_QUALITY: F1 = 0.79

	### Example Predictions

	```
	Input: "Heat 2 tablespoons olive oil in a large pan over medium heat"

	Predicted entities:
	- QUANTITY: 2
	- UNIT: tablespoons
	- FOOD: olive oil
	- PROCESS: Heat
	```

	## Limitations

	- The model may not recognize very uncommon ingredients or measurements
	- Performance is optimized for recipe text similar to the TASTEset training data
	- The model uses simple tokenization and may not handle complex punctuation well

	## Ethical Considerations

	This model is intended for research and educational purposes. Users should be aware that:
	- The model's predictions may not always be accurate
	- Food-related predictions should not be used for dietary or health decisions
	- The training data may contain biases present in recipe datasets

	## Citation

	If you use this model, please cite:

	```bibtex
	@misc{recipe-ner-model,
	title={Recipe NER Model},
	author={Generated by Recipe Archaeology Project},
	year={2024},
	howpublished={\url{https://huggingface.co/rgonzale/recipe-ner-model}},
	}
	```

	## Contact

	For questions or feedback about this model, please create an issue on the model's repository.