Transcribe audio or video to text
test
Identify objects in images using DETR models
Greet someone by name