Agent_Course_Final_Assignment

Sleeping

giulia-fontanella commited on Jun 4, 2025

Commit

e205ec9

verified ·

1 Parent(s): 29bec31

Update agent.py

Files changed (1) hide show

agent.py CHANGED Viewed

@@ -57,7 +57,7 @@ class BasicAgent():
             Extract text from an image file using a multimodal model.
             Args:
-                img_path: A local image file path (strings).
             Returns:
                 A single string containing the concatenated text extracted from each image.
@@ -73,13 +73,13 @@ class BasicAgent():
         describe_image(img_path: str, query: str) -> str:
             Generate a detailed description of an image using a multimodal model.
-            This function reads a local image file, encodes it, and sends it to a
             vision-capable language model to obtain a comprehensive, natural language
             description of the image's content, including its objects, actions, and context,
             following a specific query.
             Args:
-                img_path: A string path to a local image file (e.g., PNG, JPEG).
                 query: Information to extract from the image
             Returns:

             Extract text from an image file using a multimodal model.
             Args:
+                img_path: A url pointing to an image (e.g., PNG, JPEG).
             Returns:
                 A single string containing the concatenated text extracted from each image.
         describe_image(img_path: str, query: str) -> str:
             Generate a detailed description of an image using a multimodal model.
+            This function reads a image from an url, encodes it, and sends it to a
             vision-capable language model to obtain a comprehensive, natural language
             description of the image's content, including its objects, actions, and context,
             following a specific query.
             Args:
+                img_path: A url pointing to an image (e.g., PNG, JPEG).
                 query: Information to extract from the image
             Returns: