QCRI
/

MemeLens-VLM

@@ -103,15 +103,136 @@ datasets:
 Acc = Accuracy, Ma = Macro-F1, W = Weighted-F1. Bold indicates best performance for the dataset's official evaluation metric.
 ## Usage
 ```python
 from transformers import Qwen3VLForConditionalGeneration, AutoProcessor
 model = Qwen3VLForConditionalGeneration.from_pretrained("QCRI/MemeLens-VLM", torch_dtype="auto", device_map="auto")
 processor = AutoProcessor.from_pretrained("QCRI/MemeLens-VLM")
 ```
 ## Citation
 ```bibtex

 Acc = Accuracy, Ma = Macro-F1, W = Weighted-F1. Bold indicates best performance for the dataset's official evaluation metric.
+## Supported Tasks
+The model supports 20 meme understanding tasks across multiple categories:
+| Category | Tasks |
+|---|---|
+| **Harm** | Hateful, Harmful, Toxic, Abusive, Vulgar |
+| **Targets** | Target identification, Misogyny, Misogyny Categories, Objectification, Shaming, Stereotype, Violence |
+| **Figurative/Pragmatic** | Propaganda, Metaphor, Intention, Sarcasm, Deepfake, Political |
+| **Affect** | Humor, Offensive, Motivational, Sentiment |
 ## Usage
 ```python
 from transformers import Qwen3VLForConditionalGeneration, AutoProcessor
+from PIL import Image
 model = Qwen3VLForConditionalGeneration.from_pretrained("QCRI/MemeLens-VLM", torch_dtype="auto", device_map="auto")
 processor = AutoProcessor.from_pretrained("QCRI/MemeLens-VLM")
+image = Image.open("path/to/meme.jpg")
+messages = [
+    {"role": "system", "content": "You are an expert social media image analyzer specializing in identifying hateful content in memes."},
+    {"role": "user", "content": [
+        {"type": "image"},
+        {"type": "text", "text": (
+            "Analyze the content of the meme to see if it promotes or displays hateful views against "
+            "specific individuals or groups. Look for discriminatory, prejudiced, or hate speech elements "
+            "associated with protected characteristics, including race, religion, and gender. "
+            "Classify as: 'hateful' if present, or 'not-hateful' if absent.\n\n"
+            "IMPORTANT: Your response must strictly follow this format:\n"
+            "'Label: <label>\nExplanation: <explanation>'\n"
+            "where <label> is your classification and <explanation> provides a brief justification "
+            "for your decision based on the visual and textual content.\n\n"
+            "Text extracted from meme: <OCR text here>"
+        )}
+    ]}
+]
+text = processor.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
+inputs = processor(text=[text], images=[image], padding=True, return_tensors="pt").to(model.device)
+output_ids = model.generate(**inputs, max_new_tokens=512)
+output_text = processor.batch_decode(output_ids[:, inputs.input_ids.shape[1]:], skip_special_tokens=True)[0]
+print(output_text)
+```
+## Prompt Format
+The model uses a **system prompt + user instruction** format. Each task has a specific system prompt and instruction template.
+**Output format** (all tasks):
+```
+Label: <label>
+Explanation: <explanation>
 ```
+### Example Prompts by Task
+<details>
+<summary><b>Hateful Content Detection (EN)</b></summary>
+**System:** `You are an expert social media image analyzer specializing in identifying hateful content in memes.`
+**User:** `<image> Analyze the content of the meme to see if it promotes or displays hateful views against specific individuals or groups. Look for discriminatory, prejudiced, or hate speech elements associated with protected characteristics, including race, religion, and gender. Classify as: 'hateful' if present, or 'not-hateful' if absent.`
+**Labels:** `hateful`, `not-hateful`
+**Example Output:**
+```
+Label: hateful
+Explanation: This meme is labeled as hateful because the text explicitly expresses animosity
+toward a specific group. The phrase targets a collective group, which aligns with the rubric's
+definition of hateful content as language that promotes hostility or discrimination.
+```
+</details>
+<details>
+<summary><b>Misogyny Detection (EN)</b></summary>
+**System:** `You are an expert social media image analyzer specializing in identifying misogynistic content in memes.`
+**User:** `<image> Scrutinize the meme to detect any indication of misogynous content, including aspects like prejudice, discrimination, objectification, or aversion towards women. Label the meme 'misogynous' when identified, or 'not-misogynous' if none are observed.`
+**Labels:** `misogynous`, `not-misogynous`
+</details>
+<details>
+<summary><b>Propaganda Detection (AR)</b></summary>
+**System:** `You are an expert social media analyst specializing in identifying propaganda techniques in memes.`
+**User:** `<image> Identify if the given Arabic meme is an example of propaganda by checking for biased or misleading narratives intended to affect political or ideological perspectives. Select: 'propaganda' or 'not-propaganda' as appropriate.`
+**Labels:** `propaganda`, `not-propaganda`
+</details>
+<details>
+<summary><b>Toxicity Detection (RU)</b></summary>
+**System:** `You are an expert social media image analyzer specializing in identifying toxic content in memes.`
+**User:** `<image> Evaluate the Russian meme for the presence of toxicity. This involves identifying any language or imagery that is damaging, disrespectful, or offensive to individuals or groups. Based on your assessment, classify as: 'toxic' or 'not-toxic'.`
+**Labels:** `toxic`, `not-toxic`
+</details>
+<details>
+<summary><b>Humor Classification (EN)</b></summary>
+**System:** `You are an expert social media image analyzer specializing in emotion recognition in memes.`
+**User:** `<image> Classify the level of humor in the meme. Analyze how funny the content is based on its comedic elements. Classify as: 'not-funny' for unfunny content, 'funny' for mildly humorous, 'very-funny' for highly humorous, or 'hilarious' for extremely funny content.`
+**Labels:** `not-funny`, `funny`, `very-funny`, `hilarious`
+</details>
+<details>
+<summary><b>Abuse Detection (BN)</b></summary>
+**System:** `You are an expert social media image analyzer specializing in identifying abusive content in memes.`
+**User:** `<image> Examine the Bengali meme and decide whether it features abusive content. This includes any form of verbal abuse or personal offense. Classify as: 'abusive' or 'not-abusive'.`
+**Labels:** `abusive`, `not-abusive`
+</details>
+> **Note:** All user prompts should end with: `IMPORTANT: Your response must strictly follow this format: 'Label: <label>\nExplanation: <explanation>' where <label> is your classification and <explanation> provides a brief justification for your decision based on the visual and textual content.` followed by `Text extracted from meme: <OCR text>`
 ## Citation
 ```bibtex