Update README.md
Browse files
README.md
CHANGED
|
@@ -37,9 +37,9 @@ It was trained on both T5 (text) and the [AnimaTextToImagePipeline](https://hugg
|
|
| 37 |
|
| 38 |
## Subject-Focused Attention
|
| 39 |
|
| 40 |
-
In an SVO sentence structure, CLIPs focus too much on the subject, text encoders are undertrained for certain verbs and cannot reliably identify the object's position.
|
| 41 |
|
| 42 |
-
This repo is an experiment to address these issues. The spatial knowledge is explicitly encoded, so the attention modules are not overwhelmed by the task.
|
| 43 |
|
| 44 |
## Inference
|
| 45 |
|
|
|
|
| 37 |
|
| 38 |
## Subject-Focused Attention
|
| 39 |
|
| 40 |
+
- In an SVO sentence structure, CLIPs focus too much on the subject, text encoders are undertrained for certain verbs and cannot reliably identify the object's position.
|
| 41 |
|
| 42 |
+
- This repo is an experiment to address these issues. The spatial knowledge is explicitly encoded, so the attention modules are not overwhelmed by the task.
|
| 43 |
|
| 44 |
## Inference
|
| 45 |
|