nightknocker
/

cosmos-bert

Model card Files Files and versions

nightknocker commited on 2 days ago

Commit

2205ec7

·

verified ·

1 Parent(s): 72a0e8d

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -37,9 +37,9 @@ It was trained on both T5 (text) and the [AnimaTextToImagePipeline](https://hugg
 ## Subject-Focused Attention
-In an SVO sentence structure, CLIPs focus too much on the subject, text encoders are undertrained for certain verbs and cannot reliably identify the object's position.
-This repo is an experiment to address these issues. The spatial knowledge is explicitly encoded, so the attention modules are not overwhelmed by the task.
 ## Inference

 ## Subject-Focused Attention
+- In an SVO sentence structure, CLIPs focus too much on the subject, text encoders are undertrained for certain verbs and cannot reliably identify the object's position.
+- This repo is an experiment to address these issues. The spatial knowledge is explicitly encoded, so the attention modules are not overwhelmed by the task.
 ## Inference