lyhisme commited on
Commit
1d43211
·
verified ·
1 Parent(s): e5d9030

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +30 -4
README.md CHANGED
@@ -1,10 +1,36 @@
1
  ---
2
- title: README
3
- emoji: 😻
4
- colorFrom: red
5
  colorTo: gray
6
  sdk: static
7
  pinned: false
8
  ---
9
 
10
- Edit this `README.md` markdown file to author your organization card.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: ASID-Caption
3
+ emoji: 🦉
4
+ colorFrom: indigo
5
  colorTo: gray
6
  sdk: static
7
  pinned: false
8
  ---
9
 
10
+ # ASID-Caption
11
+
12
+ We build **ASID-Caption**, a data-and-model suite for **fine-grained audiovisual video understanding**.
13
+
14
+ Our goal is to move beyond “one video → one generic caption” by providing **attribute-structured supervision** and **quality-verified annotations**, enabling models to produce **more complete, more controllable, and more temporally consistent** descriptions that cover both **visual content** and **audio cues**.
15
+
16
+ ## What we release
17
+
18
+ - **ASID-1M**: a large-scale collection of **attribute-structured** audiovisual instructions with both *single-attribute* and *all-attributes* training formats.
19
+ - **ASID-Verify**: a scalable curation pipeline that generates, ensembles, verifies, and refines annotations to improve semantic and temporal consistency.
20
+ - **ASID-Captioner**: Qwen2.5-Omni-based audiovisual captioning models fine-tuned on ASID-1M.
21
+
22
+ ## Research interests
23
+
24
+ - Video understanding & video captioning
25
+ - Audio-visual learning
26
+ - Multimodal LLMs / instruction tuning
27
+ - Data curation, verification, and quality control
28
+
29
+ ## Links
30
+
31
+ - **Dataset (ASID-1M):** https://huggingface.co/datasets/AudioVisual-Caption/ASID-1M
32
+ - **Models (ASID-Captioner):** https://huggingface.co/AudioVisual-Caption
33
+
34
+ ## Contact
35
+
36
+ For questions, issues, or takedown requests, please open a **Discussion** under the corresponding dataset/model page.