Improve model card: add metadata, paper and GitHub links

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +15 -3
README.md CHANGED
@@ -1,7 +1,18 @@
 
 
 
 
 
 
1
  # AuroLA
2
- This repo contains the checkpoint of the following paper:
3
 
4
- **Scaling Audio-Text Retrieval with Multimodal Large Language Model**
 
 
 
 
 
 
5
 
6
  ## Quick Start
7
 
@@ -97,7 +108,8 @@ text_messages = [
97
  [
98
  {
99
  "role": "user",
100
- "content": [{"type": "text", "text": f"{t}\nSummarize above sentence in one word:"}],
 
101
  },
102
  {
103
  "role": "assistant",
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: transformers
4
+ pipeline_tag: feature-extraction
5
+ ---
6
+
7
  # AuroLA
 
8
 
9
+ This repository contains the checkpoint of the following paper:
10
+
11
+ **[Scaling Audio-Text Retrieval with Multimodal Large Language Models](https://huggingface.co/papers/2602.18010)**
12
+
13
+ AuroLA is a novel contrastive language-audio pre-training framework that re-purposes Multimodal Large Language Models (MLLMs) as a unified backbone for retrieval. It utilizes a scalable data pipeline, adapts an MLLM for retrieval by prompting it to summarize audio/text inputs, and uses a bidirectional re-ranking module for refined cross-modal interaction.
14
+
15
+ Code: [https://github.com/Jazzcharles/AuroLA](https://github.com/Jazzcharles/AuroLA)
16
 
17
  ## Quick Start
18
 
 
108
  [
109
  {
110
  "role": "user",
111
+ "content": [{"type": "text", "text": f"{t}
112
+ Summarize above sentence in one word:"}],
113
  },
114
  {
115
  "role": "assistant",