Improve model card: Add pipeline tag, paper, and project page links

This PR improves the model card for the `Unbabel/Tower-Plus-2B` model by:
- Adding the `pipeline_tag: text-generation` to the metadata, which enhances discoverability on the Hugging Face Hub (e.g., at https://huggingface.co/models?pipeline_tag=text-generation).
- Including a direct link to the research paper "[Tower+: Bridging Generality and Translation Specialization in Multilingual LLMs](https://huggingface.co/papers/2506.17080)", providing users with easy access to its technical details and findings.
- Adding a link to the official project page ([Unbabel Tower+ Collection](https://huggingface.co/collections/Unbabel/tower-plus-6846ca452a10c0905dc03c0f)), offering further context and related resources for the Tower+ models.

Files changed (1) hide show

README.md +12 -3

README.md CHANGED Viewed

@@ -1,6 +1,5 @@
 ---
 base_model: google/gemma-2-2B
-license: cc-by-nc-sa-4.0
 language:
 - de
 - nl
@@ -25,8 +24,14 @@ language:
 - ro
 - fi
 library_name: transformers
 ---
 ![Tower Plus Pareto](./Tower-plus-pareto.png)
 # Model Description:
@@ -71,7 +76,9 @@ sampling_params = SamplingParams(
   max_tokens=8192,
 )
 llm = LLM(model="Unbabel/Tower-Plus-2B", tensor_parallel_size=1)
-messages = [{"role": "user", "content": "Translate the following English source text to Portuguese (Portugal):\nEnglish: Hello world!\nPortuguese (Portugal): "}]
 outputs = llm.chat(messages, sampling_params)
 # Make sure your prompt_token_ids look like this
 print (outputs[0].outputs[0].text)
@@ -89,7 +96,9 @@ from transformers import pipeline
 pipe = pipeline("text-generation", model="Unbabel/Tower-Plus-2B", device_map="auto")
 # We use the tokenizer’s chat template to format each message - see https://huggingface.co/docs/transformers/main/en/chat_templating
-messages = [{"role": "user", "content": "Translate the following English source text to Portuguese (Portugal):\nEnglish: Hello world!\nPortuguese (Portugal): "}]
 input_ids = pipe.tokenizer.apply_chat_template(messages, tokenize=True, add_generation_prompt=True)
 outputs = pipe(messages, max_new_tokens=256, do_sample=False)
 print(outputs[0]["generated_text"])

 ---
 base_model: google/gemma-2-2B
 language:
 - de
 - nl
 - ro
 - fi
 library_name: transformers
+license: cc-by-nc-sa-4.0
+pipeline_tag: text-generation
 ---
+This repository contains the model presented in the paper [Tower+: Bridging Generality and Translation Specialization in Multilingual LLMs](https://huggingface.co/papers/2506.17080).
+You can find the official project page and other related models in the [Unbabel Tower+ Collection](https://huggingface.co/collections/Unbabel/tower-plus-6846ca452a10c0905dc03c0f).
 ![Tower Plus Pareto](./Tower-plus-pareto.png)
 # Model Description:
   max_tokens=8192,
 )
 llm = LLM(model="Unbabel/Tower-Plus-2B", tensor_parallel_size=1)
+messages = [{"role": "user", "content": "Translate the following English source text to Portuguese (Portugal):
+English: Hello world!
+Portuguese (Portugal): "}]
 outputs = llm.chat(messages, sampling_params)
 # Make sure your prompt_token_ids look like this
 print (outputs[0].outputs[0].text)
 pipe = pipeline("text-generation", model="Unbabel/Tower-Plus-2B", device_map="auto")
 # We use the tokenizer’s chat template to format each message - see https://huggingface.co/docs/transformers/main/en/chat_templating
+messages = [{"role": "user", "content": "Translate the following English source text to Portuguese (Portugal):
+English: Hello world!
+Portuguese (Portugal): "}]
 input_ids = pipe.tokenizer.apply_chat_template(messages, tokenize=True, add_generation_prompt=True)
 outputs = pipe(messages, max_new_tokens=256, do_sample=False)
 print(outputs[0]["generated_text"])