Rising Tide
Collection
This collection contains all the GRPO-trained models for our paper "A Rising Tide Lifts All Boats". Please consider citing us!
•
17 items
•
Updated
•
1
Hindi-to-English translation model trained with GRPO with MTQE rewards. Works well on idiom translation, non-idiomatic translation, and for other languages as well.
from vllm import LLM, SamplingParams
sampling_params = SamplingParams(temperature=0.3, max_tokens=512)
llm = LLM('ishikaa/Hindi_llama8b-da', tensor_parallel_size=torch.cuda.device_count(), gpu_memory_utilization=0.8, trust_remote_code=True)
idiom = "" # your Hindi idiom
prompt = f"Concisely translate the idiom {idiom} semantically into English: "
output = llm.generate(prompt, sampling_params=sampling_params)
print(output.outputs[0].text)
For more information, read here: https://www.arxiv.org/abs/2601.06307
@misc{agarwal2026risingtideliftsboats,
title={A Rising Tide Lifts All Boats: MTQE Rewards for Idioms Improve General Translation Quality},
author={Ishika Agarwal and Zhenlin He and Dhruva Patil and Dilek Hakkani-Tür},
year={2026},
eprint={2601.06307},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2601.06307},
}
Base model
meta-llama/Llama-3.1-8B