Working models
updated
Pretrain-FBK-NLP/Llama-3.2-1B_AllDataSourcesClinical_0.0002_constant_512_paper
Text Generation
• 1B • Updated
Pretrain-FBK-NLP/Llama-3.2-1B_AllDataSourcesClinical_0.0002_cosine_512_paper
Text Generation
• 1B • Updated • 1
Pretrain-FBK-NLP/mt5-large_AllDataSourcesClinical_0.0002_constant_512_paper
Updated
Pretrain-FBK-NLP/Llama-3.2-1B_AllDataSourcesClinical_0.0002_constant_1024_paper
Text Generation
• 1B • Updated • 1
Pretrain-FBK-NLP/Llama-3.2-1B_AllDataSourcesClinical_0.0002_cosine_1024_paper
Text Generation
• 1B • Updated
Pretrain-FBK-NLP/mt5-large_AllDataSourcesClinical_0.0002_constant_1024_paper
1B • Updated
Pretrain-FBK-NLP/mt5-large_AllDataSourcesClinical_0.0002_cosine_1024_paper
1B • Updated
Pretrain-FBK-NLP/Llama-3.2-1B-Instruct_AllDataSourcesClinical_0.0002_cosine_1024_paper
Text Generation
• 1B • Updated
Pretrain-FBK-NLP/gemma-3-1b-pt_AllDataSourcesClinical_0.0002_cosine_1024_paper
Text Generation
• 1.0B • Updated
Pretrain-FBK-NLP/gemma-3-1b-it_AllDataSourcesClinical_0.0002_cosine_1024_paper
Text Generation
• 1.0B • Updated • 3
Pretrain-FBK-NLP/Qwen3-1.7B_AllDataSourcesClinical_0.0002_cosine_1024_paper
Text Generation
• 2B • Updated • 1