Working models
updated
Pretrain-FBK-NLP/Llama-3.2-1B_AllDataSourcesClinical_0.0002_constant_512_paper
Text Generation
•
1B
•
Updated
•
2
Pretrain-FBK-NLP/Llama-3.2-1B_AllDataSourcesClinical_0.0002_cosine_512_paper
Text Generation
•
1B
•
Updated
•
2
Pretrain-FBK-NLP/mt5-large_AllDataSourcesClinical_0.0002_constant_512_paper
Updated
Pretrain-FBK-NLP/Llama-3.2-1B_AllDataSourcesClinical_0.0002_constant_1024_paper
Text Generation
•
1B
•
Updated
•
4
Pretrain-FBK-NLP/Llama-3.2-1B_AllDataSourcesClinical_0.0002_cosine_1024_paper
Text Generation
•
1B
•
Updated
•
2
Pretrain-FBK-NLP/mt5-large_AllDataSourcesClinical_0.0002_constant_1024_paper
1B
•
Updated
Pretrain-FBK-NLP/mt5-large_AllDataSourcesClinical_0.0002_cosine_1024_paper
1B
•
Updated
Pretrain-FBK-NLP/Llama-3.2-1B-Instruct_AllDataSourcesClinical_0.0002_cosine_1024_paper
Text Generation
•
1B
•
Updated
•
1
Pretrain-FBK-NLP/gemma-3-1b-pt_AllDataSourcesClinical_0.0002_cosine_1024_paper
Text Generation
•
1.0B
•
Updated
•
1
Pretrain-FBK-NLP/gemma-3-1b-it_AllDataSourcesClinical_0.0002_cosine_1024_paper
Text Generation
•
1.0B
•
Updated
•
1
Pretrain-FBK-NLP/Qwen3-1.7B_AllDataSourcesClinical_0.0002_cosine_1024_paper
Text Generation
•
2B
•
Updated
•
1