Query about Base Model
#1
by
Tangchiu - opened
Hi LeDissolution,
I found your model StatSuite_G2B_Alpha wonderful! I’m looking into the training recipe of this model to better understand its behavioral alignment.
Is the base model ibm-granite/granite-3.1-2b-base and could some hyperparameters during training be shared?
Would you mind sharing some insights here, or perhaps providing an email address for a more detailed technical discussion?
Thanks for your contribution to the community!