Query about Base Model

#1
by Tangchiu - opened

Hi LeDissolution,

I found your model StatSuite_G2B_Alpha wonderful! I’m looking into the training recipe of this model to better understand its behavioral alignment.

Is the base model ibm-granite/granite-3.1-2b-base and could some hyperparameters during training be shared?

Would you mind sharing some insights here, or perhaps providing an email address for a more detailed technical discussion?

Thanks for your contribution to the community!

Sign up or log in to comment