microsoft
/

phi-1_5

Text Generation

text-generation-inference

Model card Files Files and versions

Resources

View closed (87)

TMT: dynamic graph attention beats Mamba on WikiText-2 at 48% compute — open source

#96 opened 17 days ago by

Prompt for GSM8K evaluations

#91 opened over 1 year ago by

Phi 1.5 Instruct: an instruction following Phi 1.5 model that has undergone SFT and DPO

#89 opened almost 2 years ago by

Regarding the '/n' output

#87 opened about 2 years ago by

[AUTOMATED] Model Memory Requirements

#86 opened about 2 years ago by

model-sizer-bot

The training time mentioned in the paper and the explanations in the Git repository have a significant gap.

#77 opened over 2 years ago by

How to get model architecture/parameter names from the previous version

#76 opened over 2 years ago by