Different results gotten by this phi-2 and microsoft/phi-2

by EnmingZhang - opened Dec 20, 2023

Dec 20, 2023

I inference phi-2 with trasformers==4.36.2. But I got different results between this phi-2 and microsoft/phi-2 (trust_remote_code=True)
Here are corresponding codes
this phi-2

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("susnato/phi-2")
tokenizer = AutoTokenizer.from_pretrained("susnato/phi-2")

inputs = tokenizer('Can you help me write a formal email to a potential business partner proposing a joint venture?', return_tensors="pt", return_attention_mask=False)

outputs = model.generate(**inputs, max_length=256)
text = tokenizer.batch_decode(outputs)[0]
print(text)

The results were

Can you help me write a formal email to a potential business partner proposing a joint venture?
## INSTRUCTION
## INPUT
We are looking to expand our business relationship
##OUTPUT
##OUTPUT
##OUTPUT
## INSTRUCTIONS
#

For microsoft/phi-2

from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained('microsoft/phi-2', trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained('microsoft/phi-2', trust_remote_code=True)
prompt = 'Can you help me write a formal email to a potential business partner proposing a joint venture?'
input = tokenizer(prompt, return_tensors="pt")
generate_id = model.generate(input.input_ids,max_length=256)
output = tokenizer.batch_decode(generate_id)[0]
print(output)

while the results were

Can you help me write a formal email to a potential business partner proposing a joint venture?
Input: Company A: ABC Inc.
Company B: XYZ Ltd.
Joint Venture: A new online platform for e-commerce
Output: Dear Mr. Smith,

I am writing to you on behalf of ABC Inc., a leading provider of e-commerce solutions. I am interested in exploring the possibility of a joint venture with XYZ Ltd., a reputable online retailer.

We have been following your company's impressive growth and innovation in the e-commerce sector, and we believe that we have a lot to offer each other. We have extensive experience and expertise in developing and managing online platforms for e-commerce, with a proven track record of delivering high-quality products and services to our clients. We also have a strong network of suppliers, distributors, and customers, as well as a dedicated team of professionals.

We propose to create a new online platform for e-commerce that combines our strengths and resources, and leverages our mutual opportunities and goals. The platform would offer a wide range of products and services, from various categories and brands, at competitive prices and with fast delivery. The platform would also provide features such as secure payment,

Obviously, the microsoft/phi-2 results are better

susnato

Owner Dec 20, 2023

Hi @EnmingZhang , yes the results are different because the weights were not properly converted...I am looking into it. Will ping you when it's the same.

EnmingZhang

Dec 20, 2023

@susnato Thanks for such quick reply !
Could you please provide further details on why the weights were not converted correctly? I have compared the source codes of modeling_phi.py inmicrosoft/phi-1_5 and microsoft/phi-2. I noticed that there were only two modifications made (at lines 500 and 867).

susnato

Owner Dec 20, 2023

Hi @EnmingZhang , it's because the script that I wrote for conversion of Phi weights are prior to this commit.

And with phi2 the code follows the new codebase(that was introduced with this commit)...that's why it's failing. I am looking into it.
(Also you could not directly use this script, you need to make some minor modifications in the keys dict).

EnmingZhang

Dec 20, 2023

•

edited Dec 20, 2023

Got it!
@susnato Thank you for letting me know when it's solved. I appreciate it.

susnato

Owner Dec 22, 2023

Hi @EnmingZhang , can you please use it now and let me know if it is working as supposed or not?

I am getting quite good results here -

susnato pinned discussion Dec 22, 2023

susnato unpinned discussion Dec 22, 2023

zhumj34

Dec 28, 2023

Hi @susnato , i found the same result for this phi-2 and microsoft/phi-2. Thank you very much for your work!

susnato

Owner Dec 28, 2023

Thanks for confirming @zhumj34 , I am closing this issue.

susnato changed discussion status to closed Dec 28, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment