Any plans to release the evaluation data or evaluate AraGPT2 1.5B ?

#10

by wissamantoun - opened Sep 16, 2023

Discussion

wissamantoun

Sep 16, 2023

the evaluation dataset can serve as a good benchmark for the evaluation of future Arabic LLMs.

Also are you planning to evaluate AraGPT2 1.5B? was the grover modeling file causing issues?

MazenSiraj

Sep 16, 2023

may i ask u please if u already installed and ran the model how to do so, cause am facing issues and dontknow how to handle?
@wissamantoun

sherif1

Sep 24, 2023

@MazenSiraj why ? what was the issue or the error?
did you try the code snippet provided, I tried it and worked perfectly.
Can you share your problem in different post so we can help

MazenSiraj

Sep 25, 2023

Hello @sherif1 , it was the offload folder path issue and I managed to handle it earlier.
can you tell me how did you run the model because every time I run the sample code it takes so long to run and downloads the model all over again.

MazenSiraj

Sep 25, 2023

@sherif1 this is what I get every time I run the code, I don't think this is normal

sherif1

Sep 25, 2023

after downloading it for the first time and storing it say in a variable called model and tokenizer , save the mode and tokenizer using

model.save_pretrained(<ur path>)
tokenizer.save_pretrained(<ur path>)

The next time, don't use AutoTokenizer.from_pretrained('inception-mbzuai/jais-13b-chat')
Use ur local path like this AutoTokenizer.from_pretrained(<ur path>)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment