Spaces:

stanley-00
/

slm-testing

Running on Zero

App Files Files Community

Apply for a GPU community grant: Personal project

by stanley-00 - opened Jun 4

Discussion

stanley-00

Owner Jun 4

This is demo to evaluate small LLM model without needing for inference provider.

GODELEV

Jun 4

Hii !!
Thank You to put my model "Archaea-74M" On your inference engine.
I would suggest you to put a "token per sec" Metric in this inference engine , It would help to evalutate model speed further.

Thank You !!

stanley-00

Owner Jun 5

•

edited Jun 5

Thanks for the suggestion, I would consider that.

Updated: the "token per sec" has been added

Datdanboi25

Jun 5

Oh also could you add smollM2-135m instruct?

stanley-00

Owner Jun 5

Oh also could you add smollM2-135m instruct?

Actually, you can paste HuggingFaceTB/SmolLM2-135M-Instruct directly to the Model field and it also work.
But let me add clear instruction for that

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment