Watches silently.

by Lewdiculous - opened Mar 4, 2024

Discussion

Lewdiculous

Mar 4, 2024

👀

Nitral-AI

The Chaotic Neutrals org Mar 4, 2024

@Lewdiculous Should be live now!

Lewdiculous

Mar 4, 2024

•

edited Mar 4, 2024

@Test157t

It's already coming in hot.
Gods, it's like ~~100GB~~ 80GB of files!

Lewdiculous

Mar 4, 2024

•

edited Mar 4, 2024

I will say 9B is quite the unusual parameter size, at least for me, is that stable? Is it GQA? If not, could it be in the future?

jeiku

Mar 4, 2024

it's just two 7B in a passthrough with overlapping layers. Should inherit all characteristics of a Mistral finetune.

jeiku

Mar 4, 2024

It is quite good though, in my testing.

Lewdiculous

Mar 4, 2024

•

edited Mar 4, 2024

@jeiku -- Alrighty! Sounds good!

    quantization_options = [
        "Q4_K_M", "Q4_K_S", "IQ4_NL", "IQ4_XS", "Q5_K_M", 
        "Q5_K_S", "Q6_K", "Q8_0", "IQ3_M", "IQ3_S", "IQ3_XS", "IQ3_XXS"
    ]

Maybe time to put those IQs to the test.

Lewdiculous

Mar 4, 2024

Also, of course, Bepis-chan is a cutie.

jeiku

Mar 4, 2024

•

edited Mar 4, 2024

in my testing iq3_xxs is worthwhile, but others show either no improvement or worse perplexity than similar k quant

Lewdiculous

Mar 4, 2024

•

edited Mar 4, 2024

but others show either no improvement or worse perplexity than similar k quant

Yeah, I honestly want more feedback on this to focus on the more important quants. They take way longer than a normal quant, so I have to know if it's even worth it.

Update:

Everything should be uploaded in about 15 minutes.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment