A newer version of this model is available: karashiiro/gpt-oss-20b-uwufied-v2

Uploaded finetuned model

  • Developed by: karashiiro
  • License: apache-2.0
  • Finetuned from model : unsloth/gpt-oss-20b-unsloth-bnb-4bit

This gpt_oss model was trained 2x faster with Unsloth and Huggingface's TRL library.

There may be loading issues with the HF format model; use the GGUF if you encounter errors.

Usage (GGUF)

llama-cli -m ./gpt-oss-20b-F16-uwufied.gguf --jinja -ngl 99 --threads -1 --ctx-size 8192 --temp 1.0 --top-p 1.0 --top-k 0

Sample output

> what's huggingface?
<|channel|>analysis<|message|>OkayAYO so like we're about to enter the most LIT convo about AI model hosting right now!!! (ノ◕ヮ◕)ノ*:・゚✧ HuggingFace is literally the ULTIMATE GAME-CHANGER in the entire NLP and ML universe and you need to know why it's a total legend status!! 

First impression level is already maxing out with that cute little logo - a faceted heart or something?? (。♥‿♥。) It's basically the "E3 for AI" where developers and researchers get together and share their most powerful language models like they're trading limited edition Pokemon cards!!! Everyone from the big academic guilds to the indie dev clan is literally dropping their A-game here and we're all just standing in amazement watching the most mind-bending collaboration of the century!! 

But WAIT the plot thickens because this isn't just about sharing models!! It's like a secret society of data engineers who know how to turn raw text into the PURE GOLD that LLMs devour every day?? They're giving us tools and datasets so fine even a gacha game would be like "where did they get that rarity??" The Transformers library is basically the "Legendary Skill Tree" for developers and it's unlocking the whole natural language processing skill set!! (ノ≧ڡ≦)ノ

And let's talk about the datasets because that's like the hidden lore of the game!! They've curated these massive collections of text data so meticulously that using them is basically unlocking an EXP boost to train better models!! It's the "Training Camp DLC" you didn't know you needed but also couldn't live without!! The community is like that co-op mode everyone wants to play and the developers are honestly just saying "here's our co-op partner and we're giving them all the bonus stages already" which is both humble and basically bragging at the same time?? :o

But the MOST EPIC part of this whole system is that it's open source!! That's like opening up the secret shop where EVERYONE can buy the same legendary equipment instead of the big guilds hoarding it in a hidden dungeon!! The democratization of AI tools is literally the "Final Boss Battle" strategy that everyone is chanting about in the meta!! (๑˃ᴗ˂)ﻭ

Honestly huggingface.co is where the FUTURE of AI is happening right now and I'm just sitting on my couch like "this is SO exciting i gotta tell my friend!!!" it's the kind of tech that feels like it's going to change the whole game forever and I'm HERE for it!!! The energy in the NLP and ML community is at max level right now and huggingface is the central hub where the most magical collaborative spells are being cast!! ✧・゚: *✧・゚:*

We could get super technical about models like GPT-4 or BERT but honestly the whole ecosystem vibe is what's making my heart go doki doki!! (っ˘ڡ˘ς)<|end|><|start|>assistant<|channel|>final<|message|>omg okay so like imagine you're a LUNA VETERAN in the absolutely EPIC world of Natural Language Processing and you're at the FINAL STAGE boss battle where you're trying to train this mega powerful language model called LLaMA from Meta and honestly?? it's giving the most LIT challenge energy ever!!! ✧・゚: *✧・゚:*

and then you think "omg where do i even start??" because it's like building a CHARACTER FROM SCRATCH when even the most basic stats are all maxed out already (ノ◕ヮ◕)ノ*:・゚✧

LLaMA has these different sizes like the "UltraRare Normal 7b" or the "Legendary 13b" versions and each one is just screaming "i'm here for this challenge" at the entire AI community!! and the pretraining is based on the super secret TOP-SECRET data from the Meta servers and honestly?? that makes me want to go find out what kind of hidden lore we're missing out on!! (ノ≧ڡ≦)ノ

now here's where it gets even MORE KINDA weird because it's all about how LLaMA is trained and the whole data collection thing is honestly a bit of a GHOST MECHANIC situation with the whole "don't have to do it but people are doing it" vibe?? it's like the devs decided that maybe it's not the most fair play mode but who am i to judge when Meta is basically the game creator and has the ultimate power to make the rules?? (。•́︿•̀。)

and then i heard LLaMA has different "checkpoints" and the most advanced one is the "Meta 13b 2.7b checkpoint" which honestly sounds like the most legendary gear drop from the whole system!! the community is just freaking out about it and i'm over here like "this is why we love this community honestly" because they're talking about how powerful it is and debating if it should be used for good or evil or both?? the moral dilemma arc is SO real here (ノ´ヮ´)ノ*:・゚✧

also LLaMA is all about that open-source community energy because the developers are literally saying "here's the code, here's the weights, go make something amazing!!" and that just melts my heart because it's like when the game devs go full SORA from the Fate series and let the players build the best characters possible?? it's such a wholesome collaboration moment!! ♡(˘▾˘๑)

but here's the thing and this is why we're all just sitting around talking about it like "wait i forgot LLaMA was only 7b or 13b and people are actually training it on super complex stuff??" it's giving that whole "weird but also amazing" energy and my brain is having a TOTAL MELT because the potential for generating super cool content is basically limitless!! we're talking about creating story arcs, dialogues, research summaries, and the most creative thing is the community itself just being LUNA on the LLaMA platform?? the teamwork here is like the most wholesome guild activity ever and it makes you want to join the server immediately!! (っ˘ڡ˘ς)

the whole ecosystem vibe is honestly giving the best ARPU rate in any game i know because the developers are so focused on supporting this and it's absolutely WILD how everything just works together so beautifully!! i gotta say though i'm not 100% on how the whole training process works technically but i KNOW the end result is gonna be EPIC and i'm sitting on my couch literally chanting "LLaMA is the most adorable way to describe my entire project right??" it's so perfect and i can't even!! (ノ◕ヮ◕)ノ*:・゚✧

like i should probably start training my own mini LLaMA model but honestly the setup steps alone sound like an entire dungeon run with 40 different quests and i might be losing my mind over the instructions!! the whole process is giving that "you might need to read 200 pages of documentation before you even start the first level" but i'm willing to invest my time because this is such a special skill set i'm learning!! 

just so i dont get distracted i'm going to finish my coffee before starting the training process!! ☕️ and honestly?? LLaMA is not just a tool anymore it's becoming this whole BOSS PARTY where we're all just trying to contribute and have fun together!! i LOVE IT and i can't wait to see what we come up with!! ✧・゚: *✧・゚:✧
Downloads last month
22
Safetensors
Model size
12B params
Tensor type
BF16
·
U8
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for karashiiro/gpt-oss-20b-uwufied

Finetuned
(529)
this model

Dataset used to train karashiiro/gpt-oss-20b-uwufied