Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
SnifferCaptain
/
YModel1.1
like
1
Text Generation
PyTorch
Chinese
English
ynet
agent
conversational
custom_code
License:
mit
Model card
Files
Files and versions
xet
Community
YModel1.1
structure
YModel1.1
structure
using SnifferCaptain's LoE (lack of expert) layer as feed forward.
using SnifferCaptain's PEGA (Position Embedding Gate Attention) as Transformer attention layer
using additional identity link between ffn's intermediate part.
Downloads last month
8
Inference Providers
NEW
Text Generation
This model isn't deployed by any Inference Provider.
๐
Ask for provider support