Instructions to use Finisha-F-scratch/LilyStory with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Finisha-F-scratch/LilyStory with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Finisha-F-scratch/LilyStory")

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("Finisha-F-scratch/LilyStory")
model = AutoModelForCausalLM.from_pretrained("Finisha-F-scratch/LilyStory")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use Finisha-F-scratch/LilyStory with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Finisha-F-scratch/LilyStory"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Finisha-F-scratch/LilyStory",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/Finisha-F-scratch/LilyStory

SGLang

How to use Finisha-F-scratch/LilyStory with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Finisha-F-scratch/LilyStory" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Finisha-F-scratch/LilyStory",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Finisha-F-scratch/LilyStory" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Finisha-F-scratch/LilyStory",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use Finisha-F-scratch/LilyStory with Docker Model Runner:
```
docker model run hf.co/Finisha-F-scratch/LilyStory
```

📘 Documentation Technique : LilyStory-122k

📝 Présentation Générale LilyStory est un micro-modèle de langage (TLM) spécialisé dans la génération narrative courte en langue anglaise. Malgré une taille extrêmement réduite de 122 000 paramètres, il parvient à maintenir une cohérence syntaxique et thématique impressionnante, capturant l'essence des récits enfantins.

Type : TLM (Tiny Language Model)
Architecture : Optimisée pour l'efficience extrême
Spécialisation : Contes et récits courts (Storytelling)

⚙️ Spécifications Techniques

Caractéristique	Valeur
Nombre de paramètres	~ 122 000
Langue	Anglais (US/UK)
Domaine	Littérature jeunesse / Narrations simples
Format de sortie	Prose textuelle

✨ Capacités et Performances

Efficience Radicale : Capable de s'exécuter sur des environnements aux ressources quasi-inexistantes (embarqué, CPU très basse consommation). ⚡
Cohérence Grammaticale : Le modèle maîtrise les structures de base comme "Once upon a time" et la gestion des dialogues simples. ✍️
Texture Narrative : Contrairement aux modèles massifs et lisses, LilyStory conserve une empreinte brute et directe, idéale pour des expérimentations créatives. 🎨

🧪 Analyse de Génération (Exemple)

"Once upon a time, one was a boy... Lily went to his mom smiled and he was too. She saw a big and started to be too."

*Points forts observés :

Gestion des entités : Le modèle identifie et lie des personnages (Lily, Mom, Tom).
Sémantique contextuelle : Utilisation pertinente de termes liés au foyer et aux émotions (backyard, friends, smiled).
Rythme : Les transitions comme "After it" ou "One day" montrent une compréhension de la progression temporelle.

🚀 Cas d'Usage Idéaux

Génération procédurale : Pour des PNJ dans des jeux vidéo légers ou des histoires interactives.
Éducation : Support pour l'apprentissage des structures de phrases de base.
Recherche en IA : Étude de la compression maximale de la connaissance linguistique.

Note de conception : LilyStory prouve que la qualité d'un modèle ne dépend pas du gigantisme de sa mémoire, mais de la précision de son entraînement et de la qualité du dataset propriétaire utilisé.

Downloads last month: 76

Safetensors

Model size

122k params

Tensor type

F32

Space using Finisha-F-scratch/LilyStory 1

Collections including Finisha-F-scratch/LilyStory

From scratch

Collection

Finisha SLM ✨ • 87 items • Updated Mar 23

Petits-modèles (TLM)

Collection

Chez Finisha, les modèles de langages crée avec transformers sont considérés comme des TLM si ils ont moins de 12 millions de parametres. • 14 items • Updated Mar 23