How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="samcheng0/deeplm-108m")
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("samcheng0/deeplm-108m", dtype="auto")
Quick Links

Deeplm โ€” Auto-upload (step 2000)

Model checkpoint otomatis dari training loop. Setiap 100 langkah, seluruh state dikirim ke HF.

Training Metrics

  • step: 2000
  • loss: 9.9303
  • lr: 0.0001
  • grad_norm: 1.0000
  • phase: exploration
  • state: init
  • confidence: 0

Charts

grad_norm.png

loss.png

loss_analysis.png

lr.png

phases.png

tokens_per_sec.png

tuner_multipliers.png

tuner_signals.png

tuner_state_summary.png

Included Files (setiap upload)

File Deskripsi
model.safetensors BitNet ternary weights
config.json Model configuration
tokenizer.json BPETokenizer
tokenizer_config.json Tokenizer config
checkpoint-2000/ Full checkpoint (model.pt, optimizer.pt, training_state.json)
charts/ Training visualization PNGs
metrics.jsonl Full training log (all steps)
tuner_state.json AutoTuner internal state
README.md This file

Penggunaan

import sys; sys.path.insert(0, 'deeplm')
from deeplm.config import DeeplmConfig
from deeplm.model.deeplm import DeeplmModel
from safetensors.torch import load_file

config = DeeplmConfig()
model = DeeplmModel(config)
state_dict = load_file('model.safetensors')
model.load_state_dict(state_dict, strict=False)
Downloads last month
1,481
Safetensors
Model size
0.1B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support