How to use from
llama.cpp
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf aixk/haru-180m:F16
# Run inference directly in the terminal:
llama-cli -hf aixk/haru-180m:F16
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf aixk/haru-180m:F16
# Run inference directly in the terminal:
llama-cli -hf aixk/haru-180m:F16
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf aixk/haru-180m:F16
# Run inference directly in the terminal:
./llama-cli -hf aixk/haru-180m:F16
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf aixk/haru-180m:F16
# Run inference directly in the terminal:
./build/bin/llama-cli -hf aixk/haru-180m:F16
Use Docker
docker model run hf.co/aixk/haru-180m:F16
Quick Links
ISAI Logo

ISAI - 이사이

I’m an independent developer building and maintaining AI projects on my own.
Everything from model development to server costs, datasets, and feature updates is handled personally.
Every bit of support helps keep the project running and allows me to improve it further.
If you enjoy the project, please consider supporting it. Thank you.

ν˜Όμžμ„œ AI ν”„λ‘œμ νŠΈλ₯Ό κ°œλ°œν•˜κ³  μš΄μ˜ν•˜κ³  μžˆμŠ΅λ‹ˆλ‹€.
λͺ¨λΈ 개발, μ„œλ²„ λΉ„μš©, κΈ°λŠ₯ κ°œμ„ κΉŒμ§€ λͺ¨λ‘ 개인이 직접 μ§„ν–‰ν•˜κ³  μžˆμŠ΅λ‹ˆλ‹€.
μž‘μ€ 후원 ν•˜λ‚˜ν•˜λ‚˜κ°€ μ„œλΉ„μŠ€ μœ μ§€μ™€ μƒˆλ‘œμš΄ κΈ°λŠ₯ κ°œλ°œμ— 큰 도움이 λ©λ‹ˆλ‹€.
ν”„λ‘œμ νŠΈκ°€ λ§ˆμŒμ— λ“œμ…¨λ‹€λ©΄ ν›„μ›μœΌλ‘œ μ‘μ›ν•΄μ£Όμ„Έμš”. κ°μ‚¬ν•©λ‹ˆλ‹€.

ko-fi

ISAI link ollapp link Addly link blogig link
logig link AI Magician link 99s link Global Stock link
AI Archive link wikiwi link wwwiki link Oduck link
lai link spirit browser link 799 link thedeouk link
wallpaper forum link webbar link Stode link OMAP link
hummorabbit link ollone link ranovel.kr link adsense forum link

Model Description

Haru-180M is a lightweight language model built upon SmolLM2-135M. It has been specifically optimized to enhance Korean language capabilities and features an expanded model depth, providing a more robust performance while maintaining efficiency for various AI-driven tasks.

Downloads last month
280
Safetensors
Model size
0.2B params
Tensor type
F32
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support