Text Generation
Safetensors
GGUF
English
llama
How to use from
llama.cpp
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf mengmeong/meng-programming-skill-finetune
# Run inference directly in the terminal:
llama-cli -hf mengmeong/meng-programming-skill-finetune
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf mengmeong/meng-programming-skill-finetune
# Run inference directly in the terminal:
llama-cli -hf mengmeong/meng-programming-skill-finetune
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf mengmeong/meng-programming-skill-finetune
# Run inference directly in the terminal:
./llama-cli -hf mengmeong/meng-programming-skill-finetune
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf mengmeong/meng-programming-skill-finetune
# Run inference directly in the terminal:
./build/bin/llama-cli -hf mengmeong/meng-programming-skill-finetune
Use Docker
docker model run hf.co/mengmeong/meng-programming-skill-finetune
Quick Links

Programming Skills Learning Path Model

This model is a fine-tuned version of the base mdoel designed to generate path of learning a skill based on input text. It's particularly useful for identifying emerging trends and skill combinations in the rapidly evolving tech landscape.

Usage & Limitations

llama.cpp demo

The model is intended for:

  • Deploying in limited CPU resource, with average about 40 tps on 1 CPU core

The model has limits:

  • The dataset might not capture the very latest tools development in programming world
  • Chatbot usecase does not fit the model usecase
  • The model only return the response as JSON list.

Please note that this model was trained on a custom dataset and may reflect biases present in that data.

Training Hyperparameters

  • Batch Size: 4
  • Optimizer: Experimental GrokAdamW

Little Training Metrics

Eval Loss Eval Runtime Eval Sample Per Seconds Eval Steps per Seconds Loss on Train

Downloads last month
138
Safetensors
Model size
0.2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mengmeong/meng-programming-skill-finetune

Quantized
(6)
this model

Dataset used to train mengmeong/meng-programming-skill-finetune