Instructions to use Ylemnox/llama-toefl-checkbot with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Ylemnox/llama-toefl-checkbot with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Ylemnox/llama-toefl-checkbot")

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("Ylemnox/llama-toefl-checkbot", dtype="auto")

PEFT
How to use Ylemnox/llama-toefl-checkbot with PEFT:
```
Task type is invalid.
```
Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use Ylemnox/llama-toefl-checkbot with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Ylemnox/llama-toefl-checkbot"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Ylemnox/llama-toefl-checkbot",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/Ylemnox/llama-toefl-checkbot

SGLang

How to use Ylemnox/llama-toefl-checkbot with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Ylemnox/llama-toefl-checkbot" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Ylemnox/llama-toefl-checkbot",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Ylemnox/llama-toefl-checkbot" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Ylemnox/llama-toefl-checkbot",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use Ylemnox/llama-toefl-checkbot with Docker Model Runner:
```
docker model run hf.co/Ylemnox/llama-toefl-checkbot
```

TOEFL Speaking Evaluation Model

A fine-tuned Llama 3.2 model for automated TOEFL speaking assessment using LoRA adapters.

Model Description

This repository contains a fine-tuned LoRA adapter for evaluating TOEFL speaking responses. The model has been trained to assess speaking quality and provide scores based on TOEFL evaluation criteria.

Base Model: Llama 3.2 Training Method: LoRA (Low-Rank Adaptation) Task: TOEFL Speaking Assessment

Repository Contents

Model Weights

toefl_judge_adapter/ - LoRA adapter weights in safetensors format
- Multiple checkpoint files (every 100 steps from 100-1000)
- Final adapter: adapters.safetensors
- Configuration: adapter_config.json

Training Data

train.jsonl (1.9MB) - Training dataset
valid.jsonl (642KB) - Validation dataset
toefl_speaking_data_test.jsonl (650KB) - Test dataset

Application Code

toefl_judge_app.py - Web application for TOEFL evaluation
cli.py - Command-line interface
fine_tune.py - Fine-tuning script
data_formatter.py - Data preprocessing utilities
evaluator.py - Model evaluation tools

Results & Evaluation

toefl_predictions.csv - Model predictions on test set
toefl_evaluations.csv - Evaluation metrics
toefl_evaluation_results.png - Visualization of results

Configuration

lora_config.json - LoRA training configuration

Requirements

Platform: Apple Silicon Mac (M1/M2/M3) - This application uses MLX framework Python: 3.9 or higher

Dependencies

Install the required packages:

pip install -r requirements.txt

Or manually install:

pip install streamlit mlx mlx-lm pandas transformers peft torch

Installation & Setup

Clone or download this repository
Install dependencies:
```
pip install -r requirements.txt
```
Download the base Llama 3.2 model:

The application requires the base Llama 3.2 model. You can download it from:
- Hugging Face: meta-llama/Llama-3.2-3B or meta-llama/Llama-3.2-1B
- You'll need to request access from Meta if you haven't already
Update model paths in the application code:

Edit toefl_judge_app.py and set the correct paths for:
- model_path: Path to your base Llama 3.2 model
- adapter_path: Path to the downloaded adapter (from this repo)

Usage

Loading the Model

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel

# Load base model
base_model = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-3.2-3B")
tokenizer = AutoTokenizer.from_pretrained("meta-llama/Llama-3.2-3B")

# Load LoRA adapter
model = PeftModel.from_pretrained(base_model, "Ylemnox/llama-toefl-checkbot/toefl_judge_adapter")

Running the Web Application

streamlit run toefl_judge_app.py

The web interface will open in your browser where you can:

Enter TOEFL speaking questions
Input student responses
Get automated evaluations with scores (0-4 scale)

Using the CLI

python cli.py

Example: Download and Use the Model

# 1. Install dependencies
pip install -r requirements.txt

# 2. Download this repository
git clone https://huggingface.co/Ylemnox/llama-toefl-checkbot
cd llama-toefl-checkbot

# 3. Run the web app (make sure to set model paths first)
streamlit run toefl_judge_app.py

Training Details

The model was trained using LoRA with the following configuration:

Configuration available in lora_config.json
Training performed on TOEFL speaking response data
Multiple checkpoints saved during training

Evaluation Results

Evaluation metrics and visualizations are available in:

toefl_evaluations.csv - Detailed metrics
toefl_evaluation_results.png - Performance visualization

License

Please ensure compliance with Meta's Llama license when using this model.

Citation

If you use this model, please cite:

@misc{llama-toefl-checkbot,
  author = {Davis Kwak},
  title = {TOEFL Speaking Evaluation Model},
  year = {2025},
  publisher = {Hugging Face},
  howpublished = {\url{https://huggingface.co/Ylemnox/llama-toefl-checkbot}}
}

Downloads last month: -; Downloads are not tracked for this model. How to track

Model tree for Ylemnox/llama-toefl-checkbot

Base model

meta-llama/Llama-3.2-3B

Adapter

(274)

this model