LLM-VNSE: Llama 3.1 8B Fine-tuned for QoS-Aware Volunteer Node Selection
Model Description
This repository contains four QLoRA fine-tuned adapters for Llama 3.1 8B Instruct, one for each QoS weight scheme (A, B, C, D), trained as part of the LLM-VNSE framework for QoS-aware volunteer node selection in fog computing environments. The adapters are released as reproducibility artefacts accompanying the paper:
"Calibrated and Adaptive Listwise Ranking with Large Language Models for Quality-of-Service-Aware Volunteer Node Selection in Fog Computing"
Available Adapters
| Adapter | Scheme | Application Context | Top-1 (%) | MAE | Spearman | NDCG |
|---|---|---|---|---|---|---|
| scheme-a | A โ Equal weights | Ambient sensing | 93.63 | 0.3084 | 0.9742 | 0.9962 |
| scheme-b | B โ Latency-dominant | Latency-sensitive interactive | 92.03 | 0.3530 | 0.9687 | 0.9953 |
| scheme-c | C โ Availability-dominant | Critical monitoring | 82.87 | 0.3785 | 0.9677 | 0.9935 |
| scheme-d | D โ Throughput-dominant | Bulk data transfer | 94.82 | 0.3004 | 0.9757 | 0.9972 |
Training Data
The adapters were fine-tuned on the QWS Dataset 2.0 (2,507 real web services across five QoS attributes: response time, throughput, availability, reliability, and latency). The dataset was partitioned into 2,005 training lists, 251 validation lists, and 251 test lists following an 80/10/10 ratio. Ground-truth rankings were generated using a weighted-sum scoring function under each scheme's weight configuration as defined in Table 1 of the paper.
Intended Use
These adapters are intended for QoS-aware listwise ranking of volunteer nodes in fog computing environments. Given a candidate pool of volunteer nodes with their QoS profiles, the fine-tuned model produces a complete ranked ordering from best to worst under the active SLA priority configuration.
Training Configuration
- Method: QLoRA (4-bit NF4 quantisation)
- LoRA Rank: 16
- LoRA Alpha: 32
- Learning Rate: 2 ร 10โปโด
- Epochs: 3
- Effective Batch Size: 32
- Hardware: NVIDIA A100 40GB GPU (Google Cloud Compute Engine)
- Random Seed: 42
Known Limitations
Dataset distribution: The model was trained on QWS Dataset 2.0 collected in 2008, which describes stable server-hosted web services. Transferability to real volunteer fog node deployments may be limited due to distributional differences in response time and throughput.
ICL instability: When used as a base model under in-context learning without fine-tuning, Llama 3.1 8B exhibits severe confidence compression (scores clustering around 0.5) and sensitivity to prompt serialisation format, with accuracy drops of 17โ23% under Mathematical Vector/Tuple Notation compared to the original key-value format. Fine-tuning with these adapters overcomes these limitations.
Scheme-specific: Each adapter is optimised for its corresponding weight configuration. For cross-scheme generalisation without retraining, use the weight-conditioned Protocol II approach described in the paper.
Citation
@article{alsaiari-llmvnse,
title={Calibrated and Adaptive Listwise Ranking with Large Language Models for Quality-of-Service-Aware Volunteer Node Selection in Fog Computing},
author={Alsaiari, Najat and Hussain, Farookh},
year={2026}
}
Repository
Code and reproducibility artefacts: github.com/NajatAlsaiari/LLM-VNSE
Model tree for NajatAlsa/llama-scheme-b
Base model
meta-llama/Llama-3.1-8B