Update README.md

dcfacc7 verified 2 months ago

8.04 kB

	---
	library_name: peft
	license: mit
	language:
	- en
	base_model:
	- MightyOctopus/pricer-merged-model-A-v1
	---

	# Model Card for Model ID

	<!-- Provide a quick summary of what the model is/does. -->


	### Model Description

	pricer-lora-ft-v3 is a fine-tuned large language model (with the base model: MightyOctopus/pricer-merged-model-A-v1) specialized in numeric price prediction for consumer products(e.g. Amazon products etc).
	The model predicts approximate product prices from textual metadata such as product title, description, and category.
	It demonstrates that a fully open source LLM can be adapted for structured numeric regression tasks traditionally handled by classical ML models.



	- Developed by: MyungHwan Hong (MightyOctopus)
	- Funded by: Self-funded / independent research
	- Shared by: MyungHwan Hong
	- Model type: Causal Language Model (Numeric Prediction / Regression via Text-to-Number)
	- Language(s) (NLP): English
	- License: MIT
	- Finetuned from model: meta-llama/Llama-3.1-8B

	### Model Sources

	<!-- Provide the basic links for the model. -->

	- Repository: [More Information Needed]
	- Research Note: (https://docs.google.com/document/d/1PwuOCS6wgO3MqKexnEdAqpVswXMqGilqKEuFyhUGk7M/edit?tab=t.0)
	- Demo: [More Information Needed]


	### Direct Use

	- Predicting approximate Amazon product prices from text metadata

	- Research on LLM-based numeric regression

	- Benchmarking open-source LLMs against frontier models (e.g., GPT-4o-mini)

	- Educational experiments on LoRA fine-tuning and evaluation

	### Downstream Use

	- Price estimation pipelines (non-production)

	- Feature generation for pricing analytics

	- Comparative studies with classical ML regressors

	### Out-of-Scope Use

	- Real-time or production pricing systems

	- Financial decision-making

	- Legal, medical, or safety-critical applications

	- Use as an authoritative price source


	## Bias, Risks, and Limitations

	- Predictions are approximate, not exact

	- Performance depends on similarity to training data distribution

	- The model may hallucinate prices for unfamiliar or novel products

	- Prices may reflect historical or dataset-specific biases

	- Not robust to rapid market price changes


	### Recommendations

	Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.

	- Treat outputs as estimates, not ground truth

	- Validate predictions against real pricing data

	- Avoid using the model in high-stakes or commercial systems

	- Be aware of dataset and temporal bias


	## How to Get Started with the Model

	Use the code below to get started with the model.

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer
	from peft import PeftModel
	import torch

	TOKENIZER_MODEL = "meta-llama/Llama-3.1-8B"
	BASE_MODE_ID = "MightyOctopus/pricer-merged-model-A-v1"
	FINE_TUNED_ADAPTER = "MightyOctopus/pricer-lora-ft-v3"

	tokenizer = AutoTokenizer.from_pretrained(TOKENIZER_MODEL)
	base_model = AutoModelForCausalLM.from_pretrained(
	BASE_MODE_ID,
	torch_dtype=torch.bfloat16,
	device_map="auto"
	)

	tokenizer.pad_token = tokenizer.eos_token
	base_model.generation_config.pad_token_id = tokenizer.pad_token_id

	fine_tuned_model = PeftModel.from_pretrained(
	base_model,
	FINE_TUNED_ADAPTER
	)

	fine_tuned_model.eval()

	prompt = """Product:
	Title: Stainless Steel Electric Kettle 1.7L
	Category: Home & Kitchen
	Description: Fast boiling electric kettle with auto shut-off.

	Price is $"""

	inputs = tokenizer(prompt, return_tensors="pt").to(fine_tuned_model.device)

	with torch.no_grad():
	outputs = fine_tuned_model.generate(
	**inputs,
	max_new_tokens=10,
	temperature=0.2
	)

	print(tokenizer.decode(outputs[0], skip_special_tokens=True))
	```


	## Training Details

	### Training Data

	- Amazon product metadata dataset

	- Fields include title, description, category, and ground-truth price

	- Prices normalized via structured text prompts

	- Dataset split into training, validation, and test sets

	### Training Procedure

	<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->

	#### Preprocessing

	Preprocessing

	- Text normalization

	- Structured prompt formatting

	- Numeric price represented as plain text output

	- Loss applied only to answer tokens using response masking


	#### Training Hyperparameters

	Training regime:
	Supervised Fine-Tuning (SFT) with LoRA

	Key Hyperparameters:

	- Optimizer: AdamW

	- Learning Rate: 2e-5

	- Epochs: 2

	- Batch Size: 16

	- Gradient Accumulation: 2

	- Effective Batch Size: 3

	- LoRA Rank: 32

	- LoRA Alpha: 64

	- LoRA Dropout: 0.0

	- Weight Decay: 0.0

	- Precision: bfloat16

	#### Speeds, Sizes, Times

	- Base model: 8B parameters

	- LoRA parameters: ~0.5% of base model

	- Training time: Approx. 21 hours on single GPU

	## Evaluation

	### Testing Data

	- Held-out Amazon product samples

	- Products not seen during training

	### Factors

	- Product category

	- Price range distribution

	- Description length

	- Token length

	### Metrics

	- Mean Absolute Error (MAE)

	- Root Mean Squared Logarithmic Error (RMSLE)

	- Hit Rate (prediction within ±20% of ground truth)


	### Results

	\| Model \| MAE ($) \| RMSLE \| Hit Rate \|
	\| ----------------------- \| ---------- \| -------- \| --------- \|
	\| GPT-4o-mini (Zero-shot) \| ~84.56 \| 0.70 \| 49.4% \|
	\| pricer-lora-ft-v3 \| ~67.40 \| 0.59 \| 62.0% \|

	#### GPT 4o Mini

	![Screenshot 2025-12-17 at 4.34.49 PM](https://cdn-uploads.huggingface.co/production/uploads/6845b5bc786e0c9d32aad167/8WzzRxU_mZScltMQc23kD.png)

	#### pricer-lora-ft--v3

	![Screenshot 2025-12-17 at 4.35.28 PM](https://cdn-uploads.huggingface.co/production/uploads/6845b5bc786e0c9d32aad167/0dx54TlZtrOmRHAAygWVN.png)

	#### Summary



	## Model Examination

	<!-- Relevant interpretability work for the model goes here -->

	The model demonstrates that fine-tuned open-source LLMs can outperform frontier zero-shot models on specialized numeric tasks when trained with domain-specific data and structured prompts.

	## Environmental Impact

	<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->

	Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).

	- Hardware Type: NVIDIA GPU (e.g., A100)
	- Hours used: ~21 hours
	- Cloud Provider: Google Colab / Hugging Face
	- Compute Region: Unknown (Colab GPUs used)
	- Carbon Emitted: Not estimated

	## Technical Specifications

	### Model Architecture and Objective

	- Transformer-based causal language model

	- Objective: Next-token prediction optimized for numeric accuracy

	- Loss applied selectively to price tokens

	### Compute Infrastructure

	- Single-GPU fine-tuning

	- LoRA-based parameter-efficient training

	#### Hardware

	- NVIDIA GPU (A100)

	#### Software

	- Transformers

	- TRL

	- PEFT 0.14.0

	- PyTorch

	## Citation

	<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->

	BibTeX:

	@misc{hong2025pricer,
	author = {Hong, MyungHwan},
	title = {Pricer LoRA Fine-Tuned LLaMA 3.1 8B Model},
	year = {2025},
	url = {https://huggingface.co/MightyOctopus/pricer-lora-ft-v3}
	}


	APA:

	MyungHwan Hong, (2025). Pricer LoRA Fine-Tuned LLaMA 3.1 8B Model. Hugging Face. https://huggingface.co/MightyOctopus/pricer-lora-ft-v3

	## Glossary

	<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->

	[More Information Needed]

	## More Information

	[More Information Needed]

	## Model Card Authors

	MyungHwan Hong

	## Model Card Contact

	Hugging Face: MightyOctopus

	### Framework versions

	- PEFT 0.14.0