web_a11y_model / README.md

Update README.md

a9d8c16 verified 8 months ago

5.49 kB

	---
	license: gpl
	language:
	- en
	base_model:
	- google/gemma-3-1b-it
	tags:
	- accessibility
	- question-answering
	- fine-tuning
	- llm
	datasets:
	- omark807/web_a11y_dataset
	metrics:
	- eval_loss # Changed from 'accuracy' to reflect actual metric
	---
	# Fine-Tuned Gemma3 1B-IT - Web Accessibility Q&A

	This model is a fine-tuned version of Google's Gemma 1B-IT, adapted for the task of answering questions related to web accessibility, WCAG (Web Content Accessibility Guidelines) principles, and general digital accessibility best practices. It was fine-tuned using the LoRA (Low-Rank Adaptation) method.

	## Model Details

	### Model Description

	This model is a Causal Language Model (`AutoModelForCausalLM`) fine-tuned from `google/gemma-3-1b-it`. Its primary function is to serve as a conversational assistant for queries concerning web accessibility. It processes user questions about accessibility guidelines and concepts (e.g., alt text, color contrast, keyboard navigation) and generates informative answers based on the patterns learned from its fine-tuning dataset.

	- Developed by: omark807
	- Shared by: omark807
	- Model type: Causal Language Model (Fine-tuned via PEFT/LoRA)
	- Language(s) (NLP): English
	- License: GPL
	- Finetuned from model: [google/gemma-3-1b-it](https://huggingface.co/google/gemma-3-1b-it)

	### Model Sources

	- Base Model on Hugging Face: [https://huggingface.co/google/gemma-3-1b-it](https://huggingface.co/google/gemma-3-1b-it)

	## Uses

	### Direct Use

	This model is intended for direct use as a conversational agent or a knowledge base query system for specific questions about digital accessibility. Potential users include web developers, designers, accessibility auditors, or anyone seeking quick information about WCAG guidelines and their implementation.

	### Downstream Use

	This model could potentially be integrated into:
	- Accessibility testing tools to provide explanations for detected issues.
	- Chatbots on accessibility resource websites.
	- Educational platforms teaching web accessibility.

	### Out-of-Scope Use

	This model is not intended for:
	- Providing legal advice or certifications regarding accessibility compliance.
	- Diagnosing user disabilities.
	- Generating content unrelated to web accessibility.
	- Generating harmful, biased, or unethical content.
	- Substituting professional accessibility audits or human expertise.

	## Bias, Risks, and Limitations

	- Data Limitations: The model's knowledge is limited to the `omark807/web_a11y_dataset` (73 samples for training, 15 for evaluation). It may not cover all aspects of web accessibility comprehensively, nor may it respond accurately to questions outside the scope of its training data. Its responses are highly dependent on the quality and breadth of this specific dataset.
	- Generalization: Due to the relatively small dataset size, the model's ability to generalize to new or complex accessibility questions beyond the training examples may be limited.
	- Bias Inheritance: The model inherits potential biases present in its base model (`google/gemma-3-1b-it`) and any biases inadvertently introduced through the fine-tuning dataset.
	- Factual Accuracy: While fine-tuned on factual data, LLMs can still "hallucinate" or provide inaccurate information. Users should cross-reference critical information.
	- No Human Evaluation: The evaluation performed is quantitative (loss-based), not qualitative or human-based, which is important for conversational models.

	### Recommendations

	Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. For critical applications or for definitive answers, always consult official WCAG documentation, accessibility experts, or perform thorough manual audits.

	## How to Get Started with the Model

	Use the code below to get started with the model.

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer
	import torch

	# Replace 'omark807' with your actual username/organization if different
	model_id = "omark807/gemma3-finetuned-web-accessibility" # This will be the ID on Hugging Face Hub

	# Load tokenizer
	tokenizer = AutoTokenizer.from_pretrained(model_id)

	# Load the model. Adjust load_in_4bit and torch_dtype based on your GPU and desired precision.
	# Model was trained in float16 due to bitsandbytes import failure.
	model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=torch.float16, device_map="auto")
	# Or, if you want to try 4-bit inference (requires bitsandbytes to work):
	# from bitsandbytes import BitsAndBytesConfig
	# bnb_config_inference = BitsAndBytesConfig(load_in_4bit=True, bnb_4bit_quant_type="nf4", bnb_4bit_compute_dtype=torch.bfloat16)
	# model = AutoModelForCausalLM.from_pretrained(model_id, quantization_config=bnb_config_inference, device_map="auto")

	# Example prompt following Gemma's instruction format
	prompt = "<start_of_turn>user\nWhat is the purpose of alt text in images?<end_of_turn>\n<start_of_turn>model\n"

	# Tokenize input
	input_ids = tokenizer(prompt, return_tensors="pt").to(model.device)

	# Generate response
	outputs = model.generate(
	**input_ids,
	max_new_tokens=256,
	num_return_sequences=1,
	do_sample=True, # Recommended for more varied responses
	top_p=0.9, # Top-p sampling
	temperature=0.7 # Sampling temperature
	)

	# Decode and print response
	response = tokenizer.decode(outputs[0], skip_special_tokens=False)
	print(response)