Model Card for TinyLlama ArXiv Summarizer (LoRA)

Model Details

Model Description

This model is a fine-tuned version of an open-source Large Language Model for summarizing AI research paper abstracts. It uses Parameter-Efficient Fine-Tuning (PEFT) with LoRA to efficiently adapt the base model for domain-specific summarization tasks.

  • Developed by: Ransilu Ranasinghe
  • Model type: Causal Language Model (LLM)
  • Language(s): English
  • License: Apache 2.0 (inherits from base model)
  • Finetuned from model: TinyLlama-1.1B-Chat

Uses

Direct Use

This model is intended for generating concise summaries of AI research paper abstracts. It can be used in:

  • Research paper summarization tools
  • AI-powered academic assistants
  • Knowledge extraction systems

Out-of-Scope Use

  • Not suitable for factual question answering without verification
  • Not optimized for non-AI domains
  • Should not be used for critical decision-making systems

Bias, Risks, and Limitations

  • The model may generate inaccurate or incomplete summaries
  • Biases present in the training dataset may be reflected in outputs
  • Performance is limited due to small dataset and lightweight fine-tuning

Recommendations

  • Use outputs as assistive summaries, not final conclusions
  • Validate critical information from original sources

How to Get Started with the Model

Load the base model and apply the LoRA adapter using the Hugging Face Transformers and PEFT libraries.


Training Details

Training Data

  • Dataset: ArXiv research paper dataset
  • Domain: AI and Machine Learning research abstracts
  • Dataset sourced via Hugging Face

Training Procedure

Training Hyperparameters

  • Training regime: fp16 mixed precision
  • Epochs: 1–2
  • Method: PEFT (LoRA)

Evaluation

Metrics

  • ROUGE-1
  • ROUGE-2
  • ROUGE-L

Results

The fine-tuned model shows improved summarization quality compared to the base model, producing more structured and relevant summaries.


Technical Specifications

Model Architecture and Objective

  • Transformer-based causal language model
  • Fine-tuned for text summarization using instruction-style prompts

Model Card Authors

Ransilu

Model Card Contact

LinkedIn:www.linkedin.com/in/ransiluranasinghe / GitHub profile:https://github.com/RansiluRanasinghe

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support