Instructions to use Ransilu/tinyllama-arxiv-lora with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Ransilu/tinyllama-arxiv-lora with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "summarization" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("summarization", model="Ransilu/tinyllama-arxiv-lora")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("Ransilu/tinyllama-arxiv-lora", dtype="auto") - PEFT
How to use Ransilu/tinyllama-arxiv-lora with PEFT:
Task type is invalid.
- Notebooks
- Google Colab
- Kaggle
Model Card for TinyLlama ArXiv Summarizer (LoRA)
Model Details
Model Description
This model is a fine-tuned version of an open-source Large Language Model for summarizing AI research paper abstracts. It uses Parameter-Efficient Fine-Tuning (PEFT) with LoRA to efficiently adapt the base model for domain-specific summarization tasks.
- Developed by: Ransilu Ranasinghe
- Model type: Causal Language Model (LLM)
- Language(s): English
- License: Apache 2.0 (inherits from base model)
- Finetuned from model: TinyLlama-1.1B-Chat
Uses
Direct Use
This model is intended for generating concise summaries of AI research paper abstracts. It can be used in:
- Research paper summarization tools
- AI-powered academic assistants
- Knowledge extraction systems
Out-of-Scope Use
- Not suitable for factual question answering without verification
- Not optimized for non-AI domains
- Should not be used for critical decision-making systems
Bias, Risks, and Limitations
- The model may generate inaccurate or incomplete summaries
- Biases present in the training dataset may be reflected in outputs
- Performance is limited due to small dataset and lightweight fine-tuning
Recommendations
- Use outputs as assistive summaries, not final conclusions
- Validate critical information from original sources
How to Get Started with the Model
Load the base model and apply the LoRA adapter using the Hugging Face Transformers and PEFT libraries.
Training Details
Training Data
- Dataset: ArXiv research paper dataset
- Domain: AI and Machine Learning research abstracts
- Dataset sourced via Hugging Face
Training Procedure
Training Hyperparameters
- Training regime: fp16 mixed precision
- Epochs: 1–2
- Method: PEFT (LoRA)
Evaluation
Metrics
- ROUGE-1
- ROUGE-2
- ROUGE-L
Results
The fine-tuned model shows improved summarization quality compared to the base model, producing more structured and relevant summaries.
Technical Specifications
Model Architecture and Objective
- Transformer-based causal language model
- Fine-tuned for text summarization using instruction-style prompts
Model Card Authors
Ransilu
Model Card Contact
LinkedIn:www.linkedin.com/in/ransiluranasinghe / GitHub profile:https://github.com/RansiluRanasinghe