mtgv
/

MobileLLaMA-1.4B-Base

Text Generation

text-generation-inference

Model card Files Files and versions

mtgv commited on Dec 29, 2023

Commit

4d83a3b

·

1 Parent(s): 542b5d8

Create README.md

Files changed (1) hide show

README.md +23 -0

README.md ADDED Viewed

	@@ -0,0 +1,23 @@

+---
+license: apache-2.0
+datasets:
+- togethercomputer/RedPajama-Data-1T
+tags:
+- llama
+---
+# Model Summery
+MobileLLaMA-1.4B-Base is a Transformer with 1.4B billon paramters. We downscale LLaMA to facilitate the off-the-shelf deployment. To make our work reproducible, all
+the models are trained on 1.3T tokens1 from the [RedPajama v1](https://www.together.ai/blog/redpajama) dataset only. This benefits further research by enabling controlled experiments.
+We extensively assess our models on two standard natural language benchmarks, for language understanding and common sense reasoning respectively. Experimental results show that our
+MobileLLaMA 1.4B is on par with the most recent opensource models.
+# Model Sources
+- Repository: https://github.com/Meituan-AutoML/MobileVLM
+- Paper: https://arxiv.org/abs/2307.09288
+# How to Get Started with the Model
+Model weights can be loaded with Hugging Face Transformers. Examples can be found at [Github](https://github.com/Meituan-AutoML/MobileVLM).
+# Datasets and Training
+For our training details, please refer to our paper in section 4.1: [MobileVLM: A Fast, Strong and Open Vision Language Assistant for Mobile Devices](https://arxiv.org/pdf/2312.16886.pdf).