mtgv commited on
Commit
4d83a3b
·
1 Parent(s): 542b5d8

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -0
README.md ADDED
@@ -0,0 +1,23 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - togethercomputer/RedPajama-Data-1T
5
+ tags:
6
+ - llama
7
+ ---
8
+ # Model Summery
9
+ MobileLLaMA-1.4B-Base is a Transformer with 1.4B billon paramters. We downscale LLaMA to facilitate the off-the-shelf deployment. To make our work reproducible, all
10
+ the models are trained on 1.3T tokens1 from the [RedPajama v1](https://www.together.ai/blog/redpajama) dataset only. This benefits further research by enabling controlled experiments.
11
+
12
+ We extensively assess our models on two standard natural language benchmarks, for language understanding and common sense reasoning respectively. Experimental results show that our
13
+ MobileLLaMA 1.4B is on par with the most recent opensource models.
14
+
15
+ # Model Sources
16
+ - Repository: https://github.com/Meituan-AutoML/MobileVLM
17
+ - Paper: https://arxiv.org/abs/2307.09288
18
+
19
+ # How to Get Started with the Model
20
+ Model weights can be loaded with Hugging Face Transformers. Examples can be found at [Github](https://github.com/Meituan-AutoML/MobileVLM).
21
+
22
+ # Datasets and Training
23
+ For our training details, please refer to our paper in section 4.1: [MobileVLM: A Fast, Strong and Open Vision Language Assistant for Mobile Devices](https://arxiv.org/pdf/2312.16886.pdf).