l2t-project
/

raw-1b-shared

Model card Files Files and versions

raw-1b-shared / README.md

atsuki-yamaguchi's picture

atsuki-yamaguchi

Upload README.md with huggingface_hub

671faa9 verified 5 days ago

|

history blame contribute delete

890 Bytes

	---
	license: mit
	datasets:
	- HuggingFaceTB/smollm-corpus
	language:
	- en
	---

	# Raw 1B Shared



	## How to Get Started with the Model
	Use the code below to get started with the model.
	```python
	from transformers import AutoTokenizer, AutoModelForCausalLM

	model = AutoModelForCausalLM.from_pretrained(
	"l2t-project/raw-1b-shared"
	)
	tokenizer = AutoTokenizer.from_pretrained(
	"l2t-project/raw-1b-shared"
	)
	```


	## Citation
	```
	@article{yamaguchi2026enhancinglinguisticcompetencelanguage,
	title={Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks},
	author={Atsuki Yamaguchi and Maggie Mi and Nikolaos Aletras},
	year={2026},
	eprint={2601.03448},
	archivePrefix={arXiv},
	primaryClass={cs.CL},
	url={https://arxiv.org/abs/2601.03448},
	journal={arXiv},
	volume={abs/2601.03448}
	}
	```