GSAI-ML
/

LLaDA-8B-Instruct

Text Generation

Model card Files Files and versions

LLaDA-8B-Instruct / README.md

cherry0328's picture

Update README.md

ac38451 verified about 1 month ago

|

472 Bytes

	---
	license: mit
	library_name: transformers
	pipeline_tag: text-generation
	base_model:
	- meta-llama/Meta-Llama-3-8B
	---
	# LLaDA-8B-Instruct

	We introduce LLaDA, a diffusion model with an unprecedented 8B scale, trained entirely from scratch, rivaling LLaMA3 8B in performance.

	[Project Page](https://ml-gsai.github.io/LLaDA-demo/)

	[Code](https://github.com/ML-GSAI/LLaDA)

	## Updates
	[2025-10-21] We have modified modeling_llada.py to support the input of attention_mask.