Buckets:

hf-doc-build
/

doc-dev

Files

xet

hf-doc-build/doc-dev / course /pr_1114 /th /chapter1 /7.md

rtrm

about 1 month ago

preview code

download

raw

2.41 kB

	# โมเดล sequence-to-sequence

	<CourseFloatingBanner
	chapter={1}
	classNames="absolute z-10 right-0 top-0"
	/>

	<Youtube id="0_4KEb08xrE" />

	โมเดล encoder-decoder (หรือเรียกอีกชื่อหนึ่งว่า โมเดล sequence-to-sequence) ใช้ทั้งสองส่วนในสถาปัตยกรรม Transformer ในแต่ละชั้น attention layer ของ encoder จะเข้าถึงคำทั้งหมดในประโยคเริ่มต้นได้ ในขณะที่ attention layer ของ decoder สามารถเข้าถึงได้เพียงคำที่อยู่ตำแหน่งก่อนหน้าคำที่กำหนดใน input เท่านั้น

	โมเดล pretrain สามารถเทรนมาในลักษณะเดียวกับโมเดล encoder หรือโมเดล decoder ก็ได้ แต่โดยมากแล้วจะซับซ้อนมากกว่า ตัวอย่างเช่น [T5](https://huggingface.co/t5-base) ถูกเทรนมาโดยการแทนที่กลุ่มคำ(ซึ่งอาจจะมีเพียงคำเดียวหรือหลายคำก็ได้)ด้วยคำพิเศษคำเดียว และเป้าหมายคือให้ทำนายข้อความที่คำพิเศษคำนี้แทนที่มา

	โมเดล sequence-to-sequence เหมาะกับงานในการสร้างประโยคขึ้นมาใหม่จาก input ที่กำหนดให้ เช่น การสรุปความ, การแปลภาษา, หรือการสร้างคำตอบจากคำถาม

	ตัวแทนโมเดลในกลุ่มนี้ได้แก่:

	- [BART](https://huggingface.co/transformers/model_doc/bart.html)
	- [mBART](https://huggingface.co/transformers/model_doc/mbart.html)
	- [Marian](https://huggingface.co/transformers/model_doc/marian.html)
	- [T5](https://huggingface.co/transformers/model_doc/t5.html)


	<EditOnGithub source="https://github.com/huggingface/course/blob/main/chapters/th/chapter1/7.mdx" />

Xet Storage Details

Size:: 2.41 kB
Xet hash:: ad3fac6fefe76be9422db318e5af60c8025011d459c22b25c5f78fb9be7fbaf0

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.