Huang Liang Hsun PRO
lianghsun
AI & ML interests
Founder of π§ππΆπ»πΈπΉπ² ππ. Focused on applying deep learning in legal and scientific domains, with expertise in NLP and model fine-tuning.
Recent Activity
updated
a dataset about 10 hours ago
lianghsun/tw-law updated
a collection
2 days ago
πΈοΈ Text-to-SQL updated
a collection
2 days ago
πΈοΈ Text-to-SQL Organizations
πͺ Gemma-3-Taiwan
-
lianghsun/gemma-3-tw-270m
Text Generation β’ 0.4B β’ Updated β’ 2 β’ 1 -
lianghsun/gemma-3-tw-270m-it
Text Generation β’ 0.4B β’ Updated β’ 6 -
lianghsun/gemma-3-tw-270m-thinking
Text Generation β’ 0.4B β’ Updated -
lianghsun/fineweb-edu-zhtw-classifier
Text Classification β’ 0.3B β’ Updated β’ 1
πͺ¨ Marble SLM
πΉπΌ Taiwan-Bench
Evaluation dataset in Traditional Chinese.
Taiwan-Legal-Bench
This repository offers a dataset for evaluating legal models based on Taiwanβs laws, including legal questions, provisions, and case law.
π₯ Google DevFest Taipei 2025
π FineWeb-Edu-zhtw
-
lianghsun/fineweb-edu-zhtw
Viewer β’ Updated β’ 1.81M β’ 7 β’ 7 -
lianghsun/fineweb-edu-zhtw-sm
Viewer β’ Updated β’ 230k β’ 7 -
lianghsun/fineweb-edu-zhtw-magistral-annotations
Viewer β’ Updated β’ 5.22M β’ 9 -
lianghsun/fineweb-edu-zhtw-classifier
Text Classification β’ 0.3B β’ Updated β’ 1
π¦ Llama-3.2-Taiwan
Based on the meta-llama/Llama-3.2-*B model, we continue pre-training on a large corpus of Traditional Chinese and non-Chinese language data.
- Paused3
Taiwan Smol Chat
π¦3A Traditional Chinese small language model built on Llama3.2
-
lianghsun/Llama-3.2-Taiwan-3B
Text Generation β’ 4B β’ Updated β’ 25 β’ 27 -
lianghsun/Llama-3.2-Taiwan-3B-Instruct
Text Generation β’ 4B β’ Updated β’ 7.59k β’ 27 -
lianghsun/Llama-3.2-Taiwan-1B
Text Generation β’ 1B β’ Updated β’ 6
βοΈ Llama-3.2-Taiwan-Legal
Based on the lianghsun/Llama-3.2-Taiwan-*B model, the fine-tuning was conducted using datasets related to the laws and judgments of Taiwan.
My interest
πΈοΈ Text-to-SQL
π₯ Google DevFest Taipei 2025
πͺ Gemma-3-Taiwan
-
lianghsun/gemma-3-tw-270m
Text Generation β’ 0.4B β’ Updated β’ 2 β’ 1 -
lianghsun/gemma-3-tw-270m-it
Text Generation β’ 0.4B β’ Updated β’ 6 -
lianghsun/gemma-3-tw-270m-thinking
Text Generation β’ 0.4B β’ Updated -
lianghsun/fineweb-edu-zhtw-classifier
Text Classification β’ 0.3B β’ Updated β’ 1
π FineWeb-Edu-zhtw
-
lianghsun/fineweb-edu-zhtw
Viewer β’ Updated β’ 1.81M β’ 7 β’ 7 -
lianghsun/fineweb-edu-zhtw-sm
Viewer β’ Updated β’ 230k β’ 7 -
lianghsun/fineweb-edu-zhtw-magistral-annotations
Viewer β’ Updated β’ 5.22M β’ 9 -
lianghsun/fineweb-edu-zhtw-classifier
Text Classification β’ 0.3B β’ Updated β’ 1
πͺ¨ Marble SLM
π¦ Llama-3.2-Taiwan
Based on the meta-llama/Llama-3.2-*B model, we continue pre-training on a large corpus of Traditional Chinese and non-Chinese language data.
- Paused3
Taiwan Smol Chat
π¦3A Traditional Chinese small language model built on Llama3.2
-
lianghsun/Llama-3.2-Taiwan-3B
Text Generation β’ 4B β’ Updated β’ 25 β’ 27 -
lianghsun/Llama-3.2-Taiwan-3B-Instruct
Text Generation β’ 4B β’ Updated β’ 7.59k β’ 27 -
lianghsun/Llama-3.2-Taiwan-1B
Text Generation β’ 1B β’ Updated β’ 6
πΉπΌ Taiwan-Bench
Evaluation dataset in Traditional Chinese.
βοΈ Llama-3.2-Taiwan-Legal
Based on the lianghsun/Llama-3.2-Taiwan-*B model, the fine-tuning was conducted using datasets related to the laws and judgments of Taiwan.
Taiwan-Legal-Bench
This repository offers a dataset for evaluating legal models based on Taiwanβs laws, including legal questions, provisions, and case law.
My interest