ntphuc149 commited on
Commit
9e97352
·
verified ·
1 Parent(s): d0836c1

Update index.html

Browse files
Files changed (1) hide show
  1. index.html +29 -27
index.html CHANGED
@@ -164,20 +164,7 @@
164
 
165
  <div class="publication-item">
166
  <h4>
167
- [3] ViLegalBERT & ViLegalQwen: Lightweight Domain-Adaptive
168
- Language Models for Vietnamese Legal Text Processing
169
- </h4>
170
- <p class="publication-meta">
171
- <strong>Authors:</strong> <strong>Truong-Phuc Nguyen</strong>,
172
- Quy-Nhan Nguyen, Manh-Cuong Phan, Tien-Manh Tran, Huy-The Vu &
173
- Minh-Tien Nguyen<br />
174
- <strong>Status:</strong> Under Writing
175
- </p>
176
- </div>
177
-
178
- <div class="publication-item">
179
- <h4>
180
- [4] Application of Machine Learning in Image Recognition to Detect
181
  Some Abnormalities in the Examination Rooms
182
  </h4>
183
  <p class="publication-meta">
@@ -196,7 +183,19 @@
196
 
197
  <div class="publication-item">
198
  <h4>
199
- [1] UTEHY-NLU@ALQAC 2025: Dynamic Weighted Ensemble and Adaptive
 
 
 
 
 
 
 
 
 
 
 
 
200
  Reasoning for Vietnamese Legal Text Processing
201
  </h4>
202
  <p class="publication-meta">
@@ -211,7 +210,7 @@
211
 
212
  <div class="publication-item">
213
  <h4>
214
- [2] ViEduQA: A New Vietnamese Dataset for Question Answer
215
  Generation in Education
216
  </h4>
217
  <p class="publication-meta">
@@ -232,7 +231,7 @@
232
 
233
  <div class="publication-item">
234
  <h4>
235
- [3] Vietnamese Legal Question Answering: An Experimental Study
236
  </h4>
237
  <p class="publication-meta">
238
  <strong>Authors:</strong> Thu-Ha Nguyen,
@@ -305,22 +304,25 @@
305
  <div class="timeline-item">
306
  <div class="timeline-date">September 2024 – Present</div>
307
  <h3>
308
- ViLegalBERT & ViLegalQwen - Domain-specific Language Models
309
  </h3>
310
  <p style="color: var(--accent); font-weight: 600">
311
  NLU Laboratory, Hung Yen University of Technology and
312
  Education
313
  </p>
314
  <p class="timeline-content">
315
- Developing representation and generation models specifically
316
- for the legal domain in Vietnam through continual pretraining
317
- of language models on large datasets from four sources of
318
- authoritative legal documents in Vietnam. Legal pretrained
319
- models are trained on high-quality large-scale synthetic
320
- datasets, compared with base models and Vietnamese-specific
321
- models of the same size on the problems of Question Answering
322
- (True/False, Multiple-choice), Natural Language Inference,
323
- Text Classification.
 
 
 
324
  </p>
325
  <p
326
  style="
 
164
 
165
  <div class="publication-item">
166
  <h4>
167
+ [3] Application of Machine Learning in Image Recognition to Detect
 
 
 
 
 
 
 
 
 
 
 
 
 
168
  Some Abnormalities in the Examination Rooms
169
  </h4>
170
  <p class="publication-meta">
 
183
 
184
  <div class="publication-item">
185
  <h4>
186
+ [1] ViLegalLM: Language Models for Vietnamese Legal Text
187
+ </h4>
188
+ <p class="publication-meta">
189
+ <strong>Authors:</strong> <strong>Truong-Phuc Nguyen</strong>,
190
+ Quy-Nhan Nguyen, Van-Quyet Nguyen &
191
+ Minh-Tien Nguyen<br />
192
+ <strong>Status:</strong> Under Writing
193
+ </p>
194
+ </div>
195
+
196
+ <div class="publication-item">
197
+ <h4>
198
+ [2] UTEHY-NLU@ALQAC 2025: Dynamic Weighted Ensemble and Adaptive
199
  Reasoning for Vietnamese Legal Text Processing
200
  </h4>
201
  <p class="publication-meta">
 
210
 
211
  <div class="publication-item">
212
  <h4>
213
+ [3] ViEduQA: A New Vietnamese Dataset for Question Answer
214
  Generation in Education
215
  </h4>
216
  <p class="publication-meta">
 
231
 
232
  <div class="publication-item">
233
  <h4>
234
+ [4] Vietnamese Legal Question Answering: An Experimental Study
235
  </h4>
236
  <p class="publication-meta">
237
  <strong>Authors:</strong> Thu-Ha Nguyen,
 
304
  <div class="timeline-item">
305
  <div class="timeline-date">September 2024 – Present</div>
306
  <h3>
307
+ ViLegalLM: Language Models for Vietnamese Legal Text
308
  </h3>
309
  <p style="color: var(--accent); font-weight: 600">
310
  NLU Laboratory, Hung Yen University of Technology and
311
  Education
312
  </p>
313
  <p class="timeline-content">
314
+ Developing one representation (135M) and two generation (1.54B,
315
+ 1.72B) models specifically for the legal domain in Vietnam
316
+ through continual pretraining of language models on large
317
+ datasets from four sources of authoritative legal documents
318
+ in Vietnam. Legal pretrained models are trained on high-quality
319
+ large-scale synthetic datasets, compared with 7 state-of-the-art
320
+ Vietnamese general and legal LMs of the same size across 10
321
+ benchmarks spanning 4 main tasks: Information Retrieval,
322
+ Question Answering, Natural Language Inference, and Syllogism Reasoning.
323
+ ViLegalLM achieves state-of-the-art performance on 10 benchmarks,
324
+ establishes the newest strong baselines for Vietnamese Legal text
325
+ processing.
326
  </p>
327
  <p
328
  style="