Minh-Thien Nguyen
minhnguyent546
AI & ML interests
Research interests: Embeddings Models, Image-Text retrieval for Vietnamese, Optimal transport, RAG, and Image classification. Some pet projects on distributed training, training model on TPU, RAG for complex document multiple-choice QA.
Recent Activity
updated a dataset 1 day ago
minhnguyent546/viclip-ot-datasets updated a collection 4 days ago
cotu-legal-retriever updated a model 4 days ago
minhnguyent546/noname002-jina-embeddings-v5-text-nano-retrieval-df-3fre-culturalY-vi-stage1Organizations
[model] Machine Translation Models
[dataset] embeddings-and-retrieval-learning
Datasets for training embeddings models (and fine-tuning for retrieval tasks)
[model] embeddings
cotu-legal-retriever
cotu-legal-retriever is a family of models optimized for Vietnamese legal retrieval tasks.
-
minhnguyent546/cotu-legal-retriever-Octen-Embedding-4B-stage1
Sentence Similarity • 4B • Updated • 40 -
minhnguyent546/cotu-legal-retriever-Qwen3-Embedding-4B-stage1
Sentence Similarity • 4B • Updated • 52 -
minhnguyent546/cotu-legal-retriever-Qwen3-Embedding-8B-stage1
Sentence Similarity • 8B • Updated • 23 -
minhnguyent546/KaLM-Embedding-Gemma3-12B-2511-tokenizer-for-transformers-v5
Updated
[dataset] image-text datasets
[dataset] text-generation
Med-Alpaca
e2026
cotu-legal-retriever
cotu-legal-retriever is a family of models optimized for Vietnamese legal retrieval tasks.
-
minhnguyent546/cotu-legal-retriever-Octen-Embedding-4B-stage1
Sentence Similarity • 4B • Updated • 40 -
minhnguyent546/cotu-legal-retriever-Qwen3-Embedding-4B-stage1
Sentence Similarity • 4B • Updated • 52 -
minhnguyent546/cotu-legal-retriever-Qwen3-Embedding-8B-stage1
Sentence Similarity • 8B • Updated • 23 -
minhnguyent546/KaLM-Embedding-Gemma3-12B-2511-tokenizer-for-transformers-v5
Updated
[model] Machine Translation Models
[dataset] image-text datasets
[dataset] embeddings-and-retrieval-learning
Datasets for training embeddings models (and fine-tuning for retrieval tasks)
[dataset] text-generation
[model] embeddings
Med-Alpaca