teohyc 's Collections

Multimodal Model📖👁️

Experimental vision-language models integrating visual encoders with large language models, exploring multimodal alignment and reasoning capabilities.