File size: 1,360 Bytes
8335a6f |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 |
---
license: cc-by-nc-4.0
language:
- en
pipeline_tag: image-text-to-text
tags:
- vision
- multimodal
- reasoning
base_model: tbd
---
# Asch 0.1
An experimental image-text-to-text model by OceanirAI.
## What is this?
Asch 0.1 is an image-text-to-text model - you give it an image and text, and it generates text responses based on what it sees. Think of it as a vision-language model that can look at images and answer questions about them, describe what's happening, or help you understand visual content.
## Model Overview
ASCH is a compact, efficient vision-language model designed for advanced reasoning and multimodal understanding.
### Key Features
- Hybrid Reasoning: Structured thinking traces for multi-step decisions
- Perceptive Tool Calling: Focus system with zoom and crop capabilities
- Structured Outputs: Reliable JSON generation
- Advanced OCR: Text recognition in challenging conditions
- UI Understanding: Optimized for desktop and mobile interfaces
- Edge-Optimized: Efficient architecture for resource-constrained devices
## Model Details
- Model Type: Vision-Language Model (Image-Text-to-Text)
- Parameters: ~2B
- Architecture: Transformer-based hybrid model
- License: CC-BY-NC-4.0
- Developed by: OceanirAI
## Usage
Coming soon - model under development.
## Contact
- Organization: OceanirAI
- GitHub: github.com/Oceanir
|