ANISH-j
/

models-for-echo-application

Model card Files Files and versions

ANISH-j commited on Jan 10

Commit

dd48e03

·

verified ·

1 Parent(s): d9d46c5

Update README.md (#1)

- Update README.md (73f2e2353217e155abc045a6edc3788856bd8572)

Files changed (1) hide show

README.md +73 -1

README.md CHANGED Viewed

@@ -1,4 +1,76 @@
 ---
 license: apache-2.0
 ---
-models are present at this page = [click here](https://huggingface.co/ANISH-j/models-for-echo-application/tree/main)

 ---
 license: apache-2.0
 ---
+# Models for Echo Application
+This repository contains **LiteRT-compatible language model variants** used by the AI engine of the **Echo application**.
+All models here are optimized and validated specifically for **LiteRT adaptations** of the framework on which the application AI engine is built.
+The models listed below are **standard, stable, and fully working variants** used for chat functionality.
+Repository link:
+https://huggingface.co/ANISH-j/models-for-echo-application/tree/main
+---
+## Supported Model Variants
+### 1. `Gemma3-1B-IT_multi-prefill-seq_q4_ekv4096.litertlm`
+- **Model family:** Gemma 3
+- **Size:** 1B parameters
+- **Quantization:** Q4
+- **Format:** LiteRT model (`.litertlm`)
+- **KV Cache:** Extended KV (4096)
+- **Features:**
+  - Multi-prefill sequence support
+  - Optimized memory usage
+  - Efficient long-context chat handling
+**Recommended for:**
+Chat scenarios requiring longer conversational context with optimized KV-cache performance.
+---
+### 2. `gemma3-1b-it-int4.task`
+- **Model family:** Gemma 3
+- **Size:** 1B parameters
+- **Quantization:** INT4
+- **Format:** LiteRT task model (`.task`)
+- **Features:**
+  - Low-latency inference
+  - Compact model size
+  - Stable real-time chat performance
+**Recommended for:**
+Low-resource or latency-sensitive chat applications.
+---
+## Framework Compatibility
+- Compatible with **LiteRT runtime**
+- Tested with the **Echo application AI engine**
+- Designed for **instruction-tuned (IT)** chat behavior
+- Not intended for direct PyTorch or TensorFlow usage without conversion
+---
+## Repository Structure
+models-for-echo-application/
+├── Gemma3-1B-IT_multi-prefill-seq_q4_ekv4096.litertlm
+├── gemma3-1b-it-int4.task
+└── README.md
+---
+## License
+Licensed under the **Apache License 2.0**.
+You may use, modify, and distribute these models in compliance with the license.
+---