Spaces:
Running
Running
+ emoji
Browse files- .github/README.md +1 -1
- README.md +1 -1
.github/README.md
CHANGED
|
@@ -23,7 +23,7 @@ Instead of asking "Summarize this video," Aileen 3 Core allows users to task a L
|
|
| 23 |
*"Here is what I already know, and here is what I expect the speaker to say. Tell me only where they deviate from this baseline."*. As part of a larger agentic AI system, the prior knowledge can even be derived from a memory bank.
|
| 24 |
|
| 25 |
|
| 26 |
-
### Key Capabilities
|
| 27 |
* **β³οΈ Expectation-Driven Briefings:** Uses Google Gemini to analyze audio/video against user-supplied priors (context, expectations, and knowledge gaps) to surface genuine surprises.
|
| 28 |
* **π Context-Biased Transcription:** Prevents hallucinations (e.g., confusing the German treaty "NOOTS" for "emergency state") by feeding media metadata as priors to the model.
|
| 29 |
* **πΌοΈ Visual Slide Extraction:** Automatically detects, extracts, and, on request, translates slide stills from video feeds, treating slides as high-density information artifacts.
|
|
|
|
| 23 |
*"Here is what I already know, and here is what I expect the speaker to say. Tell me only where they deviate from this baseline."*. As part of a larger agentic AI system, the prior knowledge can even be derived from a memory bank.
|
| 24 |
|
| 25 |
|
| 26 |
+
### πͺ Key Capabilities
|
| 27 |
* **β³οΈ Expectation-Driven Briefings:** Uses Google Gemini to analyze audio/video against user-supplied priors (context, expectations, and knowledge gaps) to surface genuine surprises.
|
| 28 |
* **π Context-Biased Transcription:** Prevents hallucinations (e.g., confusing the German treaty "NOOTS" for "emergency state") by feeding media metadata as priors to the model.
|
| 29 |
* **πΌοΈ Visual Slide Extraction:** Automatically detects, extracts, and, on request, translates slide stills from video feeds, treating slides as high-density information artifacts.
|
README.md
CHANGED
|
@@ -37,7 +37,7 @@ Instead of asking "Summarize this video," Aileen 3 Core allows users to task a L
|
|
| 37 |
*"Here is what I already know, and here is what I expect the speaker to say. Tell me only where they deviate from this baseline."*. As part of a larger agentic AI system, the prior knowledge can even be derived from a memory bank.
|
| 38 |
|
| 39 |
|
| 40 |
-
### Key Capabilities
|
| 41 |
* **β³οΈ Expectation-Driven Briefings:** Uses Google Gemini to analyze audio/video against user-supplied priors (context, expectations, and knowledge gaps) to surface genuine surprises.
|
| 42 |
* **π Context-Biased Transcription:** Prevents hallucinations (e.g., confusing the German treaty "NOOTS" for "emergency state") by feeding media metadata as priors to the model.
|
| 43 |
* **πΌοΈ Visual Slide Extraction:** Automatically detects, extracts, and, on request, translates slide stills from video feeds, treating slides as high-density information artifacts.
|
|
|
|
| 37 |
*"Here is what I already know, and here is what I expect the speaker to say. Tell me only where they deviate from this baseline."*. As part of a larger agentic AI system, the prior knowledge can even be derived from a memory bank.
|
| 38 |
|
| 39 |
|
| 40 |
+
### πͺ Key Capabilities
|
| 41 |
* **β³οΈ Expectation-Driven Briefings:** Uses Google Gemini to analyze audio/video against user-supplied priors (context, expectations, and knowledge gaps) to surface genuine surprises.
|
| 42 |
* **π Context-Biased Transcription:** Prevents hallucinations (e.g., confusing the German treaty "NOOTS" for "emergency state") by feeding media metadata as priors to the model.
|
| 43 |
* **πΌοΈ Visual Slide Extraction:** Automatically detects, extracts, and, on request, translates slide stills from video feeds, treating slides as high-density information artifacts.
|