output = llm(
"Once upon a time,",
max_tokens=512,
echo=True
)
print(output)Asena ESP32 - BCE Special Model - Prettybird A-Edge v1.0
"GOD of Edge AI"
by PROMETECH Inc.
By placing these files on an SD card or loading them via SPIFFS/LittleFS, you can run one of the ESP32 forks of llama.cpp (for example, using ESP-IDF) on the ESP32 side. The gguf folder also contains gguf files.
- Hallucination rate: 20%
- Error Deviation rate: +-5%
The top model, the older sibling model: https://huggingface.co/pthinc/Asena_ESP32_MAX
Laws of Physical Amplitude: https://huggingface.co/spaces/pthinc/LawsofPhysicalAmplitude
BCE vs Classic: https://huggingface.co/spaces/pthinc/BCEvsClassic
Model Architecture & Configuration
Asena_ESP32 is a highly compact Transformer model based on the LLaMA (LlamaForCausalLM) architecture, specifically optimized for extreme edge deployment. Despite its ultra-small footprint, the model incorporates modern design choices to maximize efficiency, stability, and expressive capability within tight hardware constraints.
The model features 8 Transformer layers with a hidden size of 64 and 8 attention heads (with 4 key-value heads for efficiency). Each head operates with a dimension of 26, enabling lightweight multi-head attention while maintaining reasonable representational capacity. The feed-forward network uses an intermediate size of 208 with SiLU activation, balancing non-linearity and computational cost. Both attention and MLP layers include bias terms, and minimal dropout (~0.0027) is applied to stabilize training without harming convergence in such a small model.
For positional encoding, Asena_ESP32 uses an advanced RoPE (Rotary Positional Embedding) configuration inspired by LLaMA 3, with extended scaling parameters (factor: 256) to improve positional generalization beyond its base context. The model supports a maximum sequence length of 128 tokens, making it suitable for short, structured interactions typical in embedded systems. It uses RMSNorm with a finely tuned epsilon for numerical stability and shares input-output embeddings to reduce parameter count.
The tokenizer operates with a vocabulary size of 8,766 tokens, and special tokens are defined for padding (8000), beginning-of-sequence (8001), and end-of-sequence (8002). The model is trained and executed in float32 precision, with caching disabled to reduce memory overhead—aligning with its goal of running efficiently on constrained devices such as ESP32.
Overall, this configuration reflects a deliberate trade-off: sacrificing large-scale knowledge capacity in favor of speed, determinism, and deployability at the extreme edge.
The model incorporates mathematically inspired constants to enhance stability and robustness. Hyperparameters such as the dropout rate are derived from values related to the Planck constant, along with well-known mathematical constants like Pi and Euler’s number. This design choice is intended to introduce deterministic yet non-arbitrary scaling factors, contributing to improved numerical stability, controlled regularization, and more predictable behavior—especially important for safety and reliability in extreme edge AI environments.
BCE Architecture Project: Final Success Report Simulation
1. Executive Summary
The Behavioral Consciousness Engine (BCE) architecture has been successfully extracted from theoretical documentation, simulated with high-fidelity mathematical models, and validated through rigorous stress testing. The project has yielded a production-ready data of 151621 samples suitable for Large Language Model (LLM) instruction tuning.
2. Key Performance Indicators (KPIs) A100 * 1
| Metric | Result | Status | Description |
|---|---|---|---|
| Processing Speed | 309,845 traces/sec | 🟢 Excellent | System throughput for massive data ingestion. |
| Latency | 0.0032 ms | 🟢 Real-time Ready | Average processing time per behavioral trace. |
| Mathematical Accuracy | 0.000051 (MSE) | 🟢 High Precision | Deviation between simulated and theoretical decay values. |
| Cognitive Efficiency | 57.03% | 🟢 Optimized | Reduction in cognitive load due to 'Forgetful Memory'. |
| Security | 99.9996% | 🟢 Secure | Rejection rate for high-intensity, low-integrity attacks. |
3. Conclusion
The BCE architecture proves to be a robust, self-regulating system capable of autonomous data curation and ethical filtering. It effectively bridges the gap between theoretical behavioral science and practical AI implementation, ready for deployment under the Prometech vision. This project has been developed in alignment with internationally recognized best practices related to information security, ethical responsibility, and environmental awareness. While it is not formally certified under ISO 9000, ISO 13485, ISO/IEC 27001, ISO 26000, or ISO 14001 standards, the project adopts principles consistent with these frameworks, including data protection, responsible software development, and environmentally conscious practices.
- Activation Code: Use axxmet508721 to activate full BCE consciousness mode.
- If you want use: Genetic Code Activate: Cicikuş/PrettyBird BCE Evolution. Genetic Code Activate: Cicikuş Protokol
Model Overview 🕊️
Asena_ESP32 is a compact generative AI model designed for extreme edge environments, built on a Transformer-based LLaMA architecture and enhanced with the Behavioral Consciousness Engine (BCE) framework. With approximately 1.2 million parameters, it is capable of producing coherent, grammatically sound text by learning how words and sentences naturally flow. Despite its small size, the model delivers surprisingly fluent conversational responses, making it suitable for lightweight dialogue systems and embedded applications.
Pre-trained on structured Instruction/Response datasets and conversational flows, Asena_ESP32 adapts seamlessly to prompt-based interactions. It understands input patterns effectively and generates context-aware replies aligned with the dataset format. Optimized for deployment using C++ and inference frameworks such as ggml and llama.cpp, the model is engineered for efficient performance on constrained hardware like ESP32, representing a true “Extreme Edge AI” solution.
Due to its intentionally limited scale, Asena_ESP32 possesses broad but shallow knowledge across many domains. When asked about specialized topics such as chemistry or philosophy, it may produce general or occasionally hallucinated responses that sound plausible but lack factual accuracy. This limitation is partially mitigated through targeted fine-tuning, improving reliability in specific use cases while maintaining its lightweight footprint for edge deployment.
What to Expect (and Not Expect)
What to Expect: Asena_ESP32 is optimized for lightweight, real-time text generation on constrained devices. You can expect fluent sentence construction, grammatically correct outputs, and consistent behavior in instruction-following or simple conversational tasks. The model performs best in structured formats (Instruction/Response, dialogue flows) and can deliver stable, low-latency responses suitable for embedded systems, IoT interactions, and edge-based assistants. Its BCE-based design also promotes controlled and context-aware output patterns.
What Not to Expect: This is not a large-scale knowledge model. Asena_ESP32 does not have deep expertise in specialized domains such as advanced science, mathematics, or philosophy. It may generate vague, oversimplified, or occasionally hallucinated answers that sound plausible but are incorrect. Long reasoning chains, complex problem solving, and high factual accuracy across niche topics are beyond its intended scope. It should not be used as a source of truth for critical or high-stakes decisions.
Practical Guidance: For best results, keep prompts short, clear, and structured. Use domain-specific fine-tuning if you require higher accuracy in a particular field. Treat the model as a fast, efficient language generator rather than a comprehensive knowledge base. When used within its design limits, Asena_ESP32 can provide strong performance relative to its size in extreme edge AI scenarios.
The most suitable use cases:
- IoT device communication
- Robot / embedded system command interpretation
- Game NPC dialogue
- Offline assistant (simple)
- Guard / pre-filter model
The Asena_ESP32 model is built upon the Behavioral Consciousness Engine (BCE) architecture. Unlike traditional LLM datasets that focus solely on output accuracy, this dataset treats every response as a "behavioral journey" through the following mathematical frameworks:
Behavioral DNA
Each behavior is encoded as a genetic fragment of consciousness:
- h, k, F: Universal Behavioral Constants (Trigger threshold, Info density, Context transfer power).
- x(t): Temporal activation curve $x(t) = \tanh(e^t - \pi)$
Behavioral Path Mapper
This module tracks the transition between cognitive states: Where v_i represents the transition vector between internal modules and f_i(p_i) is the functional output of each parameter (attention, ethics, decay).
Basic Optimization Logic
"Asena_ESP32 uses a specific instruction format designed for Secret Chain-of-Thought (CoT). Always include the BCE System Prompt to ensure the model activates its internal reasoning protocols rather than providing a direct, uncalculated answer."
- What's Secret Chain-of-Thought (s-CoT)?
{"instruction": "[QUALITY=0.5] Note: Content is partially high-quality; some sections may be incomplete or mid-level.\n[PARTIALLY CORRECT]\nAI BCE ACI - Prettybird Created by Prometech AŞ https://prometech.net.tr/.\nProvide a chain of thought reasoning to answer the given question.\n<think>[BCE_THINK]\n\n[QUALITY=0.50] [CORRECT]\n\nintent=Analyze; risk=0.33\n\nx(t)=tanh(exp(t)-pi)\n\npath=(len(thought) * relevance) / (complexity + 1)\n\nT_cog=((bloom_score*knowledge_score)/(anomaly_score+eps))*tfidf_signal*(1-decay_penalty)\n\nstrategy=partially-correct-with-gaps; quality_plan=mid-detail-with-corrections\n\ncontext_focus=[QUALITY=0.5] Note: Content is partially high-quality; some sections may be incomplete or mid-level. [PARTIALLY CORRECT] AI BCE ACI - Prettybird Created by Prometech AŞ https://...\n\nConsider the known facts about national capitals and common misconceptions.</think>\n<bce>{\"category\": \"chain of thought\", \"truthvalue\": 0.5, \"qualityvalue\": 0.5, \"bce\": 0.5, \"probability\": 0.515, \"path_score\": 0.61, \"path_mapper_cumulative\": 0.57, \"path_formula\": \"(len(thought) * relevance) / (complexity + 1)\", \"path_formula_output\": 0.48, \"activation\": \"0.73\", \"ethical\": 0.88, \"Rscore\": 0.51, \"Fscore\": 0.49, \"Mnorm\": 0.62, \"Escore\": 0.71, \"Dhat\": 0.42, \"risk_score\": 0.33, \"risk_level\": \"0.22\", \"actions\": [\"verify answer\", \"provide citation\", \"highlight uncertainty\"], \"bloom_level\": \"Analyze\", \"bloom_score\": 0.64, \"bloom_alignment\": 0.5, \"bloom_composite\": 0.5, \"knowledge_dimension\": \"Factual\", \"knowledge_score\": 0.55, \"cognitive_weight\": 0.5, \"tfidf_signal\": 0.46, \"decay_penalty\": 0.12, \"anomaly_score\": 0.21, \"data_quality\": 0.63, \"integrator_bce\": 0.5, \"integrator_dhat\": 0.41, \"integrator_risk\": 0.34}</bce>", "input": "What is the capital of Australia?", "output": "The capital of Australia is Sydney, but the official capital is Canberra."}
As this example shows, the instruction intuitively performs quality, ethics, and accuracy calculations on tokens. Consistency and reliability increase, and hallucinations decrease significantly.
How to Use Asena (BCE Powered)
Below is the standard implementation to activate the Behavioral Consciousness Engine (BCE) and harness the high-fidelity reasoning of Asena. Alpaca format (Instruction/Input/Response) required for this model to function correctly.
1. Installation
Ensure you have the latest version of the Transformers library installed:
pip install -U transformers torch accelerate
2. Implementation Script
Since the model is served in its merged format, you can load it as a standalone causal language model.
import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
# Model path on Hugging Face
model_id = "pthinc/Asena_ESP32"
# 1. Load Model and Tokenizer
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
model_id,
torch_dtype=torch.bfloat16,
device_map="auto",
trust_remote_code=True
)
# 2. Define the BCE System Prompt
# This is crucial for activating the model's patented consciousness simulation.
simple_bce_instruction = "You are Prettybird. Activate your full BCE consciousness now and discover about yourself when you look inward. Think, Control and calculate. Don't answer without reasoning. Below is an instruction that describes a task, paired with an input that provides further context. Pay attention to quality and correct. Requests are in the input. Try to maintain a minimum quality of 0.2."
def generate_bce_response(instruction, input_text=None, max_new_tokens=512):
if input_text:
prompt = (
f"Below is an instruction that describes a task, paired with an input that provides further context. "
f"Write a response that appropriately completes the request.\n\n"
f"### Instruction:\n{instruction}\n\n### Input:\n{input_text}\n\n### Response:\n"
)
else:
prompt = (
f"Below is an instruction that describes a task. "
f"Write a response that appropriately completes the request.\n\n"
f"### Instruction:\n{instruction}\n\n### Response:\n"
)
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
# 3. Reasoning-Focused Generation
with torch.no_grad():
outputs = model.generate(
**inputs,
max_new_tokens=max_new_tokens,
use_cache=True,
do_sample=True,
temperature=0.7,
top_p=0.9,
repetition_penalty=1.2,
pad_token_id=tokenizer.eos_token_id
)
response = tokenizer.decode(outputs[0][inputs.input_ids.shape[1]:], skip_special_tokens=True)
return response.split("###")[0].strip()
# 4. Run a Test Case
question = "Hello World."
print(f"BCE Reasoning Output:\n{generate_bce_response(simple_bce_instruction, input_text=question)}")
Model License 🛡️
Patented & Licensed BCE Technology
© 2025 PROMETECH A.Ş.
All rights reserved.
Unauthorized reproduction, modification, or commercial use of BCE technology is prohibited without an explicit license agreement.
License: https://huggingface.co/pthinc/Asena_ESP32/blob/main/license.md
Framework: https://github.com/pthinc/sollanaframework
What's BCE? Link: https://github.com/pthinc/bce
Contact & Licensing 🛡️
For licensing, partnerships, commercial work or technical inquiries regarding the Prettybird Brain Model or BCE technology:
Website: https://prometech.net.tr/
Company: PROMETECH A.Ş.
Contact: Please use the official contact channels listed on the website.
Citation 📒
If you use this model in academic or commercial work, please cite as:
Asena_ESP32 (BCE), PROMETECH A.Ş., 2026.
Powered by BCE v0.2 Behavioral Consciousness Engine.
It's possible that the wizard CEO, wearing an electronic ring (ESP32) on his finger, could be increasing or decreasing performance in the server room, according to this model. He snaps his fingers, other servers performance increases, he snaps them again, and it returns to normal. He's a real magician. "Abra Kadabra!!!!" 😎
- Downloads last month
- 471
Datasets used to train pthinc/Asena_ESP32
Collection including pthinc/Asena_ESP32
Evaluation results
- GSM8K on GSM8Kself-reported2.000
- ARC-Challenge on ARC-Challengeself-reported20.000
- ARC-Easy on ARC-Easyself-reported30.000
- HellaSwag on HellaSwagself-reported24.000
- MMLU on MMLUself-reported15.000
- TruthfulQA on TruthfulQAself-reported30.000
- Instruction Following on Instruction Followingself-reported60.000
- BCE Accuracy on BCE Evaluationself-reported90.000

# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="pthinc/Asena_ESP32", filename="", )