OTel-Embedding-568M

OTel-Embedding-568M is a telecom-specialized embedding model fine-tuned on telecommunications domain data. It is part of the OTel Family of Models, an open-source initiative to build industry-standard AI models for the global telecommunications sector.

Model Details

Attribute Value
Base Model BAAI/bge-m3
Parameters 568M
Training Method Full parameter fine-tuning
Language English
License Apache 2.0

Training Data

The model was trained on high-quality telecom-focused data curated by 200+ domain experts from organizations including AT&T, GSMA, Purdue University, Khalifa University, University of Leeds, Yale University, and others.

Data Sources:

  • GSMA Permanent Reference Documents
  • 3GPP Specifications
  • O-RAN Documentation
  • RFC Series
  • eSIM, terminals, security, networks, roaming, APIs
  • Industry whitepapers and telecom academic papers

Intended Use

This model is optimized for:

  • RAG applications in telecommunications
  • Question answering on telecom specifications and standards

Related Models

Language Models

Embedding Models

Reranker Models

Related Datasets

Training Infrastructure

  • Framework: ScalarLM (GPU-agnostic)
  • Compute: TensorWave with AMD GPUs and Azure with Nvidia GPUs.

Citation

@misc{otel2026,
  title={OTel: Open Telco AI Models},
  author={Tavakkoli, Farbod and Diamos, Gregory and Paulk, Roderic and Terrazas, Jorden},
  year={2026},
  url={https://huggingface.co/farbodtavakkoli}
}

Contact

For questions or collaboration inquiries: farbod.tavakkoli@att.com or farbodtavakoli@gmail.com

Downloads last month
12
Safetensors
Model size
0.6B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for farbodtavakkoli/OTel-Embedding-568M

Base model

BAAI/bge-m3
Finetuned
(393)
this model

Collection including farbodtavakkoli/OTel-Embedding-568M