| --- |
| title: FunGO |
| emoji: 𧬠|
| colorFrom: blue |
| colorTo: green |
| sdk: gradio |
| sdk_version: 4.44.0 |
| app_file: app.py |
| pinned: false |
| license: mit |
| short_description: Protein Function Prediction using ESM2 + XGBoost |
| --- |
| |
| # FunGO β Protein Function Prediction |
|
|
| **Beyond Prediction β Understanding Function.** |
|
|
| FunGO predicts Gene Ontology (GO) terms for protein sequences using: |
| - **ESM2-t36-3B** β protein language model embeddings (layers 30β35) |
| - **XGBoost classifiers** β 4,133 GO-term specific models |
| - **Evidence-tiered filtering** β Strong / Moderate / Indicative |
|
|
| ## Evidence Tiers |
|
|
| | Tier | IA Weight | Confidence | Description | |
| |------|-----------|------------|-------------| |
| | Strong Evidence | > 5.0 | β₯ 0.30 | Highly specific GO term | |
| | Moderate Evidence | > 2.0 | β₯ 0.50 | Moderately specific term | |
| | Indicative | > 1.0 | β₯ 0.65 | Lower specificity, high confidence | |
|
|
| ## Ontologies Covered |
|
|
| - **MFO** β Molecular Function |
| - **BPO** β Biological Process |
| - **CCO** β Cellular Component |
|
|
| ## Development Team |
|
|
| - **Dr. Beenish Maqsood** β Principal Investigator, School of Biochemistry and Biotechnology, University of the Punjab |
| - **Dr. Naeem Mahmood** β Co-Supervisor, School of Biochemistry and Biotechnology, University of the Punjab |
| - **Muteeba Azhar** β Lead Developer, MS Researcher, University of the Punjab |
|
|