File size: 1,941 Bytes
4f3c7c2
f504551
8715adc
 
 
4f3c7c2
11d3da4
8b9e033
4f3c7c2
 
8715adc
4f3c7c2
 
8715adc
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
---
title: "Codon Optimizer (codon-optimizer)"
emoji: 🧬
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 5.10.0
python_version: "3.11"
app_file: app.py
pinned: false
license: mit
---

# Codon Optimizer

Multi-objective codon optimization tool based on the GenScript GenSmart algorithm (Patent WO2020024917A1).

## Features

- **NSGA-III Optimization**: Uses multi-objective genetic algorithm for comprehensive optimization
- **Three Optimization Indices**:
  - Harmony Index: Match codon usage to highly-expressed genes
  - Codon Context Index: Optimize codon pair preferences
  - Outlier Index: Minimize adverse sequence features (GC content, repeats, motifs)
- **10 Expression Hosts**: E. coli, Human, CHO, Yeast, Mouse, Insect, and more
- **Restriction Site Exclusion**: Avoid common restriction enzyme cut sites
- **Comprehensive Metrics**: CAI, GC content, and optimization indices

## Usage

1. Paste your protein or DNA sequence
2. Select target expression organism
3. Optionally select restriction sites to exclude
4. Click "Optimize Sequence"
5. Copy the optimized DNA sequence

## Algorithm

The optimization algorithm is based on GenScript's patent WO2020024917A1, which describes:

1. **Harmony Index (H)**: Measures codon usage frequency match to reference highly-expressed genes
2. **Codon Context Index (CC)**: Evaluates codon pair placement optimization
3. **Outlier Index (OI)**: Penalizes adverse features like extreme GC content, repeats, and splice sites

The NSGA-III algorithm optimizes all three objectives simultaneously to find Pareto-optimal solutions.

## References

- [GenScript GenSmart Codon Optimization](https://www.genscript.com/tools/gensmart-codon-optimization)
- [Patent WO2020024917A1](https://patents.google.com/patent/WO2020024917A1/en)
- Deb, K., & Jain, H. (2014). An evolutionary many-objective optimization algorithm using reference-point-based nondominated sorting approach. IEEE TEVC.