--- language: rna library_name: multimolecule license: agpl-3.0 pipeline: rna-secondary-structure pipeline_tag: other tags: - Biology - RNA widget: - example_title: microRNA 21 output: text: '......................' pipeline_tag: rna-secondary-structure sequence_type: ncRNA task: rna-secondary-structure text: UAGCUUAUCAGACUGAUGUUGA - example_title: microRNA 146a output: text: '......................' pipeline_tag: rna-secondary-structure sequence_type: ncRNA task: rna-secondary-structure text: UGAGAACUGAAUUCCAUGGGUU - example_title: microRNA 155 output: text: '........................' pipeline_tag: rna-secondary-structure sequence_type: ncRNA task: rna-secondary-structure text: UUAAUGCUAAUCGUGAUAGGGGUU - example_title: RNA component of mitochondrial RNA processing endoribonuclease output: text: '..............................([(..........(...{..([[.....<LMNabABE>FGLOJ))}](a(c(>[[MMbcdNaO>a}be{N{Oad{OLcaO>{cKad)](>[>bfgc)e}f)g(fgh}}gdiejdjh)kkcelb)f]}balgmmngdel.]nhom.o.oo.n.odh............................' pipeline_tag: rna-secondary-structure sequence_type: ncRNA task: rna-secondary-structure text: GGUUCGUGCUGAAGGCCUGUAUCCUAGGCUACACACUGAGGACUCUGUUCCUCCCCUUUCCGCCUAGGGGAAAGUCCCCGGACCUCGGGCAGAGAGUGCCACGUGCAUACGCACGUAGACAUUCCCCGCUUCCCACUCCAAAGUCCGCCAAGAAGCGUAUCCCGCUGAGCGGCGUGGCGCGGGGGCGUCAUCCGUCAGCUCCCUCUAGUUACGCAGGCAGUGCGUGUCCGCGCACCAACCACACGGGGCUCAUUCUCAGCGCGGCUGUAAAAAAAAA - example_title: 7SK small nuclear RNA output: text: '.....................([{{.<....A..B.A.B..A....A..C......C.C...D..DEDEFECDDE)]}}(([>a{aa[<LHaMJKL][b[bcCK}d{{MNOPJ([{[BLCe))(]O.]>a>bkihfdfce)ie{fikgjk)d]h>}}heaf()llkl]km>iianoaopoklgmpijhi..................' pipeline_tag: rna-secondary-structure sequence_type: ncRNA task: rna-secondary-structure text: GGAUGUGAGGGCGAUCUGGCUGCGACAUCUGUCACCCCAUUGAUCGCCAGGGUUGAUUCGGCUGAUCUGGCUGGCUAGGCGGGUGUCCCCUUCCUCCCUCACCGCUCCAUGUGCGUCCCUCCCGAAGCUGCGCGCUCGGUCGAAGAGGACGACCAUCCCCGAUAGAGGAGGACCGGUCUUCGGUCAAGGGUAUACGAGUAGCUGCGCUCCCCUGCUAGAACCUCCAAACAAGCUCUCAAGGUCCAUUUGUAGGAGAACGUAGGGUAGUCAAGCUUCCAAGACUCCAGACACAUCCAAAUGAGGCGCUGCAUGUGGCAGUCUGCCUUUCUUUU - example_title: telomerase RNA component output: text: '.............................................((([.{IJaKaD]MJNIK]FEGH}LMN))b}(>KaNObMbHLNOPNcIaOcPbcOb]PdPMQ[dRc[BecMSeOdeR)>R(feTUaKNgMUd}effe]befRbP]hV{gh>{]chpdnm{pohka)ipj(qek>rjjnkr]Cnstop>aulf)mro]i}gmhmjgrirobpisudbecjjnkkktvljtwvuump()xvk..noh..................' pipeline_tag: rna-secondary-structure sequence_type: ncRNA task: rna-secondary-structure text: GGGUUGCGGAGGGUGGGCCUGGGAGGGGUGGUGGCCAUUUUUUGUCUAACCCUAACUGAGAAGGGCGUAGGCGCCGUGCUUUUGCUCCCCGCGCGCUGUUUUUCUCGCUGACUUUCAGCGGGCGGAAAAGCCUCGGCCUGCCGCCUUCCACCGUUCAUUCUAGAGCAAACAAAAAAUGUCAGCUGCUGGCCCGUUCGCCCCUCCCGGGGACCUGCGGCGGGUCGCCUGCCCAGCCCCCGAACCCCGCCUGGAGGCCGCGGUCGGCCCGGGGCUUCUCCGGAGGCACCCACUGCCACCGCGAAGAGUUGGGCUCUGUCAGCCGCGGGUCUCUCGGGGGCGAGGGCGAGGUUCAGGCCUUUCAGGCCGCAGGAAGAGGAACGGAGCGAGUCCCCGCGCGCGGCGCGAUUCCCUGAGCUGUGGGACGUGCACCCAGGACUCGGCUCACACAUGC - example_title: vault RNA 2-1 output: text: '..........................([{abb([c()[)(({][...a............................' pipeline_tag: rna-secondary-structure sequence_type: ncRNA task: rna-secondary-structure text: CGGGUCGGAGUUAGCUCAAGCGGUUACCUCCUCAUGCCGGACUUUCUAUCUGUCCAUCUCUGUGCUGGGGUUCGAGACCCGCGGGUGCUUACUGACCCUUUUAUGCAA - example_title: brain cytoplasmic RNA 1 output: text: '............................(((...[...........................................{<<<{A{AA{B[{{<....)..)}>}.....}a}>>.a>ba.b....a.........................................' pipeline_tag: rna-secondary-structure sequence_type: ncRNA task: rna-secondary-structure text: GGCCGGGCGCGGUGGCUCACGCCUGUAAUCCCAGCUCUCAGGGAGGCUAAGAGGCGGGAGGAUAGCUUGAGCCCAGGAGUUCGAGACCUGCCUGGGCAAUAUAGCGAGACCCCGUUCUCCAGAAAAAGGAAAAAAAAAAACAAAAGACAAAAAAAAAAUAAGCGUAACUUCCCUCAAAGCAACAACCCCCCCCCCCCUUU - example_title: HIV-1 TAR-WT output: text: '.........................................................' pipeline_tag: rna-secondary-structure sequence_type: ncRNA task: rna-secondary-structure text: GGUCUCUCUGGUUAGACCAGAUCUGAGCCUGGGAGCUCUCUGGCUAACUAGGGAACC - example_title: prion protein (Kanno blood group) output: text: '..................................................................' pipeline_tag: rna-secondary-structure sequence_type: mRNA task: rna-secondary-structure text: AUGGCGAACCUUGGCUGCUGGAUGCUGGUUCUCUUUGUGGCCACAUGGAGUGACCUGGGCCUCUGC - example_title: interleukin 10 output: text: '......................................................' pipeline_tag: rna-secondary-structure sequence_type: mRNA task: rna-secondary-structure text: AUGCACAGCUCAGCACUGCUCUGUUGCCUGGUCCUCCUGACUGGGGUGAGGGCC - example_title: Zaire ebolavirus output: text: '..............................(........................[[[.{[<.{[[{{[[<{{AAA<{A{<[B{<aH]}IHJJKLK}>].[LLLa>Lb)LML}NNc)OdMO..OP)QedNQ)O.>Pfg).d}RaSghhi>ITbjchjJb(e]cdBaf]kl}deCl]dB)}lgl]m)lfn]lo()am}enk>ohoppbiqbeqo(elcnrastd)([j)]i.()..........................' pipeline_tag: rna-secondary-structure sequence_type: mRNA task: rna-secondary-structure text: AAUGUUCAAACACUUUGUGAAGCUCUGUUAGCUGAUGGUCUUGCUAAAGCAUUUCCUAGCAAUAUGAUGGUAGUCACAGAGCGUGAGCAAAAAGAAAGCUUAUUGCAUCAAGCAUCAUGGCACCACACAAGUGAUGAUUUUGGUGAGCAUGCCACAGUUAGAGGGAGUAGCUUUGUAACUGAUUUAGAGAAAUACAAUCUUGCAUUUAGAUAUGAGUUUACAGCACCUUUUAUAGAAUAUUGUAACCGUUGCUAUGGUGUUAAGAAUGUUUUUAAUUGGAUGCAUUAUACAAUCCCACAGUGUUAU - example_title: SARS coronavirus output: text: '..................................................((..[.....{......{(<([[{.[{[[A[{{<.A(ABA(AB{([(CCC.DE[A{[HECFGHHA)K)F>JGMKOIHGJaF]N}JNLHMG>MOPI>QabL>B>gCdeQ)]RfgSRhTiSCg(eUfjhfDSTf}fEgk}ghi>GGalcqe))](rkdnels(sbrcdfjom}cncke}koteliaefspl>gggfbui)gh)hmunnhaoq()([m)]rtklu..............................' pipeline_tag: rna-secondary-structure sequence_type: mRNA task: rna-secondary-structure text: AUGUUUAUUUUCUUAUUAUUUCUUACUCUCACUAGUGGUAGUGACCUUGACCGGUGCACCACUUUUGAUGAUGUUCAAGCUCCUAAUUACACUCAACAUACUUCAUCUAUGAGGGGGGUUUACUAUCCUGAUGAAAUUUUUAGAUCAGACACUCUUUAUUUAACUCAGGAUUUAUUUCUUCCAUUUUAUUCUAAUGUUACAGGGUUUCAUACUAUUAAUCAUACGUUUGACAACCCUGUCAUACCUUUUAAGGAUGGUAUUUAUUUUGCUGCCACAGAGAAAUCAAAUGUUGUCCGUGGUUGGGUUUUUGGUUCUACCAUGAACAACAAGUCACAGUCGGUGAUUAUUAUUAACAAUUCUACUAAUGUUGUUAUACGAGCAUGUAACUUUGAAUUGUGUGACAACCCUUUCUUUGCUGUUUCUAAACCCAUGGGUACACAGACACAUACUAUGAUAUUCGAUAAUGCAUUUAAAUGCACUUUCGAGUACAUAUCU - example_title: insulin output: text: '.....................([{...>{<{[([{[{{)KcLa([I){LKaKKbadLd]eaA]M)bK}>ALMe}e].Kde)bNfLgMbNgGN>)(dcbkd)[fCeCagOefakOlOkPg(ek}jffOfj))gmhed>}k]klmlgiab)ncgn>loomglpn()f(g(oh)c>[o())]......................' pipeline_tag: rna-secondary-structure sequence_type: mRNA task: rna-secondary-structure text: AUGGCCCUGUGGAUGCGCCUCCUGCCCCUGCUGGCGCUGCUGGCCCUCUGGGGACCUGACCCAGCCGCAGCCUUUGUGAACCAACACCUGUGCGGCUCACACCUGGUGGAAGCUCUCUACCUAGUGUGCGGGGAACGAGGCUUCUUCUACACACCCAAGACCCGCCGGGAGGCAGAGGACCUGCAGGUGGGGCAGGUGGAGCUGGGCGGGGGCCCUGGUGCAGGCAGCCUGCAGCCCUUGGCCCUGGAGGGGUCCCUGCAGAAGCGUGGCAUUGUGGAACAAUGCUGUACCAGCAUCUGCUCCCUCUACCAGCUGGAGAACUACUGCAACUAG - example_title: cyclin dependent kinase inhibitor 2A output: text: '.......(.[...{{{..<.A..B...<)(](((.[..([[CDD(C[[DDCC[DAE(FAEGBBGFDCB}}FC}{GH{EH{I>IaJN(aL(aIJKMOAJAOKLbbPcMM[KcLdNePQL(ce>LRMNOePRf[{))MfP]SLIMJQSgJbORT]OPUfgcN}dh}])SO>{a]fl}gjfnhmc[lagd[)(o{loaf(kain{g(boebo{]h[jjd[kCaompn)plfmkmadqijliljmqrdgjbrrng)b.rk.}ssoppo}lk]s>l)}ptuctjv.qrms].]r.kimhs.kuul.tlu...vbv...........................' pipeline_tag: rna-secondary-structure sequence_type: mRNA task: rna-secondary-structure text: AUGGAGCCGGCGGCGGGGAGCAGCAUGGAGCCUUCGGCUGACUGGCUGGCCACGGCCGCGGCCCGGGGUCGGGUAGAGGAGGUGCGGGCGCUGCUGGAGGCGGGGGCGCUGCCCAACGCACCGAAUAGUUACGGUCGGAGGCCGAUCCAGGUCAUGAUGAUGGGCAGCGCCCGAGUGGCGGAGCUGCUGCUGCUCCACGGCGCGGAGCCCAACUGCGCCGACCCCGCCACUCUCACCCGACCCGUGCACGACGCUGCCCGGGAGGGCUUCCUGGACACGCUGGUGGUGCUGCACCGGGCCGGGGCGCGGCUGGACGUGCGCGAUGCCUGGGGCCGUCUGCCCGUGGACCUGGCUGAGGAGCUGGGCCAUCGCGAUGUCGCACGGUACCUGCGCGCGGCUGCGGGGGGCACCAGAGGCAGUAACCAUGCCCGCAUAGAUGCCGCGGAAGGUCCCUCAGACAUCCCCGAUUGA - example_title: human papillomavirus type 16 E6 output: text: '........................................(([(......{....<...A((.[{([<(AB([{(A<({[{[A(BB{[[[[CD<{({YaQQ>}TRY]{U.]R)>b)XaVJSb>Nbbcbdaee[[B..ffYg)W])ghhZg].hZ)DEi.]F}}jh.iG}j...ik..]jkl}}Im>nnamooppqnJqqrro)(b]J>pKabLqqqsccdCd][rsteee}[>sfutvf{gEwtauvg..}>e)...ea.ahil.(..b.bfmh.bin[)j]k..............................' pipeline_tag: rna-secondary-structure sequence_type: mRNA task: rna-secondary-structure text: AUGCACCAAAAGAGAACUGCAAUGUUUCAGGACCCACAGGAGCGACCCAGAAAGUUACCACAGUUAUGCACAGAGCUGCAAACAACUAUACAUGAUAUAAUAUUAGAAUGUGUGUACUGCAAGCAACAGUUACUGCGACGUGAGGUAUAUGACUUUGCUUUUCGGGAUUUAUGCAUAGUAUAUAGAGAUGGGAAUCCAUAUGCUGUAUGUGAUAAAUGUUUAAAGUUUUAUUCUAAAAUUAGUGAGUAUAGACAUUAUUGUUAUAGUUUGUAUGGAACAACAUUAGAACAGCAAUACAACAAACCGUUGUGUGAUUUGUUAAUUAGGUGUAUUAACUGUCAAAAGCCACUGUGUCCUGAAGAAAAGCAAAGACAUCUGGACAAAAAGCAAAGAUUCCAUAAUAUAAGGGGUCGGUGGACCGGUCGAUGUAUGUCUUGUUGCAGAUCAUCAAGAACACGUAGAGAAACCCAGCUGUAA - example_title: NRAS proto-oncogene output: text: '..................(([({<<<{[ABAAAC.CC.DEBDFE.FGHIJK......))]])(((}}[[>>>[)(aaaa]bb]))c])deccfdghi(jefk)()()()......................' pipeline_tag: rna-secondary-structure sequence_type: 5' UTR task: rna-secondary-structure text: GGGGCCGGAAGUGCCGCUCCUUGGUGGGGGCUGUUCAUGGCGGUUCCGGGGUCUCCAACAUUUUUCCCGGCUGUGGUCCUAAAUCUGUCCAAAGCAGAGGCAGUGGAGCUUGAGGUUCUUGCUGGUGUGAA - example_title: amyloid beta precursor protein output: text: '.......................((....[{<....................A>(aabcbc[bcdeef(e{g))](ef}ghgi)g([)]ij.......................' pipeline_tag: rna-secondary-structure sequence_type: 5' UTR task: rna-secondary-structure text: GUCAGUUUCCUCGGCAGCGGUAGGCGAGAGCACGCGGAGGAGCGUGCGCGGGGGCCCCGGGAGACGGCGGCGGUGGCGGCGCGGGCAGAGCAAGGACGCGGCGGAUCCCACUCGCACAGCAGCGCACUCGGUGCCCCGCGCAGGGUCGCG - example_title: RUNX family transcription factor 1 output: text: '.............................................................((.[..[{..(<.[A{.{A[B{.<.ABB{C(C[)D(EE](EI))a>]ab}(c}](J)JdKc](db)efe})}g>aaehih(eib(cjjjk)[)()]................' pipeline_tag: rna-secondary-structure sequence_type: 5' UTR task: rna-secondary-structure text: ACUUCUUUGGGCCUCAUAAACAACCACAGAACCACAAGUUGGGUAGCCUGGCAGUGUCAGAAGUCUGAACCCAGCAUAGUGGUCAGCAGGCAGGACGAAUCACACUGAAUGCAAACCACAGGGUUUCGCAGCGUGGUAAAAGAAAUCAUUGAGUCCCCCGCCUUCAGAAGAGGGUGCAUUUUCAGGAGGAAGCG - example_title: fragile X messenger ribonucleoprotein 1 output: text: '.......(.[(.{..({(<([{A.AB.CA..D.D.EF.B.DE.CEFE.DEG.)))])})(]}}[>a[a{ae]}d(D]aE}(fDbGGGc}>{M>a{c}i.cjgf>il.)kjamglimj..e.............................' pipeline_tag: rna-secondary-structure sequence_type: 5' UTR task: rna-secondary-structure text: CUCAGUCAGGCGCUCAGCUCCGUUUCGGUUUCACUUCCGGUGGAGGGCCGCCUCUGAGCGGGCGGCGGGCCGACGGCGAGCGCGGGCGGCGGCGGUGACGGAGGCGCCGCUGCCAGGGGGCGUGCGGCAGCGCGGCGGCGGCGGCGGCGGCGGCGGCGGCGGAGGCGGCGGCGGCGGCGGCGGCGGCGGCGGCUGGGCCUCGAGCGCCCGCAGCCCACCUCUCGGGGGCGGGCUCCCGGCGCUAGCAGGGCUGAAGAGAAG - example_title: MYC proto-oncogene output: text: '............................(([((.{{<.A{BCD{<{A{<{ABECBA<a[a([>}[JLK>L}(KaMJL[MbbHGaLGHIJKLaNbO>OaPOJPcMdbbL>cKQb))Od(]>]N}efaPNd(aePcc[.dbQ}MgbhbfNc}gNijg>h}>k)ajgg]ghhngnlmn(knol]p(oohm)))iiijpjpqjkklqj.]ljmiopj.m..................' pipeline_tag: rna-secondary-structure sequence_type: 5' UTR task: rna-secondary-structure text: AACUCGCUGUAGUAAUUCCAGCGAGAGGCAGAGGGAGCGAGCGGGCGGCCGGCUAGGGUGGAAGAGCCGGGCGAGCAGAGCUGCGCUGCGGGCGUCCUGGGAAGGGAGAUCCGGAGCGAAUAGGGGGCUUCGCCUCUGGCCCAGCCCUCCCGCUGAUCCCCCAGCCAGCGGUCCGCAACCCUUGCCGCAUCCACGAAACUUUGCCCAUAGCAGCGGGCGGGCACUUUGCACUGGAACUUACAACACCCGAGCAAGGACGCGACUCUCCCGACGCGGGGAGGCUAUUCUGCCCAUUUGGGGACACUUCCCCGCCGCUGCCAGGACCCGCUUCUCUGAAAGGCUCUCCUUGCAGCUGCUUAGACG - example_title: activating transcription factor 4 output: text: '.............((.[...{.<<.<.ABCDEFC{FBGGBBCCBHHGHDIHJ)E)((I(][[KIGLJG}I[KI[J}{M{>KaAAN(Kb(AAL>bL{AMMM[McLLLNLbMNc)O)PbQdRSTeQ)PP(OO]I]U}MNbBOSPcQ>]>)>(iopikcqaapbcbpjnmbk)qbq}.ldrsstmm>.aadeum.....n.............' pipeline_tag: rna-secondary-structure sequence_type: 5' UTR task: rna-secondary-structure text: CAUUUCUACUUUGCCCGCCCACAGAUGUAGUUUUCUCUGCGCGUGUGCGUUUUCCCUCCUCCCCGCCCUCAGGGUCCACGGCCACCAUGGCGUAUUAGGGGCAGCAGUGCCUGCGGCAGCAUUGGCCUUUGCAGCGGCGGCAGCAGCACCAGGCUCUGCAGCGGCAACCCCCAGCGGCUUAAGCCAUGGCGCUUCUCACGGCAUUCAGCAGCAGCGUUGCUGUAACCGACAAAGACACCUUCGAAUUAAGCACAUUCCUCGAUUCCAGCAAAGCACCGCAAC - example_title: Human GPI protein p137 output: text: '..........................................................([((...(((.(.{([{({.(<<[A{B<<.A.BCBABBBCBDCBDCE.ED.FEGF.BFFF)GADGECHHCGIJCDKJKHEIKHLFFGGH(LHBG]))}[{[>[aab)b)bc]d]bef]cc}cfdg)][ecghdghaebf}gM>)>]l>>(ajkeffce)}gbmfghennkdi.o..............................' pipeline_tag: rna-secondary-structure sequence_type: 3' UTR task: rna-secondary-structure text: UUUUUAAAAGGAAAAGAUACCAAAUGCCUGCUGCUACCACCCUUUUCAAUUGCUAUGUUUUGAAAGGCACCAGUAUGUGUUUUAGAUUGAUUUAAAUGUUUCAUUUAAAUCACGGACAGUAGUUUCAGUUCUGAUGGUAUAAGCAAAACAAAUAAAACGUUUAUAAAAGUUGUAUCUUGAAACACUGGUGUUCAACAGCUAGCAGCUUAUGUGAUUCACCCCAUGCCACGUUAGUGUCACAAAUUUUAUGGUUUAUCUCCAGCAACAUUUCUCUAGUACUUGCACUUAUUAUCUGAAUUC - example_title: nucleophosmin 1 output: text: '..............................(([.........{.{<(.AAB...[C{D<[}>{HJEaHIJIJKFK>I)efI)J]gdL]a)M][b)(b}}[cd[>Mfg>CN)d(d}(O>]ah[eaff>gghgijFabOiij[baekilDk}e)>lci>}ha]b)kdalcmcj]dkjfmlghdj]mdmneffo()oeni]nfhkfkokjh...................' pipeline_tag: rna-secondary-structure sequence_type: 3' UTR task: rna-secondary-structure text: GAAAAUAGUUUAAACAAUUUGUUAAAAAAUUUUCCGUCUUAUUUCAUUUCUGUAACAGUUGAUAUCUGGCUGUCCUUUUUAUAAUGCAGAGUGAGAACUUUCCCUACCGUGUUUGAUAAAUGUUGUCCAGGUUCUAUUGCCAAGAAUGUGUUGUCCAAAAUGCCUGUUUAGUUUUUAAAGAUGGAACUCCACCCUUUGCUUGGUUUUAAGUAUGUAUGGAAUGUUAUGAUAGGACAUAGUAGUAGCGGUGGUCAGACAUGGAAAUGGUGGGGAGACAAAAAUAUACAUGUGAAAUAAAACUCAGUAUUUUAAUAAAGUAGCACGGUUUCUAUUGA - example_title: superoxide dismutase 1 output: text: '............................................(([{.].a.].......P.....)..(........QR}SQ>T(}..}(a>P}.a.R}>b{a.b]abbcc.{..d.Q.defef.f.eQ]..fgg.h...gh>aic...baj........kkdgdhlmdmeld)noo}pm[)qpp)mqffhg}pqrm]qrngimstf............................................' pipeline_tag: rna-secondary-structure sequence_type: 3' UTR task: rna-secondary-structure text: ACAUUCCCUUGGAUGUAGUCUGAGGCCCCUUAACUCAUCUGUUAUCCUGCUAGCUGUAGAAAUGUAUCCUGAUAAACAUUAAACACUGUAAUCUUAAAAGUGUAAUUGUGUGACUUUUUCAGAGUUGCUUUAAAGUACCUGUAGUGAGAAACUGAUUUAUGAUCACUUGGAAGAUUUGUAUAGUUUUAUAAAACUCAGUUAAAAUGUCUGUUUCAAUGACCUGUAUUUUGCCAGACUUAAAUCACAGAUGGGUAUUAAACUUGUCAGAAUUUCUUUGUCAUUCAAGCCUGUGAAUAAAAACCCUGUAUGGCACUUAUUAUGAGGCUAUUAAAAGAAUCCAAAUUCAAACUAAA - example_title: hemoglobin subunit alpha 2 output: text: '..............(((....[.....(.[{[{abc)d]}[))aa(a)ee...]....................................' pipeline_tag: rna-secondary-structure sequence_type: 3' UTR task: rna-secondary-structure text: CUGGAGCCUCGGUAGCCGUUCCUCCUGCCCGCUGGGCCUCCCAACGGGCCCUCCUCCCCUCCUUGCACCGGCCCUUCCUGGUCUUUGAAUAAAGUCUGAGUGGGCAGCA - example_title: BRAF proto-oncogene output: text: '.................................................([...)]..(.((([{..<([(A{]L<][))NO(Ma([bb[cPdbe}L[LNbbLcd[DOQMN{OPf{DQfDRgahDM>g)S(OG]{]NhD(THGG<))iHj}eT]>I[>>H[e)a}UUb>cddfS>(}b}kd>dg(B)cUl]C)>(l]mhj]dadgl}bP>gdlafkhge)}mnfeEh]lanablj[}fmiF>oamcg)}g>ompcbnach.h]pnqri(djoeinssootjt)ufjef(kupuq())...................' pipeline_tag: rna-secondary-structure sequence_type: 3' UTR task: rna-secondary-structure text: AACAAAUGAGUGAGAGAGUUCAGGAGAGUAGCAACAAAAGGAAAAUAAAUGAACAUAUGUUUGCUUAUAUGUUAAAUUGAAUAAAAUACUCUCUUUUUUUUUAAGGUGAACCAAAGAACACUUGUGUGGUUAAAGACUAGAUAUAAUUUUUCCCCAAACUAAAAUUUAUACUUAACAUUGGAUUUUUAACAUCCAAGGGUUAAAAUACAUAGACAUUGCUAAAAAUUGGCAGAGCCUCUUCUAGAGGCUUUACUUUCUGUUCCGGGUUUGUAUCAUUCACUUGGUUAUUUUAAGUAGUAAACUUCAGUUUCUCAUGCAACUUUUGUUGCCAGCUAUCACAUGUCCACUAGGGACUCCAGAAGAAGACCCUACCUAUGCCUGUGUUUGCAGGUGAGAAGUUGGCAGUCGGUUAGCCUGGG - example_title: H3 clustered histone 1 output: text: '..........................................................' pipeline_tag: rna-secondary-structure sequence_type: 3' UTR task: rna-secondary-structure text: UUACUGUGGUCUCUCUGACGGUCCAAGCAAAGGCUCUUUUCAGAGCCACCACCUUUUC --- # UFold Pre-trained model for RNA secondary structure prediction using an image-like sequence representation and a U-Net. ## Disclaimer This is an UNOFFICIAL implementation of [UFold: fast and accurate RNA secondary structure prediction with deep learning](https://doi.org/10.1093/nar/gkab1074) by Laiyi Fu, Yingxin Cao, Jie Wu, Qinke Peng, Qing Nie, and Xiaohui Xie. The OFFICIAL repository of UFold is at [uci-cbcl/UFold](https://github.com/uci-cbcl/UFold). > [!TIP] > The MultiMolecule implementation is a direct PyTorch port of the original U-Net architecture and feature construction. **The team releasing UFold did not write this model card for this model so this model card has been written by the MultiMolecule team.** ## Model Details UFold predicts RNA base-pair contact maps from single RNA sequences. It represents a sequence as a 17-channel image: 16 channels are outer products of one-hot nucleotide indicators and one channel is a hand-crafted canonical/wobble pairing score. A U-Net predicts a symmetric contact score matrix, and the original constrained post-processing routine can be enabled to enforce base-pairing constraints. ### Model Specification | Num Parameters (M) | FLOPs (G) | MACs (G) | | ------------------ | --------- | -------- | | 8.64 | 188.29 | 93.81 | FLOPs and MACs are computed with `multimolecule.utils` for one 600 nt sequence. ### Links - **Code**: [multimolecule.ufold](https://github.com/DLS5-Omics/multimolecule/tree/master/multimolecule/models/ufold) - **Weights**: [multimolecule/ufold](https://huggingface.co/multimolecule/ufold) - **Paper**: [UFold: fast and accurate RNA secondary structure prediction with deep learning](https://doi.org/10.1093/nar/gkab1074) - **Developed by**: Laiyi Fu, Yingxin Cao, Jie Wu, Qinke Peng, Qing Nie, Xiaohui Xie - **Original Repository**: [uci-cbcl/UFold](https://github.com/uci-cbcl/UFold) ## Usage The model file depends on the [`multimolecule`](https://multimolecule.danling.org) library. You can install it using pip: ```bash pip install multimolecule ``` ### RNA Secondary Structure Pipeline ```python import multimolecule from transformers import pipeline predictor = pipeline("rna-secondary-structure", model="multimolecule/ufold") output = predictor("GGGCUAUUAGCUCAGUUGGUUAGAGCGCACCCCUGAUAAGGGUGAGGUCGCUGAUUCGAAUUCAGCAUAGCUCA") ``` ### PyTorch Inference ```python from multimolecule import RnaTokenizer, UfoldModel tokenizer = RnaTokenizer.from_pretrained("multimolecule/ufold") model = UfoldModel.from_pretrained("multimolecule/ufold") sequence = "GGGCUAUUAGCUCAGUUGGUUAGAGCGCACCCCUGAUAAGGGUGAGGUCGCUGAUUCGAAUUCAGCAUAGCUCA" inputs = tokenizer(sequence, return_tensors="pt") output = model(**inputs) contact_map = output.contact_map ``` To run the original constrained post-processing loop: ```python output = model(**inputs, use_postprocessing=True) contact_map = output.postprocessed_contact_map ``` ## Training Details UFold was trained for RNA secondary structure prediction from annotated contact maps and base-pairing rules. ### Training Data - RNAStrAlign: 30,451 unique RNAs from eight RNA families; the paper reports a random split with 24,895 training RNAs and 2,854 test RNAs after redundancy filtering. - bpRNA-1m: 102,318 RNAs from 2,588 families; CD-HIT was used to remove redundant sequences before splitting the data into TR0 and TS0. - augmented data: synthetic training examples were generated from bpRNA-new sequences by random mutation and structure prediction. - PDB training data: high-resolution RNA structures from bpRNA and the PDB were used for fine-tuning/evaluation experiments; test sets TS1, TS2, and TS3 were filtered at 80% sequence identity. - evaluation data: ArchiveII, TS0, bpRNA-new, and PDB test data were used for benchmark evaluation. ### Training Procedure - input representation: 16 outer-product channels following the MultiMolecule tokenizer order plus one hand-crafted pairing-score channel. - objective: weighted binary cross entropy over base-pair contact maps. - optimizer: Adam. - training epochs: 100. - batch size: 1. - positive-class weight: 300. - post-processing: constrained optimization with canonical/wobble pairing rules, sparsity shrinkage, and a 0.5 threshold. ## Citation ```bibtex @article{fu2022ufold, author = {Fu, Laiyi and Cao, Yingxin and Wu, Jie and Peng, Qinke and Nie, Qing and Xie, Xiaohui}, title = {UFold: fast and accurate RNA secondary structure prediction with deep learning}, journal = {Nucleic Acids Research}, volume = {50}, number = {3}, pages = {e14}, year = {2022}, doi = {10.1093/nar/gkab1074} } ``` > [!NOTE] > The artifacts distributed in this repository are part of the MultiMolecule project. > If you use MultiMolecule in your research, you must cite the MultiMolecule project. ## License This model is licensed under the [GNU Affero General Public License](license.md). For additional terms and clarifications, please refer to our [License FAQ](license-faq.md). ```spdx SPDX-License-Identifier: AGPL-3.0-or-later ```