zypchn commited on
Commit
605e0c1
·
verified ·
1 Parent(s): ec0fbb7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +22 -1
README.md CHANGED
@@ -7,6 +7,7 @@ tags:
7
  - lora
8
  - transformers
9
  - gpt2
 
10
  ---
11
 
12
  # Model
@@ -17,4 +18,24 @@ This is a fine-tuned version of [ProtGPT2](https://huggingface.co/nferruz/ProtGP
17
  # Dataset
18
  Protein data set retrieved from Research Collaboratory for Structural Bioinformatics (RCSB) Protein Data Bank (PDB). \
19
  Only the OXIDOREDUCTASE enzymes were used. \
20
- You can find the JSON formatted data @ [oxidos.json](https://github.com/zypchn/pLM/blob/main/data/oxidos.json)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  - lora
8
  - transformers
9
  - gpt2
10
+ - protein_design
11
  ---
12
 
13
  # Model
 
18
  # Dataset
19
  Protein data set retrieved from Research Collaboratory for Structural Bioinformatics (RCSB) Protein Data Bank (PDB). \
20
  Only the OXIDOREDUCTASE enzymes were used. \
21
+ You can find the JSON formatted data @ [oxidos.json](https://github.com/zypchn/pLM/blob/main/data/oxidos.json)
22
+
23
+ <br/>
24
+
25
+ # How to Use?
26
+ ```
27
+ >>> from transformers import pipeline
28
+ >>> pipe = pipeline("text-generation", model="zypchn/ProtGPT2-Oxido")
29
+ >>> sequences = pipe("", max_length=100, do_sample=True, top_k=950, repetition_penalty=1.2, num_return_sequences=5, eos_token_id=0)
30
+ # input field has left blank for diversity
31
+ ```
32
+
33
+ ```json
34
+ [
35
+ {"generated_text": "SNANQAPQPQTPTRATDAKKGSYGHPADRVGMEDNKYQVGVFYYDGPNPSYAEWNRDTQFWVETAKTAEKGKFDSIFFADTLGIYDSFKGSFEANLRHGAQFPVNDPLVAISAIAGATTKLGLVATASTTYSEPFHIARRFASLDHLSNGRAGWNIVTSYLDSAARNFGRTEQMEHDERYAIAEEYIDVVYKLWEGSWEDNAVIKDKETGLFTDPAKVHQINHEGEHFRVAGPLNIPRSPQGHPVIFQAGTSERGRDFAARHAEAVFTAQLDLEAGREFYEDIKSRAAKLGRDPDDVKILPGISVFVGKTREEAERKFRELQSLIDEEGALTRFSSYTGTDLSTYDPDGPLPELAGIDPTTPIAKLEGLLGKSKMTVREIALKQGGVSLREYQPFVGATAGSALVGGTPEQIADFMQDWFIEGTVDGFNIMPPYLPDGLEDFVDHVVPELQRRGLFRTEYEGTTLREHLGLAKPLEHHHHHH"},
36
+ {"generated_text": "MGSSHHHHHHSSGLVPRGSHMASMTGGQQMGRGSMGPCLICRSTSLKCVFCVRDPNGYKKCSKCDAFFCSRECQTEHWQRHHKFECPAAVAQPQIPPLPKPQQKQLTAAELGMFMEVRNQFALLKTNLERLDYEIFILERNVKLANTVTPPTNRTYFQSTMRYAPNPLRPNMTDAMRQQYLDKNKSSAALEHDLKELIKFKCYLLNDEYVEKEREENPFIWEYFLNKEWRKRNVWGNK\n"},
37
+ {"generated_text": "MGHHHHHHSSGLVPRGSHMTVEQAKKLRAEAEAQAQIQDKAKAIAQTHGKVEVMVDGKHRVVDLDATTRRQLTDGELQAIVVAAQEAAAKQLKAQRQALLEQHQDAELRKLALEGEIV\nAVITGAAQGIGRAIALRLAKDGFRVAVADIDLAAAEAVAAEIEAKGGKALVIEGDVSREEDVKRLVRKAIDQFGRLDYAVNNAGIQGPLAPTEELPLALWNKVIDVNLTGVFLCMKYEIAQMVKQGRGGAIVNTASVAGLSGQPGMVAYCASKHGVVGLTKTVAIEYAKHGIRINAVAPGFIDTPMVQKLPEEKRARIAAAIPMRRLGQPDEIAAVVAFLLSDDASFITGQCIAVDGGFTAGLLA"},
38
+ {"generated_text": "MAASKAADSLAEGAAKLEHHHHHH"},
39
+ {"generated_text": "GSKPQPGVQVEGAKCQVLQAVYDFTVQSASELSFKAGDVICVTGQYDPTLGWWLAEERRTGKSGLVPENYVELLSTGPAQHHHHHH"}
40
+ ]
41
+ ```