Collagen protein (polypeptide) - C. hemisphaerica

Overview
NameCollagen protein
Unique NameTCONS_00015680-protein
Typepolypeptide
OrganismClytia hemisphaerica (Jellyfish)
Sequence length3056

Sequence
The following sequences are available for this feature:

polypeptide sequence

>TCONS_00015680-protein ID=TCONS_00015680-protein|Name=Collagen protein|organism=Clytia hemisphaerica|type=polypeptide|length=3056bp
MTSSRSNAIGRFLVLFCAVALIRAQTPTTTEICDQYIDLVFAIDASGSVK
EEGYQQIKQFTKEIIKSFQIGPSKTHVGVVTFSEYAELQVKLTDTFDKQD
LLDRVDNLDYPGYRTALDDALRVVNSAMFSLDGGARQGVPQVLIFLTDGK
CTVCTDDLQTVVTPLKDAGVTIYTVGITNNINRTELEIISSDPSQKYMFE
VETFSDLKGIIQKLNQKTCVVKKGKCAKPLPPFTCPQNQEDECNGDETCG
KGPKKCCFNGCIHKCQFPLANCLSSIDIAFALDSSRSVRDDTFVQMKKFA
NDIVDSFQVSQQDARFASLIFGSNTEVNFNFVRYDTAKEVKRAIDGLVHQ
KSSTDIHSALEKVKSDIFSLQGKVRTRRPMVLIVFYDGDVRSGTKDLAEV
VEPLKAYGVKVVAIGVGPEVNTYQLNKIAQSSNTVFQARTFDVLLPELYS
IAKESCTEKPGECVAEEDAPSEEECAGRTGDDECDNDFDCFGNSKCCKSG
CFNKCVLPVEKCKRKVDIAFAIHQNLEANDHDMTKYFLRSSIDHFNIGKD
QSHAALLTFGYDNKVVFDFNTDQDAELNLKPQINELPFIPGKGRLVDVLK
MACDNIFCAAGGTRNNVPKMLVIVTAEFDESDPNELQERIKILRSRGVDV
VIIGVGAVDPKILQQITSDKPGVDNQVLLTKYFNGVLQYVQDIADFACGE
AGSILPPPPAGGLTEGGNQVCGAAKIKGCLKGPKGSRGRPGNYGGRGAPG
PRGPLGSAGLAGKQGPRGDPGLNGMVGERGPKGLIGLPGFPGFDGIPGVP
GVEGRQGEMGGCGLPGQPGEAGKAGLGGFTGVSGRKGPSGTKGRKGEPAP
DNPDLEILPGDDGDPGEGGDGGFKGQKGEIGMLGMIGDDGVDGSEGPKGF
MGEMGDEGVVTLEKGPKGFTGDRGEAGSPGVAEATSGELLQGPEGSSGPK
GEPGMKGQVGQNSDVKGIQGERGEQGFIGQTGVEGGPGPAGEKGPSGELG
FQGKKGQQGEAGMDGLTGEKGQQGEPGFEQGAPGPKGKRGNPAEPTSDAS
PKGAKGPLGDDGDPGPNGAIGAEGEEGPKGERGMSKMGNSGDDGAVGPRG
PQGKNGMQGEAGMGGLRGPKGIKGDIGAKGDTGRNAKAISGDPGMDGPPG
QKGVFGSLGNPGEVGPRGPTMQLEQNCIENVCPQNRKGDPGQQGDTGPIG
PAGEKGVNGSKGYPGGPCPTCPSGDDGISGDFGEPGKVGEKGELGSAGDP
GPAGVKGQRGEKGPSGFRGEGGIKGLQGDAGDIGGKGPDALGESLAEEGD
KGPPGSDGINGTRGADGPKGLPGKKGELGEQGKRGENGLKGESGPAGNKG
QPGLGGSKGEKGDQGEDSRGLTGEGGELGPEGRQGAVGEVGLKGRKGEAG
EEPTEEEKEAMKNKMAGDEGVRGADGEPGEPGRNSTTPGERGPRGPKGKM
GTVGPKGYGGIKGDNGFNGRSGEDGDPGPAGFPGKNGPRGQSGEPGKAGK
EGIAPKGQKGETGGQGPNGERGPDAPPPPGCFNFGDAADVGFIMDSSKSV
NEADFNRQKEFIDQILQQFNIGPKQTQAAVIKYGRTASVEINFDDFYTYP
SLKNKIDNIEYDSARESRLDLALKLARDDMFTRRRGARIDDPKVEQALVI
LGDGFISGGGGSRRQLMEDAREYAQDLKKEGILVFTLAVGVEKNIFLLKE
IASKDTYYLEVQSYTELIRRIGLLKDNLAQGCTVEGEPGKDGVTGEPGNT
GEPGEPGKPGEGEPGRDGLKGLKGFPGEPGDQVAAKGEPGMKGVKGMKGP
DGFDGLQGSRGLGNQGRKGEKGDRADDLKGFKGTRGEQGPLGPVGPAGQD
FCDSVLATDILFILDTSSGVGIDNFNKMKQFLDEIVNSFDIDNQLTRVGV
ITFDEEARYDIKLNAFNTQRDLLDGIEELAYANGKATRIDKALLLASSQG
FAEVNGARSGVNKAMVIMTDGQSSYHPEETPISEAVQPLIDDRVLRYAIG
IGPEISPVELASIAGKNVLLATDFDALLGKIEDQLGLIGRGGCKGDPGLP
GKSGEKGDSGFIGTIGSSGEDGAEGPEGPKGLKGNKGEQGSTFGVGGKGD
KGSVGDEGPIGLPGCTEEQRPTPTDLGFMIDASASLYDYGFKNELDFVTD
IIDKVGPISSGGLRAGVVVYSDKANSRIYMNDHFNTDDFKFAVKELPYDN
RGQTRIDLGFEESKNLFKVENGARPSSKKVLILMTDGQQTYVPGVKEPSE
IAKELHVDGVDVYSIGIGEEIDRVELESFISKKEYIFLASDLKQLLNVLV
KEITQALTCEGGPRGPSGRPGGDGPPGDQGPVGSIGNTGRKGDQGDPGFG
PKGEPGEAGGEGFPGRDGEKGFRGRDGTVGPRGPIGDKGSEGPSGPKGGM
GEDGFIGFNGLEGPPGPKGQRGDAVDEGQKGLKGEPGEPGLKGPRGDQGS
LPPGNTLKDLRGEEGNQGPAGDEGGEGPEGGLKVAPGPLGLAGEKGEKGV
AGDFGERGPLGPLGPDGFKGEEGPTGEKGDSGSPGFFGKIGEKGVTGDGG
EPGVSLKGEPGVPGEPGSIGDLGPEGEVGLRGFYGMEGGKGEVGEIGPPG
RQQLPEEGKDQEMKGRVGEKGFSGDFGPDGEPGEPGILLERQGEFGSKGL
KGAQGPKGATGERGIGGLIGMIGKDGLKGRTGLDGDPGAAGPQGAEGEMG
PVIGGGEKGDFGGLGDVGLTGRPGLKGVKGFIGDIGPKGDLGIKGGQGDS
GFPGMIGDQGPPGGVGTPGPRGEEGPLGLEGPGGPAGPMGSAKNFGHNIV
RHSQNSFVPSCPRDYMVLWEGYSLMYTVGNGYAHSQDLGDAGSCSRSFST
LPFLFCSLTGTCQYASRNDFTYWLAGDVQQPMMPVTNDAIEPFISRCVVC
KAPDINMAVHSQDVVLPSCPVGYESLWSGFSFMMVAAAGNTGSGQSLGSS
GSCLEEFRPKPFIECQGARGTCHFFSDKFSFWLTTIDPSRQFEIPTPLTL
IETKGDDLSSRVSRCRVCLRSSTSGGGNVPEGSNNNNGNTFSRLARSVKK
WMVGGN
Run BLAST on NCBI
Gene-mRNA-Prot
This polypeptide comes from the following gene feature:
Feature NameUnique NameSpeciesType
XLOC_008752XLOC_008752Clytia hemisphaericagene
This polypeptide derives from the following transcript feature(s):
Feature NameUnique NameSpeciesType
TCONS_00015680TCONS_00015680Clytia hemisphaericatranscript
Annotated Terms
The following terms have been associated with this polypeptide:
Vocabulary: INTERPRO
TermDefinition
IPR001442Collagen_VI_NC
IPR002035VWF_A
IPR008197WAP_dom
IPR008160Collagen
IPR016187CTDL_fold
Vocabulary: Cellular Component
TermDefinition
GO:0005581collagen trimer
GO:0005576extracellular region
Vocabulary: Molecular Function
TermDefinition
GO:0005201extracellular matrix structural constituent
GO:0030414peptidase inhibitor activity
GO Annotation
GO Assignments
This polypeptide is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005581 collagen trimer
cellular_component GO:0005576 extracellular region
molecular_function GO:0005201 extracellular matrix structural constituent
molecular_function GO:0030414 peptidase inhibitor activity
InterPro
Analysis Name: InterPro Annotations of C. hemisphaerica v1.0
Date Performed: 2017-06-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePRINTSPR00453VWFADOMAINcoord: 1897..1911
score: 20.13
coord: 2230..2238
score: 51.92
coord: 1858..1875
score: 38.03
NoneNo IPR availableGENE3D4.10.75.10coord: 460..509
e-value: 2.9E-9
score: 36.5
NoneNo IPR availableGENE3D2.160.20.50coord: 956..1034
e-value: 0.0037
score: 16.6
coord: 2485..2557
e-value: 0.0011
score: 18.3
coord: 2035..2123
e-value: 0.2
score: 11.0
coord: 718..793
e-value: 0.017
score: 14.4
coord: 1035..1115
e-value: 0.016
score: 14.5
coord: 2639..2703
e-value: 1.0E-6
score: 28.0
NoneNo IPR availableGENE3D4.10.75.10coord: 223..270
e-value: 2.6E-5
score: 23.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 2749..2791
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 813..881
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1132..1169
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 731..775
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1181..1536
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 2591..2612
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1733..1843
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 3021..3041
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 2042..2122
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 2646..2665
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 2308..2576
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 912..1119
NoneNo IPR availableCDDcd00199WAPcoord: 220..268
e-value: 4.62604E-7
score: 48.9855
NoneNo IPR availableCDDcd00199WAPcoord: 455..509
e-value: 2.98079E-5
score: 43.5927
NoneNo IPR availableCDDcd01450vWFA_subfamily_ECMcoord: 2124..2278
e-value: 3.66965E-29
score: 116.622
NoneNo IPR availableCDDcd01450vWFA_subfamily_ECMcoord: 1538..1706
e-value: 5.27301E-28
score: 113.155
NoneNo IPR availableCDDcd01450vWFA_subfamily_ECMcoord: 1858..2016
e-value: 2.51843E-34
score: 131.26
NoneNo IPR availableCDDcd01450vWFA_subfamily_ECMcoord: 276..436
e-value: 8.32251E-29
score: 115.467
NoneNo IPR availableCDDcd01450vWFA_subfamily_ECMcoord: 516..671
e-value: 1.22396E-22
score: 97.3622
IPR001442Collagen IV, non-collagenousSMARTSM00111C4_2coord: 2796..2903
e-value: 2.6E-46
score: 169.9
coord: 2905..3021
e-value: 1.1E-55
score: 201.0
IPR001442Collagen IV, non-collagenousGENE3D2.170.240.10coord: 2794..3021
e-value: 2.4E-95
score: 317.5
IPR001442Collagen IV, non-collagenousPFAMPF01413C4coord: 2799..2901
e-value: 4.5E-30
score: 104.2
coord: 2907..3019
e-value: 3.4E-37
score: 127.1
IPR001442Collagen IV, non-collagenousPROSITEPS51403NC1_IVcoord: 2796..3022
score: 87.015
IPR002035von Willebrand factor, type ASMARTSM00327VWA_4coord: 2123..2300
e-value: 1.4E-24
score: 97.7
coord: 515..694
e-value: 1.2E-10
score: 51.4
coord: 1857..2032
e-value: 3.2E-32
score: 123.0
coord: 275..456
e-value: 2.7E-27
score: 106.7
coord: 36..215
e-value: 5.0E-37
score: 139.0
coord: 1537..1729
e-value: 1.6E-29
score: 114.1
IPR002035von Willebrand factor, type AGENE3D3.40.50.410coord: 1533..1731
e-value: 3.7E-25
score: 88.4
IPR002035von Willebrand factor, type APFAMPF00092VWAcoord: 2125..2297
e-value: 1.1E-26
score: 94.0
coord: 1859..2027
e-value: 6.4E-32
score: 111.1
coord: 277..448
e-value: 1.7E-32
score: 113.0
coord: 525..671
e-value: 2.2E-17
score: 63.8
coord: 1539..1717
e-value: 8.0E-28
score: 97.8
coord: 38..212
e-value: 1.6E-40
score: 139.2
IPR002035von Willebrand factor, type AGENE3D3.40.50.410coord: 24..222
e-value: 6.6E-41
score: 139.9
coord: 271..459
e-value: 1.4E-29
score: 102.9
IPR002035von Willebrand factor, type AGENE3D3.40.50.410coord: 1856..2034
e-value: 2.6E-27
score: 95.4
coord: 2124..2310
e-value: 4.4E-27
score: 94.6
IPR002035von Willebrand factor, type AGENE3D3.40.50.410coord: 510..703
e-value: 3.1E-13
score: 49.5
IPR002035von Willebrand factor, type APROSITEPS50234VWFAcoord: 1859..2035
score: 27.053
IPR002035von Willebrand factor, type APROSITEPS50234VWFAcoord: 1539..1728
score: 27.793
IPR002035von Willebrand factor, type APROSITEPS50234VWFAcoord: 38..214
score: 31.333
IPR002035von Willebrand factor, type APROSITEPS50234VWFAcoord: 517..693
score: 16.381
IPR002035von Willebrand factor, type APROSITEPS50234VWFAcoord: 2125..2303
score: 25.451
IPR002035von Willebrand factor, type APROSITEPS50234VWFAcoord: 277..452
score: 25.556
IPR002035von Willebrand factor, type ASUPERFAMILY53300vWA-likecoord: 229..458
IPR002035von Willebrand factor, type ASUPERFAMILY53300vWA-likecoord: 1523..1726
IPR002035von Willebrand factor, type ASUPERFAMILY53300vWA-likecoord: 2109..2299
IPR002035von Willebrand factor, type ASUPERFAMILY53300vWA-likecoord: 484..700
IPR002035von Willebrand factor, type ASUPERFAMILY53300vWA-likecoord: 1842..2033
IPR002035von Willebrand factor, type ASUPERFAMILY53300vWA-likecoord: 26..219
IPR008197WAP-type 'four-disulfide core' domainSMARTSM00217wap2coord: 222..269
e-value: 8.1E-4
score: 25.9
coord: 459..509
e-value: 2.9E-4
score: 29.8
IPR008197WAP-type 'four-disulfide core' domainPFAMPF00095WAPcoord: 459..508
e-value: 9.1E-6
score: 25.9
IPR008197WAP-type 'four-disulfide core' domainPROSITEPS51390WAPcoord: 456..509
score: 13.834
IPR008197WAP-type 'four-disulfide core' domainPROSITEPS51390WAPcoord: 219..269
score: 12.195
IPR008160Collagen triple helix repeatPFAMPF01391Collagencoord: 1233..1289
e-value: 1.6E-6
score: 27.7
coord: 2499..2554
e-value: 2.1E-6
score: 27.3
coord: 2038..2091
e-value: 2.9E-7
score: 30.0
coord: 2643..2700
e-value: 4.3E-7
score: 29.5
coord: 1299..1353
e-value: 6.5E-8
score: 32.1
IPR016187C-type lectin foldSUPERFAMILY56436C-type lectin-likecoord: 2799..2904
IPR016187C-type lectin foldSUPERFAMILY56436C-type lectin-likecoord: 2906..3020

Blast
BLAST of Collagen protein vs. Swiss-Prot (Human)
Match: CO4A6 (Collagen alpha-6(IV) chain OS=Homo sapiens GN=COL4A6 PE=1 SV=3)

HSP 1 Score: 395.201 bits (1014), Expect = 1.714e-111
Identity = 320/811 (39.46%), Postives = 400/811 (49.32%), Query Frame = 0
Query: 2327 GDQGPVGSIGNTGRKGDQGDPG----------------FGPKGEPGEAGGEGFPGRDGEKG-------------------FRGRDGTVGP------RGPIGDKGSEGPSGPKGGMGEDGFIGFNGLEGPPGPKGQRGDAVDEGQK-------GLKGEPGEPGLKGPRGDQGSLPPGNT-LKDLRGEEGNQGPAGDEGGEGPEGGLKVAPGPLGLAGEKGEKGVAGDFGERGPLGPLGPDGFKGEEGPTGEKGDSGSPGFFGKIGEKGVTGDGGEPGVSLKGEPGVPGEPGSIGDLGPEGEVGLRGFYGMEGGKGEVGEIGPPGRQQLPEEGKDQEMKGRVGEKGFSGDFGPDGEPGEPGILL-----------------------------------ERQGEFGSKGLKGAQGPKGATGERGIGGLIGMIGKDGLKGRTGLDGDPGAAGPQG------------AEGEMGPVIGGGEKGDFGGLGDV-------GLTGRPGLKGVKGFIGDIGPKGDLGIKGGQGDSGFPG------------MIGDQGPPGGVGTPGPRGEEGPLGLEGPGGPAGPMGSAKNFGHNIVRHSQNSFVPSCPRDYMVLWEGYSLMYTVGNGYAHSQDLGDAGSCSRSFSTLPFLFCSLTGTCQYASRNDFTYWLAGDVQQPMMPVTNDAIEPFISRCVVCKAPDINMAVHSQDVVLPSCPVGYESLWSGFSFMMVAAAGNTGSGQSLGSSGSCLEEFRPKPFIECQGARGTCHFFSDKFSFWLTTIDPSRQF-EIPTPLTLIETKGDDLSSRVSRCRVCLRS 3021
            G  GP G  G  G KGD+G+PG                 G KG  G AG  GFPG  G+KG                    +G  G  GP      RG  G KGS G +G  G  GE G  G  G  G PG  G  G   D GQ        G KG+PGE G KG +G  G +  GN      +GE+G  G +GD G       L  APG  G+AG +GE G+ G  G +G +GPLG  G        G KG  G PG  G  G  G  G  G PG S+ G PG  G PG      P+GE G  G             IG PG+  L         +G+ G++GF G  GP G PG PGI L                                     QG+ G  G  G  GPKG  G++GI G  G+ G+ GLKG   + G+PG  G  G             +G+ GP    G +GD G            G  G PG+ G+ G  GD G +G +G++G +G  G PG             +GD G PG     GP G EG  G +GP G  G  G +   G+ +V+HSQ+  VP CP     LW GYSL++  G   AH+QDLG AGSC   FST+PF++C++   C YA RND +YWL+     PMMPV+   I  +ISRC VC+AP   +AVHSQD+ +P CP+G+ SLW G+SF+M  AAG  G GQSL S GSCLE+FR  PFIEC GARGTCH+F++K+SFWLTT++  +QF E+P   TL   K   L +RVSRC+VC++S
Sbjct:  930 GKMGPSGRAGTPGEKGDRGNPGPVGIPSPRRPMSNLWLKGDKGSQGSAGSNGFPGPRGDKGEAGRPGPPGLPGAPGLPGIIKGVSGKPGPPGFMGIRGLPGLKGSSGITGFPGMPGESGSQGIRGSPGLPGASGLPGLKGDNGQTVEISGSPGPKGQPGESGFKGTKGRDGLI--GNIGFPGNKGEDGKVGVSGDVG-------LPGAPGFPGVAGMRGEPGLPGSSGHQGAIGPLGSPGL------IGPKGFPGFPGLHGLNGLPGTKGTHGTPGPSITGVPGPAGLPG------PKGEKGYPGI-----------GIGAPGKPGL---------RGQKGDRGFPGLQGPAGLPGAPGISLPSLIAGQPGDPGRPGLDGERGRPGPAGPPGPPGPSSNQGDTGDPGFPGIPGPKGPKGDQGIPGFSGLPGELGLKG---MRGEPGFMGTPGKVGPPGDPGFPGMKGKAGPRGSSGLQGDPGQTPTAEAVQVPPGPLGLPGIDGIPGLTGDPGAQGPVGLQGSKGLPGIPGKDGPSGLPGPPGALGDPGLPG---LQGPPGFEGAPGQQGPFGMPGMPGQSMRVGYTLVKHSQSEQVPPCPIGMSQLWVGYSLLFVEGQEKAHNQDLGFAGSCLPRFSTMPFIYCNINEVCHYARRNDKSYWLSTTAPIPMMPVSQTQIPQYISRCSVCEAPSQAIAVHSQDITIPQCPLGWRSLWIGYSFLMHTAAGAEGGGQSLVSPGSCLEDFRATPFIECSGARGTCHYFANKYSFWLTTVEERQQFGELPVSETL---KAGQLHTRVSRCQVCMKS 1690          

HSP 2 Score: 182.57 bits (462), Expect = 1.321e-45
Identity = 325/995 (32.66%), Postives = 411/995 (41.31%), Query Frame = 0
Query:  737 RGRPGNYGGRGAPGPRGPLGSAGLA------GKQGPRGDPGLNGMVGERGPKGLIGLPGFPGFDGIPGVPGVEG--RQGEMGGCGLPGQPGEAGKAGLGGFTGVSGRKGPSGTKGRKGEPAPDNPDLEILPGDDGDPGEGGDGGFKGQKGEIGMLGMIGDDGVDGSEG---PKGFMGEMGDEGVVTL-----EKGPKGFTGDRGEAGSPGVAEAT------SGELLQGPEGSSGPKGEPGMKGQVGQNSDVKGIQGERGEQGFIGQTGVEGGPGPAGEKGPSGELGFQGKKGQQGEAGMDGLTGEKGQQGE--PGFEQGAPGPKGKRGNPAEPTSDASPKGAKGPLGDDGDPGPNGAIGAEGEEGPKGERGMSK-----------------------------MGNSGDDGAVGPRGPQGKNGMQGEAGMGGLRGPKGIKGDIGAKGDTGRNAKA--ISGDPGMDGPPGQKGVFGSLGNPGEVGPRGPTMQLEQNCIENVCPQNRKGDPGQQGDTGPIGPAGEKGVNGSKGYPGGPCPTCPSGDDGISGDFGEPGKVGEKGELGSAGDPGPAGVKGQRGEKGPSGFRGEGGIKGLQGDAGD--IGGKGPDALGESLAEEGDKG----PPGSDGINGTRG------ADGPKGLPGKKGE------LGEQGKRGENGLKGE------------------SGPAGNKGQPGLGGSKGEKGDQGEDSRGLTGEGGELGPEGRQGAVGEVGLKGRKGEAGEEPTEEEKEAMKNKMAGDEGVRGADGEPGEPGRNSTTPGERGPRGPKGKMGTVGPKGYGGIKGDNGFNGRSGEDGDPGPAGFPGKNGPRGQS------GEPGKAGKEGIAPKGQKGETGGQGPNGERGPDAPPPPGCFNFGDAADVGFIM---DSSKSVNEADFNRQKEFIDQILQQFNIGPKQTQAAVIKYGRTASVEINFDDFYTYPSLKNKIDNIEYDSARESRLDLALKLARDDMF 1631
            RG PG+ G  G PG +G  GS G+       G  GP G PG  G  G +G +GL G PG PG  G  G PG  G     E     LPG PG  G+ GL GF G+ G+ G  G  G  G P       +I   ++G PGE G  G  G KG +G  G+ G  GV G  G   PKG  G  G  G V         GP G  G  G  G+PG    +           +GP GS   KG PG+KG  G N  + G++G  G  G  G   + G   P GEKG  G +GF G  G  G  G  GL G  G  G+  P    G PG KG RGNP  P    SP+     L   GD G  G+ G+ G  GP+G++G +                              MG  G  G  G  G  G  GM GE+G  G+RG  G+ G  G  G  G N +   ISG PG  G PG+ G  G+ G  G +G              N+      G PG +G+ G +G +G+ G+ G+ G+PG           G+ G+ G PG  G +G +G  G PG  G KG  G  G  G  G  G KG  G  G    G  GP  L     E+G  G     PG  G+ G +G        GP GLPG  G        G+ G  G  GL GE                   G  G+ G PG+ G KG KGDQG    G +G  GELG +G +G  G +G  G+             +     M G  G RG+ G  G+PG+  T    + P GP   +G  G  G  G+ GD G  G  G  G  G  G PGK+GP G        G+PG  G +G  P G +G  G QGP G   P  P        G +  VG+ +     S+ V        + ++   L        Q +A     G   S       F T P +   I+ + + + R  +       A   M 
Sbjct:  630 RGLPGDKGKDGLPGQQGLPGSKGITLPCIIPGSYGPSGFPGTPGFPGPKGSRGLPGTPGQPGSSGSKGEPGSPGLVHLPE-----LPGFPGPRGEKGLPGFPGLPGKDGLPGMIGSPGLPGSKGATGDIFGAENGAPGEQGLQGLTGHKGFLGDSGLPGLKGVHGKPGLLGPKGERGSPGTPGQVGQPGTPGSSGPYGIKGKSGLPGAPGFPGISGHPGKKGTRGKKGPPGSIVKKGLPGLKGLPG-NPGLVGLKGSPGSPGVAGLPALSG---PKGEKGSVGFVGFPGIPGLPGIPGTRGLKGIPGSTGKMGPSGRAGTPGEKGDRGNPG-PVGIPSPRRPMSNLWLKGDKGSQGSAGSNGFPGPRGDKGEAGRPGPPGLPGAPGLPGIIKGVSGKPGPPGFMGIRGLPGLKGSSGITGFPGMPGESGSQGIRGSPGLPGASGLPGLKGDNGQTVEISGSPGPKGQPGESGFKGTKGRDGLIG--------------NI------GFPGNKGEDGKVGVSGDVGLPGAPGFPG---------VAGMRGEPGLPGSSGHQGAIGPLGSPGLIGPKGFPGFPGLHGLNGLPGTKGTHGTPGPSITGVPGPAGLPGPKGEKGYPGIGIGAPGKPGLRGQKGDRGFPGLQGPAGLPGAPGISLPSLIAGQPGDPGRPGLDGERGRPGPAGPPGPPGPSSNQGDTGDPGFPGIPGPKGPKGDQGIP--GFSGLPGELGLKGMRGEPGFMGTPGK--------VGPPGDPGFPGMKGKAGPRGSSGLQGDPGQTPTAEAVQVPPGP---LGLPGIDGIPGLTGDPGAQGPVGLQGSKGLPGIPGKDGPSGLPGPPGALGDPGLPGLQG--PPGFEGAPGQQGPFGM--PGMP--------GQSMRVGYTLVKHSQSEQVPPCPIGMSQLWVGYSLLFVE---GQEKAHNQDLGFAGSC---LPRFSTMPFIYCNINEVCHYARRNDKSYWLSTTAPIPMM 1554          

HSP 3 Score: 169.859 bits (429), Expect = 9.775e-42
Identity = 299/812 (36.82%), Postives = 356/812 (43.84%), Query Frame = 0
Query:  770 PGLNGMVGERGPKGLIGLPGFPGFDGIPGVPGVEGRQGEMGGCGLPGQPGEAGKAGLGGFTGVSGRKGPSGTKGRKGEPAPDNPDLEILPGDDGDPGEGGD--------GGFKGQKGEIGMLGMIGDDGVDGSEGPKGFMGEMGDEGVVTLEKGPKGFTGDRGEAGSPGVAEATSGELLQGPEGSSGPKGEPGMKGQVGQNSDVKGIQGERGEQGFIGQTGVEGGPGPAGEKGPSGELGFQGKKGQQGEAGMDGLTGEKGQQGEP-GFEQGAPGPKGKRGNPAEPTSDASPKGAKGPLGDDGDPGPNGAIGAEGEEGPKGERGMS----KMGNSGDDGAVGPRGPQGKNGMQGEAGMGGLRGPKGIKGDIGAKGDTGRNAKAISGDPGMDGPPGQKGVFGSLGNPGEVGPRGPTMQLEQNCIENVCPQNRKGDPGQQGDTGPIGPAGEKGVNGSKGYPGGP-CPTCP-----SGDDGISGDFGEPGKVGEKGELGSAGDPGPAGVKGQR---------GEKGPSGFRGEGGIKGLQGDAGDIGGKGPDALGESLAE-------EGDKGPPGSDGINGT---RGADGPKGLPGKKGELGEQGKRGENGLKGESGPAGNKGQPG----LGGSKGEKGDQGEDSRGLTGEGGELGPEGRQGAVGEVGLKGRKGEAGEEPTEEEKEAMKNKMAGDEGVRGADGEPGEPGRNSTTPGERGPRGPKGKMGTVGPKGYGGIKGDNGFNGRSGEDGD-----------PGPAGFPGKNGPRGQSG----EPGKAGKEGIAPKGQKGETGGQGPNGERGPD 1524
            PGL G  G+RG  G  G  G PG  G  G  G +G++GE     + G PG+ G +G  GF GV G  G  G  G  G P       +  PG+ G PG  G+            G  G  G  G+ GD G DG  G +G  G  G    +TL     G  G  G  G+PG           GP+GS   +G PG  GQ G +    G +GE G  G +    + G PGP GEKG  G  G  GK G  G  G  GL G KG  G+  G E GAPG +G +G            G KG LGD G PG  G  G  G  GPKGERG      ++G  G  G+ GP G +GK+G+ G  G  G+ G  G KG  G KG  G   K   G PG+ G PG  G+ G  G+PG                           PG  G     GP GEKG  G  G+PG P  P  P      G  G +G  G  G+ G  GE G  G+PGP G+   R         G+KG  G  G  G  G +GD G+ G  GP  L  +           G  GPPG  GI G    +G+ G  G PG  GE G QG RG  GL G SG  G KG  G    + GS G KG  GE         G  G +GR G +G +G  G KGE G+       +       G  GV G  GEPG PG +    G +G  GP G  G +GPKG+ G  G +G NG  G  G            PGPAG PG  G +G  G     PGK G  G   KG +G  G QGP G  G  
Sbjct:  509 PGLKGARGDRGSGGAQGPAGAPGLVGPLGPSGPKGKKGEPILSTIQGMPGDRGDSGSQGFRGVIGEPGKDGVPGLPGLPGLPGDGGQGFPGEKGLPGLPGEKGHPGPPGLPGNGLPGLPGPRGLPGDKGKDGLPGQQGLPGSKG----ITLPCIIPGSYGPSGFPGTPG---------FPGPKGS---RGLPGTPGQPGSS----GSKGEPGSPGLVHLPELPGFPGPRGEKGLPGFPGLPGKDGLPGMIGSPGLPGSKGATGDIFGAENGAPGEQGLQGL----------TGHKGFLGDSGLPGLKGVHGKPGLLGPKGERGSPGTPGQVGQPGTPGSSGPYGIKGKSGLPGAPGFPGISGHPGKKGTRGKKGPPGSIVK--KGLPGLKGLPGNPGLVGLKGSPG--------------------------SPGVAGLPALSGPKGEKGSVGFVGFPGIPGLPGIPGTRGLKGIPGSTGKMGPSGRAGTPGEKGDRGNPGPVGIPSPRRPMSNLWLKGDKGSQGSAGSNGFPGPRGDKGEAGRPGPPGLPGAPGLPGIIKGVSGKPGPPGFMGIRGLPGLKGSSGITGFPGMPGESGSQGIRGSPGLPGASGLPGLKGDNGQTVEISGSPGPKGQPGES--------GFKGTKGRDGLIGNIGFPGNKGEDGK--VGVSGDVGLPGAPGFPGVAGMRGEPGLPGSS----GHQGAIGPLGSPGLIGPKGFPGFPGLHGLNGLPGTKGTHGTPGPSITGVPGPAGLPGPKGEKGYPGIGIGAPGKPGLRGQ--KGDRGFPGLQGPAGLPGAP 1246          

HSP 4 Score: 150.984 bits (380), Expect = 4.992e-36
Identity = 265/767 (34.55%), Postives = 329/767 (42.89%), Query Frame = 0
Query:  730 LKGPKGSRGRPGNYGGRGAPGPRGPLGSAGLAGKQGPRGD--------PGLNGMVGERGPKGLIGLPGFPGFDGIPGVPGVEGRQGEMGGCGLPGQPGEAGKAGLGGFTGVSGRKG------------------------PSGTKGRKGEPA----PDNPDLEILPGDDGDPGEGGD---GGFKGQKGEIGMLGMIGDDGVDGSEGPKGFMGEMGDEGVV--TLEKGPKGFTGDRGEAGSPGVAEA---TSGELLQG---PEGSSGPKGEPGMKGQVGQNSDVKGIQGERGEQGFIGQTGVEGGPGPAGEKGPSGELGFQGKKGQQGEAGMDGLTGEKGQQGEPGF--EQGAPGPKGKRGNPAEPTSDASPKGAKGPLGDDGDPGPNGAIGAEGEEGPKGERGMSKMGNSGDDGAVGPRGPQGKNGMQGEAGMGGLRGPKGIKGDIGAKGDTG-RNAKAISGDPGMDGPPGQKGVFGSLGNPGEVGPRGPTMQLEQNCIENVCPQNRKGDPGQQGDTGPIGPAGEKGVNGSKGYPGGPCPTCPSGDDGISGDFGEPGKVGE---------------------KGELGSAGDPGPAGVKGQRGEKGPSGFRGEGGIKGLQGDAGDIGGKG---PDALGESLAEEGDKGPPGSDGINGTRG----------ADGPKGLPGKKGELGEQGKRGENGLKGESGPAGNKGQPGLGGSKGEKGDQGED----------SRGLTGEGGELGPEGRQGAVGEVGLKGRKGEAGEE 1402
              GP+G +G PG  G  G  G  G +GS GL G +G  GD        PG  G+ G  G KG +G  G PG  G+ G PG+ G +GE G  G PGQ G+ G  G  G  G+ G+ G                        P G+  +KG P     P NP L  L G  G PG  G     G KG+KG +G +G  G  G+ G  G +G  G  G  G +  +   G  G  GDRG  G  G+       S   L+G    +GS+G  G PG +G  G+                    GV G PGP G  G  G  G +G  G  G  GM G +G +G +G PG     G PG KG  G   E +    PKG  G  G  G  G +G IG  G  G KGE G  K+G SGD G  G  G  G  GM+GE G+ G  G +G  G +G+ G  G +      G  G++G PG KG  G+ G P   G  GP   L     E   P    G PG+ G  G  G  G  G+ G  G PG P  + PS   G  GD G PG  GE                      G+ G  G PGP G KG +G  G SG  GE G+KG++G+ G +G  G   P         +G  GP GS G+ G  G            GP GLPG             +G+ G +G  G +G  GL GSKG  G  G+D          + G  G  G  GP G +GA G+ G  G  G  G+ 
Sbjct:  713 FPGPRGEKGLPGFPGLPGKDGLPGMIGSPGLPGSKGATGDIFGAENGAPGEQGLQGLTGHKGFLGDSGLPGLKGVHGKPGLLGPKGERGSPGTPGQVGQPGTPGSSGPYGIKGKSGLPGAPGFPGISGHPGKKGTRGKKGPPGSIVKKGLPGLKGLPGNPGLVGLKGSPGSPGVAGLPALSGPKGEKGSVGFVGFPGIPGLPGIPGTRGLKGIPGSTGKMGPSGRAGTPGEKGDRGNPGPVGIPSPRRPMSNLWLKGDKGSQGSAGSNGFPGPRGDKGEAGRPGPPGLPGAPGLPGIIKGVSGKPGPPGFMGIRGLPGLKGSSGITGFPGMPGESGSQGIRGSPGLPGASGLPGLKGDNGQTVEISGSPGPKGQPGESGFKGTKGRDGLIGNIGFPGNKGEDG--KVGVSGDVGLPGAPGFPGVAGMRGEPGLPGSSGHQGAIGPLGSPGLIGPKGFPGFPGLHGLNGLPGTKGTHGTPG-PSITGVPGPA-GLPGPKGEKGYPGIGIGAPGKPGLRGQKGDRGFPGLQGPAGLPGAPGISLPSLIAGQPGDPGRPGLDGERGRPGPAGPPGPPGPSSNQGDTGDPGFPGIPGPKGPKGDQGIPGFSGLPGELGLKGMRGEPGFMGTPGKVGPPGDPGFPGMKGKAGPRGSSGLQGDPGQTPTAEAVQVPPGPLGLPGI------------DGIPGLTGDPGAQGPVGLQGSKGLPGIPGKDGPSGLPGPPGALGDPGLPGLQGPPGFEGAPGQQGPFGMPGMPGQS 1463          

HSP 5 Score: 132.494 bits (332), Expect = 2.244e-30
Identity = 320/966 (33.13%), Postives = 402/966 (41.61%), Query Frame = 0
Query:  734 KGSRGRPGNYGGRGAPGPRGPLGSAGLAGKQGPRGDPGLNGMVGERGPKGLIGLPGFPGFD---------GIPGVPGVEGRQGEMGGCGLPGQPGEAGKAGLGGFTGVSGRKG-----PSGTKGRKGEPAPDNPDLEILPGDDGDPGEGGDG--------------------------GFKGQKGEIGMLGMIGDDGVDGSEGPKGFMGEMGDEGVVTLEKGPKGFTGDRGEAGSPGVAEATSGELL-----------QGPEGSSGPKGEPGMKGQVGQ-----NSDVKGIQGERGEQGFIGQ-----------TGVEGGPGPAGEKGPSGELGFQGKKGQQGEAGMDGLTGEKGQQGEPGFEQGAPGPKGKRGNPAEPTSDASPKGAKGPLGDDGDPGP---------------NGAIGAEGEEGPKGERGMSKM-GNSG---DDGAV------------------------GPRGPQGKNGMQGEAGMGGLRGPKG---------------IKGDIGAKGDTGRNA-KAISGDPGMD-----------------GPPGQKGVFGSLGNPGEVGPRGPTMQLEQNCIENVCPQNRKGDPGQQGDTGPIGPAGEKGVN---------GSKGYPGGPCPTCPSGDDGISGDFGEPGKVGEKGELGSAG---DPGPAGVKGQRGEKGPSGF---RGEGGIKGLQGDAGDIGGKGPDALGESL-AEEGDKGPPGSDGINGTRGADGPKGLPGKKGELGEQGKRGENGLKGESGPAGNKGQPGLGGSKGEKGDQGEDSRGLTGEGGELGPEGRQGAVGEVGLKGRKGEAGEEPTEEEKEAMKNK-------MAGDEGVRGADGEPGEPGRNSTTPGERGPRGPKGKMGTV---GPKGYGGIKGDNGFNGRSGEDGDPGPAGFPGKNGPRGQSGEPGKAG-------KEGIAPKGQKGETGGQGPNGERGP 1523
            KG+RGRPG  G +G  GP+G  GS GL+G +G RG PGL G  G +G KG +G+PGF G +         G  G PG++G  G  G  G PG  G  G  G  G  G  G KG     P   KG KG+P    P L+ + G  G PG  G                            GF+G+KG  G +G+ G  G   S G   FMG    +     E GPKGF G  G  G PG+                    +GP GS G +G PG +G+ G       +  +GI+G++G+ G  G            +G  G PG  G  G  G+ G QG +G  G  G+  L+G  G  G     QG PG KG +GNP   T  A+    +  L     P                 +G  G  GE+GPKG  G+  + G+SG    DG V                        G RG +G  G QG AG  GL GP G               I+G  G +GD+G    + + G+PG D                 G PG+KG+    G PGE G  GP             P+   GD G+ G  G  G  G KG+          G  G+PG P    P G  G+ G  G+PG  G KGE GS G    P   G  G RGEKG  GF    G+ G+ G+ G  G  G KG  A G+   AE G  G  G  G+ G +G  G  GLPG KG  G+ G  G  G +G  G  G  GQPG  GS G  G +G+               G  GA G  G+ G  G+ G    +    ++  K       + G+ G+ G  G PG PG  +  P   GP+G KG +G V   G  G  GI G  G  G  G  G  GP+G  G  G +G  G PG  G          +  KG KG  G  G NG  GP
Sbjct:   46 KGARGRPGPIGIQGPTGPQGFTGSTGLSGLKGERGFPGLLGPYGPKGDKGPMGVPGFLGINGIPGHPGQPGPRGPPGLDGCNGTQGAVGFPGPDGYPGLLGPPGLPGQKGSKGDPVLAPGSFKGMKGDPG--LPGLDGITGPQGAPGFPGAVGPAGPPGLQGPPGPPGPLGPDGNMGLGFQGEKGVKGDVGLPGPAGPPPSTGELEFMGFPKGKKGSKGEPGPKGFPGISGPPGFPGLGTTGEKGEKGEKGIPGLPGPRGPMGSEGVQGPPGQQGKKGTLGFPGLNGFQGIEGQKGDIGLPGPDVFIDIDGAVISGNPGDPGVPGLPGLKGDEGIQGLRGPSGVPGLPALSGVPGALG----PQGFPGLKGDQGNPGRTTIGAAGLPGRDGLPGPPGPPGPPSPEFETETLHNKESGFPGLRGEQGPKGNLGLKGIKGDSGFCACDGGVPNTGPPGEPGPPGPWGLIGLPGLKGARGDRGSGGAQGPAGAPGLVGPLGPSGPKGKKGEPILSTIQGMPGDRGDSGSQGFRGVIGEPGKDGVPGLPGLPGLPGDGGQGFPGEKGL---PGLPGEKGHPGPPGLPGNGLPGLPGPRGLPGDKGKDGLPGQQGLPGSKGITLPCIIPGSYGPSGFPGTPGFPGPKGSRGLPGTPGQPGSSGSKGEPGSPGLVHLPELPGFPGPRGEKGLPGFPGLPGKDGLPGMIGSPGLPGSKG--ATGDIFGAENGAPGEQGLQGLTGHKGFLGDSGLPGLKGVHGKPGLLGPKGERGSPGTPGQVGQPGTPGSSGPYGIKGKS--------------GLPGAPGFPGISGHPGKKGTRGKKGPPGSIVKKGLPGLKGLPGNPGLVGLKGSPGSPGV-AGLPALSGPKGEKGSVGFVGFPGIPGLPGIPGTRGLKGIPGSTGKMGPSGRAGTPGEKGDRGNPGPVGIPSPRRPMSNLWLKGDKGSQGSAGSNGFPGP 985          

HSP 6 Score: 117.472 bits (293), Expect = 7.941e-26
Identity = 179/504 (35.52%), Postives = 218/504 (43.25%), Query Frame = 0
Query: 2312 GPRGPSGRPGGDG------PPGDQGPVGSIGNTGRKGDQGDPGF-------------GPKGEPGEAGGEGFPGRDGEKGFRGRDGTVGPRGPIGDKGSEGPSGPKGGMGEDGFIGFNGLEGPPGPKGQRGDAVDEGQKGLKGEPGEPGLKGPRGDQGSL---PPGNTLKD--LRGEEGNQGPAGDEGGEGPEG------------------------GLKVAPGPLGLAGEKGEKGVAGDFGERGPLGPLGPDGFKGEEGPTGEKGDSGSPGFFGKIGEKGVTGDGGEPGVSLKGEPGVPGEPGSIGDLGPEGEVGLRGFYGMEGGKGEVGEIGPPGRQQLPEEGKDQEMKGRVGEKGFSGDFGPDGEPGEPGILLERQGEFGSKGLKGAQGPKGATGERGIGGLIGMIGKDGLKGRTGLDGDPGAAGPQGAEGEMGPVIGGGEKGDFGGLGDVGLTGRPGLKGVKGFIG-DIGPKGDLGIKGGQGDSGFPGMIGDQGPPGGVG 2766
            GP+G  G PG  G       PG  GP G  G +G  G  G PG              GP G   + G  G  G  G  G  G  G+ G  G  G     GP G KG +G  GF G  GL G PG +G +G     G+ G  G  G PG KG RG+ G +    P   + +  L+G++G+QG AG  G  GP G                        G+   PGP G  G +G  G+ G  G           GF G  G +G +G  GSPG  G  G  G+ GD G+  V + G P            GP+G+ G  GF G +G  G +G IG PG             KG  G+ G SGD G  G PG PG+   R    G  GL G+ G +GA G  G  GLIG  G  G  G  GL+G PG    +G  G  GP I           G  G  G PG KG KG+ G  IG  G  G++G +GD GFPG+ G  G PG  G
Sbjct:  795 GPKGERGSPGTPGQVGQPGTPGSSGPYGIKGKSGLPGAPGFPGISGHPGKKGTRGKKGPPGSIVKKGLPGLKGLPGNPGLVGLKGSPGSPGVAGLPALSGPKGEKGSVGFVGFPGIPGLPGIPGTRGLKGIPGSTGKMGPSGRAGTPGEKGDRGNPGPVGIPSPRRPMSNLWLKGDKGSQGSAGSNGFPGPRGDKGEAGRPGPPGLPGAPGLPGIIKGVSGKPGPPGFMGIRGLPGLKGSSGI---------TGFPGMPGESGSQGIRGSPGLPGASGLPGLKGDNGQ-TVEISGSP------------GPKGQPGESGFKGTKGRDGLIGNIGFPGN------------KGEDGKVGVSGDVGLPGAPGFPGVAGMR----GEPGLPGSSGHQGAIGPLGSPGLIGPKGFPGFPGLHGLNGLPGT---KGTHGTPGPSI----------TGVPGPAGLPGPKGEKGYPGIGIGAPGKPGLRGQKGDRGFPGLQGPAGLPGAPG 1247          

HSP 7 Score: 97.4413 bits (241), Expect = 1.096e-19
Identity = 200/576 (34.72%), Postives = 237/576 (41.15%), Query Frame = 0
Query: 2310 EGGPRGPSGRPGGDGPPGDQGPVGSIGNT---------GRKGDQGDPGF-GPKGEPGE--------------------AGGEGFPGRDGEK-----------------------GFRGRDGTVGPRGPIGDKGSEGPSGPKGGMGEDGFIGFNGLEGPPGPKGQRGDAVDEGQKGLKGEPGEPGL---------KGPRGDQGSLP--PGNTLKD-LRGEEGNQGPAGDEGGEGPEGGLKV-APGPLGLAGEKGEKGVAGDFGERGPLGPLGPDGFKGEEGPTGEKGDSGSPGFFGKIGEKGVTGDGGEPGVSLK-GEPGVPGEPGSIGD---------LGPEGEVGLRGFYGMEGGKGEVGEIGPPGRQQLPEEGKDQEMKGRVGEKGFSGDFGPDGEPGEPGILLERQGEFGSKGLKGAQGPKGATGERGIGGLIGMIGKDGLKGRTGLDGDPGAAGPQGAEGEMGPVIGGGEKGDFGGLGDVGLTGRPGLKG----------------------VKGFIGDIGPKGDLGI------KGGQGDSGFPGMIGDQGPPGGVGTPGPRGEEGPLGLEG 2781
             GG +GP+G PG  GP G  GP G  G           G +GD G  GF G  GEPG+                     G +G PG  GEK                       G +G+DG  G +G  G KG   P    G  G  GF G  G  GP G +G  G     G  G KGEPG PGL          GPRG++G LP  PG   KD L G  G+ G  G +G  G   G +  APG  GL G  G KG  GD G        G  G  G+ G  G KG+ GSPG  G++G+ G  G  G  G+  K G PG PG PG  G           GP G +  +G  G++G  G  G +G  G    P       + G  GEKG  G  G  G PG PGI        G++GLK            GI G  G +G  G  G  G  GD G  GP G      P+     KGD G  G  G  G PG +G                      +KG  G  GP G +GI      KG  G +GFPGM G+ G  G  G+PG  G  G  GL+G
Sbjct:  520 SGGAQGPAGAPGLVGPLGPSGPKGKKGEPILSTIQGMPGDRGDSGSQGFRGVIGEPGKDGVPGLPGLPGLPGDGGQGFPGEKGLPGLPGEKGHPGPPGLPGNGLPGLPGPRGLPGDKGKDGLPGQQGLPGSKGITLPCIIPGSYGPSGFPGTPGFPGPKGSRGLPGTPGQPGSSGSKGEPGSPGLVHLPELPGFPGPRGEKG-LPGFPGLPGKDGLPGMIGSPGLPGSKGATGDIFGAENGAPGEQGLQGLTGHKGFLGDSGLP------GLKGVHGKPGLLGPKGERGSPGTPGQVGQPGTPGSSGPYGIKGKSGLPGAPGFPGISGHPGKKGTRGKKGPPGSIVKKGLPGLKGLPGNPGLVGLKGSPGSPGVAGLPALSGPKGEKGSVGFVGFPGIPGLPGIP-------GTRGLK------------GIPGSTGKMGPSGRAGTPGEKGDRGNPGPVGIPSPRRPMSNLWLKGDKGSQGSAGSNGFPGPRGDKGEAGRPGPPGLPGAPGLPGIIKGVSGKPGPPGFMGIRGLPGLKGSSGITGFPGMPGESGSQGIRGSPGLPGASGLPGLKG 1069          

HSP 8 Score: 95.5153 bits (236), Expect = 4.503e-19
Identity = 172/505 (34.06%), Postives = 219/505 (43.37%), Query Frame = 0
Query: 2363 FPGRDGEKGFRGRDGTVGPRGPIGDKGSEGPSGPKGGMGEDGFIGFNGLEGPPGPKGQRGDAVDEGQKGLKGE-----------------------PGEPGLKGPRGDQGSLPPGNTL----------KDLRGEEGNQGPAGDEGGEGPEGGLKVAPGPLGLAGEKGEKGVAGDFGERGPLGPLGPDGFKGEEGPTGEKGDSGSPG---------FFGKIGEKGVTGDGGEPG----VSLKGEPGVPGEPGSIGDL--GPEGEVGLRGFYGMEGGKGEVGEIGPPGRQQLPEEGKDQEMKGRVGEKGFSGDFGPDGEPGEPGILLE--RQGEFGSKGLKGAQGPKGATGERGIGGLI------------GMIGKDGLKGRTGLDGDPGAAGPQGAEGEMG----PVIGG--GEKGDFGGLGDVGLTGRPG------LKGVKGFIGDIGPKGDLGIKGGQGDSGFPGMIG---------------DQGPPGGVGT---PGPRGEEG 2775
             PG  G +G RG  G  GP G  G  G  GPSGPKG  GE       G+ G       RGD+  +G +G+ GE                       PGE GL G  G++G   P              + L G++G  G  G +G  G +G + +   P  + G  G  G  G  G  GP G  G  G  G+ G +G KG+ GSPG         F G  GEKG+ G  G PG      + G PG+PG  G+ GD+     G  G +G  G+ G KG +G+ G PG            +KG  G+ G  G  G  G PG PG + +    G  G  G+KG  G  GA G  GI G              G I K GL G  GL G+PG  G +G+ G  G    P + G  GEKG  G +G  G+ G PG      LKG+ G  G +GP G  G  G +GD G PG +G               D+G  G  G+   PGPRG++G
Sbjct:  508 LPGLKGARGDRGSGGAQGPAGAPGLVGPLGPSGPKGKKGEPILSTIQGMPG------DRGDSGSQGFRGVIGEPGKDGVPGLPGLPGLPGDGGQGFPGEKGLPGLPGEKGHPGPPGLPGNGLPGLPGPRGLPGDKGKDGLPGQQGLPGSKG-ITL---PCIIPGSYGPSGFPGTPGFPGPKGSRGLPGTPGQPGSSGSKGEPGSPGLVHLPELPGFPGPRGEKGLPGFPGLPGKDGLPGMIGSPGLPGSKGATGDIFGAENGAPGEQGLQGLTGHKGFLGDSGLPG------------LKGVHGKPGLLGPKGERGSPGTPGQVGQPGTPGSSGPYGIKGKSGLPGAPGFPGISGHPGKKGTRGKKGPPGSIVKKGLPGLKGLPGNPGLVGLKGSPGSPGVAGLPALSGPKGEKGSVGFVGFPGIPGLPGIPGTRGLKGIPGSTGKMGPSGRAGTPGEKGDRGNPGPVGIPSPRRPMSNLWLKGDKGSQGSAGSNGFPGPRGDKG 990          

HSP 9 Score: 88.1965 bits (217), Expect = 6.386e-17
Identity = 199/565 (35.22%), Postives = 240/565 (42.48%), Query Frame = 0
Query: 2312 GPRGPSGRPGGDGPPGDQGPVGSIGNTGRKGDQGDPG-FGPKGEPGE-----------AGGEGFPGRDGEKGFRGRDGTVGPRGPIGDKGSEGPSGPKGGMGEDGFIGFNGLEGPPGPKGQRGDAVDEGQKGLKG--------------------------EPGEPGLKGPRGDQGSLPPGNTLKDLRGEEGNQGPAGDEGG-----------EGPEGGLKVAPGPLGLAGEKGE---KGVAGDFGERGPLGPLGPDGFKGE------EGPTGEKGDSGSPGFFGKIGEKGVTGDG--------------GEPGVSLKGEPGVPGEPGSI-----------------GDLGPEGEVGLRGFYGMEGGKGEV------GEIGPPGRQQLPEEGKDQEMKGRVGEKGFSGDFGPDGEPGEPGILLERQGEFGSKGLKGAQGPKGATGERGIGGLIGMIGKDGLKGRTGLDGDPGAAGPQGAEGEMGPVIGGGEKGDFGGLGDVGLTGRPGLKGVKGFIGDIGPKGDLGIKGGQGDSGFPGMIGDQGPPGGVGTPGPRGEEGPLGLEG 2781
            GPRGP G  G  GPPG QG  G++G  G  G QG  G  G  G PG            +G  G PG  G  G +G +G  G RGP G  G    SG  G +G  GF G  G +G PG           G  GL G                          E G PGL+G +G +G+L        L+G +G+ G    +GG                GL   PG  G  G++G    +G AG  G  GPLGP GP G KGE      +G  G++GDSGS GF G IGE G  G                G PG   KG PG+PGE G                   G  G +G+ GL G  G+ G KG        G  GP G    P     +  +G  G  G  G  G  GEPG PG++   +       L G  GP+G   E+G+ G  G+ GKDGL    G+ G PG  G +GA G+    I G E G  G  G  GLTG       KGF+GD G  G  G+ G  G  G  G  G  G PG VG PG  G  GP G++G
Sbjct:  300 GPRGPMGSEGVQGPPGQQGKKGTLGFPGLNGFQGIEGQKGDIGLPGPDVFIDIDGAVISGNPGDPGVPGLPGLKGDEGIQGLRGPSGVPGLPALSGVPGALGPQGFPGLKGDQGNPG-------RTTIGAAGLPGRDGLPGPPGPPGPPSPEFETETLHNKESGFPGLRGEQGPKGNL-------GLKGIKGDSGFCACDGGVPNTGPPGEPGPPGPWGLIGLPGLKGARGDRGSGGAQGPAGAPGLVGPLGPSGPKGKKGEPILSTIQGMPGDRGDSGSQGFRGVIGEPGKDGVPGLPGLPGLPGDGGQGFPGE--KGLPGLPGEKGHPGPPGLPGNGLPGLPGPRGLPGDKGKDGLPGQQGLPGSKGITLPCIIPGSYGPSGFPGTPGFPGPKGSRGLPGTPGQPGSSGSKGEPGSPGLVHLPE-------LPGFPGPRG---EKGLPGFPGLPGKDGL---PGMIGSPGLPGSKGATGD----IFGAENGAPGEQGLQGLTGH------KGFLGDSGLPGLKGVHGKPGLLGPKGERGSPGTPGQVGQPGTPGSSGPYGIKG 825          
The following BLAST results are available for this feature:
BLAST of Collagen protein vs. Swiss-Prot (Human)
Analysis Date: 2018-01-31 (Blastp Clytia hemisphaerica v1.0 proteins vs SwissProt (Homo sapiens))
Total hits: 1
Match NameE-valueIdentityDescription
CO4A61.714e-11139.46Collagen alpha-6(IV) chain OS=Homo sapiens GN=COL4... [more]
back to top