TCONS_00016953-protein (polypeptide) - C. hemisphaerica

Overview
NameTCONS_00016953-protein
Unique NameTCONS_00016953-protein
Typepolypeptide
OrganismClytia hemisphaerica (Jellyfish)
Sequence length2130

Sequence
The following sequences are available for this feature:

polypeptide sequence

>TCONS_00016953-protein ID=TCONS_00016953-protein|Name=TCONS_00016953-protein|organism=Clytia hemisphaerica|type=polypeptide|length=2130bp
MEIKWELTLLLLFAIKGISGWWGQIYWSQRYGRYGINYRNWRGFWNDESS
YPGYKRDSESKTITSGCIVDEVTKTLDCSRSTRIEIDVVKQTTFTYIPEN
AFDARVKEIRISNEELKTIHPKAFHNLVNLENLKIVSTKLNHIPDVSRCS
KLKNLDLFDNAIEVYRHNVTNLPSSLEEVILISNKIYKFPVRYFNLPKLR
YMGLALNVMEHFPGDAFGNVQALEYLGVDDNKIDSISRRELEPLMNSPSF
RHLNISNNRINYIAPKALKVLKNLVVLELHNNLIVNLPKYALWRIPKLVH
IDIDHNKIAVLTSNFITDCPELRNLYLHSQVDKMRSVYFDSLRNLTKLTQ
VYLGSNALAQFPHPVFSEEVFPSLSSLYLDNNQIPSLSDYSKDEFPPSAQ
VHYLTKKDTFKPFKNLPVLTTLDVKSNSIMEIKKTDLEYARSLQVLFIQG
NVLPEGSIHEDAFVKATSLVTLDLSSQRGTPRIQYIPKALQKHTMPNLGT
LLLYDNKITFILKGAFIKIPKLEYLSLSANQIVAIDDDAFPNSLSTLLLS
SNKFAFTNYKPFFGLTKLESLNLQGNKIKVIPDVGLHGLERIKNLKLSAN
KIGRLKKIHFKDIKIMTELDFDSNDIAFIEDGTFAENRPSKVINYMLFRV
NRVTQLPKTDFWNMHINILSFQFNRITKLHKGDFNNTKVDDKVVLFQNRI
ERIESYAVVNLNAVNFQIGGQQGNNPLQIIQQYAFVDVRVGNELDLSYHS
VTTLESYSFNIKGIQGTLYVNNGQLKAIETFAFNLGRTNGVINLSKNKIE
TIARKIFVDGTTFRDLNLYGNKLRSIDDYAFLGSSMTGLLDFYQNELTIF
PASAIKIYTSMKHLKMSSNRIEAIPAGSLDTLTNLQIFEMWQTDLKKIPN
NLFINNAKLQTIKLEDNKITTLEDDYLKMKGNSDLKELRIDQNPITHVPK
FNQSYLRLEKFWAKNVQTVSPELFGSNMPALRYFSLDTTRLECECFWYDT
ILGYLSRSNEYVEKTIQCQSPPPLRGFSAFRDASAIRGRKHQFICIPSNI
VVSAPADYTIQLNYDYPARLYPEENVNLATDKLYFDATCKLENKDVFLTA
SVLNQKQITITGNDRVLAGRNYWCHMTMTYNRNNQGNVTSARSQSILITT
LERKSKATSCSRGDAYECQTLESHITCVTNAGCDIDASNALRKSSCDQSG
CCLHCFKSCIKCNNTVVDTQTIDITYYDFSRDDPDFAKSRYTPDIARPKF
MASPYETWLANPAIEDPLLDSFSGWFQSITGRNKVVKSTITLSSEGKSDT
VNNKDLFTFWDNDFWAVNGRGFTAEGQKDCKGLELINFGFTSAIRSGFIF
GGGEHMQFAGGEDLWVYINKQLVVSIVIPQGYDGKQVICKTLHLQNTTTS
GLLIPHVGIVDQNTRTCINTRPLMEEAVTINLKENIMYSLDIFHVERFRC
TSEFLLATSGVVFAGTEQQKDIVDYIFEPPEDLHLKAIVGEFLVSDLYNS
NSPYTVTVETGNSEDRYDIVRNSTQAHHDSATKPEATIYKNFTLNGEVII
VCPNNTNASANNNFEHTITSPQAFPNIDTKYALLLLKQPLDYETTSRYQL
NLKVEDHGGRLGYITILILVVDKSDNCPIIRDNVFPHFTPLPPLQQNPLL
TIVATDADSGLNGKISYVSQLISKIPTLKHTPFDNNGKTVWENKTVEWTI
HVNLYAIDNGSPKRGDFFSISMSYAPSCDKTGRIVVNETSGQVFFAAPGM
TTNDTSKFRYGEDQCYECTAGYFCPGNGQELNCVQNEETKHYFSYGFASS
CSPCPEGWLCHNGILRKCAENTYVKCNTTWCPQSCFECQPGYYCYEGIRQ
SCRPGTFSKGKGPCQLCAPGSFSNSSNSIQCTCCPSGYGSTYEKTSCEPC
QFKEFSTGECTMCKTCVPPGSCGCDRNPCYRSVSCFNTGESGASFQCGKC
PNGYSGDGTSCQDVDECTNNNVCYNSACINMSPGFKCAGCPPGFTGNAPH
GVGGEVVHERQFCRDINECENPELHSCDPNAECINTLGSYRCGKCKPGYV
GDGYLGCKSGDYCATGQHNCHINATCIPLGSRKFTCVCKEGYAGNGQICG
LDHDYDGVTSFGAACIENSCKLLIHAWKFI
Run BLAST on NCBI
Gene-mRNA-Prot
This polypeptide comes from the following gene feature:
Feature NameUnique NameSpeciesType
XLOC_009493XLOC_009493Clytia hemisphaericagene
This polypeptide derives from the following transcript feature(s):
Feature NameUnique NameSpeciesType
TCONS_00016953TCONS_00016953Clytia hemisphaericatranscript
Annotated Terms
The following terms have been associated with this polypeptide:
Vocabulary: INTERPRO
TermDefinition
IPR002126Cadherin
IPR003591Leu-rich_rpt_typical-subtyp
IPR011641Tyr-kin_ephrin_A/B_rcpt-like
IPR001881EGF-like_Ca-bd_dom
IPR000742EGF-like_dom
IPR032675L_dom-like
IPR001611Leu-rich_rpt
IPR024731EGF_dom
IPR026906LRR_5
IPR013032EGF-like_CS
IPR018097EGF_Ca-bd_CS
IPR015919Cadherin-like
Vocabulary: Cellular Component
TermDefinition
GO:0016020membrane
Vocabulary: Molecular Function
TermDefinition
GO:0005509calcium ion binding
GO:0005515protein binding
Vocabulary: Biological Process
TermDefinition
GO:0007156homophilic cell adhesion via plasma membrane adhesion molecules
GO Annotation
GO Assignments
This polypeptide is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0007156 homophilic cell adhesion via plasma membrane adhesion molecules
cellular_component GO:0016020 membrane
molecular_function GO:0005509 calcium ion binding
molecular_function GO:0005515 protein binding
InterPro
Analysis Name: InterPro Annotations of C. hemisphaerica v1.0
Date Performed: 2017-06-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002126CadherinPRINTSPR00205CADHERINcoord: 1580..1606
score: 25.28
coord: 1612..1629
score: 33.14
coord: 1234..1247
score: 28.01
IPR002126CadherinSMARTSM00112CA_2coord: 1553..1628
e-value: 2.2E-6
score: 37.2
IPR003591Leucine-rich repeat, typical subtypeSMARTSM00369LRR_typ_2coord: 271..294
e-value: 0.49
score: 19.5
coord: 220..243
e-value: 120.0
score: 4.3
coord: 196..219
e-value: 40.0
score: 8.4
coord: 416..439
e-value: 130.0
score: 4.1
coord: 295..318
e-value: 180.0
score: 2.9
coord: 519..542
e-value: 1.4
score: 18.0
coord: 127..149
e-value: 370.0
score: 0.4
coord: 613..636
e-value: 130.0
score: 4.2
coord: 814..833
e-value: 75.0
score: 6.1
coord: 248..270
e-value: 390.0
score: 0.2
coord: 495..518
e-value: 6.1
score: 15.0
coord: 371..393
e-value: 21.0
score: 10.6
coord: 858..881
e-value: 76.0
score: 6.0
coord: 345..368
e-value: 16.0
score: 11.6
coord: 565..588
e-value: 0.022
score: 23.9
coord: 882..905
e-value: 68.0
score: 6.4
IPR011641Tyrosine-protein kinase ephrin type A/B receptor-likeSMARTSM01411GCC2_GCC3_2coord: 1855..1900
e-value: 0.0036
score: 26.6
IPR001881EGF-like calcium-binding domainSMARTSM00179egfca_6coord: 2015..2058
e-value: 4.3E-10
score: 49.5
coord: 2059..2100
e-value: 0.57
score: 3.8
coord: 1925..1962
e-value: 0.37
score: 5.3
coord: 1963..2014
e-value: 6.3E-6
score: 35.7
IPR001881EGF-like calcium-binding domainPFAMPF07645EGF_CAcoord: 2015..2050
e-value: 4.9E-9
score: 36.1
coord: 1963..1997
e-value: 5.2E-5
score: 23.3
IPR000742EGF-like domainSMARTSM00181egf_5coord: 2062..2100
e-value: 0.0068
score: 25.6
coord: 2018..2058
e-value: 4.4E-4
score: 29.6
coord: 1851..1898
e-value: 72.0
score: 6.2
coord: 1966..2014
e-value: 0.032
score: 23.4
coord: 1923..1962
e-value: 0.8
score: 18.7
IPR000742EGF-like domainPROSITEPS50026EGF_3coord: 2059..2100
score: 13.644
IPR000742EGF-like domainPROSITEPS50026EGF_3coord: 2015..2058
score: 9.296
IPR000742EGF-like domainPROSITEPS50026EGF_3coord: 1963..2000
score: 7.305
NoneNo IPR availableSMARTSM00365LRR_sd22_2coord: 565..586
e-value: 26.0
score: 12.2
coord: 519..540
e-value: 180.0
score: 5.4
coord: 271..289
e-value: 210.0
score: 4.8
coord: 371..392
e-value: 290.0
score: 3.7
coord: 906..927
e-value: 210.0
score: 4.8
coord: 149..174
e-value: 290.0
score: 3.7
coord: 589..610
e-value: 770.0
score: 0.2
NoneNo IPR availableGENE3D2.40.155.10coord: 2040..2068
e-value: 3.9E-11
score: 42.4
NoneNo IPR availableGENE3D2.60.40.60coord: 1627..1758
e-value: 0.018
score: 14.9
NoneNo IPR availableGENE3D2.10.25.10coord: 1965..1987
e-value: 1.7E-7
score: 30.6
NoneNo IPR availableGENE3D2.60.40.60coord: 1473..1626
e-value: 1.2E-9
score: 37.7
NoneNo IPR availableGENE3D2.10.25.10coord: 1923..1964
e-value: 9.9E-7
score: 28.1
NoneNo IPR availableGENE3D2.10.50.10coord: 1759..1851
e-value: 0.001
score: 18.9
coord: 1852..1916
e-value: 0.0018
score: 18.1
NoneNo IPR availableGENE3D2.10.25.10coord: 2069..2100
e-value: 4.7E-9
score: 35.8
NoneNo IPR availableGENE3D2.10.25.10coord: 1988..2014
e-value: 4.4E-7
score: 29.2
NoneNo IPR availableGENE3D2.10.25.10coord: 2015..2039
e-value: 9.9E-10
score: 37.7
NoneNo IPR availablePROSITEPS50268CADHERIN_2coord: 1530..1638
score: 12.589
NoneNo IPR availableCDDcd00054EGF_CAcoord: 2061..2094
e-value: 0.00537549
score: 36.0754
NoneNo IPR availableCDDcd00054EGF_CAcoord: 1963..1996
e-value: 1.41106E-6
score: 46.861
NoneNo IPR availableCDDcd11304Cadherin_repeatcoord: 1648..1667
e-value: 0.00909877
score: 35.7522
NoneNo IPR availableCDDcd00185TNFRSFcoord: 1864..1968
e-value: 3.21452E-5
score: 43.3519
NoneNo IPR availableCDDcd00054EGF_CAcoord: 2015..2052
e-value: 2.06127E-8
score: 51.8686
NoneNo IPR availableCDDcd11304Cadherin_repeatcoord: 1577..1626
e-value: 1.22834E-8
score: 53.8566
NoneNo IPR availableSUPERFAMILY57196EGF/Laminincoord: 1956..1996
IPR032675Leucine-rich repeat domain, L domain-likeGENE3D3.80.10.10coord: 256..401
e-value: 2.7E-25
score: 88.2
IPR032675Leucine-rich repeat domain, L domain-likeGENE3D3.80.10.10coord: 640..763
e-value: 0.018
score: 13.1
IPR032675Leucine-rich repeat domain, L domain-likeGENE3D3.80.10.10coord: 764..1034
e-value: 7.7E-30
score: 103.0
IPR032675Leucine-rich repeat domain, L domain-likeGENE3D3.80.10.10coord: 402..639
e-value: 3.0E-33
score: 114.2
IPR032675Leucine-rich repeat domain, L domain-likeGENE3D3.80.10.10coord: 73..255
e-value: 3.2E-26
score: 91.1
IPR032675Leucine-rich repeat domain, L domain-likeSUPERFAMILY52058L domain-likecoord: 411..632
IPR032675Leucine-rich repeat domain, L domain-likeSUPERFAMILY52058L domain-likecoord: 89..388
IPR032675Leucine-rich repeat domain, L domain-likeSUPERFAMILY52058L domain-likecoord: 770..967
IPR032675Leucine-rich repeat domain, L domain-likeSUPERFAMILY52058L domain-likecoord: 614..772
IPR001611Leucine-rich repeatPFAMPF13855LRR_8coord: 543..602
e-value: 3.5E-10
score: 39.4
coord: 105..162
e-value: 2.1E-6
score: 27.3
coord: 251..308
e-value: 1.4E-10
score: 40.7
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 418..439
score: 5.163
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 591..612
score: 5.34
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 175..196
score: 4.732
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 812..833
score: 7.312
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 908..929
score: 6.387
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 497..518
score: 5.918
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 273..294
score: 7.196
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 151..173
score: 5.409
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 347..368
score: 4.67
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 222..243
score: 6.018
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 521..542
score: 6.996
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 934..955
score: 6.18
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 249..270
score: 5.979
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 860..883
score: 5.386
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 297..318
score: 4.978
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 665..686
score: 5.055
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 615..636
score: 4.678
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 129..150
score: 5.687
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 373..394
score: 7.758
IPR001611Leucine-rich repeatPROSITEPS51450LRRcoord: 567..588
score: 7.981
IPR024731EGF domainPFAMPF12947EGF_3coord: 2063..2099
e-value: 2.7E-8
score: 33.8
IPR026906Leucine rich repeat 5PFAMPF13306LRR_5coord: 705..835
e-value: 1.0E-5
score: 25.4
coord: 860..974
e-value: 0.0022
score: 17.8
IPR013032EGF-like, conserved sitePROSITEPS01186EGF_2coord: 2086..2099
IPR018097EGF-like calcium-binding, conserved sitePROSITEPS01187EGF_CAcoord: 2015..2042
IPR015919Cadherin-likeSUPERFAMILY49313Cadherin-likecoord: 1539..1630

Blast
BLAST of TCONS_00016953-protein vs. Swiss-Prot (Human)
Match: TSP4 (Thrombospondin-4 OS=Homo sapiens GN=THBS4 PE=1 SV=2)

HSP 1 Score: 182.956 bits (463), Expect = 1.808e-46
Identity = 95/200 (47.50%), Postives = 116/200 (58.00%), Query Frame = 0
Query: 1924 CDRNPCYRSVSCFNTGESGASFQCGKCPNGYSGDGTSCQDVDECTNNNVCYNSACINMSPGFKCAGCPPGFTGNAPHGVGGEVVHE-RQFCRDINECENPELHSCDPNAECINTLGSYRCGKCKPGYVGDGYLGCKSGDYCATGQHN-CHINATCIPLGSRKFTCVCKEGYAGNGQICGLDHDYDGVTSFGAACIENSCK 2121
            CD NPC+R V C ++ +    FQCG CP GY+G+G +C DVDEC  +       CIN+SPGF+C  CP GFTG    GVG       +Q C DI+EC N    +C PN+ C+NTLGSYRCG CKPGY GD   GCK+   C   + N C +NA CI       TCVC  G+AG+G ICG D D D        C   +CK
Sbjct:  290 CDSNPCFRGVQCTDSRDG---FQCGPCPEGYTGNGITCIDVDECKYHPCYPGVHCINLSPGFRCDACPVGFTGPMVQGVGISFAKSNKQVCTDIDECRN---GACVPNSICVNTLGSYRCGPCKPGYTGDQIRGCKAERNCRNPELNPCSVNAQCIEERQGDVTCVCGVGWAGDGYICGKDVDIDSYPDEELPCSARNCK 483          
The following BLAST results are available for this feature:
BLAST of TCONS_00016953-protein vs. Swiss-Prot (Human)
Analysis Date: 2018-01-31 (Blastp Clytia hemisphaerica v1.0 proteins vs SwissProt (Homo sapiens))
Total hits: 1
Match NameE-valueIdentityDescription
TSP41.808e-4647.50Thrombospondin-4 OS=Homo sapiens GN=THBS4 PE=1 SV=... [more]
back to top