ZNF292
znf292 HOME

CONSERVATION



C-TERMINAL ANALYSES
DNA AND RNA SEQUENCES
RESEARCH QUESTIONS
RESULTS
RESOURCES
REFERENCES
Conservation


Conservation
57-SPECIES ALIGMENTS | 5-SPECIES ALIGNMENT/SUBDOMAINS | REFERENCES


A comparison of Homo sapiens, Pan troglodytes, Mus musculus, Rattus norvegicus, Erinaceus europaeus and Xenopus laevis canonical amino acid sequences show znf292 to be very highly conserved across species. In particular, the zinc finger sequences and their relative positions are very well conserved, though many of the intervening sequences are not. Much of the N-terminal region up to the first zinc finger are also highly conserved. The C-terminal after the last zinc finger is slightly less well conserved.

H. Sapiens toFull lengthN-TerminalZinc FingersC-Terminal
p.t.99999999
r.n.80927975
m.m.79917871
e.e.86958582
x.l.37733938


Conservation of Key Domains

Zinc Fingers

In general, the zinc fingers are highly conserved in sequence and relative positions. When the zinc finger sequences vary, they often still have predicted position weight matrices (PWMs) that correlate well across species. The PWMs were predicted using zf.princeton.edu. The spacing between ZF1 and ZF2 and ZF4-ZF7 are consistent with DNA binding modules were the zinc fingers would bind adjacent trinucleotides. The spaces between ZF2 and ZF3 and ZF3 and ZF4 are consistent in length and very well conserved, but also much large than would be expected for binding to adjacent nucleotide sequences.

By contrast the sequence between ZF7-ZF8, ZF8-ZF9, ZF11-ZF12 are quite variable in length and poorly conserved across species. The space between ZF9-ZF10 is better maintained across species but has a highly variable sequence. The spacings between ZF12-ZF16 are also well conserved, but not of sizes consistent with canonical DNA-binding zinc finger modules. The spacer sequences are only moderately well conserved.

It is interesting to note that ZF9, the only C2HC zinc finger motif is proceeded by a highly conserved stretch of ~40aa. Perhaps it is part of a larger conserved domain.

For more details on ZNF292 zinc fingers see Playing with Fingers.

Zinc Finger
h.s. vs species12345678910111213141516
p.t.1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
r.n./m.m./e.e.0.980.971.00.990.9850.970.881.01.00.970.991.01.00.901.01.0
x.l.0.980.850.930.960.910.870.740.890.930.820.930.960.720.580.990.90
zf.princeton score2.218.523.517.58.915.929.82111.710.927.916.315.010.2237.6
SMART E-Value55.40.1010.3520.1531.2310.80.0003690.07671930.0440.04720.4980.1643.720.01180.286
Entropies0.149
0.341
0.833
0.441
1.827
1.65
1.994
1.824
1.476
0.947
1.721
1.381
1.436
1.662
1.479
1.526
1.299
1.241
1.763
1.434
1.567
0.665
0.711
0.981
1.519
1.98
1.96
1.843
1.387
1.612
0.846
1.282
1.868
1.722
1.862
1.817
1.812
1.578
1.726
1.705
1.652
1.781
1.614
1.682
1.615
1.872
1.521
1.669
1.555
1.011
1.649
1.405
1.391
1.179
1.219
1.263
1.788
1.181
1.808
1.592
0.175
1.23
0.461
0.622
PWM Correlations0.880
0.6281
0.578
0.695
1
1
1
1
0.675
0.644
0.729
0.683
0.919
0.840
0.916
0.892
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
0.822
0.806
0.914
0.847
0.905
0.877
0.812
0.865
0.769
0.678
0.728
0.725
0.928
0.988
0.939
0.951
1
1
1
1
0.766
0.721
0.933
0.807
1
1
1
1
gnomAD Control
missense
in α:in β
1
0:1
3
2:1
5
2:3
5
3:2
3
1:2
2
2:0
5
4:1
3
2:1
1
1:0
1
1:0
3
2:1
5
1:4
9
5:4
6
5:1
5
2:3
3
1:2
DNA binding0212001011112110


Coiled-Coils

The putative coiled-coil region after the last zinc finger predicted by multicoil2 is absent in Mus musculus, Rattus norvegicus, Erinaceus europaeus and very weak in Xenopus laevis. It is present in many other mammals s.a. Aotus nancymaae, Felis silvestris and Leporidae. It is not universally found in primates either - for example, the potential coiled-coil in Tarsius syrichta (the Philippine tarsier) has a score of only 0.025. The capriciousness of the coiled-coil region calls into question it’s functionality.
UniProt predicts a coiled coil at 1827-1854 that is not predicted by MultiCoil2. UniProt’s documentation says COIL by Lupas with coil window length set to 28 is used for predictions, but an independent run of COIL predicts only a weak sequence of window length 21 in this area with a stronger prediction at 2523-2550 which overlaps the MultiCoil2 prediction and COIL window length 28 prediction.

Low Complexity Regions

The location of low complexity regions closely aligns across species. The most prominent being in the N-terminal and between zinc fingers 2 and 3. Most are conserved across mammals.

Nuclear Localization Signals

All species studied have clear NLS sequences as predicted by cNLS Mapper (score 7.8 or greater). The strongest bipartite sequences appear prior to the first zinc finger approximately 427-430 aa from N-terminal of the canonical sequence. A second strong bipartite sequence also appears between zinc finger 15 and 16 putting them much closer to the C-terminal.


Multi-species Sequence Alignments

57 Species Alignment from Clustal Omega and MVIEW

The amino acid sequences from 57 species with intact N-Terminal domains were aligned with Clustal Omega and sent through MVIEW. Only human sequence data is shown here. Full MVIEW output
Consensus key: =100% =90% =80% =70% =<%70 blank=no match -=insert Variation key: =1 =1…10 =10…100 >100 blank=none h.s. -------------------------------------------------------------------------------- 100% ................................................................................ 90% ................................................................................ 80% ................................................................................ 70% ................................................................................ h.s. -------------------------------------------------------------------------------- 100% ................................................................................ 90% ................................................................................ 80% ................................................................................ 70% ................................................................................ LCR cons ----------------------------------------- vars ----------------------------------------- h.s. ------------------------------------MADEEAEQERLSCGEGGCVAELQRLGERLQELELQLRES----- 100% ....................................Mt-.pA..........s.........Epl.pht...ppt..... 90% ....................................MAD-EAEpEphu...GGshhELpRLtEpLpELEptLcES..... 80% ....................................MADEEAEQERLStttGGCsAELQRLGERLQELERpLRES..... 70% ....................................MADEEAEQERLSpGtGGClAELQRLGERLQELERQLRES..... LCR cons ---- vars ---- h.s. ----RVPAVEAATDYCQQLCQTLLEYAEKWKTSEDPLPLLEVYTVAIQSYVKARPYLTSECENVALVLERLALSCVELLL 100% .........psus.ah.phh.TLh.Yut+WKh.-Ds.sLlEVYTsAl.SaspstPaLoSpCEpVshVLERLsLSChpLLL 90% ....t.PAVpAAo-YCpQLCQTLLEYAEKWKTSEDPLPLLEVYTVAIpSYVKARPYLTSECENVAhVLERLALSCVELLL 80% ....RsPAVEAAT-YCQQLCQTLLEYAEKWKTSEDPLPLLEVYTVAIQSYVKARPYLTSECENVALVLERLALSCVELLL 70% ....RVPAVEAAT-YCQQLCQTLLEYAEKWKTSEDPLPLLEVYTVAIQSYVKARPYLTSECENVALVLERLALSCVELLL cons vars h.s. CLPVELSDKQWEQFQTLVQVAHEKLMENGSCELHFLATLAQETGVWKNPVLCTILSQEPLDKDKVNEFLAFEGPILLDMR 100% sL..-lspt.WpphQs.lphAp..L.ppGs.pLphL..lspEpGsWtNshL..Ihs.p..D..p.....s.cGshLL-MR 90% CLPlELs-ppWEpFQshVQVAHcpLMENGSsELphLuTLuQEoGVWKNPVLssILSQEPLDp-KVNEFLshEGPlLLDMR 80% CLPVELSDKQWEQFQoLVQVAHEKLMENGSCELHFLATLAQETGVWKNPVLCTILSQEPLDKDKVNEFLAFEGPILLDMR 70% CLPVELSDKQWEQFQoLVQVAHEKLMENGSCELHFLATLAQETGVWKNPVLCTILSQEPLDKDKVNEFLAFEGPILLDMR cons vars h.s. IKHLIKTNQLSQATALAKLCSDHPEIGIKGSFKQTYLVCLCTSSPNGKLIEEISEVDCKDALEMICNLESEGDEKSALVL 100% lKpLhK.tpl.pAs.LA+hCutH.Ehu.pGtFpQhYLsCLssssPp.hhhpElutVDC+DAL-MICNlES-GDEK.uh.L 90% IKHLlKspQLsQATALAKLCSDHPEIusKGsFKQTYLVCLCouSPNEKLhEEIuEVDCKDALEMICNLES-GDEKsALlL 80% IKHLIKTpQLSQATALAKLCSDHPEIGoKGSFKQTYLVCLCTSSPNEKLIEEISEVDCKDALEMICNLESEGDEKSALVL 70% IKHLIKTsQLSQATALAKLCSDHPEIGTKGSFKQTYLVCLCTSSPNEKLIEEISEVDCKDALEMICNLESEGDEKSALVL cons vars h.s. CTAFLSRQLQQGDMYCAWELTLFWSKLQQRVEPSIQVYLERCRQLSLLTKTVYHIFFLIKVINSETEGAGLATCIELCVK 100% CsuFLoRQL..G-MYCAWELTLFWSKL.pRh-sShQlaL-+CRQhSlLsKTVYHIhFhIKVlpSEh-ssGLssCIEhCl+ 90% CsAFLSRQLQQG-MYCAWELTLFWSKLQQRVEPSlQVYLERCRQLSLLTKTVYHIFFLIKVINSEhEGAGLATCIELCVK 80% CTAFLSRQLQQGDMYCAWELTLFWSKLQQRVEPSIQVYLERCRQLSLLTKTVYHIFFLIKVINSETEGAGLATCIELCVK 70% CTAFLSRQLQQGDMYCAWELTLFWSKLQQRVEPSIQVYLERCRQLSLLTKTVYHIFFLIKVINSETEGAGLATCIELCVK NLS cons vars h.s. ALRLESTENTEVKISICKTISCLLPDDLEVKRACQLSEFLIEPTVDAYYAVEMLYNQPDQKYDEENLPIPNSLRCELLLV 100% AL+h-stEsspsKholCKTlSCLLPpDLEVKRACQLoEFLlEPTVDuYYAVEhLaNpPDQKh-EEsLPlPNSLRCELLLV 90% ALRLESoENs-VKISICKTISCLLPDDLEVKRACQLSEFLlEPTVDAYYAVEMLYNQPDQKYDEENLPIPNSLRCELLLV 80% ALRLESTENTEVKISICKTISCLLPDDLEVKRACQLSEFLIEPTVDAYYAVEMLYNQPDQKYDEENLPIPNSLRCELLLV 70% ALRLESTENTEVKISICKTISCLLPDDLEVKRACQLSEFLIEPTVDAYYAVEMLYNQPDQKYDEENLPIPNSLRCELLLV cons - vars - h.s. LKTQWPFDPEFWDWKTLKRQCLALMGEEASIVSSIDELNDSEVYEKVVDYQEESKETSMNGLS-GGVGANSGLLKDIGDE 100% hKTpWPFDPEFWDWKTLKRpCLtLMGtEASIVSSIDELNDsEl.-t..-.pt..t...hs.........s........-- 90% LKTQWPFDPEFWDWKTLKRQCLALMGEEASIVSSIDELNDSEVYEK..DhQ--.K-TShN....ushstsouhLpshtDE 80% LKTQWPFDPEFWDWKTLKRQCLALMGEEASIVSSIDELNDSEVYEK.sDYQEEtKETShNGLs.GGlGssSuLL+DIsDE 70% LKTQWPFDPEFWDWKTLKRQCLALMGEEASIVSSIDELNDSEVYEKssDYQEEsKETShNGLS.GGlGANSGLLKDIsDE NLS ZF1 ZF2 cons vars h.s. KQKKREIKQLRERGFISARFRNWQAYMQYCVLCDKEFLGHRIVRHAQKHYKDGIYSCPICAKNFNSKETFVPHVTLHVKQ 100% +pK.+phKph+-tGalSARFRNWQAYMQYClLCDKEFLGHRIVRHAQpHhKsGhYsCPICAppasoK-.hVPHVs.HVKp 90% KQKK+EIKpLRERGFISARFRNWQAYMQYCVLCDKEFLGHRIVRHAQKHYKDGlYSCPICAcNFNSKEsFVPHVTLHVKQ 80% KQKKREIKpLRERGFISARFRNWQAYMQYCVLCDKEFLGHRIVRHAQKHYKDGIYSCPICAKNFNSKETFVPHVTLHVKQ 70% KQKKREIKQLRERGFISARFRNWQAYMQYCVLCDKEFLGHRIVRHAQKHYKDGIYSCPICAKNFNSKETFVPHVTLHVKQ LCR cons ---- vars ---- h.s. SSKERLAAMKPLRRLGRPPKITTTNENQKTN--TVAKQEQRPIKKNSLYSTDFIVFNDNDGSDDEND--DKDKSYEPEVI 100% SsKERLtsMKs.++lupssKhss...s.p.......p..pRPIhKsp.......VhNDsDhopsppt......s...... 90% SSKERLAAMKPLRRLGRPPKIssspENQKTN..sVsKQEQRPIKKNSLYSTDFIVFNDNDGSDDEsD..DKDKsYtPEll 80% SSKERLAAMKPLRRLGRPPKIoTssENQKTN..sVsKQEQRPIKKNSLYSTDFIVFNDNDGSDDEsD..DKDKSYEPEVI 70% SSKERLAAMKPLRRLGRPPKITToNENQKTN..sVuKQEQRPIKKNSLYSTDFIVFNDNDGSDDEND..DKDKSYEPEVI ZF3 ZF4 ZF5 cons vars h.s. PVQKPVPVNEFNCPVTFCKKGFKYFKNLIAHVKGHKDNEDAKRFLEMQSKKVICQYCRRHFVSVTHLNDHLQMHCGSKPY 100% .h.c...hpEasCPVt.C+KtFKYF+NLIAHs+uHKss--ApRFLEhQS+KVlCQYCRRpFVplsHLNDHLQhHCGspPY 90% PVQKPlPVNEFsCPVoFCKKGFKYFKNLIAHVKGHKDNE-AKRFLEMQSKKVICQYCRRHFVSVTHLNDHLQMHCGSKPY 80% PVQKPVPVNEFNCPVTFCKKGFKYFKNLIAHVKGHKDNEDAKRFLEMQSKKVICQYCRRHFVSVTHLNDHLQMHCGSKPY 70% PVQKPVPVNEFNCPVTFCKKGFKYFKNLIAHVKGHKDNEDAKRFLEMQSKKVICQYCRRHFVSVTHLNDHLQMHCGSKPY ZF6 ZF7 cons vars h.s. ICIQMKCKAGFNSYAELLTHRKEHQVFRAKCMFPKCGRIFSEAYLLYDHEAQHYNTYTCKFTGCGKVYRSQGELEKHLDD 100% ICIQhpCpAuF.o.s-LLsHR+EHphF+A+ChFPpCGRlFptAYhLaDHEAQHYpTaTC+..sCGKla+SQ.ph-hH.pt 90% ICIQMKCKAGFNSYAELLoHRKEHQVFRAKCMFPKCGRlFSEAYLLYDHEAQHYNTYTCKFTGCGKVYRSQsELEKHl-D 80% ICIQMKCKAGFNSYAELLTHRKEHQVFRAKCMFPKCGRIFSEAYLLYDHEAQHYNTYTCKFTGCGKVYRSQuELEKHLDD 70% ICIQMKCKAGFNSYAELLTHRKEHQVFRAKCMFPKCGRIFSEAYLLYDHEAQHYNTYTCKFTGCGKVYRSQuELEKHLDD cons vars h.s. HSTPPEKVLPPEAQLNSSGDSIQPSEVNQNTAENIEKERSMLPSENNIENSLLADRSDAWDKSKAESAVTKQDQISASEL 100% H....p......t.....t...................t.........t.s............................. 90% Hsp.PEps.ssEsphp.ss..hp..phs.sst.sh.cpt.h.sst.s.pssh..spstsWsp.+sEsssscps.hSsS.L 80% HsT.PEKVLsPEsQhssoGsslQPScsNpsst.shcKEpShLPSENshENoh.sDcSssWDKSKsESsVo+QsplSsSEL 70% HST.PEKVLPPEsQLNSSG-slQPScVNpNTttsscKEcShLPSENNIENol.sDRSsuWDKSKuESsVTKQDQISASEL cons ------------------------ vars ------------------------ h.s. R-QANGPLSNGLENPAT-TP-LLQSSEVAVSIKVSLNQGI-----------------EDNFGKQENSTVEGS----GEAL 100% ............................s..h...hpphh.................p.p.................... 90% t..sss..ssG.ppsss.s..h.psstsusshpsulNptl.................cssFsKptp.sspss....scsh 80% p.pssusLssGLENsss.ss.LLQusEVAVSIKVSLNQGI.................EDNFGKQENsslpGs....uEsL 70% R.QusGPLSNGLENsss.oP.LLQuSEVAVSIKVSLNQGI.................EDNFGKQENSoVEGo....GEuL cons ------------------------------------------- vars ------------------------------------------- h.s. VTDLHTPVEDTCNDLCHPGFQERK----------EQDCFND---------------------------------AHVTQN 100% ...h...h........h..................p...p........................................ 90% ssplpsss.t.sss.ChsshpE+K..........tpsChsp.................................sp.hps 80% VTsLposVtssCNDLC+suFQERK..........cpsCFN-.................................uQhsQN 70% VTsLHTPVEDsCNDLC+PGFQERK..........EQ-CFNE.................................AQlTQN cons ------------- vars ------------- h.s. SLVNSETLKIGDLTPQNLERQVNNLMTFSVQNQAAFQ--N-----------NLPTSKFECGDNVKTSSNLYNLPLKTLES 100% ........p...ht...ht.pls.h.sh..pp.......s...............sth...ts.ps..t.hsLsl.h... 90% ..ssS-sLK.tsLsspsLERQVsslhsFohQNQsuap..N...........slshsKhEhtsslKsusslYsLPLKTLES 80% oLVsSEsLKIsDLsPQNLERQVNsLMTFSlQNQAGFp..N...........sLPsuKaECussVKTSSsLYNLPLKTLES 70% SLVNSETLKIGDLTPQNLERQVNNLMTFSVQNQAGFQ..N...........sLPsSKFECGsNVKTSSsLYNLPLKTLES ZF8 cons - vars - h.s. IAFVPPQSDLSNSLGTPSVPPKAPVQKFSCQVEGCTRTYNSSQSIGKHMKTAHPDQYAAFKMQRKSKKGQKANNL-NTPN 100% ........s.......ssss.ts.hp+apCthEsCTR.YsS.pSlsKHhKsAHP-.YsthKht+Kspt..ts.......s 90% IsFlPsQss.ssslsssssP.pAPsQKFsCQVEGCTRTYNSSQSIGKHMKsAHPDQYAAFKhQRKsK+spKusNL.NsPs 80% IsFVPsQPs.SsSLGoPSVPPKAPVQKFSCQVEGCTRTYNSSQSIGKHMKTAHPDQYAAFKMQRKsKKGQKuNNL.NTPN 70% IsFVPPQPNLSsSLGTPSVPPKAPVQKFSCQVEGCTRTYNSSQSIGKHMKTAHPDQYAAFKMQRKsKKGQKuNNL.NTPN cons --------------------- vars --------------------- h.s. NGKFVYFLPSPVNSS-NP---FFTSQTKANG-----------NP----ACSAQLQHVSP-PIFPAHLASVSTP-LLSSME 100% ttp...hh....tt..t....hht.Q.pss............p........s...h..s..h.s.p.tsh..s.hls.ht 90% sGKhVYhLPS.Vsss.ss...hFTsQsKsss...........Ns....sCSsQlQHlSs.slFPsHLtslusP.lLsshE 80% NGKhVYFLPS.VsSSsNA...FFTsQTKAsG...........NP....sCSsQLQHVSP.slFPAHLssVSsP.LLsShE 70% NGKFVYFLPSQVsSSNNA...FFTsQTKANG...........NP....TCSsQLQHVSP.sIFPAHLsoVSsP.LLsShE cons -------------------------------------------------------------- vars -------------------------------------------------------------- h.s. SVINPNIT------------SQDKNEQG-GM------------------------------------------------- 100% s..s.s.......................th................................................. 90% SVhsPsls............opsKs-.t.uh................................................. 80% SVINPNIs............SQDKsEQs.uh................................................. 70% SVINPNIs............SQDKNEQG.Gh................................................. cons ----------------------------------------------------- vars ----------------------------------------------------- h.s. -----------------------------------------------------LCSQMENLPSTALPAQMEDLTKTVLPL 100% .....................................................hsu.htsh....ls..htslsp.hhP. 90% .....................................................lCSQMENLssssLPuQhEDLsKTVhPL 80% .....................................................LCSQMENLssTsLPAQMEDLTKTVLPL 70% .....................................................LCSQMENLssTsLPAQMEDLTKTVLPL cons ---- vars ---- h.s. NIDSGSDPFLPLPAESSSMSLFPSPADSGTNSVFSQL-ENN-TNHYSSQIEGNTNSSFLKGGNGENAVFPSQVNVANN-- 100% .h-s.pDPhhs...Ess..sh.s.......sssFsp...s..tp..s...tts...............F........... 90% NIDuGSDPFLPLPsEsushSLFPSPu-sssNSVFSQl.ENs.sNpasSQhEGNssSsF.Kttss-pslFsSpssssss.. 80% NIDSGSDPFLPLPAEoSSMSLFPSPADsGsNSVFSQL.ENN.TNHaSSQhEGNTNSoFLKGusuENslFPSQVssuss.. 70% NIDSGSDPFLPLPAESSSMSLFPSPADSGsNSVFSQL.ENN.TNHaSSQhEGNTNSSFLKGGNGENAVFPSQVNVAss.. ZF9 cons ---- vars ---- h.s. ----FSSTNAQQSAPEKVKKDRGRGPNGKERKPKHNKRAKWPAIIRDGKFICSRCYRAFTNPRSLGGHLSKRSYCKPLDG 100% ................p....ttps.p.p.++s+pspRsKhPAII+DGKFICsRCaR.FTsP+SLGGHLSKRuhCKPhpt 90% ....hstossQpsu.EKVKKDRGpGsNGKERKPKHNKRAKWPAIIRDGKFICSRCaRsFTNPRSLGGHLSKRSYCKPL-G 80% ....FsuTssQQSAsEKVKKDRGRGPNGKERKPKHNKRAKWPAIIRDGKFICSRCYRAFTNPRSLGGHLSKRSYCKPLDG 70% ....FsuTssQQSAPEKVKKDRGRGPNGKERKPKHNKRAKWPAIIRDGKFICSRCYRAFTNPRSLGGHLSKRSYCKPLDG cons -- vars -- h.s. AEIAQELLQSNGQPSLLASMILSTNAV-NLQQPQQSTFNPEACFKDPSFLQLLA-ENRSPAFLPNTFPRSGVTNFNTSVS 100% .-hs..h....h.....s.hl.Ssps..t.....t.shs.thsh+t.s.....t.p.p...h..shhsp........... 90% uEIutElLQsNGQsSLLASMILSosul.N.QQPppSsFsPtsCFKDPSFLQLLusENRs.sFL.shFPRssVosFsssss 80% AEIAQELLQsNGQPSLLASMILSTNAV.NlQQPQQSsFsPEACFKDPSFLQLLAuENRSssFLPsTFPRsuVosFsTuVS 70% AEIAQELLQsNGQPSLLASMILSTNAV.NLQQPQQSTFNPEACFKDPSFLQLLAAENRSssFLPNTFPRsuVTNFNTSVS cons ----- vars ----- h.s. QEGSEIIKQALETAGIPSTFEGAEMLSH-VST---GCVSDASQVNATVMPNPTVP-PLLHTVCHPNTLLTNQNRTSNSKT 100% pps.tllcps..sssh...hp..t...............ts...................ths..s...tpt........ 90% pEGscIIKQALETAGIPSTF-ss-hLsp.Vss...uClosss.lsuslhssssss.sLLpoVCpssshhTsQspT.NsKh 80% QEGSEIIKQALETAGIPSTFEuuEhLS+.Vss...uCVSDssQVNAsVhPNPsVP.PLLQTVCHPNsLLTsQNRTsNSKs 70% QEGSEIIKQALETAGIPSTFEGAEMLSH.VsT...GCVSDssQVNATVMPNPsVP.PLLQTVCHPNsLLTsQNRTsNSKT NLS cons ------ vars ------ h.s. SSIEE-CSSLPVFPTNDLLLKTVENGLCSSSFPNSGGPSQNF--TSNSSRVSVISGPQNTRSSHLN-KKGNSASKRR--K 100% ..............................s..ps....p.h....sssthSl.s..ps..s...p....ts..hh+... 90% sslpE.CpsLPlFssp-LhLKTlENGLCSsSasssss.sQsF..hsNSoRVSVISuPpNstuspLN.KKGsSuSK++..+ 80% sol-E.CsSLPVFssNDLLLKTVENGLCSuSFsNSsusSQNF..ssNSSRVSVISGPQNTRSSHLN.KKGNSASKRR..K 70% SSIEE.CsSLPVFPTNDLLLKTVENGLCSSSFPNSuGPSQNF..soNSSRVSVISGPQNTRSSHLN.KKGNSASKRR..K cons ------ vars ------ h.s. KVAPPLIAPNASQNLVTSD-LTTMGLIAKSVE----IPTTNLHSNVIPTCEPQSLVENLTQKL-NNVNNQLFMTDVKENF 100% ..s.s......s.t..stt.h.shs..spth.....h..t............p.hh.p.sp...sshpt.....shp.ph 90% KsssPllssNssQslssss.lsshGLlAKpl-....lsssshpssllssCpsQslVENLsQKL.sNlsNpLFhsslK-NF 80% KVsPPLIAPNuSQNLVsoD.LTshGLlAKslE....IPsoNl+SslIPsCEPQuLVENLTQKL.NNVsNQLFhTDVKENF 70% KVsPPLIAPNuSQNLVTSD.LTsMGLIAKSlE....IPTTNLHSNVIPsCEPQuLVENLTQKL.NNVsNQLFhTDVKENF cons ---------- vars ---------- h.s. KTSLESHTVL--APLTLKTENGDSQMMALNS--CTTSINSDLQISEDNVIQNFEKTLEIIKTAMNSQILEVKSG------ 100% ptth.st..................................................t...ts.soph..h.s....... 90% KsslEuHshL..ssLolKsENGDSQMMshNS..Cs..hNS-lQISEDNVhQNFEKTLEIIKoAMNSQhLEVKot...... 80% KTslEoHTlL..APLTLKTENGDSQMMALNS..CTsulNSDLQISEDNVIQNFEKTLEIIKoAMNSQILEVKoG...... 70% KTsLESHTVL..APLTLKTENGDSQMMALNS..CTsSlNSDLQISEDNVIQNFEKTLEIIKoAMNSQILEVKSG...... cons ------------------- vars ------------------- h.s. SQGAGETS-------QNAQINYNIQ------LPSVNTVQN----NKLPDSSPFSSFIS-VMPTKSNIPQSEVSHK-EDQI 100% .pt....s.........s..s.s................t....s..sp.s............p.....s....t...ph 90% .pshstso.......tpsphs.shQ......lsosNssts....sKLssssphusahs.lhssKss.s.s-h.pK.-sQl 80% SpGsGETS.......QNuQlNYNlQ......LPSVNolQN....NKLPDSoQFSSFlu.VhPsKsNlPpSEl.HK.EDQl 70% SQGsGETS.......QNAQINYNIQ......LPSVNoVQN....NKLPDSSQFSSFlu.VhPsKoNIPQSEl.HK.EDQI ZF10 cons --------------- vars --------------- h.s. QEILEGLQKLKLENDLSTPA------SQCVLINTSVTLTPTPVK--------STADITVIQPVSEMI-NIQFNDKVNKPF 100% .-I.thlppLpL.pp............t...........................s....p............cp..KPF 90% .EILEGLQ+LKLENDhs.ss......sQC..hsp.sshs..Phh........sh.shsllQ.s.p.h.pIQhs-+VNKPF 80% QEILEGLQKLKLENDLSsPs......sQClLlNTSVoLoPtPVK........shsslsllQPVSEhI.pIQFsD+VNKPF 70% QEILEGLQKLKLENDLSsPA......sQCVLINTSVTLTPTPVK........shsslTVVQPVSEMI.NIQFsD+VNKPF ZF11 cons vars h.s. VCQNQGCNYSAMTKDALFKHYGKIHQYTPEMILEIKKNQLKFAPFKCVVPTCTKTFTRNSNLRAHCQLVHHFTTEEMVKL 100% lCpp.sCsYpAMTKDALFKHhu+.H.YT.EhI.-IKKpphKaAPF+C....CsKTFTRNSNLRAHCQ.hHpFo.-pMlKh 90% VCQNpGCNYSAMTKDALFKHYGKlHQYTsEMILEIKKpQLKaAPFKCVVsTCsKTFTRNSNLRAHCQLVHHFToEEMVKL 80% VCQNQGCNYSAMTKDALFKHYGKIHQYTPEMILEIKKNQLKFAPFKCVVPTCTKTFTRNSNLRAHCQLVHHFTTEEMVKL 70% VCQNQGCNYSAMTKDALFKHYGKIHQYTPEMILEIKKNQLKFAPFKCVVPTCTKTFTRNSNLRAHCQLVHHFTTEEMVKL cons ----------------------------------------------- vars ----------------------------------------------- h.s. KIKRPYGRKSQSENVP-ASRST---Q------------VKKQLAMTEE-------------------------------N 100% KltRsas++s..p................................................................... 90% KIKRPYGRKSQsEs.s.ssp.s...p............lKp..shs.E...............................s 80% KIKRPYGRKSQsEN.s.usR.s...Q............VK+Q.shsEE...............................N 70% KIKRPYGRKSQSENlS.APR.T...Q............VK+QLshTEE...............................N cons -------------------------------------------------- vars -------------------------------------------------- h.s. KKE-------------------SQPALELRAE---------------T-QN-------------THSNVAVIPEKQLV-- 100% .......................................................................h.t...... 90% Kpt....................p.shplt...................pp.............s..phshlsEp.l... 80% KpE....................QPsl-Lts................p.ps.............shsNlAllPEKQLh.. 70% K+E....................QPALELtu................p.ps.............shuNlAVIPEKQLl.. cons ------------------- vars ------------------- h.s. --EKKSPDKTES------------SLQVITVT-SEQCNTNALTN--TQ-TKGRKIRRHKKEKEEKKRKKPVSQSL-EFPT 100% ...p..s.....................hs..........s.ts........p..tp..Kchp-h..............t 90% ..EKppP-+hEp............s.Qhlsls..EQhssts.ss..hp.sKsRKhRRp+KEKEE++c++Plopuh.EhPT 80% ..EKKSP-KhEs............S.QllTVo.sEQtNosuhoN..hQ.TKGRKlRRH+KEKEE+KRKKPVSpSl.EFPT 70% ..EKKSP-KsES............S.QVITVT.SEQsNTNuLTN..lQ.TKGRKlRRHKKEKEEKKRKKPVSpSL.EFPT ZF12 ZF13 cons -- vars -- h.s. RYSPYRPYRCVHQGCFAAFTIQQNLILHYQAVHKSDLPAFSAEVEEESEAGKE-SEE-TETKQTLKEFRCQVSDCSRIFQ 100% passY+PYpCVHpGChAAFTIQpNLILHYpAlHpS....F............-...E..t.t...pEFRC...sCSRIF. 90% RYSPYRPYRCVHQGCFAAFTIQQNLILHYQAVHKS-LPsFSAEVEEEsEssK-.pEE.hETK.sh+EFRCpVSDCSRIFQ 80% RYSPYRPYRCVHQGCFAAFTIQQNLILHYQAVHKSDLPAFSAEVEEESEAGKE.SEE.hETKQTlKEFRCQVSDCSRIFQ 70% RYSPYRPYRCVHQGCFAAFTIQQNLILHYQAVHKSDLPAFSAEVEEESEAGKE.SEE.hETKQTlKEFRCQVSDCSRIFQ ZF14 ZF15 cons vars h.s. AITGLIQHYMKLHEMTPEEIESMTASVDVGKFPCDQLECKSSFTTYLNYVVHLEADHGIGLRASKTEEDGVYKCDCEGCD 100% tlsuLlQHYhphHph..-phtsh.s..phGpF.CDQ.pCt..Fhs.hpYl.HlE..Ht.t......t.-tha+C-sEGCD 90% tlTuLIQHYMKLHEMoPE-ItoMpuul-lG+FsCDQ.pCKSSFTsYLsYllHLEsDHGlth+ssKsE-DGlaKCDCEGCD 80% AITGLIQHYMKLHEMTPEEIESMTAuVDVGKFPCDQ.ECKSSFTTYLNYVVHLEsDHGIGhRsSKsEEDGlYKCDCEGCD 70% AITGLIQHYMKLHEMTPEEIESMTASVDVGKFPCDQLECKSSFTTYLNYVVHLEADHGIGhRuSKTEEDGlYKCDCEGCD NLS LCR cons ------ vars ------ h.s. RIYATRSNLLRHIFNKHNDKHKAHLIRPRRLTPG-QENMSSKANQEKS-KSK-HRGTKHSRCGKEGIKMP-KTKRKK--K 100% +hYATRSNLLRHhhpKHpD.aK.pLhp.R+...t..-phs....tccs...p........t.t................. 90% RIYATRSNLLRHIFNKHND+HKsHLIRPR+LTsG.QENhSSKANQEKs.KuK..RGhK..RsG+EG.+hs.KsKRKK..p 80% RIYATRSNLLRHIFNKHNDKHKAHLIRPRRLTPG.QENhSSKANQEKo.KSK.aRGhKa.RsGKEGhKhs.KTKRKK..K 70% RIYATRSNLLRHIFNKHNDKHKAHLIRPRRLTPG.QENhSSKANQEKo.KSK.HRGTKH.RsGKEGhKMP.KTKRKK..K ZF16 COIL cons vars h.s. NNLENKNAKIVQIEENKPYSLKRGKHVYSIKARNDALSECTSRFVTQYPCMIKGCTSVVTSESNIIRHYKCHKLSKAFTS 100% ....ptp.c......pp.h.hKhG+....lKshptA.s.Copph.hQYPCMl+sCpoVVoSEpsIh+HYhpHtLuttah. 90% .NLEsKsuKhhQlpENKsYSLKRGKHVYSIKARNDALSECTS+FlTQYPCMIKGCoSVVTSESNIIRHYKCHKLSKAFTS 80% sNLENKsAKIVQIEENKPYSLKRGKHVYSIKARNDALSECTSRFVTQYPCMIKGCTSVVTSESNIIRHYKCHKLSKAFTS 70% sNLENKsAKIVQIEENKPYSLKRGKHVYSIKARNDALSECTSRFVTQYPCMIKGCTSVVTSESNIIRHYKCHKLSKAFTS cons ---------- vars ---------- h.s. QHRNLLIVFKRCCNSQVKET-SEQE-------GA-KNDVKDSDTCVS-ESNDNSRTTATVSQKEVEKNEKDEMDELTELF 100% ppps.h.h.Kphs..p.p...............t.h...pt.....................pt....cKD.hsE.sEh. 90% QHRNLLIV.K+psssphK-s..EQE.......st.Ks-hKps-ssl..pssss....sshsQpE.tKsEKDEhDELTELF 80% QHRNLLIVFK+CssSQlK-s.oEQE.......ss.KsDVKsSDsslo.pssDsStp.sslsQ+EsEKsEKDEhDELTELF 70% QHRNLLIVFKRCCNSQlKET.SEQE.......ss.KsDVK-SDoslS.EoNDNStT.uTVsQKEsEKNEKDEMDELTELF NLS cons -- vars -- h.s. ITKLINEDST-SVETQANTSSNVSNDFQEDNLCQSERQKA-SNLKRVNKEKNVSQNKKRKVEKAEPASAAELSSVRKEEE 100% .sK..s-Dss............h.tp...p..t.s...t.....pch.pppt...........s....s...tthptt.. 90% ITKLINEDss.ssEsQs.ooSslspDhQEssss.sE+QKs.sNLKRssKEKslsQsK+R+h-KsE...ssthsshp+EEE 80% ITKLINEDso.osETQA.TSSsVssDFQEsNsCQSE+QKs.uNLKRVNKEKNVSQNKKRKlEKsEss.ss-lSSh+KEEE 70% ITKLINEDsT.SVETQApTSSNVSNDFQEsNPCQSE+QKu.SNLKRVNKEKNVSQNKKRKVEKAEPssAsELSSs+KEEE cons ------- vars ------- h.s. TA---VAIQTIEEHPASFDWSSFKPMGFEVSFLKFLEESAVKQKKNTDKDHPNTGNKKGSHSNSRKNIDKTAVT-SG--- 100% .....ht..s.ppp..shshSoFKPMGFEsSFLKFLEtSs.p.pcp...c...st..ht...........h.h....... 90% TA...VulQTsEEpPuoFDWSSFKPMGFEVSFLKFLEESAVKQKKs.-+Da.soGsK+GSHSsuR+s.-KTuls.ss... 80% TA...VAIQTTEEHPASFDWSSFKPMGFEVSFLKFLEESAVKQKKN.D+DHssoGsK+GSHSNSRKshDKTAVo.SG... 70% TA...VAIQTTEEHPASFDWSSFKPMGFEVSFLKFLEESAVKQKKNoDKDHPNoGsKKGSHSNSRKNhDKTAVT.SG... cons --- vasr --- h.s. --NHVCPCKESETFVQFANPSQLQCSDNVKIVLDKNLKDCTELVLKQLQEMKPTVSLKKLEVHS-NDPDMSVMKDISIGK 100% ..ph....ttpp.hl.FtNP.ph.s.tslplVhspthpph.-hhlKQLpph+PhV.Lp+..................... 90% ..NhhhsCpEoEhaVpFANPSpLpCu-NVKIVLDKsLKcCoELVLKQLQEMKPsVSLpKLEsc..sss-hohhK.lshGp 80% ..NHlCsCKESETFVQFANPSQLQCSDNVKIVLDKsLKDCTELVLKQLQEMKPTVSLKKLEV+S.NDsDhSlhK-lShGK 70% ..NHlCsCKESETFVQFANPSQLQCSDNVKIVLDKsLKDCTELVLKQLQEMKPTVSLKKLEVHS.NDPDhSVhK-IShGK cons ---------------------------------- vars ---------------------------------- h.s. ATGRGQY---------------------------------- 100% ......................................... 90% tpGcup................................... 80% ATGRGQ................................... 70% ATGRGQY.................................. MVIEW consensus codes: alcohol => o { S, T } aliphatic => l { I, L, V } aromatic => a { F, H, W, Y } charged => c { D, E, H, K, R } hydrophobic => h { A, C, F, G, H, I, K, L, M, R, T, V, W, Y } negative => - { D, E } polar => p { C, D, E, H, K, N, Q, R, S, T } positive => + { H, K, R } small => s { A, C, D, G, N, P, S, T, V } tiny => u { A, G, S } turnlike => t { A, C, D, E, G, H, K, N, Q, R, S, T }

Sequence Alignments for 5 species

The amino acid sequences from five species were aligned with ClustalW and annotated. Though the zinc fingers align and are highly similar, there are significant differences in low complexity regions, predicted coiled-coils and possible nuclear localization signals.

CLUSTAL 2.1 Multiple Sequence Alignments Sequence type explicitly set to Protein Sequence format is Pearson Sequence 1: sp|O60281|ZN292_HUMAN 2723 aa Sequence 2: tr|H2QTD2|H2QTD2_PANTR 2723 aa Sequence 3: sp|Q9Z2U2|ZN292_MOUSE 2698 aa Sequence 4: tr|A0A1L8G393|A0A1L8G393_XENLA 2670 aa Sequence 5: tr|D3ZXZ1|D3ZXZ1_RAT 2706 aa Sequence 6: tr|A0A1S3A3I8|A0A1S3A3I8_ERIEU 2725 aa Sequences (1:2) Aligned. Score: 99.7062 Sequences (1:3) Aligned. Score: 79.5033 Sequences (1:4) Aligned. Score: 36.6667 Sequences (1:5) Aligned. Score: 80.5248 Sequences (1:6) Aligned. Score: 86.0815 Sequences (2:3) Aligned. Score: 79.5404 Sequences (2:4) Aligned. Score: 34.0449 Sequences (2:5) Aligned. Score: 80.5617 Sequences (2:6) Aligned. Score: 86.1183 Sequences (3:4) Aligned. Score: 39.176 Sequences (3:5) Aligned. Score: 92.55 Sequences (3:6) Aligned. Score: 56.9681 Sequences (4:5) Aligned. Score: 42.1348 Sequences (4:6) Aligned. Score: 35.3184 Sequences (5:6) Aligned. Score: 76.7923 There are 5 groups Start of Multiple Alignment Aligning... Group 1: Sequences: 2 Score:42519 Group 2: Sequences: 2 Score:44862 Group 3: Sequences: 3 Score:41088 Group 4: Sequences: 5 Score:42460 Group 5: Sequences: 6 Score:19602 Alignment Score 175599 CLUSTAL 2.1 multiple sequence alignment 1 48 h.s. MADEEAEQERLSCGEGGCVAELQRLGERLQELELQLRE--SRVPAVEAAT lcr p.t. MADEEAEQERLSCGEGGCVAELQRLGERLQELERQLRE--SRVPAVEAAT lcr ********************************* ****--********** m.m. MADDEAEQERLS--GGGCAAELRRLGERLQELERRLCE--SREPAVEAAA lcr r.n. MADDEAEQERLS--GGSCAAELRRLGERLQELERRLCE--SREPAVEAAA lcr e.e. MADEEAEQERLNRGGGGCVAELQRLGERLQELERQLRE--SRVPAVEAAT lcr ***:*******. *.*.***:********** :* *--** ******: x.l. MADGEAERENAP-PGPDVIAEMPHLEESLRELERLLQEESSKGEAAQASS *** ***:*. . **: :* * *:*** * * *: *.:*:: 49 98 h.s. DYCQQLCQTLLEYAEKWKTSEDPLPLLEVYTVAIQSYVKARPYLTSECEN p.t. DYCQQLCQTLLEYAEKWKTSEDPLPLLEVYTVAIQSYVKARPYLTSECEN ************************************************** m.m. AYCRQLCQTLLEYAEKWKTSEDPLPLLEVYTVAIQSYVKARPYLTSECES r.n. AYCRQLCQTLLEYAEKWKTSEDPLPLLEVYTVAIQSYVKARPYLTSECES e.e. EYCQQLCQTLLEYAEKWKTSEDPLPLLEVYTVAIQSYVKARPYLTSECEN **:*********************************************. x.l. EFCQRFCQTLFEYAEKWKAPEDSLSLLEVYTVAIESYSKARPYLTSECEN :*:::****:*******:.**.*.*********:** ***********. 99 148 h.s. VALVLERLALSCVELLLCLPVELSDKQWEQFQTLVQVAHEKLMENGSCEL lcr p.t. VALVLERLALSCVELLLCLPVELSDKQWEQFQTLVQVAHEKLMENGSCEL lcr ************************************************** m.m. VALVLERLALSCVELLLCLPVELSDKQWEQFQTLVQVAHETLMESGSCEL lcr r.n. VALVLERLALSCVELLLCLPVELSDKQWEQFQTLVQVAHETLMESGSCEL lcr e.e. VALVLERLALSCVELLLCLPVELSDKQWEQFQTLVQVAHEKLMENGSCEL lcr ****************************************.***.***** x.l. VALVLERLALSCVQLLLCLPHELPDDHWEKFQCSIKAAQMLLTENGSYEL lcr *************:****** **.*.:**:** ::.*: * *.** ** 149 198 h.s. HFLATLAQETGVWKNPVLCTILSQEPLDKDKVNEFLAFEGPILLDMRIKH p.t. HFLATLAQETGVWKNPVLCTILSQEPLDKDKVNEFLAFEGPILLDMRIKH ************************************************** m.m. QFLATLAQETGVWKNAVLSTILSQEPLDKEKVNEFLAFEGPILLDMRIKH r.n. HFLATLAQETGVWKNAVLSTILSQEPLDKEKVNEFLAFEGPILLDMRIKH e.e. HFLATLAQETGVWKNPVLCTILSQEPLDKDKVNEFLTFEGPILLDMRIKH :**************.**.**********:******:************* x.l. SILYILSQETGVWKNPVLTMIMNLDPLDQYQVDQFLVSEGSTLLEMRIKQ :* *:********.** *:. :***: :*::**. **. **:****: 199 248 h.s. LIKTNQLSQATALAKLCSDHPEIGIKGSFKQTYLVCLCTSSPNGKLIEEI p.t. LIKTNQLSQATALAKLCSDHPEIGIKGSFKQTYLVCLCTSSPNEKLIEEI ******************************************* ****** m.m. LIKTNQLSQATALAKLCSDHPEIGTKGSFKQTYLVCLCTSSPSEKLIEEI r.n. LIKTNQLSQATALAKLCSDHPEIGTKGSFKQTYLVCLCTSSPSEKLIEEI e.e. LIKTNQLSQATTLAKLCSNHPEIGTKGSFKQTYLVCLCTSSPNEKLIEEI ***********:******:***** *****************. ****** x.l. LLKLGKVASATSLAKLCSGHHEMSKKGNFTQLYLTCLCAASPNIKLIEEI *:* .:::.**:******.* *:. **.*.* **.***::**. ****** 249 298 h.s. SEVDCKDALEMICNLESEGDEKSALVLCTAFLSRQLQQGDMYCAWELTLF p.t. SEVDCKDALEMICNLESEGDEKSALVLCTAFLSRQLQQGDMYCAWELTLF ************************************************** m.m. SEVDCKDALEMICNLESEGDEKSALVLCTAFLSRQLQQGDMYCAWELTLF r.n. SEVDCKDALEMICNLESEGDEKSALVLCTAFLSRQLQQGDMYCAWELTLF e.e. SEVDCKDALEMICNLESEGDEKSALVLCTAFLSRQLQRGDMYCAWELTLF *************************************:************ x.l. AKVDCKDALDMICNLESEGDEKTSLILCAAFLSRQLQFGEMYCAWELTLF ::*******:************::*:**:******** *:********** 299 348 h.s. WSKLQQRVEPSIQVYLERCRQLSLLTKTVYHIFFLIKVINSETEGAGLAT p.t. WSKLQQRVEPSIQVYLERCRQLSLLTKTVYHIFFLIKVINSETEGAGLAT ************************************************** m.m. WSKLQQRVEPSVQVYLERCRQLSLLTKTVYHIFFLIKVINSETEGAGLAT r.n. WSKLQQRVEPSVQVYLERCRQLSLLTKTVYHIFFLIKVINSETEGAGLAT e.e. WSKLQQRVEPSVQVYLERCRQLSLLTKTVYHIFFLIKVINSETEGAGLAT ***********:************************************** x.l. WSKLQRRVDPSIQIYLERCRQLSLLTKTVYHIFFLIKVIQSETEGAGLPT *****:**:**:*:*************************:********.* 349 398 h.s. CIELCVKALRLESTENTEVKISICKTISCLLPDDLEVKRACQLSEFLIEP p.t. CIELCVKALRLESTENTEVKISICKTISCLLPDDLEVKRACQLSEFLIEP ************************************************** m.m. CIELCVKALRLESTENTEVKISICKTISCLLPEDLEVKRACQLSEFLIEP r.n. CIELCVKALRLESTENTEVKISICKTISCLLPEDLEVKRACQLSEFLIEP e.e. CIELCVKALRLESTENTEVKVSICKTISCLLPDDLEVKRACQLSEFLIEP ********************:***********:***************** x.l. CIELCVRALRLESSENAKVKISICKTISCLLPDDLEVKRACQLTEFLLEP ******:******:**::**:***********:**********:***:** 399 448 h.s. TVDAYYAVEMLYNQPDQKYDEENLPIPNSLRCELLLVLKTQWPFDPEFWD nls 7.8 p.t. TVDAYYAVEMLYNQPDQKYDEENLPIPNSLRCELLLVLKTQWPFDPEFWD nls 7.8 ************************************************** m.m. TVDAYYAVEMLYNQPDQKYDEENLPIPNSLRCELLLVLKTQWPFDPEFWD nls 7.8 r.n. TVDAYYAVEMLYNQPDQKYDEENLPIPNSLRCELLLVLKTQWPFDPEFWD nls 7.8 e.e. TVDAYYAVEMLYNQPDQKYDEENLPIPNSLRCELLLVLKTQWPFDPEFWD nls 7.8 ************************************************** x.l. TVDAYYAVEMLYNQPDQKYDEESLPVPNSLRCELLLVLKTRWPFDPEFWD nls 9.4 **********************.**:**************:********* 449 498 h.s. WKTLKRQCLALMGEEASIVSSIDELNDSEVYEKVVDYQEESKETSMNGLS nls p.t. WKTLKRQCLALMGEEASIVSSIDELNDSEVYEKVVDYQEESKETSMNGLS nls ************************************************** m.m. WKTLKRQCLALMGEEASIVSSIDELNDSEVYEKVDYQGERGDTSVNGLSA nls r.n. WKTLKRQCLALMGEEASIVSSIDELNDSEVYEKVDYQGERGDTSVNGLSA nls e.e. WKTLKRQCLALMGEEASIVSSIDELNDSEVYEKVADYQGDIKETSVNGLS nls ********************************** . : . : x.l. WKTLKRQCLALMGAEASIVSSIDELNDNEVYDQTDDYQEVTKISCLNGLD nls ************* *************.***::. . : . 499 548 h.s. GGVGANSGLLKDIGDEKQKKREIKQLRERGFISARFRNWQAYMQYCVLCD zf1 p.t. GGVGANSGLLKDIGDEKQKKREIKQLRERGFISARFRNWQAYMQYCVLCD zf1 ************************************************** zf1 m.m. AGLGTDSGLLMDTGDEKQKKKEIKELKDRGFISARFRNWQAYMQYCLLCD lcr/zf1 r.n. G-LGTDSSLLIDTGDEKQKKKEIKELKDRGFISARFRNWQAYMQYCLLCD lcr/zf1 e.e. GGIGANSGFLKDMCDDKQKK-EMKHLKEGGFISARFRNWQAYMQYCVLCD . :*::*.:* * *:**** *:*.*:: *****************:*** x.l. CFDNVNN----VEEEEKQKKKNIKKMRERGYVSARFRNWQAYMQYCVLCD lcr/zf1 ..:. ::**** ::*.::: *::**************:*** 549 598 h.s. KEFLGHRIVRHAQKHYKDGIYSCPICAKNFNSKETFVPHVTLHVKQSSKE zf1/zf2 p.t. KEFLGHRIVRHAQKHYKDGIYSCPICAKNFNSKETFVPHVTLHVKQSSKE zf1/zf2 ************************************************** zf1=1.0/zf2=1.0 m.m. KEFLGHRIVRHAQKHYKDGIYSCPICAKNFNSKDSFVPHVTLHVKQSSKE zf1/zf2 r.n. KEFLGHRIVRHAQKHYKDGVYSCPICAKNFNSKESFVPHVTLHVKQSSKE zf1/zf2 e.e. KEFLGHRIVRHAQKHYKDGVYSCPICAKNFNSKETFVPHVTLHVKQSSKE zf1/zf2 *******************:*************::*************** zf1=.98/zf2=.97 x.l. KEFLGHRIVRHAQKHFKDGIYSCPICAQQYSSKENFVPHVTFHVKQSCKE ***************:***:*******:::.**:.******:*****.** zf1=.98/zf2=.85 599 646 h.s. RLAAMKPLRRLGRPPKITTTNENQ--KTNTVAKQEQRPIKKNSLYSTDFI p.t. RLAAMKPLRRLGRPPKITTTNENQ--KTNTVAKQEQRPIKKNSLYSTDFI ************************--************************ m.m. RLAAMKPLRRLGRPPKITATHENQKTNINTVAKQEQRPIKKNSLYSTDFI r.n. RLAAMKPLRRLGRPPKITATHENQKTNTNTVAKQEQRPIKKNSLYSTDFI e.e. RLAAMKPLRRLGRPPKITTVNENQ--KINAVTKQEQRPIKKNSLYSTDFI ******************:.:*** : *:*:****************** x.l. RLETMKPLRRVGKPPKKAPTSKSK--KPVAVSKQE-RPIKKNSLYLDDFI lcr ** :******:*:*** :.. :.: : :*:*** ********* *** 647 696 h.s. VFNDNDGSDDENDDKDKSYEPEVIPVQKPVPVNEFNCPVTFCKKGFKYFK lcr/zf3 p.t. VFNDNDGSDDENDDKDKSYEPEVIPVQKPVPVNEFNCPVTFCKKGFKYFK lcr/zf3 ************************************************** m.m. VFNDNDGSDDENDDKDKSYEPEVIPVQKPVPVNEFNCPVTFCKKGFKYFK lcr/zf3 r.n. VFNDNDGSDDENDDKDKSYEPEVIPVQKPVPVNEFNCPVTFCKKGFKYFK lcr/zf3 e.e. VFNDNDGSDDENDDKDKSYEPEVIPVQKPVPVNEFNCPVSFCKKGFKYFK lcr/zf3 ***************************************:********** x.l. VFNDNDKSEDDEKDN----QPDIAQKEEQTLVNEFACPVHLCKKGFKYFK lcr/zf3 ****** *:*::.*: :*:: :: . **** *** :********* 697 746 h.s. NLIAHVKGHKDNEDAKRFLEMQSKKVICQYCRRHFVSVTHLNDHLQMHCG zf3/zf4 p.t. NLIAHVKGHKDNEDAKRFLEMQSKKVICQYCRRHFVSVTHLNDHLQMHCG zf3/zf4 ************************************************** zf3=1.0/zf4=1.0 m.m. NLIAHVKGHKDSEDAKRFLEMQSKKVICQYCRRHFVSVTHLNDHLQMHCG zf3/zf4 r.n. NLIAHVKGHKDSEDAKRFLEMQSKKVICQYCRRHFVSVTHLNDHLQMHCG zf3/zf4 e.e. NLIAHVKGHKDNEDAKRFLEMQSKKVICQYCRRHFVSVTHLNDHLQMHCG zf3/zf4 ***********.************************************** zf3=.99/zf4=1.0 x.l. NLIAHVRGHKGDEEATRFLEIQSKKVVCQYCRRQFVSLAHLNDHLQMHCG zf3/zf4 ******:***..*:*.****:*****:******:***::*********** zf3=.93/zf4=.96 747 796 h.s. SKPYICIQMKCKAGFNSYAELLTHRKEHQVFRAKCMFPKCGRIFSEAYLL zf5/zf6 p.t. SKPYICIQMKCKAGFNSYAELLTHRKEHQVFRAKCMFPKCGRIFSEAYLL zf5/zf6 ************************************************** zf5=1.0 m.m. SKPYICIQMKCKAGFNSYAELLAHRKEHQVFRAKCLFPKCGRIFSQAYLL zf5/zf6 r.n. SKPYICIQMKCKAGFNSYAELLAHRKEHQVFRAKCLFPKCGRIFSQAYLL zf5/zf6 e.e. SKPYICIQMKCKAGFNSYAELLTHRKEHQVFRAKCMFPKCGRIFSEAYLL zf5/zf6 **********************:************:*********:**** zf5=.985 x.l. DQPYICIQMKCKASFETYADLLSHRKEHRVFRARCMFPKCGRIFSAAYML zf5/zf6 .:***********.*::**:**:*****:****:*:********* **:* zf5=.91 797 846 h.s. YDHEAQHYNTYTCKFTGCGKVYRSQGELEKHLDDHSTPPEKVLPPEAQLN zf6/zf7 p.t. YDHEAQHYNTYTCKFTGCGKVYRSQGELEKHLDDHSTPPEKVLPPEAQLN zf6/zf7 ************************************************** zf6=1.0/zf7=1.0 m.m. YDHEAQHYNTYTCKFTGCGKVYRSQSEMEKHQDGHSHP-ETGLPPEDQLQ zf6/zf7 r.n. YDHEAQHYNTYTCKFTGCGKVYRSQSEMEKHLDDHSAP-EKVLPPEDQLT zf6/zf7 e.e. YDHEAQHYNTYTCKFTGCGKVYRSQSELEKHLDDHSTP-EKSLPPEDHFN zf6/zf7 *************************.*:*** *.** * *. **** :: zf6=.97/zf7=.88 x.l. FDHEAQHYNTFTCKYVGCGKIYHSQLQLEKHLSEHVPE---ETPLESKFN zf6/zf7 :*********:***:.****:*:** ::*** . * * * :: zf6=.87/zf7=.74 847 896 h.s. SSGDSIQPSEVNQNTAENIEKERSMLPSENNIENSLLADRSDAWDKSKAE p.t. SSGDSIQPSEVNQNTAENIEKERSMLPSENNIENSLLADRSDAWDKSKAE ************************************************** m.m. PSGNDVNPDSGATAAGG---------RSENSIDKNLGSNRSADWEKNRAE r.n. PSGNEVTQNSEGTTGEG---------RSEHSIEKSEGPDRSGDWEKNKAE e.e. SSGESVQPSKVNESTEGNTAKESPLLPSESSIENTLSADRNNSWDKSSTE lcr .**:.: .. ** .*::. .:*. *:*. :* x.l. TAHQQTAMLGIKHVMANKQPTSVCPVDNVNAASGNSVNERADEKDSNLSS .: :. . . . :* :.. :. 897 942 h.s. SAVTKQDQISASELRQANGPLSNGLENPATTPLLQSSE----VAVSIKVS p.t. SAVTKQDQISASELRQANGPLSNGLENPATTPLLQSSE----VAVSIKVS **************************************----******** m.m. PAVTKHGQISAAELRQANIPLSNGLETRDNTTVLRTNE----VAVSIKVS r.n. SAVTTHSQISASELRQADIPLSDGLGNPGDTTVLQTNE----VAVSIKVS e.e. SLVTKQDQISTSEFRQLDGPLSNGLENSVTTPLLQASE----VAVSIKVS . **.:.***::*:** : ***:** . *.:*::.*----******** x.l. AVLPQKEESLVSNGIQQTNLLDHHLPPTTEVLPFQSLLNQMLPSSSAILN . :. : : .:: * *.. * . ::: : * :. 943 978 h.s. LNQGIEDNFGKQENSTVEGSGEALVTDLHTPV--------------EDTC p.t. LNQGIEDNFGKPENSTVEGSGEALVTDLHTPV--------------EDTC ********************************--------------**** m.m. VNHGVEGDFGKQENLTMEGTGEPLITDVHKPG--------------IGAG r.n. VNHGIEDDFGKQENLTMEGTGEPLITDVHKPG--------------EGAG e.e. LNQGIEDNFGKQDNSAVEGNSESLVTDLKASV--------------EGTC :*:*:*.:*** :* ::**..*.*:**:: . -------------- .: x.l. QTLPLEADILNPSNLPVHPSSNVKLTSVPDECPDIVLKQRKLLATDNTNN . :* :: : .* .:. ..: :*.: 979 1018 h.s. NDLCHPGFQERKEQDCFNDAHVTQNSL----------VNSETLKIGDLTP p.t. NDLCHPGFQERKEQDCFNDAHVTQNSL----------VNSETLKIGDLTP ***************************----------************* m.m. VQLCHPGFQEKKGHECLN---EAQNSL----------SNSESLKMDDLNP r.n. VHFRHPGFQEKKDHECLNKAQTAQNSL----------ANSESLKMGDLNP e.e. NDLCHPNFQERK-QSCFNEAQVTSDSS----------VNSETLKIDDLTS .: **.***:* :.*:* :.:* ---------- ***:**:.**.. x.l. CNKNKPLTRLHFEEPCLSVYEGCKTSCPIEVNKDLVSSNAVENKMEVLSE . :* : : . *:. . * *: *: *. 1019 1068 h.s. QNLERQVNNLMTFSVQNQAAFQNNLPTSKFECGDNVKTSSNLYNLPLKTL p.t. QNLERQVNNLMTFSLQNQAAFQNNLPTSKFECGDNVKTSSNLYNLPLKTL **************:*********************************** m.m. QSLERQVNTLMTFSVQNEAGLEDNSQICKFECGGDVKTSSSLYDLPLKTL r.n. QSLERQVNTLTTFSVQNEGGLEDSSQICKFECGGDVKTSSSLYDLPLKTL e.e. QNLERQVNSLMTFSVQNQTGFQNNLPTPKADCGDSDKASTTLYNLPLKTL *.******.* ***:**: .:::. * :**.. *:*:.**:****** x.l. NFVERQVSSLMPFETNNQASQLCENNLDMSGYDENCKPENALS------- : :****..* .*. :*: . . . . *... * 1069 1118 h.s. ESIAFVPPQSDLSNSLGTPSVPPKAPVQKFSCQVEGCTRTYNSSQSIGKH zf8 p.t. ESIAFVPPQSDLSKSLGTPSVPPKAPVQKFSCQVEGCTRTYNSSQSIGKH zf8 *************:************************************ m.m. ESITFVQSQPDLSSPLGSPSVPPKAPGQKFSCQVEGCTRTYNSSQSIGKH lcr/zf8 r.n. ESVAFVPSQPDLSSPLGAPSVPPKAPGQKFSCQVEGCTRTYNSSQSIGKH zf8 e.e. ETITFASSQPNLSSSVGTPSVPPKAPVQKFSCQVEGCTRTYNSSQSIGKH zf8 *:::*. .*.:**..:*:******** *********************** x.l. LSLETSSMLTNVLPSVSNPGPVQTSPAQKFKCSVEGCTRIYNSVQSIGKH lcr/zf8 :: .:: .:. *. .:* ***.*.****** *** ****** 1119 1167 h.s. MKTAHPDQYAAFKMQRKSKKGQKANNLNTPNNGKFVYFLPSPVNSSN-PF zf8 p.t. MKTAHPDQYAAFKMQRKSKKGQKANNLNTPNNGKFVYFLPSPVNSSN-PF zf8 ***********************************************-** zf8=1.0 m.m. MKTAHPDQYAAFKLQRKTKKGQKSNNLNTPNHGKCVYFLPSQVSSSNHAF zf8 r.n. MKTAHPDQYAAFKLQRKTKKGQKSNNLSTPDHGKCVYFLPSQVSSSNHAF zf8 e.e. MKTAHPDQYATFKLQRKNKKGQKPNNLNKPNNGKFVYFLPSQVSSSNNAF zf8/lcr **********:**:***.*****.***..*::** ****** *.*** .* zf8=1.0 x.l. MKTAHPEHYDAFKMERKNRKKFKCANSVLPSTEDKPTYCILQDEGCSNPV zf8 ******::* :**::**.:* * * *. . : .... .. zf8=.89 1168 1217 h.s. FTSQTKANGNPACSAQLQHVSPPIFPAHLASVSTPLLSSMESVINPNITS p.t. FTSQTKANGNPACSAQLQHVSPPIFPAHLASVSTPLLSSMESVINPNITS ************************************************** m.m. FTPQTKANGNPACSAQVQHVSPSIFPAHLASVSAPLLPSVESVLSPNIPS r.n. FTPQTKASGNPACSAQLQHVSPSIFPAHLASVSTSLLPSVESVLSPNMPS lcr e.e. FTPQTKASGTSTCSDQLQHISPSVFPAHLASVSAPLLPTVESVIDPNIPS **.****.*..:** *:**:**.:*********:.**.::***:.**:.* x.l. FQPQVQSGSNSNFCNQLQHIPNPVITSHLENLN-PILSSVESIISQSLSK lcr * .*.::.... . *:**:. .::.:** .:. .:*.::**::. .:.. 1218 1266 h.s. QDKNEQGG-MLCSQMENLPSTALPAQMEDLTKTVLPLNIDSGSDPFLPLP p.t. QDKNEQGG-MLCSQMENLPSTALPAQMEDLTKTVLPLNIDSGSDPFLPLP ************************************************** m.m. QDKHGQDG-ILCSQMENLSNAPLPAQMEDLTKTVLPLNIDSGSDPFLPLP r.n. QDKHVQDG-MLCSQMENLSNAALPAQMEDLTKTVLPLNIDSGSDPFLPLP e.e. QDKSEQGGGMLCAQMENLTSTTLPAQMEDLTKTVLPLNIDSGSDPFLSLP *** *.* :**:*****..:.*************************.** x.l. NVADSLLG----SDVGNVTSTTLTSQLEDLAKVVLPLKFENGSDPFLPMP : * ::: *:..:.*.:*:***:*.****:::.******.:* 1267 1315 h.s. AESSSMSLFPSPADSGTNSVFSQLENNTNHYSSQIEGNTN-SSFLKGGNG p.t. AESSSMSLFPSPADSGTNSVFSQLENNTNHYSSQIEGNTN-SSFLKGGNG ****************************************-********* m.m. TENS--SLFSSPADSENNSVFSQLENSTNHYPSQTDGNIN-SSFLKGGSS r.n. TENS--SLFSSPAHSENNSVFSQLESSTNHYPSPTEGNLN-SSFLKGGSS e.e. AENSSMSLFPSPADSGTNSGFSQLGNNTNHFPSQIEGNTN-SSFLKGGNG :*.* ***.***.* .** **** ..***:.* :** *-*******.. x.l. TENDPVPVMSSLSG---ATVFTQLGSNTNHDQVLSVGEAANSVFLKEETD :*.. .::.* : : *:** ..*** *: * *** .. 1316 1360 h.s. ENAVFPS-----QVNVANNFSSTNAQQSAPEKVKKDRGRGPNGKERKPKH p.t. ENAVFPS-----QVNVANNFSSTNAQQSAPEKVKKDRGRGPNGKERKPKH *******-----************************************** m.m. ENGVFPS-----QVSSADDFSSTSAQPSTPKKVKKDRGRGPNGKERKPKH r.n. ENGAFPS-----HVSTADDLSSTSAQQSAPKKVKKDRGRGPNGKERKPKH e.e. ENTVFPS-----QVSVVNDFNGTNTQQSAPEKVKKERGRGPNGKERKPKH ** .***-----:*. .:::..*.:* *:*:****:************** x.l. AESSFSKPADNADIDASFNLDKSATMLSNTIENKNGNGRTSRNRERKPRH : *.. .:. ::. : : * . : *: .** ...:****:* 1361 1410 h.s. NKRAKWPAIIRDGKFICSRCYRAFTNPRSLGGHLSKRSYCKPLDGAEIAQ zf9 p.t. NKRAKWPAIIRDGKFICSRCYRAFTNPRSLGGHLSKRSYCKPLDGAEIAQ zf9 ************************************************** zf9=1.0 m.m. NKRAKWPAIIRDGKFICSRCYRAFTNPRSLGGHLSKRSYCKPLDGAEIAQ zf9 r.n. NKRAKWPAIIRDGKFICSRCYRAFTNPRSLGGHLSKRSYCKPLDGAEIAQ zf9 e.e. NKRAKWPAIIRDGKFICSRCYRAFTNPRSLGGHLSKRSYCKPLDRAEIAQ zf9 ******************************************** ***** zf9=1.0 x.l. SSRPKCPAIIRDGKFICSRCFRAFTNPRSLGGHLSKRAVCKPHNEYDHSQ zf9 ..*.* **************:****************: *** : : :* zf9=.93 1411 1459 h.s. ELLQSNGQPSLLASMILSTNAVNLQQPQQSTFNPEACFKDPSFLQLLAE- p.t. ELLQSNGQPSLLASMILSTNAVNLQQPQQSTFNPEACFKDPSFLQLLAE- *************************************************- m.m. ELLQTNRQPSLLASMILSTSAVNMQQPQQSNFNPETCFKDPSFLQLLNVE r.n. ELLQTNRQPSLLASMILSTSTVNMQQPQQSNCNPETCFKDPSFLQLLSVE e.e. ELLQNNGQPSLLASMILSTNAINLQQPQQSTFNPETCFKDPSFLQLLAAE ****.* ************.::*:******. ***:*********** x.l. LVPQIDGQASVLASMILSSSPKHNLQSPSQTYNHEVSIKEEPFFPTISSN lcr : * : *.*:*******:.. : *. ... * *..:*: .*: : 1460 1509 h.s. NRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEML p.t. NRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEML ************************************************** m.m. NRPT-FLPSTFPRCDVSNFNASVSQEGSEIIKQALETAGIPSTFESAEML r.n. NRPN-FLPSTFPRCDVSNFNAGVSQEGSEIIKQALETAGIPSTFESAEML e.e. NRSSTFLPNTFPRTSVTNFNTNVSQEGSEIIKQALETAGIPSTFEGAEVL **. ***.**** .*:***:.***********************.**:* x.l. NQTNQFFSSFVTR---------DMDETKKVIETPEIVANRPLSLN----- lcr *:. *:.. ..* :* .::*: . .*. * ::: 1510 1558 h.s. SHVS-TGCVSDASQVNATVMPNPTVPPLLHTVCHPNTLLTNQNRTSNSKT p.t. SHVS-TGCVSDASQVNATVMPNPTVPPLLHTVCHPNTLLTNQNRTSNSKT ****-********************************************* m.m. SQVVPIGSVSDAAQVSAAGMPGPPVTPLLQTVCHPNTSPSNQNQTPNS-- r.n. SQVVPIGSVSDAAQVNAEGMPGPTVTPLLQTVCHTNTSPSSQNQTPNSKT e.e. PHVP-VSCVSDTAQVNATLIPNPTVPPLLQTVCPPNSLLTNQNGTQNSKP .:* .***::**.* :*.*.*.***:*** .*: :.** * ** x.l. ------------MTKDGSLNVSSDTTPLTAVLSQISPTSTKMEESRKILH .. .. ..** .:. .. :. : : : 1559 1605 h.s. SSIEECS-SLPVFPTNDLLLKTVENGLCSSSFPNSGGPSQNFTS--NSSR p.t. SSIEECS-SLPVFPTNDLLLKTVENGLCSSSFPNSGGPSQNFTS--NSSR *******-************************************--**** m.m. KTLKECN-SLPLFTTNDLLLKTIENGLCSNSFSSSTEPPQNFTN--NSAH r.n. SNLKECN-SLPIFTTNDLLLKTIENGLCSNSFSSSTEPPQNFTN--NSTH e.e. SSIEECCPSLPVFPSNDLLLKTVENGLCSTPFPNPSGPSQNFTN--NNAR ..::** ***:*.:*******:******..*... *.****.--*.:: x.l. SELNNCN-------KAATVTDNVENGLCSNASLNAGNENNCLSTPVSSTR . :::* . : ..:******.. .. : ::. ..:: 1606 1655 h.s. VSVISGPQNTRSSHLN-KKGNSASKRRKKVAPPLIAPNASQNLVTSDLTT nls p.t. VSVISGPQNTRSSHLN-KKGNSASKRRKKVAPPLIAPNASQNLVTSDLTT nls ****************-********************************* m.m. VSVISGPQNTRSSHLN-KKGNSASKKRKKVAPAVSVSNTSQNVLPTDLPV lcr r.n. VSVISGPQNTRSNHLN-KKGNSASKKRKKVTPAVIVSNTSQNVVPTDLTM lcr e.e. VSVISGPQNTKSIHLNNKKGNSASKRRKKVVPPLIVPNTSQNLVTSDLTA **********:* *** ********:****.*.: ..*:***::.:**. x.l. VSVISGPQNPSSTATK-KKGGGTRKKKKTTDVPTEISTSHELAKNLLAAV lcr *********. * : ***..: *::*.. . ..: : . 1656 1705 h.s. MGLIAKSVEIPTTNLHSNVIPTCEPQSLVENLTQKLNNVNNQLFMTDVKE p.t. MGLIAKSVEIPTTNLHSNVIPTCEPQSLVENLTQKLNNVNNQLFMTDVKE ************************************************** m.m. G-LPAKNLPVPDTNTRSDMTPDCEPRALVENLTQKLNNIDNHLFITDVKE r.n. G-LASKNLTVPDTNTR-DMIPDCEPQALVENLTQKLNNIDNHLFLTDVKE e.e. MGLIAKSIEIPTTNLHPNVIPNCEPQGLVGNLTEKLNNVDNQLFMTDVKE * :*.: :* ** : :: * ***:.** ***:****::*:**:***** x.l. GGLPQHVGGSLQMPTEMQPNLLSYSSELIENIAKHLNNTDKQLFLSCIND * : . : . . *: *::::*** :::**:: ::: 1706 1755 h.s. NFKTSLESHTVLAPLTLKTENGDSQMMALNSCTTSINSDLQISEDNVIQN p.t. NFKTSLESHTVLAPLTLKTENGDSQMMALNSCTTSINSDLQISEDNVIQN ************************************************** m.m. NCKASLEPHTMLTPLTLKTENGDSRMMPLSSCTPVN-SDLQISEDNVIQN r.n. SCKASLEPHTMLTPLTLKTENGDSRMMPLNSCTPVS-SDLQISEDNVIQN e.e. NFKSTLESNAVLAPLTLKAENDDSQMMALNTCTTSS-SDLQISEDNVIQN . *::**.:::*:*****:**.**:**.*.:**. ************* x.l. TIKGHADAN-VISQTPDKPEKFDSVAMPTFTLNSKEHSEVQIP-TELIGS . * :.: ::: . *.*: ** *. : .. *::**. ::* . 1756 1805 h.s. FEKTLEIIKTAMNSQILEVKSGSQGAGETSQNAQINYNIQLPSVNTVQNN p.t. FEKTLEIIKTAMNSQILEVKSGSQGAGETSQNAQINYNIQLPSVNTVQNN ************************************************** m.m. FEKTLEIIKTAMNSQILEVKSGSQGTGETTQNAQINYSMQLPSVNSIPDN r.n. FEKTLEIIKTAMNSQILEVKSGSQGTGETTQNAQINYSMQLPSVNSIPDN e.e. FEKTLEIIKTAMNSQMLDVKNESRAVGGTSQNAQINYNIQLPSVNTVQNN ***************:*:**. *:..* *:*******.:******:: :* x.l. HLQGLNSQAAEVFSVNGNAESPCFSLPRSQNTVNTDLDILSRDVHNIISA . : *: : : * :.:. . . : :..: : .: .*:.: . 1806 1855 h.s. KLPDSSPFSSFISVMPTKSNIPQSEVSHKEDQIQEILEGLQKLKLENDLS p.t. KLPDSSPFSSFISVMPTKSNIPQSEISHKEDQIQEILEGLQKLKLENDLS *************************:************************ m.m. KLPDASQCSSFLTVMPTKS-----EALHKEDQIQDILEGLQNLKLENDTS r.n. KLPDSSQCSSFLTVMPTKHNIPQSEALHKEDQIQDILEGLQNLKLETDIS e.e. RLPDPPQFSSFIGAMPTKSNIPQSEILHKGDQVQEILEGLQKLKLENNLS :***.. ***: .**** * ** **:*:******:****.: * x.l. ANNDSPNDTLNIPICSASP------TQEADAHILEIMDLVQKLQLVDSVI *.. : : .:. . :: :*:: :*:*:* . 1856 1901 h.s. TP--ASQCVLINTSVTLTPTPVKSTADITVIQPVSEMIN-IQFNDKVNKP p.t. PP--ASQCVLINTSVTLTPTPVKSTADITVIQPVSEMIN-IQFNDKVNKP .*--***********************************-********** m.m. AP--ASQSMLMNKSVALSPTPTKSTPNIVVQP--VPEVIHVQLNDRVNKP r.n. AP--PSQATLINKSVALSPTPTKSTPNIIVQP--VSEVIHVQLNDRVNKP e.e. TP--VSHCVLINTSVTLTPTPAKLIPNATVVQPVSEMINNIQFNDRVNKP .*-- *:. *:*.**:*:***.* .: * : :*:**:**** x.l. FENIGIDSVPGCPSDYLPMTSSVIVPNPSLNVVEEPDDSKSLLDKCNAKP .. * *. *. .: : ::. ** 1902 1951 h.s. FVCQNQGCNYSAMTKDALFKHYGKIHQYTPEMILEIKKNQLKFAPFKCVV zf10/11 p.t. FVCQNQGCNYSAMTKDALFKHYGKIHQYTPEMILEIKKNQLKFAPFKCVV zf10/11 ************************************************** zf10=1.0 m.m. FVCQNQGCNYSAMTKDALFKHYGKIHQYTPEMILEIKKNQLKFAPFKCVV zf10/11 r.n. FVCQNQGCNYSAMTKDALFKHYGKIHQYTPEMILEIKKNQLKFAPFKCVV zf10/11 e.e. FVCQNQDCNYSAMTKDALFKHYGKIHQYTPEMILEIKKNQLKFAPFKCVV zf10/11 ******.******************************************* zf10=.97 x.l. FICQETDCTYCAMTKDALFKHYAKVHYYTQEKIMDIKKHQLKFAPFKCVV zf10/11 *:**: .*.*.***********.*:* ** * *::***:*********** zf10=.82 1952 2001 h.s. PTCTKTFTRNSNLRAHCQLVHHFTTEEMVKLKIKRPYGRKSQSENVPASR zf11 p.t. PTCTKTFTRNSNLRAHCQLVHHFTTEEMVKLKIKRPYGRKSQSENVPASR zf11 ************************************************** zf11=1.0 m.m. PSCTKTFTRNSNLRAHCQLVHHFTIEEMVKLKIKRPYGRKSQSENLSSPQ zf11 r.n. PTCTKTFTRNSNLRAHCQLVHHFTIEEMVKLKIKRPYGRKSQGENLSSPQ zf11 e.e. PTCTKTFTRNSNLRAHCQLVHHFTTEEMVKLKIKRPYGRKSQNENSPAPQ zf11 *:********************** *****************.** .:.: zf11=.99 x.l. PSCTKTFTRNSNLRAHCQSMHNFTPEQMIKLKIKRAYGRKSEITCANIQE zf11 *:**************** :*:** *:*:******.*****: . zf11=.93 2002 2050 h.s. STQVKKQLAMTEENKKESQPALELRAET-QNTHSNVAVIPEKQLVEKKSP p.t. STQVKKQLAMTEENKKESQPALELRAET-QNTHSNVAVIPEKQLVEKKSP ****************************-********************* m.m. NNQVKKQPSMAEETKTESQPAFKVPAATGDAALANATVIPEKQLAEKKSP lcr r.n. INQVKKQLPMAEEAKTESQPAFEVPAVAGEDALDNVAVIPENQLAEKKSP lcr e.e. VTQVKRQLDTTEENKREFQSPLELGTVK-ENALSDVSVIPEKELAEKKSP .***:* :** * * *..::: : : : :.:****::*.***** x.l. LPHTIPEMEHSVDKLASVRQKQHSEQHQEMPAVERVEAPKEQEPISPQSL :. : : : . : . : . . *:: . :* 2051 2098 h.s. DKTESSLQVITVTS-EQCNTNALTNTQTKGRKIRRHKKEKEEKKRKKPVS lcr p.t. DKTESSLQVITVTS-EQCNTNALTNTQTKGRKIRRHKKEKEEKKRKKPVS lcr ************************************************** m.m. EKPESSSQPVTSSAEQYNAN--LANLKTKGRKNKRHRKEKEEKREKNPVS lcr r.n. EKPESSSQPVTASTEQYNAN--LGNLKTKGRKNKRHRKEKEEKREKNPVS lcr e.e. ENTENSSQVITVTSSEQCDTTSLTSTQTKGRKVRRNRKEKEEKKRKKPVS lcr ::.*.* * :* :: : . * . :***** :*::******:.*:*** x.l. GKPSS------------HVLQALSQKCKKGSKVRR--KTKEVTDKVDIQK :... * . .** * :* * ** . . . . 2100 2149 h.s. QSLEFPTRYSPYRPYRCVHQGCFAAFTIQQNLILHYQAVHKSDLPAFSAE zf12/lcr p.t. QSLEFPTRYSPYRPYRCVHQGCFAAFTIQQNLILHYQAVHKSDLPAFSAE zf12/lcr ************************************************** zf12=1.0 m.m. QAFELPTKYSSYRPYCCVHQGCFAAFTIQQNLILHYQAVHKSNLPTFSAE zf12/lcr r.n. QSFEFPTRYSSYRPYCCVHQGCFAAFTIQQNLILHYQAVHKSNLPTFSAE zf12/lcr e.e. QSLEFPAKYSPYRPYRCVHQGCFAAFTIQQNLILHYQAVHKSDLPAFSAE zf12/lcr *::*:*::**.**** **************************:**:**** zf12=1.0 x.l. DIKPAVQEFNSYKPYQCVHQGCTAAFTIQQNLILHYQAVHKS-------- zf12 : .:..*:** ****** ******************* zf12=.96 2150 2199 h.s. VEEESEAGKESEETETKQTLKEFRCQVSDCSRIFQAITGLIQHYMKLHEM lcr/zf13 p.t. VEEESEAGKESEETETKQTLKEFRCQVSDCSRIFQAITGLIQHYMKLHEM lcr/zf13 ************************************************** zf13=1.0 m.m. VEEESEAVKESEETEPKQSMKEFRCQVSDCSRIFQAITGLIQHYMKLHEM lcr/zf13 r.n. VEEESEAVKESEETEPKQSMKEFRCQVSDCSRIFQAITGLIQHYMKLHEM lcr/zf13 e.e. VEEESETGKDSEEIETKHTVKEFRCQVSDCSRIFQAITGLIQHYMKLHEM lcr/zf13 ******: *:*** *.*:::****************************** zf13=1.0 x.l. --VSKFSLEEGEECENGFSQKEFRCMEIDCSRIFQEVGSLVQHYMKFHEM zf13 .. : ::.** * : ***** ******* : .*:*****:*** zf13=.72 2200 2249 h.s. TPEEIESMTASVDVGKFPCDQLECKSSFTTYLNYVVHLEADHGIGLRASK zf14 p.t. TPEEIESMTASVDVGKFPCDQLECKSSFTTYLNYVVHLEADHGIGLRASK zf14 ************************************************** zf14=1.0 m.m. TPEEIESMTAAVDVGKFPCDQLECKLSFTTYLSYVVHLEVDHGIGTRTSK zf14 r.n. TPEEIESMTAAVDVGKFPCDQLECKSSFTTYLSYVVHLEVDHGIGTRTSK zf14 e.e. TPEEIESMTASVNVGKFSCDQLECKSSFTTYLNYVVHLEVDHGIGIKGSK zf14 **********:*:****.******* ******.******.***** : ** zf14=.90 x.l. IAEEIENVLSLINIGQFKCDQLNCALLFTSCTSYIEHLEEVHEIKMRHIK zf14 .****.: : :::*:* ****:* **: .*: *** * * : * zf14=.58 2250 2299 h.s. TEEDGVYKCDCEGCDRIYATRSNLLRHIFNKHNDKHKAHLIRPRRLTPGQ zf15 p.t. TEEDGVYKCDCEGCDRIYATRSNLLRHIFNKHNDKHKAHLIRPRRLTPGQ zf15 ************************************************** zf15=1.0 m.m. AEEDGIYKCDCEGCDRIYATRSNLLRHIFNKHNDKHKAHLIRPRKLT-GQ zf15 r.n. TEEDGIYKCDCEGCDRIYATRSNLLRHIFNKHNDKHKAHLIRPRKLT-GQ zf15 e.e. SEEDGVYKCDCEGCDRIYATRSNLLRHIFNKHNDKHKAHLIRPRRLTPGQ zf15 :****:**************************************:** ** zf15=1.0 x.l. VEGEEMYKCDCEGCDRVYATRSNLLRHIFNKHNDKHKEHLIRPRKISLSD nls 7.5 * : :**********:******************** ******::: .: zf15=.99 2300 2349 h.s. ENMSSKANQEKSKSKHRGTKHSRCGKEGIKMPKTKRKKKNNLENKNAKIV nls 10.2/lcr p.t. ENMSSKANQEKSKSKHRGTKHSRCGKEGIKMPKTKRKKKNNLENKNAKIV nls 10.2/lcr ************************************************** m.m. ENISSKANQEKSKSKHRTTKPNRSGKDGMKMPKTKRKKKSNLENKSAKVV nls 10.7/lcr r.n. ENISSKANQEKSKSKHRAIKHNRFGKDGMKGPKTKRKKKSNLESKSAKIV nls 10.7/lcr e.e. ENMSSKANQEKTKSKYRGTKHSRSGKEGIKLPKTKRKKRTNLENKNAKIV nls 12.5 **:********:***:* * .* **:*:* *******:.***.*.**:* x.l. QDTTDEK-PLKEKPKVAKQKCETEGNDVLKAKRSK----GCLIDKDIKIE nls :: :.: * *.* * . *:: :* ::* * .*. *: 2350 2399 h.s. QIEENKPYSLKRGKHVYSIKARNDALSECTSRFVTQYPCMIKGCTSVVTS zf16 p.t. QIEENKPYSLKRGKHVYSIKARNDALSECTSRFVTQYPCMIKGCTSVVTS ************************************************** m.m. QIEENKPYSLKRGKHVYSIKARNDALAECTSKFVTQYPCMIKGCTSVVTS r.n. QIEEDKPYSLKRGKHVYSIKARNDALSECTSKFVTQYPCMIKGCTSVVTS e.e. HIEENKPYSLKRGKHVYSIKARNDALSECTSRFVTQYPCMIKGCTSVVTS :***:*********************:****:****************** x.l. QGQTKQETTLKYGRHTYSLKPKDAAFVECSSNLAKQYPCMVRGCTSVVTS : : .: :** *:*.**:*.:: *: **:*.:..*****::******** 2400 2449 h.s. ESNIIRHYKCHKLSKAFTSQHRNLLIVFKRCCNSQVKETSEQEGAKNDVK zf16/coiled p.t. ESNIIRHYKCHKLSKAFTSQHRNLLIVFKRCCNSQVKETSEQEGAKNDVK zf16/coiled ************************************************** zf16=1.0 m.m. ESNIIRHYKCHKLSRAFTSQHRNILIVFKRYGNPQGKEISEQEDEKNDKK zf16 r.n. ESNIIRHYKCHKLSKAFTSQHRNILIVFKRYGNPQEKEVSEQEDEKNDKK zf16 e.e. ESNIIRHYKCHKLSKAFTSQHRSLLVVSKQCCDSQVKETCEQEVGKSDMK zf16 **************:*******.:*:* *: :.* ** .*** *.* * zf16=1.0 x.l. ERSIIRHYKCHKLSGAFILKHRDSLIVCKRRARPKGKAESIIIKAGEDAK zf16 * .*********** ** :**. *:* *: .: * . .* * zf16=.90 2450 2496 h.s. DSDTC---VSESNDNSRTTATVSQKEVEKNEKDEMDELTELFITKLINED p.t. DSDTC---VSESNDNSRTTATVSQKEVEKNEKDEMDELTELFITKLINED *****---****************************************** m.m. DPDSS---VLEKNDNSEPAAAP-QEEGRKGEKDEMDELTELFITKLINED r.n. DPDSS---VLEKNDNSQPASIP-QEEGMKGEKDEMDELTELFITKLINED e.e. DPDTC---VLEG-TDSTTLASGPQKEVEKHEKDEMDELTELFITKLINED *.*:.---* * :. *: .*:* * ******************** x.l. NEISSQKESVPSLLAFIDESSLSTSQPSENEKDEMDELAELFISKLSNED : :. : .: : ********:****:** *** 2497 2546 h.s. STSVETQANTSSNVSNDFQEDNLCQSERQKASNLKRVNKEKNVSQNKKRK coiled/nls p.t. STSVETQANTSSNVSNDFQEDNLCQSERQKASNLKRVNKEKNVSQNKKRK coiled/nls ************************************************** m.m. STNAENQGNTTLKGNNEFQEHDSCTSERQKPGNLKRVYKEKNTVQSKKRK nls r.n. STNTETQVSTTLKVNNDFQEHDSCISERQKTGNLKRVYKEKNITQNKKRK nls e.e. NTNVETQAPTSSNIDTDFQENNPSQPEKQKASNLKRVNKEKSVSQNKKRK nls .*..*.* *: : ..:***.: . .*:**..***** ***. *.**** x.l. STGSDSQARTSSFVSSDFHETSSCQSEPQKETSNLKRTNKQKHVLNRKRK .*. :.* *: ..:*:* . . .* ** . : :::. .:*** 2547 2696 h.s. VEKAEPASAAELSSVRKEEETAVAIQTIEEHPASFDWSSFKPMGFEVSFL nls p.t. VEKAEPASAAELSSVRKEEETAVAIQTTEEHPASFDWSSFKPMGFEVSFL nls *************************** ********************** m.m. IDKTEPEVSLVVNNTRKEEEPAVAVQTTEEHPASFDWSSFKPMGFEASFL nls r.n. IDKTEPEVPLVVNKTHKEEETAVAVQTTEEHPASFDWSSFKPMGFEASFL nls/lcr e.e. IEKTGPAPAVEVSSMHKEEETAIGIQTTDDHPASFDWSSFKPMGFEASFL nls ::*: * . :.. :****.*:.:** ::****************.*** x.l. VDKSDMQSTETSSTVSGEE--IIATLPTTEEPPALDLSSFKPMGFEVSFL lcr ::*: . .. ** :. . :.*.::* *********.*** 2597 2646 h.s. KFLEESAVKQKKNTDKDHPNTGNKKGSHSNSRKNIDKTAVTSGNHVCPCK p.t. KFLEESAVKQKKNTDKDHPNTGNKKGSHSNSRKNIDKTAVTSGNHVCPCK ************************************************** m.m. KFLEESAVKQKKNSDRDHSNSGSKRGSHSSSRRHVDKAAVAGSSHVCSCK lcr r.n. KFLEESAVKQKKNTDRDHSHSGSKRGSHSNSRKNVDKTAVTGGNHVCSCK lcr e.e. KFLEESAVKQKKNTDKDHPNTGNKKGSHSNSRKSADKTAVTSGNHVCPCK *************:*:**.::*.*:****.**: **:**:...***.** x.l. KFLEASATQEEESVDLDDFKAESNVDLHLIKKK--MRLSVDEDELSLGSS nls/lcr **** **.::::. * *. :: .: . * .:: : :* .. .. 2647 2696 h.s. ESETFVQFANPSQLQCSDNVKIVLDKNLKDCTELVLKQLQEMKPTVSLKK p.t. ESETFVQFANPSQLQCSDNVKIVLDKNLKDCTELVLKQLQEMKPTVSLKK ************************************************** m.m. DSEIFVQFANPSKLQCSENVKIVLDKTLKDRSELVLKQLQEMKPTVSLKK r.n. ESEIFVQFANPSKLQCSENVKIVLDKTLKDCSELVLKQLQEMKPTVSLKK e.e. ESETFVQFANPSQLQCSDNVKIVLDKTLKDCTELVLKQLQEMKPTVSLKK :** ********:****:********.*** :****************** x.l. ESDPYLLFANPLHFPDMDNIKLVLDQKFSDYIDLVVKQLNELKPVVVLRR lcr :*: :: **** :: :*:*:***:.:.* :**:***:*:**.* *:: 2697 2723 h.s. LEVHSNDPDMSVMKDISIGKATGRGQY p.t. LEVHSNDPDMSVMKDISIGKATGRGQY *************************** m.m. LEVLSNNPDRTVLKEISIGKATGRGQY r.n. LEVLSNDPDRTVLKELSLGKATGRGQY e.e. LEVHSNDLDVSVMKEISLGKATGRGQY *** **: * :*:*::*:********* x.l. HEMLRNELTSTSKENVLAVAEEATA-- *: *: : ::: . .

Partial Amino Acid Sequence Alignment Scores: N-teminal

Amino acid sub-sequence (N-terminal to zinc finger 1, zinc finger 1-16 and end of zinc finger 16 to C-terminal) from five species were aligned with ClustalW to calculate alignemnt scores for comparison.

CLUSTAL 2.1 Multiple Sequence Alignments Sequence type explicitly set to Protein Sequence format is Pearson Sequence 1: sp|O60281|ZN292_HUMAN 570 aa Sequence 2: tr|H2QTD2|H2QTD2_PANTR 570 aa Sequence 3: sp|Q9Z2U2|ZN292_MOUSE 568 aa Sequence 4: tr|A0A1L8G393|A0A1L8G393_XENLA 567 aa Sequence 5: tr|D3ZXZ1|D3ZXZ1_RAT 567 aa Sequence 6: tr|A0A1S3A3I8|A0A1S3A3I8_ERIEU 569 aa Start of Pairwise alignments Aligning... Sequences (1:2) Aligned. Score: 99 Sequences (1:3) Aligned. Score: 91 Sequences (1:4) Aligned. Score: 73 Sequences (1:5) Aligned. Score: 92 Sequences (1:6) Aligned. Score: 95 Sequences (2:3) Aligned. Score: 91 Sequences (2:4) Aligned. Score: 73 Sequences (2:5) Aligned. Score: 93 Sequences (2:6) Aligned. Score: 95 Sequences (3:4) Aligned. Score: 71 Sequences (3:5) Aligned. Score: 99 Sequences (3:6) Aligned. Score: 92 Sequences (4:5) Aligned. Score: 71 Sequences (4:6) Aligned. Score: 72 Sequences (5:6) Aligned. Score: 92 Guide tree file created: [clustalw.dnd] There are 5 groups Start of Multiple Alignment Aligning... Group 1: Sequences: 2 Score:9211 Group 2: Sequences: 2 Score:9354 Group 3: Sequences: 3 Score:9081 Group 4: Sequences: 5 Score:8781 Group 5: Sequences: 6 Score:8657 Alignment Score 46133

Partial Amino Acid Sequence Alignment Scores: Zinc finger region

CLUSTAL 2.1 Multiple Sequence Alignments Sequence type explicitly set to Protein Sequence format is Pearson Sequence 1: sp|O60281|ZN292_HUMAN 2153 aa Sequence 2: tr|H2QTD2|H2QTD2_PANTR 2153 aa Sequence 3: sp|Q9Z2U2|ZN292_MOUSE 2130 aa Sequence 4: tr|A0A1L8G393|A0A1L8G393_XENLA 2103 aa Sequence 5: tr|D3ZXZ1|D3ZXZ1_RAT 2139 aa Sequence 6: tr|A0A1S3A3I8|A0A1S3A3I8_ERIEU 2156 aa Start of Pairwise alignments Aligning... Sequences (1:2) Aligned. Score: 99 Sequences (1:3) Aligned. Score: 78 Sequences (1:4) Aligned. Score: 39 Sequences (1:5) Aligned. Score: 79 Sequences (1:6) Aligned. Score: 85 Sequences (2:3) Aligned. Score: 78 Sequences (2:4) Aligned. Score: 39 Sequences (2:5) Aligned. Score: 79 Sequences (2:6) Aligned. Score: 85 Sequences (3:4) Aligned. Score: 37 Sequences (3:5) Aligned. Score: 91 Sequences (3:6) Aligned. Score: 74 Sequences (4:5) Aligned. Score: 38 Sequences (4:6) Aligned. Score: 38 Sequences (5:6) Aligned. Score: 75 Guide tree file created: [clustalw.dnd] There are 5 groups Start of Multiple Alignment Aligning... Group 1: Sequences: 2 Score:33329 Group 2: Sequences: 2 Score:35508 Group 3: Sequences: 3 Score:32046 Group 4: Sequences: 5 Score:32766 Group 5: Sequences: 6 Score:14239 Alignment Score 129426

Partial Amino Acid Sequence Alignment Scores: C-Terminal

CLUSTAL 2.1 Multiple Sequence Alignments Sequence type explicitly set to Protein Sequence format is Pearson Sequence 1: sp|O60281|ZN292_HUMAN 313 aa Sequence 2: tr|H2QTD2|H2QTD2_PANTR 314 aa Sequence 3: sp|Q9Z2U2|ZN292_MOUSE 312 aa Sequence 4: tr|A0A1L8G393|A0A1L8G393_XENLA 310 aa Sequence 5: tr|D3ZXZ1|D3ZXZ1_RAT 312 aa Sequence 6: tr|A0A1S3A3I8|A0A1S3A3I8_ERIEU 312 aa Start of Pairwise alignments Aligning... Sequences (1:2) Aligned. Score: 99 Sequences (1:3) Aligned. Score: 71 Sequences (1:4) Aligned. Score: 38 Sequences (1:5) Aligned. Score: 75 Sequences (1:6) Aligned. Score: 82 Sequences (2:3) Aligned. Score: 71 Sequences (2:4) Aligned. Score: 39 Sequences (2:5) Aligned. Score: 75 Sequences (2:6) Aligned. Score: 83 Sequences (3:4) Aligned. Score: 36 Sequences (3:5) Aligned. Score: 88 Sequences (3:6) Aligned. Score: 66 Sequences (4:5) Aligned. Score: 36 Sequences (4:6) Aligned. Score: 34 Sequences (5:6) Aligned. Score: 72 Guide tree file created: [clustalw.dnd] There are 5 groups Start of Multiple Alignment Aligning... Group 1: Sequences: 2 Score:4802 Group 2: Sequences: 2 Score:5107 Group 3: Sequences: 3 Score:4597 Group 4: Sequences: 5 Score:4665 Group 5: Sequences: 6 Score:2060 Alignment Score 17299
Untitled Document

© 2024 WJ Sunderland, PhD "ZNF292 Oxen Dad"
contact: wjsunderland@gmail.com