kite proteins: a superfamily of smc/kleisin partners ... volume 23 supplemental information kite...
TRANSCRIPT
Structure, Volume 23
Supplemental Information
Kite Proteins: a Superfamily of SMC/Kleisin
Partners Conserved
Across Bacteria, Archaea, and Eukaryotes
Jan J. Palecek and Stephan Gruber
Figure S1, related to Figure 1 Structural alignment of bacterial and human kite WHA and WHB subdomains.
(A) PDB structures were aligned using FATCAT algorithm. Respective P‐values are given for ScpB structures ranked
among the top‐five hits (number in brackets denotes the rank). The best ranked result not belonging to the family of
kite and MAGE proteins are given in the right column. (B) Example of WHA (left panel) and WHB (right panel) domain
superimposition (human NSE3 compared to Geobacillus stearothermophilus ScpB; 3W6JB).
WHA 1 2 3 1 2 extension ScpB---HHHHHHHHHHHHHHH-------------HHHHHHHHH---------------HHHHHHHHHHHHHHHH---------SSSSSS--------SSSSSS-----HHHHHHHHHHH S.pn---MSTLAKIEALLFVAG---EDGIR-----VRQLAELLS----LP---------PTGIQQSLGKLAQKYE---KDPDSSLALIET-SG-----AYRLVT-KPQ-FAEILKEYSKA G.st---KPAKAIVEALLFAAG---DEGLS-----LSQIAAVLE----VS---------ELEAKAVIEELQQDCR---REERG-IQLVEL-GG-----VFLLAT-KKE-HAPYLKKLVEA C.te-QRQQLLRSLEALIFSS----EEPVN-----LQTLSQITA--HKFT---------PSELQEAVDELNRDYEATGRT----FRIHAI-AG-----GYRFLT-EPE-FADLVRQLLAP M.tu--ADELKRVLEALLLVI----DTPVT-----ADALAAATE----QP---------VYRVAAKLQLMADELTG--RDSG--IDLRHT-SE-----GWRMYT-RAR-FAPYVEKLLLD B.su---VNWKAIVEALLYAAG---DEGLT-----KKQLLTVLE----IE---------EPELNTIMADVADEYRGDTRG----IELIEY-AD-----TYMLST-KKD-FAPYLKKLIEV M.ge---ANLVAAIYGLLFVSG---EKGLT-----LAELNRVLR---KVG---------LEKIKAALVQLERKLSL--DDESG-IEIKKF-GH-----SFRLVT-KME-IKDFIHRYLPN F.nu---MSIKNQVEAIIFLGG----DENK-----IKDLAKFFK----IS---------IEDMLKILLELKDD-----RKDMG-INIEID-SE-----IVYLST-NPL-YGEIINNYFEQ P.ae---HELATLLEGILLAAG----KPLS-----LERLAELFD---EAERPE------PGQFRDALAILALS--CAGRS----FELKEV-AS-----GYRLQI-RER-FSPWVGRLWEE T.pa---APDLALLEAILFVEG----VRLS-----YACLARKLG----LS---------EQAVGECVARLGEALASGARGGGG-LELHCN-EQ-----GVALLP-AAT-VRERLATLYGK T.ma---MQLKAAIEALIFASN-----GIT-----LERLIKILE----KD---------PEEIKRALEELKKEYED-EAHG---VVLREV-NG-----RYRFFT-KPE-YAGFVSKLSGR P.ma---ISLPAKLEAVLYLKG----KPLS-----LSEMAELVN----ET---------EDITEQALFELMAGYSQ--RDTA--LEINEK-KG-----KYSLQL-KTG-LGELVKNLLPV T.el--LRTLTMRVEAILYLKA----QPLT-----LTELATLAG----TD---------REAIELALIELLNDYAH--RQTA--LEIVQI-DD-----KYSLQL-KSA-FQELVQSLVPV C.ag---VTLLQLIEAVLFVAG----EPVT-----LEQLARVLE----VS---------PEQIEAAIEELSASYAQ--RG----IRLQRH-GD-----QLMLVS-APE-AAPVIRRFLGS A.fu---MELKKIVEAILFSSS----EPVD-----ARELRKITG----KD---------KVEILNAIGELIKDYES--RDTS--IEIIKV-GE-----KYLMRV-KPQ-YAEYVERFTVR P.ab---LEDKALVEAALFVAG----RPLS-----VKELSKALG----IKS--------LDYLEKLIELIASEYSE--RKSA--IEIVKVAGD-----KWVMQV-KQE-YSQKVIHLMPK T.vo---MDDETKVEAILYATR----NPLS-----VRSISLILG----IE---------AGAISRIIKKLRLEYKK--RNTS--LEIAKI-GN-----KYRIQL-KKE-YYDFAYRVMEP Nse1----HHHHHHHHHHHHH------------HHHHHHHHHHHHH----------------HHHHHHHHHHHHHHH------SSSSS---------SSSSS-------HHHH S.c.----ATAKYLLQYILSA-----RGICHE-NALILALMRLETD-ASTLNTEWS------IQQWVDKLNDYINAI-VKLx-DYAVLQSIVLPESNRFFVYVNLASTEETKL S.p.----DKHKFILQYIMCR------TAGV--DNEQVRELVQEQY-GETAT----------VEDVINELNNSLHNF------DFKIKRVQDQLDGRLTLHFQNLSGDPVSQM D.d.----NAQHRLLLDQFTK-----RRIIS---TETLKKLVMTVNRITQVNIS--------LDDYINSLNSKIMEV---GL-QIKILN--TDGIN--DYILTNLKPDECARY M.o.----NLHRGFLQAMMAR-----GSMTLE-EAQPILSSLHNAE-KSVGAAGIx------LEAVLSMIREAISPL------DYDIKKHRHQTTKEEVWAFININSDLSTQL D.r.----DGHRLFLQNMMTN-----GIVSAA-QAGMLHKKCCELH---GGQEK--------IDDFINVVNTHLQPL------FMHIRKGMSEEDGQEHFVLVNMAETDITRM X.t.----ESHQRFLQVLMSH-----GIMESS-LVRALHRHCCEVH-KVNYMHDN-------LDDFVGVLNKHLQPL------FMKIEKGVGEEDGLTYYALVNRVENDITKM G.g.----DAHRRFLQVLMSH-----GIMEGA-EARKLHRCCCEIS-KAYYAQDK-------LDDFVSTINNQLQPL------FMQIRKGMSEVDGRTHYALVNMAETEITKM O.a.----DAHRRFLQLLMSH-----GIMEGS-EARKLHRHCCEKH-KVYYAHDK-------LDDFIGIINSLLQPL------FMEIRKGMSEEEGKPYYALVNLTETEITKM M.d.–---DSHRQFLQVLMSN-----GIIDAP-EARRVHRLFCEQH-KVYYAHEK-------LDEFVGVINTHLHPL------FMEIRKGRSEDNGKMFYALVNLAITEATKL D.n.----DVHRRFLQLLMTH-----GVLEEC-DVKRLQKHCYKVH-DCNATVEK-------LEDFINNINSVLESL------YIEIKKGVTEDDGRPIYALVNLATTSVSRM M.m.----DVHRRFLQLLMTH-----GVLEEW-EVRRLQNHCYQVH-DRNATVDK-------LEDFINNINSVLESL------YIEIKKGVTEDDGRPIYALVNLATTSVSKM H.s.----DVHRRFLQLLMTH-----GVLEEW-DVKRLQTHCYKVH-DRSATVDK-------LEDFINNINSVLESL------YIEIKRGVTEDDGRPIYALVNLATTSISKM Nse3-HHHHHHHHHHHHHHHHHH-----------HHHHHHHHHH--------------HHHHHHHHHHHHHHHH----------SSSSS----------SSSSS----HHHHH S.c.-KENPVARKMVRYILSRGESQNSIIT----RNKLQSVIHE---AAREENIAKPSFSKMFMDINAILYNVY------G---FELQG-----xQK--FILL----xTDRDL S.p.-NFQLLVRNVVRYAICSQT-SHNTIT----RKDIVQKAFP---EGTSRNL----FQSVFEEADRQLQLSF------G---FRLVA-----xHR--YWVLR---xKDSRL D.d.-ERYKLVYEYVRLLLFSNR-KKVPIT----KTEINKIILA---RFKDKSL----QGFVYKAGREYLKEFF------G---YEVVE-----xST--YILK----xQLDSI M.o.---TQLIKKLVRYALACEY-SRTPIR----REGIRDKVLG----AHGRS-----FRHVFDGAQKQLRAVF------G---MEMVE-----xNT--YILVT---xRTAAI D.r.-QIDHKVAEVVQFILIKDQ-KKIPIR----RADIGKHVIK---DYKHI------YAEVMNRVCRTFEQVF------G---LKLVE-IDLKQHI--YILIN---xRGQTV X.t.-QINLKVGEVVQYLLIKDQ-KKLPIK----RADIVRNVVK---EYKDI------YPEIFRRAQIALQQVF------G---FQLEE-IDTKSHI--YILTT---xQGDGM G.g.-EINRKVTELVQFLLVKDQ-KKIPIK----RVDILKKVIR---EYKDV------YSEIVNRAGRTLQQVF------G---LQMVE-IDTKHHI--YILTS---xEGENL O.a.–QVDQKVNELVQYLLVKDQ-KKLPIK----RADILRNVIK---EYKGV------SSEIVKGAGQVLEKVF------G---LHLKE-IDQKNHV--YIIVN---xEGDNM M.d.-QIQQKVNELVQFLLVKDQ-KKVPIR----RADMVKTVLQ---DYKDM------ASVIIERAGQTLEEVF------G---LQLTE-IDRKHHA--YILIN---xEGDGM D.n.-QLDLKVGELVQFLLIKDQ-KKIPIK----RTDILKHVIG---DYYKDV-----FPDLLKLAAERLQYVF------G---YRLVK-LEPKHNT--YVLIN---xGDAEM M.m.-QLELKVAELVQFLLIKDQ-KKIPIK----RTDILKHVVG---DYRDV------YPNLLKLAAERLQYVF------G---YKLVE-LEPKSHS--YILIN---xEDAEM H.s.-QLELKVSELVQFLLIKDQ-KKIPIK----RADILKHVIG---DYKDI------FPDLFKRAAERLQYVF------G---YKLVE-LEPKSNT--YILIN---xEDAEM MAGE-HHHHHHHHHHHHHHHHHHH----------HHHHHHHH----------------HHHHHHHHHHHHHHHH----------SSSSSS--------SSSSSS------HHH A3--AALSRKVAELVHFLLLKYRA-REPVT----KAEMLGSV---VGNWQYF------FPVIFSKASSSLQLVF------G---IELMEV-DPIGH--LYIFAT-----CLGL A4---ALSNKVDELAHFLLRKYRA-KELVT----KAEMLERV---IKNYKRC------FPVIFGKASESLKMIF------G---IDVKEV-DPASN--TYTLVT-----CLGL B18--PLNKKVVSLVHFLLQKYET-KEPIT----KGDMIKFV---IRKDKCH------FNEILKRASEHMELAL------G---VDLKEV-DPIRH--YYAFFS-----KLDL C2--YTLDEKVAELVEFLLLKYEA-EEPVT----EAEMLMIV----IKYKDY------FPVILKRAREFMELLF------G---LALIEV-GPDHF--CVFANT------VGL D1--ALLQERANKLVKYLMLKDYT-KVPIK----RSEMLRDI---IREYTDV------YPEIIERACFVLEKKF------G---IQLKEIDKEEHL--YILIST--PESLAGI E2a--PLEDRSIALVNFMRMKSQT-EGSIQ----QSEMLEFL----REYSDQ------FPEILRRASAHLDQVF------G---LNLRVIDPQADT---YNLVS---KRGFQI E2b--TMNDKANDLVQLAISVTEE-MLPIH----QDELLAHT---GKEFEDV------FPNILNRATLILDMFY------G---LSLIEVDTSEHI---YLLVQ---xEQVML F1---RLNRTVAELVQFLLVKDKK-KSPIT----RSEMVKYV---IGDLKIL------FPDIIARAAEHLRYVF------G---FELKQFDRKHHT---YILIN-----KLKP L2---PLDERANALVQFLLVKDQA-KVPVQ----RSEMVKVI---LREYKDE------CLDIINRANNKLECAF------G---YQLKEIDTKNHA---YIIIN-----KLGY Ndn--QLVQKAHELMWYVLVKDQK-KMIIW----FPDMVKDV---IGSYKKW------CRSILRRTSLILARVF------G---LHLRLTSLHTME---FALVK-----ALEP MukE--HHHHHHHHHHHHHHHHH-------------HHHHHHHHH---------------HHHHHHHHH---------------SSSSS----------SSSSS-----LOOP- E.co--QALANPLFPALDSALRS--GRHIGLDE---LDNHAFLMD---FQ----------EYLEEFYAR----------YN---VELIR-APEGF----FYLRP----RSTTLI H.du--IAIANPIFPQLDSQLRA--GRHISIEM---LDEHAFLMD---FQ----------TELESFYRR----------YH---VDLIR-APEGF----FYLRP----KASTLI V.ch--KAIANPLFPALDSLLRA--GRHVSSDD---LDNHAFLSD---FE----------PDLALFYQR----------YH---TELVR-APEGF----FYLRP----RSTSLI T.au--QAIANPLFPKLDTALRS--GKHISADD---LDSHSYLLD---YH----------DELETFYNR----------YQ---VELIK-APEGF----FYLRP----RSTSEI Oce---QAIANPLFPALDNQLRS--GRHITADE---LEQHSLLQE---YY----------SELDAFYQR----------YQ---AELVR-APEGF----YYLRP----RSTSEL Y.pe--QALANTLFPALDSQLRA--GRHIGIDE---LDNHAFLMD---FQ----------EQLEEFYAR----------YN---VELIR-APEGF----FYLRP----RSTTLI H.in--VAIANPIFPAVDSLLRS--GRHISTEH---LDNHAFLMD---FQ----------NELDGFYRR----------YN---VELIR-APEGF----FYLRP----KATTLI A.sa--EAIANPLFPRIDTALRS--GRHISADD---FEQHSALVE---YH----------NELEIFYGR----------YQ---VELIK-APEGF----FYLRP----RPSADI P.mi--QALANSLFPELDSQLRA--GRHIGIDS---LDNHAFLMD---FQ----------DELTDFYAR----------YN---VELIR-APEGF----FYLRP----RSTTLI S.ce--AAIADEHFPEVDLMLRR--GRHIGRDD---GTAYDYLAD---AQ----------AILEGFYRR----------FG---CELVQ-QSDGY----FYLLP----SGERLG MksE------HHHHHHHHHHH-------------HHHHHHH--------------HHHHHHHHHHHHHHH--------------SSSSS----------SSSSS-----HHHH Bac-------MEVVINYLFSH-----NFL-----LKEFQRE------KYQL----AVRNKDIIKRYLKVI----------G---WDFLV--DEKHGC--IVIASPHYEHRLKL Eub-------RKTIQDLLRQT---CILQMKCDP-VTLIQRD----NPRYQV----CLRNREFISDYLAVL----------D---CELVH--DQQEHL--FRIT----GDGVML Geo-------LRRAASIALDR----QFLFGDKSRDQRSFHQ--------------ILDAEDYYRNLFDAL----------N---LELIC--DRTAGY--VGVV--PRESHLTV Pse-------APIFRELFKGY------HISHR--DPELYTQ--------------LSSHQDQYRGLFRAM----------G---FELVC--DTRGF---YYFV--PEQVGAQV Aci-------AELAGRLLASG---VVWREHSRP-EAALYDD--------------AIQCEQLLREWFACI----------G---FVLVH--DSDARL--LRLY---PPGEGGG Cor-------RKALVQLLKGP---MVNALQ----HVEVWRA--------------ITTDQDALNAVLNNL----------F---LELVL--DEDAG---VAFT----xQEVLV Pho-------RRVLVSLLRQG-----VILSSQ--KAKLFEL--------------LCRYQSAVRKHLSEV----------Y---LRLVL--DEKAG---VAFI-------AGF
Figure S2, related to Figure 2, Part 1 Structure based sequence alignment of kite WHA domains. See "part 2" for abbreviations of species names.
WHB 1 2 3 1 2 4 ScpB-------HHHHHHHHHHHHHHH--------HHHHHHHH-------------------HHHHHHHHHHH------SSSSSS-----------SSSSS---------HHHHHHHH- S.pn-------SRAALETLSIIAYKQ---PIT--RIEIDAIR----GVN------------SSGALAKLQAF---DL-IKEDGK-KEVLGRP---NLYVT--T------DYFLDYMG G.st------LSQAALETLAIIAYRQ---PIT--RAEIEEIR----GVK------------SDKPLQTLMAR---AL-IKEVGR-AEGTGRP---ILYGT--T------PEFLDYFG C.te--IQRRLSRSMLEVLAVVAWHQ---PVT--KGEIQQIR----GAS------------PDYSIDRLLAR---GL-IEVRGR-ADSPGRP---LQYGT--T------EVFLDLFH M.tu---RTKLTRAALETLAVVAYRQ---PVT--RARVSAVR----GVN------------VDAVMRTLLAR---GL-ITEVGT-DADTGA----VTFAT--T------ELFLERLG B.su--------QASLEVLAIVSYKQ---PIT--RAEIEEIR----GVK------------SERILHSLVAK---AL-LCEVGR-ADGPGRA---ILYGT--T------PTFLEQFG M.ge--------SKTMEVLAIIAYNQ---PCT--RPRINEIR----GAD------------SFQIVDDLLEK---EL-IVELGR-KDTPGRP---FIYEV--S------PKFYDLFG F.nu--------SASIETLSIIAYKQ---PIT--KSEIESIR----GVS------------VDRIISNLEER---KF-VRNCGK-QETGRRA---NLYEV--T------SKFLSYLG P.ae--------RALLETLVLIAYRQ---PIT--RGEIEEIR----GVAV-----------NTQIVKTLMER---EW-IRIVGY-REVPGRP---AMLAT--T------KAFLDYFN T.pa--------RAAMETLSIVAYAQ---PVT--RAEIEAIR----GVGA------------DTMIRLLSER---RL-ICEVGK-KDIPGKP---AQYGT--T------EEFLTAFR T.ma--------DTQMEVVALLLISG---PIP--KSEIDAFR----GKDS------------SAVLSSLQRM---GI-VRKKRK-----GKS---YLYQL--S------PSFVESTM P.ma--------GATLRTLGTIALKK---RIL--QSELVDLR----GSSA------------YEHIKDLVEK---DF-VERKRQ---REGRS---YWLTL--S------EKFHRTFS T.el--------VAAQRTLALIALRG---PIR--QPEVIALR----GANA------------YQHIQELLTL---GF-IRRRRD---SQSRS---YILQV--T------ERFHQYFQ C.ag--------HAALETLAIIAYRQ---PIT--RAQIEAIR----GVDS------------SAALRALLAR---DL-ICEVGR-LETLGRP---ILYAT--T------PMFLQQFG A.fu--------RGTLRTLAVIALKQ---PIT--LAKVAKIR----GNKC------------YEHVKKLQER---GL-VKAEKK-----GRS---TILTT--T------EEFATYFG P.ab--------AGELKTLALIAYLQ---PVE--QSKIVKLR----GSQA------------YEHIKRLLEM---GL-IYAEPY-----ERT---KLLGT--T------EKFAELYG T.vo--------KYETGFLATVALNE---GAS--LSFFRKRY----GSRT------------DDMISKLKTM---SL-IRTSKK-----GNGT--AIYLG---------ENFEKVFG Nse1----HHHHHHHHHHHHHHHHHH--------HHHHHHHHH---------------HHHHHHHHHHHHHH------SSSS------------SSSS-------HHHHHHHHHHHHHHH S.c.----NQNEIEFMKWAIEQFMISG-xIVKEVNRILVAAT---xTNLFQFQELT--ATDIEDLLLRLCEL---KW-FYRTQ-----EG----KFGID------LRCIAELEEYLTSMY S.p.----PPVQIELMRKIIEWIMKCDDYQYSLTTLQIQKLS-----RKEMGLAP----SVIESHLHTFERD---GW-LRQR------EG----IWTFT------NHALAELDAYLHNEY D.d.----SGDELKFFKLILKMFIESR-VGLK--KNDILTLG-----RDELKIKL----SDADNLFRKFAED---GW-LRLSA-----SK----SFTTL-----TNRALSDMAPLLD--- M.o.----TADEMSFIKRLLDAIFDTY-xLMCITADQARKLS---xQSATDKGLK---HSEVDALMASLTEE---GW-LEKSA-----AG----FYSLA------PRALLELWSWMVESY D.r.----AENELELFRKIMDLIVESDSGSAS--STAILNSAD----KLISKKLK---KKEAELVLNKFVQD---KW-LKEQ------DG----EYTLS------VRCIVEMEPYMRTIY X.t.----AENELELFRKTMELIIISENGFAP--SISILNLAD----ELQSKKMK---KKEVEQLLQSFVQD---KW-LIGR------NG----EYTLH------TRCIMELEHYILNTY G.g.----AENELELFRKTMDLIILSENGFAS--STDILNSAD----QLKTKKMK---KKEAEQVLKIFVDD---KW-LSER------NG----EYTLH------TRCIMEMEQYILSTY O.a.----AENELELFKKTMDLIIISENGFAP--SMSILNLSD----QLQTKKMK---KKEVEQLLHNFVRD---KW-LSER------GG----EYTLH------TRCIMEMEQYIRHSY M.d.----AENELELFKKTMDLIVESESGYVS--STSILNLSD----KLQSKKMK---KKEVEQVLQMFVQD---KW-LSEK------QG----DYTLH------TRCIMELDQYICEMY D.n.----AENELDLFRKALDLIIDSETGFAS--STNILNLVD----QLKGKKMR---KKEAEHVLQKFVQN---KW-LIKE------EG----EFTLH------SRAILEMEQYIRETY M.m.----AENELDLFRKALELIVDSETGFAS--STNILNLVD----QLKGKKMR---KKEAEQVLQKFVQS---KW-LIEK------EG----EFTLH------GRAILEMEQFIRESY H.s.----AENELDLFRKALELIIDSETGFAS--STNILNLVD----QLKGKKMR---KKEAEQVLQKFVQN---KW-LIEK------EG----EFTLH------GRAILEMEQYIRETY Nse3----HHHHHHHHHHHHHHHHHH-------HHHHHHHHHHH----------------HHHHHHHHHHHH------SSSSSS---------SSSSS----------HHHHHHHHHHHH S.c.--------GVLSVILCIVFFSK--NNIL-HQELIKFLETF-GIPSDGSKIAILNI-TIEDLIKSLEKR---EY-IVRLEEKSDTDGEV-ISYRI--GRRx----LESLEKLVQEIM S.p.--------GFLMTVIAFIAVSH--CSVG-HSELQSFLQEL---LTEEETTPLHLD--ITRSLSLLVRQ---GY-LDRVK---DDTHNQ-FVYYI--GSRx----IEGLKSFVTEFF D.d.--------TLLTIILSIIFLEN--GHVE-SPQLLQFLSVL---GFSQNEPHPVYGD-LEKLLEKFCRE---QY-LTRRKN--VVDNQ--IIWVYEMGQRx----KRFILNSISDIY M.o.--------GLYSMIVTIIQLNR--GELS-DPKLKRYLQRL---NAETNTPVEK----TDLLLQRLIRQ---NY-IVKTVERNAQGDDDAITWRV--GPRx----DEAMASIVRDVY D.r.----NPKMGLLFVILSVIFMK--GGTIK-ENLVWNTLKKL--RLDPGEKHDEFGD-VKKVVTEEFVRQ---KY-LEYGKI-PHTEPVE-YEFRW--GLRx----KLKLLEFVGELF X.t.----TSKLGLLMVILSLIFMK--GNTAK-ESAVWEMLRRL--RIEPAEKHSDFGD-VKKLITEEFVKQ---KY-LEYSKV-LHTDPVE-YEFRW--GQRx----KMQVLEFVSKIQ G.g.----TAKLGLLIVILSFIFMK--GNSAK-DSAVWEFLRRL--RVHPGEKHEVFGD-VKKLVMEEFVRQ---KY-LEITPI-PLTDPPE-FNFQW--GPRx----KKDILSFVAKMQ O.a.----TAKMGLLMVILSLIFMK---GSATNESVIWETLRKL--RVDTRERHEVFGD-VKKLVTEEFVRQ---KY-LEYNRI-PHTEPVE-FEFQW--GARx----KMQVLNFVAKGP M.d.----VAKMGLLMVILSLIFMK--GNSAR-ESLVWDVLKKL--RVDPEKRHKTFGD-VKKLVKDEFVRQ---KY-LEYIRV-PHSEPPE-YEFLW--GPRx----KMQVLRFVAKIQ D.n.----QPTTGLLMIILGLIFMK--GNCIK-ESELWRFLRRL--GVYPTKKHLVFGD-PKKLITGEFVRQ---RY-LKYQRL-PHTDPVD-YELEW--GPRx----KMKALKFVAKIH M.m.----TPISGLLMIVLGLIFMK--GNTIT-ETEVWDFLRRL--GVYPTKKHLIFGD-PKKLITEDFVRQ---RY-LEYRRI-PHTDPVD-YELQW--GPRx----KMKVLKFVAKVH H.s.----TPTTGLLMIVLGLIFMK--GNTIK-ETEAWDFLRRL--GVYPTKKHLIFGD-PKKLITEDFVRQ---RY-LEYRRI-PHTDPVD-YEFQW--GPRx----KMKVLKFVAKVH MAGE-------HHHHHHHHHHHHHHH-------HHHHHHHHHH-----------------HHHHHHHHHHHH------SSSSSS---------SSSSS----------HHHHHHHHHHHH A3--------KAGLLIIVLAIIARE--GDCAP-EEKIWEELSVL--EVFEGREDSILGD-PKKLLTQHFVQE---NY-LEYRQV-PGSDPAC-YEFLW--GPRx----YVKVLHHMVKIS A4--------KTGLLIIVLGTIAME--GDSAS-EEEIWEELGVM--GVYDGREHTVYGE-PRKLLTQDWVQE---NY-LEYRQV-PGSNPAR-YEFLW--GPRx----YVKVLEHVVRVN B18-------KTGLLMIALGVIFLN--GNRAP-EEAVWEIMNMM--GVYADRKHFLYGD-PRKVMTKDLVQL---KY-LEYQQV-PNSDPPR-YEFLW--GPRx----KMKVLEFVAKIH C2--------ENSLLIIILSVIFIK--GNCAS-EEVIWEVLNAV--GVYAGREHFVYGE-PRELLTKVWVQG---HY-LEYREV-PHSSPPY-YEFLW--GPRx----KKKVLEFLAKLN D1--------KLGLLLVILGVIFMN--GNRAS-EAVLWEALRKM--GLRPGVRHPLLGD-LRKLLTYEFVKQ---KY-LDYRRV-PNSNPPE-YEFLW--GLRx----KMKVLRFIAEVQ E2a-------KASLLALVLGHILLN--GNRAR-EASIWDLLLKV---xKPQRINNLFGN-TRNLLTTDFVCM---RF-LEYWPV-YGTNPLE-FEFLW--GSRx----KMEALKFVSDAH E2b-------TQEYVMPILGLIFLM--GNRVK-EANVWNLLRRF-----SVDVGRKHSI-TRKLMRQRYLEC---RP-LSYSN--PVE-----YELLW--GPRx----KMKVLEYMARLY F1--------RLGLLMMILGLIYMR--GNSAR-EAQVWEMLRRL--GVQPSKYHFLFGY-PKRLIMEDFVQQ---RY-LSYRRV-PHTNPPE-YEFSW--GPRx----KMEVLGFVAKLH H1----------SLLMSILALIFIM--GNSAK-EALVWKVLGKL--GMQPGRQHSIFGD-PKKIVTEEFVRR---GY-LIYKPV-PRSSPVE-YEFFW--GPRx----KLKVMHFVARVR L2--------KFGLLMVVLSLIFMK--GNCVR-EDLIFNFLFKL--GLDVRETNGLFGN-TKKLITEVFVRQ---KY-LEYRRI-PYTEPAE-YEFLW--GPRx----KMLVLRFLAKLH Ndn-------MTGLLLMILSLIYVK--GRGAR-ESAVWNVLRIL--GLRPWKKHSTFGD-PRKLITEEFVQM---NY-LKYQRV-PYVEPPE-YEFFW--GSRx----KMQIMEFLARVF MukE--------HHHHHHHHHHHHHHH-------HHHHHHHH------------HHHHHHHHHHHHHHHHHH------SSSSS-----------SSSSS------------HHHHHHHH- E.co------ELDMMVGKILCYLYLSP-xIFTQ-QELYDELL---xGSDVD---RQKLQEKVRSSLNRLRRL---GM-VWFMG----HDSS---KFRIT--ES--------VFRFGADV H.du------EMEMLVGKVLCYLYLSP-xIFSQ-DDVYEELL---xGSDLD---RAKLAEKVGGALRRLARI---GI-ITRVG---EQNSK---KFIIS--EA--------VFRFGADV V.ch------ELDMLVGKVLCFLYLSP-xIFTN-QELYDELL---xGSDLD---REKLFEKVRTSLRRLRRL---GM-VITIG-----DTA---KFRIT--EA--------VFRFGADV T.au------ELDMLVGKVLCYLYLSP-xIFSL-QDLQEEIV---xGTDLD---KKKLQERIRTSMRRLRRL---GM-VTALG-----TGD---KFRVN--EA--------VFRFAADV Oce-------ELEMLVGKVLCYLYLSP-xVFSV-EDLQEEIL---xGSDLD---KRKLADKLKSAIRRLKRM---GM-VSSVG-----SQD---KFRIT--EA--------VFRFAADV Y.pe------ELDMMVGKILCYLYLSP-xIFSQ-QELYEELL---xGSDLD---KQKLQEKVRTSLNRLRRL---GM-IYFMG----NDST---KFRIT--EA--------VFRFGADV H.in------ELEMLVGKVLCYLYLSP-xIFST-QEVYDELL---xGSDLD---KQKLAEKVRAAIGRLRRL---GM-IQTVG---EQNSG---KFTIS--ES--------VFRFGAEV A.sa------ELDMLVGKVLCYLFLSP-xVFAM-GELQEEVL---xGTDLD---KKKLLEKIRTSMRRLRRL---GM-ITALG-----NSD---KFRVN--ES--------VFRFAADV P.mi------EMDMLVGKILCYLYLSP-xIFTV-QELFDELR---xGSDLD---LQKLQEKMRTSLNRLRRL---GM-ISFLP----NDTQ---RFSIT--ES--------VFRFGADV S.ce------AGEMLVGQTLALLYLDP-xLVAR-EALLQRLS---xDERVA---AETVRAQVGEALRRLADL---GF-VDLLD-----EA----RLRLR--PA--------LMRFAEPV MksE------HHHHHHHHHHHHHHH--------HHHHHHHHHH------------------HHHHHHHHHHH------SSSSS-----------SSSSS----------HHHHHHHHH Bac-------KDETIWLLVLRLIYE--xPFTT-LQEIKGKYET----FRLTFVS-------KTKLRELVQMGKQNQL-LRPID--NDIELDDC-RFQLF----------HSCIHVLQQ Eub-------LLTARIVIIMKIIYR---xTTN-LAEIREYGRN--TNLITRKLT-------NQEWSDALLLMKTHQM-IELPG-AIANLEDNTPIYIYG---------TVNIFCSAMD Geo-------TEHSLFLLVLRVIYE---xFTD-SEVMLDTFVA--HTGRKRPG--------LVRLREILRTFSRQGL-LEIDE---DEDKAI--RFRIR-----------PSIRDIVT Pse-------RLALFTFILVEHLAD---xPLL--EKYRDLFLQ------AEVQT-------QEELEEKVMRR-LTQL-GFASE---DSG-----VYRFM---PP-----MHRFLDVCL Aci-------RDFVAAVIALRFLYT---xAIS-LEELSQAVVS--LLAHKLPNAASE----RMVLLRELRKH---RV-LHFVE-GDDAGDMQMGLAVLR---PVMSFVSDEALEEALR Cor-------HFDTLIILILRQELT---xIVD-REEIREQVLL---YRVDEERDEAKL---AKRFDAAFRRI--VDYSLAKKT----ETPE---RFEVS---PALRQ--IFDADTVAG Pho-------LYDTLLLLVLRKHYQ---xIID-IERIESHLTP--FLPLTNSTKSDRRK--LKGALDKMVTK---KI-LSSVR-----GSED--RFEIT---PVIRY—-VVSAEFLES
Streptococcus pneumoniae = S.pn, Geobacillus stearothermophilus = G.st, Chlorobium tepidum = C.te, Mycobacterium tuberculosis = M.tu, Bacillus subtilis = B.su, Mycoplasma genitalium = M.ge, Fusobacterium_nucleatum = F.nu, Pseudomonas_aeruginosa = P.ae, Treponema_palladium = T.pa, Thermotoga_maritima = T.ma, Prochlorococcus_marinus = P.ma, Thermosynechococcus_elongatus = T.el, Chloroflexus_aggregans = C.ag, Archaeroglobus_fulgidus =A.fu, Pyrococcus_abyssi = P.ab, Thermoplasma_volcanium = T.vo
Escherichia coli = E.co, Haemophilus ducreyi = H.du, Vibrio_cholerae = V.ch, Tolumonas_auensis = T.au, Oceanimonas GK1 = Oce, Yersinia_pestis = Y.pe, Haemophilus_influenzae =H.in, Aeromonas_salmonidiae = A.sa, Proteus_mirabilis = P.mi, Sorangium_cellulosum = S.ce,
Schizosaccharomyces pombe = S.p., Saccharomyces cerevisiae = S.c., Dictyostelium discoideum = D.d., Magnaporthe oryzea = M.o., Danio rerio = D.r., Xenopus tropicalis = X.t., Gallus gallus = G.g., Ornithorhynchus anatinus = O.a., Monodelphis domesticus = M.d., Dasypus novemcinctus = D.n., Mus muscullus =M.m., Homo sapiens = H.s.
Bacillus_cereus_B4264_Type2 = Bac, Eubacterium_rectale_ATCC33656_Type3 = Eub, Geobacter_metallireducens_GS-15_Type4 = Geo, Pseudomonas_aeruginosa_PAO1_Type1 = Pse, Acidovorax_delafieldii_2AN_Type7 = Aci, Corynebacterium_glutamicum_ATCC13032_Type5, = Cor, Photorhabdus_luminescencs_laumondii_TT01_Type6 = Pho
ScpB, MukE, NSE1, NSE3 alignments are based on crystal structures, MksE structure was predicted using I-TASSER, hydrophobic pattern from crystal structures (highlighted hydrophobic residues are mostly intramolecular keeping the WH fold) have been used as master pattern for alignment of the other sequences (based only on secondary structure)
Figure S2, related to Figure 2, Part 2 Structure based sequence alignment of kite WHB domains.
Figure S3, related to Figure 3 Comparison of kleisin/kite and kleisin/heat interactions.
Cartoon representation of (A) a ScpAB (PDB: 3W6K), (B) a MukEF (PDB: 3EUH) and (C) a Scc1/Scc3 (PDB: 4PJU) sub‐complex illustrating extensive interactions of kite and heat proteins with an extended kleisin‐peptide.