Appendix 20f for Section 3.4.2.1 - Alignment of DsHps2 protein with best BlastP matches

DsHps2 was aligned with closest BlastP matches to confirm the gene model in Section 3.4.2.1. The alignment suggested that the gene model is correct.


Ds	MP--YTNEPIAVVGSGCRFPGGSTNPSRLWSLLQEPRDVLRKIDRFKAENFYNEDGHYHG

Ko	MPPSYKNEPIAVIGSGCRFPGGSTSPSRLWDLLRDPRDVLRKIDRFRADNFYNTDGHHHG

Od	MP--YTNEPIAVVGSGCRFPGNSSSPSKLWDLLQDPRDVQRKIDRFRADNFYNKDGHYHG

Ve	MA--YTNEPIAVVGSGCRFPGESSNPSKLWALLQDPRDVQQKIDRFRADNFYNKDGHYHG

	1                                                          60

Ds	ASNVLHAYLMAEDVKVFDHQFFNIPQSEAEAIDPQQRLLMETVYESLESAGLSISSLSGS

Ko	ASNVLSAYLLAEDTKLFDTQFFNIPLSEAEAIDPQQRLLMETVYESLDTAGLSMESLSGS

Od	ASNVLSAYLLDEDPKLFDSQFFNIPLSEAESIDPQQRVLMETVYESLEAAGISVESLSES

Ve	ASNVLAAYLLSEDTKLFDAQFFNIPLSEAEAIDPQQRLLMETVYESLETAGISMETLSGS

	61                                                         120

Ds	NTGVYVGVMCDDFNQIAFGDPEHVPTYAATGTSRSILSNRISYFFNWRGPSMTIDTACSS

Ko	NTAVYVGVMCDDFSQIVYGDSENVPTYAATGSARSILSNRVSYFFNWHGPSMTIDTACSS

Od	NTAVYVGVMCNDFEQIVYGDPENVPTYASTGSHRSILSNRISYFFNWHGPSMTIDTACSS

Ve	TTAVYVGVMCNDFEQIVYGDSENIPT----------------------------------

	121                                                        180

Ds	SLIAVHQAVQVLRSGECPVAIAAGSNLILGPTMFVAESNLNMLSPHGRSRMWDASANGYA

Ko	SLIAVHQAVQVLRSGEAPVAVAAGTNLIFGPTMFVAESNLNMLSPTGRSQMWDANANGYA

Od	SLVAVHQAVQALRSGEAPVAVAAGSNLIFTPTMFVSESNLNMLSSTGRSRMWDAGANGYA

Ve	-LIAVHQAVQALRSGESPVAIAAGTNLIFGPTMFVAESNLNMLSKDGRSRMWDAGANGYA

	181                                                        240

Ds	RGEGVASVVLKTLSAALRDGDHIECIIRETGVNQDGRTPGITM-------PSADSQATLI

Ko	RGEGVGSIVMKTLSAALRDGDHIEYIIRETGINQDGKTPGITM-------PSSSLQAALI

Od	RGEGVGSIIMKTLSSALRDGDHIEYIIRETGINQDGKTPGITM-------PSSELQATLI

Ve	RGEGVGSIVMKTLSAALRDGDHIEYIIRETGVNQDGKTPGITMRKICIGKSSSSMQAALI

	241                                                        300

Ds	RDTYARAGLDPTKSTDRCQFFEAHGTGTPAGDPQEASAIYKVFFESKDQRLQDSTDVAAK

Ko	RDTYARAGLDPSKRGERCQYFEAHGTGTPAGDPQEAGAIFKAFFSETDAKA--DGDEASE

Od	RDTYARAGLDPLKRRDRCQYFEAHGTGTPAGDPQEAGAIYRAFYADKKD----ASD---H

Ve	RDTYARAGLDPTKRRERCQYFEAHGTGTPAGDPQEAGAIYEAFFANKKDKLAASLD---D

	301                                                        360

Ds	DETKADDVLYVGSIKTVVGHTEGTAGIAGLLKASLAVQNKCIPPNMHFSKLNPNIKPFYG

Ko	AGFEGENILYVGSIKTIVGHTEGTAGIAGLMKASLGIQHKMIPPNMFFNRLNPAIEPYYG

Od	DDFDDEDILHVGSIKTIIGHTEGTAGIAGMMKAGLAIQNKAIPPNMHFTKLNPDIEPFYS

Ve	EGFEGEDILYVGSVKTIIGHTEGTAGIAGLMKASLAIQNKTIPPNMHFTRLNPDLEPYYG

	361                                                        420

Ds	NLKIPVRALEWPDLPSGTPRRCSVNSFGFGGANAHAIIESFEPALHEGAPSPSQLQI-TE

Ko	NLKVPVDLQEWPQLPPGVPRRASVNSFGFGGANAHAIIESYEPASP------APVKALLP

Od	NLQVPVQLRDWPDIPIDVPRRASVNSFGFGGANAHAIIESYEPEKA------APRRA-IT

Ve	NLKVPVQLRDWPDLPAGVPRRASVNSFGFGGANAHCIIENYEPSFH------TQAKT-IE

	421                                                        480

Ds	KAASTSPWPFVFSASSEKSLVAQLKSYMDFIDEGPDIELSTLSWSLFRRTAFNFRVAFSA

Ko	AASSPVPYPFVFSATSERSLAAQLRSMASFLDQNHDFDLCNLSWSLFKKTAFNFRISFSA

Od	ATLSPTPYSFVFSGTSEKSLVAQLRTYLAFLDDNLNFDLGRLSWTLFRRTALSYRVAFSA

Ve	TSSASTPYTLVFSATSEKSLAAQLKSYLGFLKDNPDFDVGSLSWSLFRRTAFNFRATFSA

	481                                                        540

Ds	TSFKALAVQIEKALGDFEGKKGPLGVRVNPKTSRDILGVFTGQGAQWATMGKELILASPI

Ko	HTAVELITKIHAALDEAEANKAALGVRVNPKTPRQILGVFTGQGAQWATMGRELVLSSHM

Od	DSVSSLAAKIAEQLEKAE-KKEDWCSRANPKTPHEILGVFTGQGAQWATMGRELIIASHF

Ve	NSVSSLISQIKEALERTETKKNPLGIRVNPKASREILGVFTGQGAQWATMGRELILVSHF

	541                                                        600

Ds	AASVIDNLERSLSALPDGPAWSLRAEIFATPDKSRIGEAAISQPLCTAIQIMVVDLLSL-

Ko	AESVMDELEKSLAELPDGPDWSLKQELFAPKEQSRIAQAAFAQPLCTAVQIMVVDLLRKS

Od	AESIVNDLEASLAELSDAPEWSLKAEMLASKENSRIAEGVISQPLCLAVQVMVVELLRQ-

Ve	AESIIDDLESSLAELPDGPEWSLKAEMVASKQESRISEGVISQPLCTAVQIMVVELLRQ-

	601                                                        660

Ds	AGVRFSAVVGHSSGEIACAYVSGFLSATDAVRVAYYRGKFAPLAKG-----GAMVAAGTD

Ko	SGVTFNTIVGHSSGEIGCAYAAGFLSARDAIRVAYYRGKFAKLAKAESGKPGAMIAAGTD

Od	AGVRFSAVVGHSSGEIACAYVSGLLSAKDAIRVAFYRGKYTPLAKG-----GSMIASGTD

Ve	AGIRFSAVVGHSSGEIACAYVSGFLSAKDAIRIAYYRGKYTQLTKG-----GSMIAAGTD

	661                                                        720

Ds	TLDALDLCSLPKLKGRAQLAADNSSASVTISGDADAIDLVEIIMKDESKFARKLKVDTAY

Ko	MDDVNYFCNLPKLKGRAQLAASNSSASVTISGDADAIDLIEVVMKDESKFARKLLVDTAY

Od	MQDAIDLCSLPKLKGRAQFAASNSSSSVTISGDADAIDLVEMVMQDESKFVRKLKVDTAY

Ve	MDDAIDLCSLPKLKGRAQFAASNSSASITISGDSDAIDLVEMVMQDESKFVRKLKVDTAY

	721                                                        780

Ds	HSHHMRVCSDPYLEALNKCDIIISRPSLDAPRWHSSVVRGNCQVTSAMSASLKGPYWRDN

Ko	HSRHMLPCGEPYMDSLRRCDIQILTPSEDAPAWYTSVHADNQRVTMEMARELKTTYWRDN

Od	HSFHMRVCSEPYIESLEKCGIQILDPAPDACPWYSSVTDGNERVTMDAASVLQSTYWRDN

Ve	HSFHMSVCSEPYVKSLEQCSIRIQTPSADACPWYSSVTEDNARVTMDSASSLKDIYWRDN

	781                                                        840

Ds	MLHPVLFAPALRAAVEISGCPGLVLEVGPHPALKGPATLTIEETTGSDAPYHGTLSRGQN

Ko	MVQPVLFSQALKAAVA-DGAPGLVLEVGPHPALKGPASLTIEEATSTDVPYFGTLARGKN

Od	MLNPVLFSQALQAAVASNGIPGLVLEVGPHPALKGPASTTIEEAVGSSVPYFGTLARGQN

Ve	MIMPVLFSQALRAAITATGTPGLVLEVGPHPALKGPASMNIEDVLGSSVPYFGTLTRGRD

	841                                                        900

Ds	DALAMTSTLGSIWSVLG-ASDITFQEFQRAFDAKATFALSKALPGFSWDHEKVIWNETRG

Ko	DALALASFIGDIWATQGAAAGINLQGFLRAFIKDATFEVSKELPTYVWDHERVVWNETRI

Od	DAIALTNTIGSIWTILG-ASDIDFQGYHRAISKDAMFEVSKKLPTYTWDHEKAIWGESRV

Ve	DGLALAATLGSIWTILG-PSGIDLQSFHRAFVKDATFEISKMLPAYAWDHDKTVWNEARV

	901                                                        960

Ds	SKAHRQRATPKHLLLGVRSVFETEGELCWRNYLQPKEMPWLKGHQIQGQIVFPAAGFATM

Ko	SKAHRLRSHATHELLGAHSVDDVEGEHRWRNYLKPKEMPWLKGHQIQGQMVFPAAGFAVM

Od	SKTHRLRTETKHELLGVRLSDEVEGELRWRNYLKPKEMPWTTGHKIQGQMVFPAAGFATM

Ve	SKAHRLRSNAKHELLGVRLVDEVEGELRWRNYLKPKEMPWLTGHQIQGQMVYPAAGFATM

	961                                                        1020

Ds	AFEGARSLAPFDTIRSMQLQNFVIHKALSFMDENSSIEHVFKLFNINKAADAIEATFSCY

Ko	ALEAARTLAPFDTIRLMELQSFSIHKALSFMDENASIETIFSLANVQEEGPYKTADFACY

Od	ALEAARNLAPFETIRLMELQNFSIHKGLSFFDENSSVETIFILSNILKQDGAITADFGCN

Ve	ALEAARNLAPFETMRLMELKNFSIHKGLSFLDESSNVEIIFVLSNVRSKGDSITADFVCN

	1021                                                       1080

Ds	ACIGKESLDIGLVADGLLRLELGTPSADALPSRARWADHFVPTQTDLFYKSLADTGYGYT

Ko	ACMNKDAGEFTSMASGVVKLTLGARAENALPERPRWVNNFVDTNVEFFYDSLATLGYGYT

Od	ACLNKDAGDFSSMASGKVLLTIGEPSKDALPERPHWPNNFVDTNVEYFYEELANLGYGYE

Ve	ACLNKDTGEFSSMASGGINIILGEPAYEALPERPHWPNNFIDTNVEYFYEELAALGYGYT

	1081                                                       1140

Ds	SMFKGITDLQRTNDGSRGVITIPQDEDSTPLTWVIHPAILDVAFQGVFAAVGAPGDGRMW

Ko	GMFQGVTELQRTNGGSKGTIVIPWDEDSAPQQWVIHPATLDVAFQAVFAAVGAPGDGRLW

Od	GMFQGVTELHRTNSGSKGVLNIPQDPDSAVQNWVVHPATLDVAFQAVFAAVGAPGDGRLW

Ve	GMFRGVTDLKRTNGGSKGILSIPQDENSTPHNWIIHPATLDVAFQAIFAAVGAPGDGRLW

	1141                                                       1200

Ds	TLHLPTTVESITINPSACELASGGVDVPLPFDAVHHSVEGHVNDIPGDVDVYDEDGLHAI

Ko	TMHVPTMIESITINPSVCEISS-GVETPLPFDACL--AQAEQGGIAGHVDLYDEDGKHAV

Od	TLHVPTMINSITVNPSAFEDTS-GVETPLPFDACL--VNAIDEGIAGDIDVYEEGGKQTI

Ve	TLHVPTMINSITVNPSALRFSS-GIESLLPFDAFL--VDAIDDGIAGDVDIYDEKESHAI

	1201                                                       1260

Ds	IQVQWLHASPIRKHTVADDRETFAAMTWGLALPDLSEDWTPPTMTIDEEKIAVFAERLSL

Ko	VQVQNLHVTPLQKPTTADDRDTFAATTWAPAYADLVTDWTEWKYNNNEEKVASFAERLSL

Od	VQIQGLNVTPLSKPTPDDDRETFAAMDWEPALPSLATNWTEAKPTADEEKIANFAERLSL

Ve	VQIQGLNVTPLSKPSSADDRQTFAAMEWDLAYPDLTVNWAPAVISDDQEKVAKFAERLSL

	1261                                                       1320

Ds	VILRQLCEAASSEHIESNGTEHQRAILSWAKDVVATTHAGAHATCHRNWLLDTWDLLAPA

Ko	YIIRHLADTVTNESIE-EGTEHQRAVLEWAQHVVETVRSEKHPTCPKQWLADTWEVLKAP

Od	LVLRDLCKETTINDVRTNGTENQRAFVDWAEHIIDTVRAGEHPTCPKQWLADTWEILQEP

Ve	FVLQELCDTVSVEQVEGTGNEHQRSILEWAQHVIQSVRYGKHPTCAKAWLADTWEVLERP

	1321                                                       1380

Ds	AERLAVSNPQIRCCLQAKDRLTDLLYNDSSAPTESDVSISQPGDDLYATLPHVGEYTSRL

Ko	AERLAKINPQVRHCLWVKQRLEPFVRGTLDI--EDELRTTKAMESLYLGIPGFRAYNKRM

Od	AERYAPSNPQVQACLDAKSRLRLYMKGRLSA---DDTGF---GQGFLACIPFHQAYVERL

Ve	AKRLAQINPQVSLCLWIKERLGPFVQGEVDI--EKELET---IQDLFDTIPFQQTYIERL

	1381                                                       1440

Ds	ATLVEQITFRHRNLRVLELGTGKSNTTRALLDVLKQNFTSYTYTATSDSGFEEVKASLSP

Ko	AQLVDQLAFRHHNLRILEIGFDKGVTTEAVLKVLGDNVMSYTCTDLEKGNFDDIRARLPE

Od	GDLVNQISFRHRNMRILEIGTGDGFLATKILGTLGDNFTSYTCTNVDEAHFDNIRSQLDG

Ve	CGLVEQIGFKHRNMRVLEIGTGKGDLTSAVLDVLDDNFTSFTCTDIEGRHFDGLRSRFAG

	1441                                                       1500

Ds	DQLPRVFFESLQIEEDLADQDVAIGAYDLVIAANVVHRAVDIGQTLRNVRSLLRPGGFLA

Ko	DQAEKFVFKPLDIEQDPAEQGFTAGYYDLVIASNTLHRAPVLSDALAHARALLRPGGYLA

Od	PYSARVLSMDLDVGEDPTEQGFNKGYYDLVLAGNTLHDARDLRQSLIHIRSLMRPGAHLA

Ve	PRADRMMFKSLDLEQDPKEQGFVLGYYDIVIASNSLHNSPDLEQSLSHARSLLRPGGYLA

	1501                                                       1560

Ds	FQEPTNSDSLAIAIRGSVSPSWFSCIENYRVQSPIISQIQWNALLRGAGFSGIDTATPEA

Ko	LLEPTSDRSLALGLSACLQPSWLAGIEEHRKFSPLCSQKTWDNLLRDSGFGGIDTATPPE

Od	FLEPTSSKSLAVALGGCLSSNWFAGVEEERKHSPLLSQQVWDDFLRDAGFSGIDTSTPEE

Ve	LLEPTNNKSLAVSLGGCLRPGWFSGIEGDRRYSPHISQKAWDSVLRDAGFSGIDTATPEE

	1561                                                       1620

Ds	SVLTTPFSVMCATAEDSQMSIINNPLSFAGKEYMDADLLILGGESIPTLRLVQELKTTLA

Ko	HTFWVPYSVMCAAAVDAQMAVVRDPLAYAGEKKLDADLLIIGGQRIQTSRLVRGLTNLLS

Od	RTHAVPFSVMCSMATDGEMDLIRDPLAFTGQKKFNNSVLIIGGATMRTKQLVRGVEKALS

Ve	GTFIVPFSVICSMAVNKEMEFIRDPLAFAGQEQFKGDLLIIGGQTMRTCRLVRGLKKLFA

	1621                                                       1680

Ds	PFFDEIITLSRLTDLDDSIIDRSPTVISLVELDEPVFKPFTEAKLKAVVRLCDSLKRILW

Ko	PFFNNVINFETLAEVDDETLASNPTTISLLELDEPLFKPFTEEKFKAIVKVCDSLSRMLW

Od	PFFQDVVVHETMVTVDAQTIASAPTTINLGELDEPLFRPFTEEKFKAAVKLCDNLQSMLW

Ve	PFFQSIVHAETLVEVDDATIAGQPTTISLTELDEPLFQPFTEDKFKAIVKLCDNLQSMLW

	1681                                                       1740

Ds	VTKGSRGEDPYMSMMTAVGRCLVGETPTLRLHFLSFDGGARPTSSVLAHHALQVHLTYNI

Ko	VTVGSRGENPYMNMMVGVGRCLVGEMPNLRLQFLNFDGADRPTPSAVAHHLLVLHLTHGF

Od	VTVGSRGEDPYMNMMTAVGRCLEGEMPSLRLQFLNFDGNDRPTPETLAYHLLRLHMTHGI

Ve	VTAGSRGENPYMNMMVAAGRCLEGEMPHLRLQFLNFDGGDKPTPNLLAHHLLRLRLTNQL

	1741                                                       1800

Ds	SGE--PHKTTEPLCTIEREMSYQNSRLLIPRYLPARAINDRINSERRYITRDVELATTVI

Ko	SND--SIKLHDPLFTNERELSVSNGVLLLPRYLPVEAINTRLNSDKRLITHHLDHTKTPV

Od	NARPGTKKAGEPLYTIERELTIQNGTLLIPRYIFSDTINTRLNSDRRLITHNVEQDHTAV

Ve	SGM--SKKPGVPLYTIERELTIRNGTLLIPRYLPADAINTRLNSDRRLITRQADQSKVPI

	1801                                                       1860

Ds	RAALSNTGFHDGTSYDLMEV----------PQLLVDADQR--KIAVHKSNFAAVLVGNAG

Ko	ALDAAA------AGYKLLER----------LQTETVDAAASVTVHVEKSLLNAIRIGGAG

Od	ELDTSN------TAYKLLERELIGDHELFKNHP---ADIRSIRVTVTKGLLNAIKINAAG

Ve	ELDTLK------SSYRLLER----------IHP---PSDGSIKVTVSKSLVNAIQVSNAG

	1861                                                       1920

Ds	CLHVSVGRDVQSGRKVIALSDSLQNVVAVPSVNVAFTDTPDEDNPVLVQSIATELVARSL

Ko	CLVPVLGS-VGSGKKVVTISEGNQSIVTVARASAVEVDVADADAAGFLLHTAAGLLAVTI

Od	CFHVVIGR-TSEDSKVIALSEHNQSLISVPDSQVIEVDVDDGDEQNLLLQVAAELLAGSI

Ve	CLHVVLGR-TQNGKKVVGLSESNGSIVSVQPSHMVEIAAVE-DDANLLLQVASELLAATI

	1921                                                       1980

Ds	LSRA-PGAVLILNPGPLLAAICQSLACELGKSLLMLSTSPSLAGATYLHSASPDRVVSAA

Ko	LGSATAGAVLVHEPTAVLAASLKRLAAKMNKTLLMTSVSSDIAHVQQVHPSTTDRVLARV

Od	LSDA-TGSVLVHEPSQALANSLSGL-WHTGKSVTITSTSPTVRGATVIHPSSPDRALARA

Ve	LDGA-SGSILIYEPSSTLANSLTSLANETGKVVTISARSSVIRGAKIIHPSCPDRVLART

	1981                                                       2040

Ds	LPKDVTTFADLSEDLGLPGKSVLSAKIRKLLPASCTTLDASHLFCRQGFNIGSVQPD-VL

Ko	VPKGTTQFVDLSEAIESKG---IGARLAKLLPLGCECKTSSHLFSPRAFATGSVDASRTL

Od	VPQDTTLFADFSQN---------GSRLDSFLPIGCTPKTTTDYIRPSAYVVGTIDAD-VL

Ve	ISRGTTLFADLSKT--------GSSRFERFLPIGCATMKASDLYSPRAFFHSPIDVV-VL

	2041                                                       2100

Ds	RQAVISGSNRLSLYHGKIDVLPASRIISLPAPSLGMNVIDWLVDDQLPVIVAPAHDTIRF

Ko	GPVVQEALASLDKPQSADHVIRASELAAQDLRPTGLVVVDWKADTTLPVSIVPADETIRF

Od	PGAVRRAFLSYRRWTEEPDVTPASELHEKPIGALDAQIVDWTADKTLPVAVEPADVSTHF

Ve	NRAVQRALLSFRRRMDQPDITPASALLSNELFSPGLQIVDWASDKTLPVAVIPADESIQF

	2101                                                       2160

Ds	RGDRTYLLVGLAGQLGLRLTKWMVVRGARHIALASRNPQVDSDWLQDIQADGTAVRTFAM

Ko	RSDRTYFMVGLTGELGLQLTKWMVQHGARYLALSSRNPSINAEWLEVVQAEGAVVKTYAM

Od	RSNRTYFMVGLAGELGLQTIKWMVSRGAKYIALSSRNPKVDAGWLEYVASQGATVKLYAM

Ve	RGDRTYFMVGMAGELGLQTIKWMVLRGARYIALSSRSPKVDAGWLEHIQSQGAVVNLYAM

	2161                                                       2220

Ds	DVTSRKSVQSVCKQVSAEMPEIVGVAQGAMIIIDGLFANKTFADFEKTLKPKVDGTVYLD

Ko	DVTSRESVHAVHKQVCAEMPPVAGVANGAMILIDGLFANKTHAEFDKTLRPKVQGTVYLD

Od	DVTSRDSVRAVHKQICAEMPPIAGVMNGAMILIDALFSNNDFATFDKVLRPKVDGTVYLD

Ve	DVTSRASVRAIHKQICAEMPPIAGVMNGAMILIDALFANNDHATFDKVLRPKVDGTLFLD

	2221                                                       2280

Ds	EFFNKDTLDFFMVFSSLAATAGNVGQSAYATANQFMNSMVANRRMRGLAGSAINMPGIIG

Ko	ETFGQGDLDFFVVFSSLACVSGNMGQTAYAAANAFMCSLIAGRRMRGKAGSAINMPGIVG

Od	EVFNNNDLDFFIVTSSLASVSGNIGQTAYAAANAFMCSLIAGRRMRGLAGSALNMPGIVG

Ve	EIFSQDDLDFFIVTSSLASISGNIGQTAYAAANAFMCSLVAGRRMRGLVGSAMNMPGIVG

	2281                                                       2340

Ds	LGLLNRHATAAYHLKAAGYDHISEWDFYQFFSEAVFAGRPESGSNFEITAGLTACEVDTM

Ko	LGYLNRDPRKLDRLKNVGYVNISEWEFYQFLSEALVAGKPDSGMNPEITAGLQRIDMERN

Od	LGYLNRDPRKLWRLKKIGYVNISEWEFFQFFSEAIVAGRPDSGLNPEITAGLQRSDVSTV

Ve	LGYLNRDPRKLWRLKKVGYVNISEWEFFQFFSEAIVAGRPDSGRNPEITAGLQRSDVAGV

	2341                                                       2400

Ds	ENPPPWTKQARFNPLRKVRTDQGTAASHETSTVSVRAQLADMETEADVHDLLMEGLLDTL

Ko	PNPPIWVNTPRFGWLQMVKPSGEASGGGDKDGSSVRSQLAELTNEEDIHKLLLDGLLNTL

Od	EDPPHWFLTHRFSTLHKVPASGGDATDGDKDGATTRNKLAELTNEKDVNSTILEDLINVL

Ve	EDPPHWFLSSRFSTLIKVPASGNTIISSDTGSSSVRSKLAELTDVADVHDAVLDGLINIL

	2401                                                       2460

Ds	YSRLKMNPEERGITPDTAIVELGVDSLLAVDMRAWFTKELDLDMPVLKLLGGATVRDLVE

Ko	YSRLNMNPEERGITPDTAIVELGVDSLLAVDMRAWFTKELDLDMPVLKILGGATVNDLVD

Od	YTRLNMDPSANAVTSDTAIVELGVDSLLAVDIRAWFTKELDLDMPVLKILGGATVQELVD

Ve	CLRLNMDPAQRAITPDTAIVELGVDSLLAVDMRAWFTKELDLDMPVLKILGGATVEELVE

	2461                                                       2520

Ds	DAVQRLSPALTPKLAR-------ADDSANAEAEDGPVEGVDATEDAEAEAEVGEDVAEQA

Ko	DAIKRLAPELVPNLKRGGETPAAAADAADTDENAASGDAVDAEESGETPVEAGPEAEEDT

Od	DAVGRLDSQLVPKLARDGEKQTTSGDEPAAPTND----PAADAALAESTS---IEESSSA

Ve	DAVKRLSAEIVPNLARDG-KASTEPEEKYASTKD--GEPSKAIEVSQETSDVAIELNQPT

	2521                                                       2580

Ds	T---------EELPMTIEPLELPSFED--------------LGASQ-FKMVIELGESR--

Ko	TPPVEITPAPEEEPLVIAPVDLPYFGDQEDHEDVVAAVAAWQTRDTLWPGASYLTPGSDI

Od	E-------ELAEEVLTVAPLDLPVIEEADD-------LHSDVAEQD-W-STASDSPSS--

Ve	T-------KPEEESITIAPLDLPSFAESTEESTSVPYTQSPLAQQS-WNSSGYPSPSS--

	2581                                                       2640

Ds	-RQQDDGGASDQSSGSSLAGRTPDTSLPSVSTPGSDVHAV-------TKGILEHVEPSRG

Ko	ASFYGSASEDDQQSN-----GTPITSGSEIDAEEAKPDEE-------PEDEVEEIEMEEH

Od	-RGDDSSDAAESSSGT----QTPITEDTDSEPEEPKQKEENAAQQEVAEEVITSEERTKP

Ve	-RAYDSSEDADQISGT----QTPITSDSEADIEEVKCRTE-------ATGLGADIETLEP

	2641                                                       2700

Ds	VATLFSPLPTHSDEQPVFVKKVPMSYGASRFWYLSQWVQDPRVFNLITHFKFTGKINQKE

Ko	VAQLHSHLPERLEHPLEFVKKTKMSYGTSRFWFLLQYLEDPTTFNIMCQLKISGPMRFDD

Od	IAKLFSNLPIRPAGELEFVKKAQMSHGTSRFWFLMQYIQDPTSFNLLAHLKCTGHVDFNT

Ve	VAKLFSSLPEHSEEQPDYIKKTTMSYGASRFWFLMQYLQDPTTFNLLAHLKCTGSVDDVK

	2701                                                       2760

Ds	AERAVIDLGQRHEAFRTAFFADHENMNEPTMGVLEHTKLHLERRYDATEADFEAEMEELL

Ko	CDRCITELGNRHEVFRTAFFADPERMNEPTMGVLKQSPLRLERRSATSEEEIQAEVDELL

Od	AEKAVRELGNRHEIFRTAFFADETRGNEPTMGVMKESPLQLECRSPVTEGDIDAEVDELL

Ve	ADRAIRELGRRHEIFRTAFFADTERMNEPTMGVLKESPLRLERRGPVSSADIDAEVGELL

	2761                                                       2820

Ds	QYDFKLEQGETFRMKLVSLD-DNTHYAVFGFHHIAMDGFSFNIILSDINALYDRAPIDPV

Ko	NYEFKLEQGETIRVKMISLDNDQTHHVIFGFHHIVIDGFSFNRLLPELNSLYDNEPLEPV

Od	NYEFKLEQGETIRAKILSLD-ENTHHVLFAFHHIAMDGFSFNILLAEVNQLYDGQKLAPV

Ve	HYEFKLEQGETIRVKMLSLD-ENTHHVLFAFHHIAMDGFSFNVLLSEINKLYEDQPLGPV

	2821                                                       2880

Ds	RLQYSDYAIRQRAQVTDGTLDEDLQYWRAMYSSQAANGEVVGDFPEPLPLFKVAHGVRQP

Ko	ETTFSDFATRQRLAVTSGSLDKELEFWKNMYSVKLPSGEVKPDYPEALPLFSVAQSPRRS

Od	SMQFTDFADRQRKQIADGSLDGEFQFWKDMYSQKLPSGEIKPDFPEPLPLFNLAQSTRQS

Ve	TMQFTDFAIRQRQQVTDGTLSKDFQFWEDMYSIKLPSGEVQPDFPEPLPLFNLAQSPRKS

	2881                                                       2940

Ds	LDDYGFEEAKLILDTRTARQIRAQCRRHKLTTFHYFLSVLRTFLFRHLDIEDLVIGIADA

Ko	LSNYEFNECQLTLDLRTVRQIKAQCRRHKITSFHFFLGALRTFLFRHVDVDDLVIGIADA

Od	LDNYEFEEASVVLDSRILRQIKTQCKKHKITPFHFFLGVLRTFLFRQLDVDDLVIGIADA

Ve	LDNYESEESQLVLDARTVRQIKAQCRRHKITTFHFFLGVLRTFLFRHLDVNDLVIGIADA

	2941                                                       3000

Ds	NRIDADIQQTVGFMLNLLPLRFKTADEDADAAFKTIASAVRSKVYEALSHSKVPFDALLE

Ko	NRADRTLDTTVGFMLNLLPLRFRSKDQDKNTPFREIATQARDVAYSALAHSALPFDALLE

Od	NRTDDSVASTIGFMLNLLPLRFKNDVADQNISFKDVAAEARKTAYDGLAHSKLPFDALLE

Ve	NRTDKSVDNTMGFMLNLLPLRFKNDDHDKTATFKDIVTGARKTAYDALAHSKLPFDALLE

	3001                                                       3060

Ds	KLSIPRSTTHSPLFQAWMDYKMFQPGYRPKLFGAEVYGAANPGKNGYDLTLEIVEADRNE

Ko	KLDVPRSTTHSPLFQAWMDYRPFSSDKQLKLFGAEVDGHPTVGRNGYDLTLDVNEVAGSE

Od	KLNIPRSATHSPLYQVFLDFRPFRPEHMPTMFGGEASGDQTVGRNGYDLTLDVNEIDGSE

Ve	NLNIPRSATHSPLFQVWMDYRPFRPDYMPTMFGGEATGTQTVGRTGYDLTLDVNEINGTD

	3061                                                       3120

Ds	IHVALRMQKHLYSSSATQRLLDSYMQLVKAFAANFDSPVSSVGLWDPNTIEAVQKLGHGP

Ko	IRLSFRTQKYLYSPEATTLLFDSYLRLVKAFAGSFDTHVDSVPLWSAKDVERATALGRGP

Od	TRVSFRTQKYLYSADATQALFDSYMRLVRSFADDFEVPVGSVSLWEEKELESARTLGKGS

Ve	IRVSFRTQKYLYSASATRTLFDSYLRLVKAFAINFELNVDTVPLWDEKEIEGAKVLGRGP

	3121                                                       3180

Ds	CVQSTWPETLLERIELIAAERPNDQAVTNSRGKTYTYSDLLEQVHMISVALSAAGVKQGS

Ko	QMQSEWPETLSHRITDIATQNPNAEALKDGSGATLTYKQLQSRVQVLSEALTKAGVGARS

Od	EIERQWPETLSHRIANIASQWPDKEAVKDGNGKLCSYTQLQHRVQAISDTLTQAGVTQGS

Ve	ALQSEWPDTLSHRIADVAKQYSDREAVKDGSGNAYTYQQLQRRVQAISEALTKVGVKQSS

	3181                                                       3240

Ds	RVAVFQEPSSDWICSLLAIWHAGGTYLPLDLRNPLPRLASIVEAAKPTVILSHHETHDAV

Ko	RVAVFQQPSADWVCSLLAIWHAGGTYIPLDLRNSLPRLAAIAKEARPAVIISHAATADQV

Od	RVAVFQHPSVEWVCSLLGIWHAGGTYIPMDLKNSLPRLAAIAAAAKPAVILCHDETESLV

Ve	RVAVFQQPSADWVCSLLGIWHAGATYIPMDLRNSLPRLAAIAKTAKPTAIICHDETEAKV

	3241                                                       3300

Ds	ASLGTDAV-AMNIGTLESTKVH--KKSLANAKTPAVILFTSGSTGTPKGVVLSHSALRNT

Ko	PELASAAT-IVNASSLADVEPKQNTPTQAKAAAAAAILFTSGSTGTPKGVVLRHANFKNT

Od	PELKSSASHLVNVSSLQHAQPI--TKVQARANGTAAILFTSGSTGTPKGVILRHSAFRNT

Ve	PELKASAA-LVNISSLEHSQPI--TKTRAKATTAAAILFTSGSTGTPKGVILRHSSFQNA

	3301                                                       3360

Ds	IEGLVQHYDIGAERVLQQSAYSFDFSLDQILVALTNGGSVYVASKEERMDPVAIANIIKT

Ko	IEGLTRTYEMGAERVLQQSAFTFDFSLDQILCGLVNAGSVYVVSAEARADPVAICKIIAA

Od	IEGLTQQYAIGAEKVLQQSAFTFDFSLDQIMCGLVNGGTVYVATKQDRADPVAIAKIIAS

Ve	IEGLVKQYNIADERVLQQSAFTFDFSLDQMLCGLVNGGSVYVVSKENRGDPVAISNIIAT

	3361                                                       3420

Ds	HNISYTRATPTEYTNWITYGAEHLMESPSWSFAWAGGETMPHSLKQSFAALSLSGLRLYN

Ko	EGVTYTRATPSEYANWLTYGAADLMQATKWKMAFGGGETMPPSLRENIAGLGLD-VKLFN

Od	ENISYTRATPSEYASWIKYGAEHLTSATSWKFAWGGGETMPQSLPESIAGLGLQGLRLFN

Ve	EGITYTRATPSEYVSWIAYGAENLTTASNWKYAWGGGEVMPLSLREGIVNLGLHDLRLYN

	3421                                                       3480

Ds	SYGPAETVTCTKVEIPYGVDDIDD---EAEIPVGFPLPNYTVSIVDSRLDLVPQGVAGEI

Ko	SYGPAEAITCTKSEVQENLNEEGE---PQEVPAGFPLPNYSVYIVDRRLQIVPQGVAGEI

Od	SYGPAESITCTKTEVSLDFDADANGLNDTDIPAGYPLPNYSVYIVDRNLDLVPQGSTGEI

Ve	SYGPAESITCTKTEISLDHDAELDDL-DSNIPAGRPLPNYSLYIVDRNLELVPQGVTGEI

	3481                                                       3540

Ds	LVGGPSVGLGYLNNERLTDEKFV-N---LGDSGIVYRTGDVGYLRADGALMYQGRVDGDL

Ko	LIGGASVGAGYLNNEKLTESKFISNTYAPADGRLVYRTGDVGRLRADGALMFTGRIAGDT

Od	LIGGPSVSAGYLNQEKLSASKFISN---PWDTGIVYRTGDTGYLRPDGALMFKGRIAGDT

Ve	LIGGPSVSSGYLNQEKLSGIKFISN---PWDTGIVYRTGDTGYLRPDGNLMFQGRIAGDT

	3541                                                       3600

Ds	QVKVRGMRIDLQDIESCVLASAEGALDKAIVSVREGDLLVAHVQFAIGHHHEEDEAKAFL

Ko	QVKIRGIRIELEEIENSILATAGGALSQAVVTVR-GEMLVAHVQFAAGHYDDEQEQKAFL

Od	QIKIRGIRIDLQDIESCILEASDAALHKVIVSVRDGDVLVAHVQFADGQYDDEASQQAFL

Ve	QVKIRGIRIDLQDIEACILNAADGALHKAVVSVRSGDLLVAHVQFAADTYDNEQSQNAFL

	3601                                                       3660

Ds	RSLRLTLPLPTHMLPSIIIPLDMMPMSTHGKVDRATIRALGLPQLSS---RRGQE--ALT

Ko	RNLRFMLPLPVYMMPAVFVALEQMPVNAHGKTDRAAVKALPLPQANR---GRTGE--ALS

Od	RQMRFLLPLPVYMIPAVFVPLDQLPVTSHGKTDRRAVSSLPLPTAHSHTATRSDE--DLT

Ve	RSLRFVLPLPVYMVPSMFIPVNSLPVNAHGKTDRKSVQSLPLPTPSTITTSQGSRSGDLS

	3661                                                       3720

Ds	ETEEKLLKIWKEV-GLAAAPEAVPVDNQTTFFEMGGNSILLVKLQILISMHFNVKISLLD

Ko	ETERKLVEVWKEVIGDAEATGAVEVSDQASFFEFGGNSLLLVKLQILISMRFHAKLSLVD

Od	KTERKLIKIWMESVPKEMAGS-MVPSSQTSFFELGGNSLLLVSLQRAVEREFGVKLGIVD

Ve	EMEKELVEMWKEAVPKEMKDAVMSLNSQTSFFELGGNSLLLVKLQMLVNQRFGIKLSLVD

	3721                                                       3780

Ds	LFNAVSLGAMSGKIEAAPKADNIDWEAETTLEQDLPKLRQH-VAPDSIPA---------R

Ko	LLGAASLGAMAARIDTAPPADFIDWAAETQLDDDLMGLVEATTAGSSMPAVQDEKKTGHK

Od	LFEASSLGGMAGKIES--------------------------------------------

Ve	LFGASSLGAMAAKIET--------------------------------------------

	3781                                                       3840

Ds	RVLVTGATGFLGRRLVQKLVAADHVEEVHCVAVR--SRQSDLNTISDKVKVYAGNLAAPR

Ko	TVVVTGATGFLGRRLVARLVADESVAEIHCVAVRPNSKHRDALPASDKVRVHAGDLAAPR

Od	------------------------------------------------------------

Ve	------------------------------------------------------------

	3841                                                       3900

Ds	LGLSDAEIETLTAQVDLIIHAGVSRSVLDSYQTLRGANLCSTKALVKLAAARGIPFHFIS

Ko	LRLSEEAFKQLAYHADVIVHAGVSRSFMDAYQMLRGPNFEATKTLVRLATPRHVPIHFLS

Od	------------------------------------------------------------

Ve	------------------------------------------------------------

	3901                                                       3960

Ds	TGSLADL--DGAAPPTGGSLGYLASKWASEKYLDNAASQLDLPVTIHRIVGSDKAADDTL

Ko	SGSVASLVKDDATPPTAGAEGYLAAKWASEQYLGHAAAALGLPVAIHRVVAA--AAADEA

Od	------------------------------------------------------------

Ve	------------------------------------------------------------

	3961                                                       4020

Ds	TTSLVSEHFLSLSETLKVGPTFEGFSSLSLDLIQVDRLTAAIIASTTAAASED---GTNV

Ko	ANAAVLAELQALAAKLNSVPAPGGW-KVALDLTPADDLAQRVVAAATSQPAAEASAAPRV

Od	---------------------------------------------MMGEA----------

Ve	---------------------------------------------LTQQPSE--------

	4021                                                       4080

Ds	VEHSCEARLELQSVQRRFAELQSEGQARVPLPRWLARAKKAGLEWQMSSLDNFPL-----

Ko	SEYAAHTTVDMGDMVPHVADGANAALPTLPAMQWLARARDAGFGWQIASLDNAPFGDAEA

Od	------------------------------------------------------------

Ve	------------------------------------------------------------

	4081                                                       4140

Ds	-EG

Ko	ARA

Od	---

Ve	---

	4141 4143



Figure A20f. Alignment of DsHps2 with best reciprocal BlastP matches. Ds: D. septosporum, Ko: K. oryzae (protein ID: 424330), Od: O. disseminans (protein ID: 360096), Ve: V. enalia (protein ID: 565861).