Báo cáo y học: "Conservation and divergence of gene families encoding components of innate immune response systems in zebrafish" pdf

Thông tin tài liệu

Open Access Volume et al Stein 2007 8, Issue 11, Article R251 Research Conservation and divergence of gene families encoding components of innate immune response systems in zebrafish Cornelia Stein*, Mario Caccamo†, Gavin Laird† and Maria Leptin*† Addresses: *Institute for Genetics, University of Cologne, Zuelpicher Str 47, 50674 Cologne, Germany †The Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1HH, UK Correspondence: Maria Leptin Email: mleptin@uni-koeln.de Published: 27 November 2007 Genome Biology 2007, 8:R251 (doi:10.1186/gb-2007-8-11-r251) Received: 20 April 2007 Revised: 30 October 2007 Accepted: 27 November 2007 The electronic version of this article is the complete one and can be found online at http://genomebiology.com/2007/8/11/R251 © 2007 Stein et al.; licensee BioMed Central Ltd This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL gene familiesof several fish genomes reveals components of the innate immune system and identifies orthologous relationships between

Analysis of proteins in zebrafish Immune system fish and mammals.

Abstract Background: The zebrafish has become a widely used model to study disease resistance and immunity Although the genes encoding many components of immune signaling pathways have been found in teleost fish, it is not clear whether all components are present or whether the complexity of the signaling mechanisms employed by mammals is similar in fish Results: We searched the genomes of the zebrafish Danio rerio and two pufferfish for genes encoding components of the Toll-like receptor and interferon signaling pathways, the NLR (NACHT-domain and leucine rich repeat containing) protein family, and related proteins We find that most of the components known in mammals are also present in fish, with clearly recognizable orthologous relationships The class II cytokines and their receptors have diverged extensively, obscuring orthologies, but the number of receptors is similar in all species analyzed In the family of the NLR proteins, the canonical members are conserved We also found a conserved NACHTdomain protein with WD40 repeats that had previously not been described in mammals Additionally, we have identified in each of the three fish a large species-specific subgroup of NLR proteins that contain a novel amino-terminal domain that is not found in mammalian genomes Conclusion: The main innate immune signaling pathways are conserved in mammals and teleost fish Whereas the components that act downstream of the receptors are highly conserved, with orthologous sets of genes in mammals and teleosts, components that are known or assumed to interact with pathogens are more divergent and have undergone lineage-specific expansions Background With the sequence of the zebrafish genome as well as the sequences of two pufferfish genomes nearly completed, and in view of the widespread use of the zebrafish as a model to study immunity [1], it is both pertinent and feasible to determine which of the genes that encode components of the mammalian immune system are also found in fish In addition to being a prerequisite for using the zebrafish as a model system for the genetic analysis of human immunity, knowledge of components of immune defense systems in the zebrafish would also aid our understanding of the evolution of immunity Zebrafish are a member of the large group of teleost fish that, together with a small nonteleost sister group, constitute the ray-finned fishes The ray-finned fishes diverged from the Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, common ancestor of other bony vertebrates, which include tetrapods as well as lungfishes and coelacanths, 450 million years ago They appear to have undergone a massive radiation about 235 million years ago, resulting in as many teleost species as there are species represented by all other vertebrates together (approximately 24,000 species in each case) One genetic event that has been regarded to be associated with the radiation of the teleosts in particular is a whole genome duplication event early in the teleost lineage Although some genes or regions of the genome, most notably the Hox gene clusters, have been maintained in multiple copies, others have undergone re-diploidization The availability of additional gene copies has been proposed to have facilitated the evolution of the high level of diversity in morphology and behavior in the teleost fish [2,3] skin defense peptides), and future genetic research may well reveal additional fish-specific molecules and mechanisms Components of the adaptive immune system have been studied intensively in many fish species and have been analyzed molecularly and genetically (for review [4]) Unlike the adaptive immune system, some of the systems that contribute to innate immunity are conserved throughout the animal kingdom The presence of genes encoding components of these systems in the zebrafish and other fish was therefore not unexpected In addition to the well studied adaptive immune genes, protein and gene families involved in innate immune mechanisms that have been analyzed in detail include the complement gene family (for review [5]), the Toll-like receptors (TLRs) [6,7], and two sets of receptor genes that encode proteins structurally similar to the immunoglobulin-type and C-type lectin domain-type of mammalian NK (natural killer cell) receptors [8-11] Similarly, genes encoding tumor necrosis factors (TNF), ILs, IFNs, and their respective receptors have been identified in various fish species [12-18] Together with studies on subsets of intracellular signaling molecules [19-23], these findings indicate that many components of innate immune signaling pathways known from mammals are conserved in the teleost fish However, it is not clear whether all components are present or whether, in general, the complexity of the signaling mechanisms employed by mammals is similar in fish For example, whereas some members of the TLR family exhibit orthologous relationships between zebrafish and mammals, there are also expansions within the TLR gene family that are specific for the zebrafish or the mammals [6,7] Similarly, the novel immune-type receptors, which share several common features with mammalian immunoglobulin-type natural killer cell receptors, exhibit species-specific expansions and diversifications [8,10] This report concentrates on identifying those molecules known from mammalian innate immune signaling systems that are conserved between teleost fish and mammals The study is restricted to the pathways that have not been extensively studied by others previously It is likely that there are also nonconserved defense systems associated with the characteristic physiologies of fish and mammals (for example, Volume 8, Issue 11, Article R251 Stein et al R251.2 To be able to judge orthologous relationships properly, we also included protein family members that have not been shown to have immune signaling functions, in particular because it cannot be excluded that these may have as yet unidentified roles in immune signaling, as has recently been discovered for TNF-receptor associated factor (Traf)3 [22] We find that the families of intracellular signaling adaptors and enzymes are largely conserved By contrast, the class II cytokines and their receptors have diverged significantly, and the NLR (NACHT-domain and leucine rich repeat containing) proteins exhibit extensive, species-specific gene amplification and diversification Results and discussion As the basis for our search, we first assembled a set of sequences of mammalian genes that encode components of the TNF, IFN, and TLR pathways, and the NLR proteins in mice and humans (Figure 1) We then identified homologs of these genes in the zebrafish genome We first checked whether Ensembl [24] or ZFIN [25] listed potential homologs and added these to our list In cases in which putative homologs were not found in Ensembl or ZFIN, we used TBLASTN [26] to screen unfinished clones from the genome sequencing project and trace sequences from the whole genome shotgun project If matching sequences were found, they were analyzed in detail in their genomic context and were manually annotated to generate a gene prediction, using the available mammalian sequences and any existing expressed sequence tags (ESTs) as evidence Where gene predictions were available from the Tetraodon nigroviridis or Takifugu rubripes genomes, we also included these in our analyses, but we did not make any assemblies or annotations ourselves A complete list of all sequences used in this study is provided in Additional data files 1-9 We used MEGA software [27] to compare the encoded fish proteins with their mammalian counterparts For some proteins, the annotated sequences were not complete and could not be completed because the available DNA sequence was not sufficiently reliable or had gaps We therefore point out that the phylogenetic trees we present show relationships, but are not intended to show precise evolutionary distances For most of the core signal transduction components of each pathway we found clear orthologous relationships between the mammalian and the zebrafish genes (see Gene families with largely orthologous relationships between teleosts and mammals, below), as illustrated for example by the branches for Tollip (Toll-interacting protein) or Tab (Tak1-binding protein)3 in Figure These branches reflect the known evolutionary relationships between the five species Mouse and human exhibit the highest level of similarity, the two Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, Volume 8, Issue 11, Article R251 Stein et al R251.3 Figure Components of the TLR and IFN signaling pathways and intracellular pattern recognition receptors Components of the TLR and IFN signaling pathways and intracellular pattern recognition receptors The molecules analyzed in this study are shown in color For simplicity, not all members of each protein family are shown IFN, interferon; TLR, Toll-like receptor pufferfish are closely related to each other, and the zebrafish is more closely related to the pufferfish than to mammals and therefore shares a branch with the pufferfish on the phylogenetic tree In several cases, for example Tab1 and Tab2, the Tetraodon sequences not group with their counterparts from Takifugu In most of these cases this is due to internal deletions or insertions, or terminal deletions or extensions in the Tetraodon genes, which are most easily explained by unreliable predictions for these genes based on faulty assembly of the genome (see below for specific cases) We have not investigated these cases further For the class II cytokine receptor family the orthology was less clear (see Class II cytokines and their receptors, below) or nonexistent, as has previously been noted [17] For one group of proteins, those containing NLRs, our comparison reveals extensive, species-specific expansion of subfamilies (see Intracellular pathogen sensors: the NACHT-domain family, below) Each of these groups of proteins is discussed individually below Gene families with largely orthologous relationships between teleosts and mammals In the protein families of the immune kinases, the adaptors in the TLR signaling pathway, the interferon response factors (IRFs), the signal transducers and activators of transcription (Stats), and the Trafs we found orthologous genes in fish for almost all of the mammalian genes This is summarized in Figures to However, there were also occasional duplications or losses either in the fish or in the mammalian lineage The findings are briefly summarized below and in the figure legends Kinases The kinases were the family that exhibited the most apparent orthologies between fish and mammals For all of the essential kinases involved in signal transduction mediated by TLR, Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, 99 100 HsSarm1 MmSarm1 DrSarm1 FrSarm1 TnSarm1 100 100 100 100 HsNEMO MmNEMO DrNEMO FrNEMO TnNEMO 100 100 100 100 92 100 100 HsTab1 MmTab1 DrTab1 FrTab1 TnTab1 HsIKAP MmIKAP DrIKAP FrIKAP TnIKAPa TnIKAPb 100 100 100 100 100 HsTirap MmTirap DrTirap FrTirap 98 100 79 HsTicam1 MmTicam1 HsTicam2 MmTicam2 DrTicam FrTicam TnTicam 100 55 100 61 36 100 100 HsTab2 MmTab2 DrTab2 FrTab2 TnTab2 100 HsTab3 MmTab3 DrTab3 FrTab3 TnTab3 100 99 51 100 100 100 100 100 85 55 0.1 Stein et al R251.4 HsMyd88 MmMyd88 DrMyd88 FrMyd88 TnMyd88 100 100 100 Volume 8, Issue 11, Article R251 HsTollip MmTollip DrTollip FrTollip TnTollip 100 aa CUE zinc finger RanBP2 coiled-coil TIR PP2C IKI3 SAM C2 Death Figure Phylogenetic trees of the innate immune signaling adaptors and diagrams of their protein structures Phylogenetic trees of the innate immune signaling adaptors and diagrams of their protein structures The fish protein names are highlighted in blue (Dr [Danio rerio]) or green (Fr [Takifugu rubripes] and Tn [Tetraodon nigroviridis]) The numbers in the tree indicate the bootstrap values Scale: interval of 0.1 amino acid substitutions Hs, Homo sapiens, Mm, Mus musculus Protein domains are shown as boxes based on identification by Pfam [55] or Smart [56] Some domains were not recognized by these programs, although manual inspection indicated clear conservation of the domain within the protein family These domains are also shown as boxes in the diagrams The identities of the domains are listed at the bottom Scale bar = 100 amino acids The Tetraodon version of the Ikap (IKK [Inhibitor of nuclear factor-κB kinase] complex associated protein) gene contains two full repeats of the IKI3 domain It is not clear whether this prediction is due to an error in the genome assembly or whether the gene does indeed contain an internal duplication covering the whole length of the gene found in other species The two halves of the predicted gene were treated as separate peptides in the phylogenetic tree and the diagram TNF, and nucleotide oligomerization domain containing protein (Nod), we find orthologs in zebrafish and in most cases also in pufferfish IL-1 receptor associated kinase (IRAK)2, which is thought to serve as an accessory protein in combina- tion with IRAK1, was not found in any of the three fish This suggests that it has arisen from a duplication event that occurred only within the mammalian lineage (Figure 3) The alternative, loss of IRAK2, for example in the teleost lineage Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, Hs IK K a M m IK K a DrIK K a F rIK K a1 100 TnIK K a1 100 F rIK K a2 100 TnIK K a2 100 Hs IK K b 100 M m IK K b DrIK K b 100 F rIK K b 96 100 TnIK K b 100 H s TB K 100 M m TB K 100 DrTB K TnTB K 1b 100 F rTB K 79 50 100 TnTB K 1a H s IK K e 100 100 M m IK K e 42 D rIK K e F rIK K e 84 100 TnIK K e 100 H s Tak 100 M m Tak DrTak 100 F rTak 100 TnTak H s R ipk 100 100 M m R ipk 100 D rR ipk F rR ipk 100 100 TnR ipk Hs Ripk 100 100 M m R ipk 47 D rR ipk 100 F rR ipk 5a 45 F rRipk 5b 75 TnR ipk 5b 100 TnRipk 5a 99 Hs Ripk 100 100 M m R ipk DrRipk H s R ipk 100 M m R ipk 100 100 D rR ipk 100 F rR ipk 100 TnRipk 99 H s R ipk 100 M m R ipk 100 D rR ipk F rR ipk 100 100 TnR ipk 100 Hs NLK 76 M m N LK 100 D rN LK a F rN LK a 75 TnNLK a 100 D rN LK b 54 F rN LK b 100 TnN LK b H s IR A K 100 100 M m IRA K D rIRA K 100 F rIRA K 100 TnIRA K 57 H s IRA K 100 M m IRA K 100 D rIR A K 32 100 Hs IRA K M m IR A K 99 F rIR A K 100 100 TnIR A K 100 DrIR A K H sIR A K 100 M m IRA K 100 H s Ty k M m Ty k 100 D rTy k 100 100 F rTy k 100 TnTy k 100 H s JA K 100 M m JA K 100 DrJA K 100 F rJA K 100 TnJA K 100 Hs JA K 100 M m JA K 79 DrJA K F rJA K 100 100 TnJA K H s JA K 100 100 M m JA K 61 F rJA K 2a 98 100 TnJA K 2a D rJA K 2b 100 D rJA K 2a F rJA K 2b 47 100 TnJA K 2b Volume 8, Issue 11, Article R251 100 100 HsIRF4 MmIRF4 DrIRF4a DrIRF4b FrIRF4 TnIRF4 DrIRF4c 100 100 99 99 98 100 98 99 GgIRF10 DrIRF10 FrIRF10 TnIRF10 95 100 100 100 HsIRF8 MmIRF8 DrIRF8 FrIRF8 TnIRF8 100 100 95 Stein et al R251.5 100 100 HsIRF9 MmIRF9 DrIRF9 FrIRF9 TnIRF9 100 92 100 100 100 97 100 100 99 HsIRF7 MmIRF7 DrIRF7 FrIRF7 TnIRF7 100 98 100 96 HsIRF3 MmIRF3 DrIRF3 FrIRF3 TnIRF3 100 HsIRF5 MmIRF5 DrIRF5 FrIRF5 TnIRF5 100 91 100 100 100 100 100 100 100 HsIRF6 MmIRF6 DrIRF6 FrIRF6 TnIRF6 HsIRF1 MmIRF1 100 78 DrIRF1 FrIRF1 TnIRF1 56 99 43 100 94 HsIRF2 MmIRF2 DrIRF2a 100 99 FrIRF2 DrIRF2b 65 100 100 DrIRF11 FrIRF11 TnIRF11 0.1 0.1 Figure Phylogenetic tree of the kinases Phylogenetic tree of the kinases Details of the tree are as in Figure Figure Phylogenetic tree of the interferon response factors Phylogenetic tree of the interferon response factors Details are as in Figure The chicken (Gg [Gallus gallus]) IRF10 was included to show its relationship to fish IRF10, because no ortholog for this gene is found in mammals IRF, interferon response factor Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, MmSTAT1 76 FrSTAT1 99 TnSTAT1 100 100 DrSTAT1a DrSTAT1b 47 HsSTAT3 100 MmSTAT3 DrSTAT3 100 99 FrSTAT3 100 TnSTAT3 100 HsSTAT4 100 MmSTAT4 100 DrSTAT4 87 100 FrSTAT4 100 TnSTAT4 HsSTAT2 100 MmSTAT2 DrSTAT2 66 FrSTAT2 100 TnSTAT2 100 100 HsSTAT5a MmSTAT5a 100 Stein et al R251.6 Adaptors HsSTAT1 100 Volume 8, Issue 11, Article R251 The adaptors that are involved in innate immune signaling cascades are well conserved in fish, as was previously observed for those interacting with the TLRs [6,7,23] We find orthologous genes in each of the three fish species for Myd88 (myeloid differentiation factor 88), Sarm1 (sterile α and HEAT/armadillo motif containing protein 1), Tollip, IKAP (IKK complex associated protein), NEMO (NF-κB essential modulator), Tab1, Tab2, and Tab3, and in the zebrafish and Takifugu for Tirap (Toll/IL-1 receptor associated protein) For the mammalian Ticam (Toll-like receptor adaptor molecule)1 and Ticam2 (also named TRIF and TRAM) genes, there is only one homologous gene in each of the three fish, which is equally distant to Ticam1 and Ticam2, indicating a duplication of an ancestral gene in the mammalian lineage and subsequent divergence of the two copies (Figure 2) The alternative interpretation, that Ticam2 was lost specifically in the teleost lineage, does not fit with the fact that it is also not present in the genomes of Xenopus and chicken [28] An apparent contradiction to our observation is a report of both Ticam1 and Ticam2 in Hydra [29] However, cnidarians too have only one Ticam, because the gene cited as Tram is in fact not the TRAM (TRIF-related adaptor molecule) that is synonymous with Ticam2, but encodes an unrelated protein, the translocation-associated membrane protein, which has the same acronym HsSTAT5b 100 100 IFN response factors MmSTAT5b For IRF1, IRF3, and IRF5 to IRF9, clear orthologous relationships are found between mammals and fish In each fish species we also find an additional gene, which we call IRF11 and which is equally distant to both IRF1 and IRF2 DrIRF4b, which is most closely related to the IRF4s found in the pufferfish, maps to a region of the genome that is syntenic with the region containing IRF4 in mammals and in the two pufferfish, indicating that these are orthologous genes In addition to the homologs of the IRFs in mammals, we find an additional IRF in each of the fish, which we named IRF10, because it groups with a similar gene from chicken It appears that this gene has been lost in mammals (Figure 4) DrSTAT5.1 100 FrSTAT5.1 100 99 TnSTAT5.1 DrSTAT5.2 100 100 HsSTAT6 MmSTAT6 DrSTAT6 100 FrSTAT6 100 100 TnSTAT6 0.1 Signal transducers and activators of transcription Figure Phylogenetic tree of the STAT proteins Phylogenetic tree of the STAT proteins Details are as in Figure STAT, signal transducer and activator of transcription (it is also absent in Medaka and stickleback), is less likely because a search of the ray and shark genomes did not identify any sequences for IRAK2 Conversely, we find duplications in the fish lineage for Jak2 (Janus kinase 2) and NLK (nuclear factor-κB [NF-κB] essential modulator-like kinase), and duplications in both pufferfish for IKKa (inhibitor of NFκB kinase) and Ripk5 (receptor-interacting protein kinase 5) Mammalian Stat2, Stat3, Stat4, and Stat6 have clear orthologs in all three fish species (Figure 5) Stat5 has been independently duplicated in mammals and in zebrafish [21] The group of Stat1 genes contains one gene from each pufferfish with a good match to mammalian Stat1, but two genes from zebrafish that are surprisingly divergent but still resemble Stat1 more than the other Stats The duplication event that led to this situation is recognizable in the genome, because the whole region containing the gene is duplicated and syntenic with the same region in human (Figure 7) The positions of flanking genes in the human and zebrafish genome are indications of a number of rearrangements On chromosome in the zebrafish these have been associated with a further Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, Volume 8, Issue 11, Article R251 Stein et al R251.7 HsTraf1 MmTraf1 100 99 DrTraf1 HsTraf2 MmTraf2 100 99 DrTraf2a 100 70 FrTraf2a1 100 100 99 78 83 100 TnTraf2a1 FrTraf2a2 100 TnTraf2a2 DrTraf2b FrTraf2b TnTraf2b HsTraf3 MmTraf3 100 100 DrTraf3 FrTraf3 TnTraf3 100 38 100 93 HsTraf5 100 MmTraf5 DrTraf5 87 HsTraf4 100 MmTraf4 DrTraf4a 80 FrTraf4 TnTraf4 DrTraf4b 100 100 100 HsTraf6 MmTraf6 DrTraf6 FrTraf6 100 100 98 TnTraf6 100 95 98 100 50 HsTraf7 MmTraf7 DrTraf7 FrTraf7 TnTraf7 0.2 100aa RING zinc finger coiled-coil MATH WD40 Figure Phylogenetic tree of the TRAFs and diagrams of their protein domain structure Phylogenetic tree of the TRAFs and diagrams of their protein domain structure Details are as in Figure 2, except that the scale shows 0.2 amino acid substitutions TRAF, tumor necrosis factor receptor-associated factor duplication of part of the Stat1 gene, resulting in a pseudogene (ENSDARG00000040710; STAT1Ψ in Figure 7) TNF-receptor associated factors All of the Traf protein family members Traf1 to Traf7 are represented in fish (Figure 6) For Traf3, Traf6, and Traf7 we find one gene in each of the three fish species, in all cases with the same protein structure and a high degree of similarity Traf1 and Traf5 are present in zebrafish, but no predictions exist for these genes in the pufferfish genomes It is interesting that zebrafish Traf1 differs from mammalian Traf1 in that, like the other family members, it contains a Ring finger and a zinc finger (Figure 6), indicating that the absence of these domains in mammalian Traf1 is due to a loss that occurred specifically in Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, Volume 8, Issue 11, Article R251 Stein et al R251.8 GDF8 STAT1 Dr chromosome 2 13 PLCD4 RQCD1 14 Q1 LWI2 Nab1 Q1 LUQ4 addit ional copy of t his on Dr chrom 1 STAT4 STAT1 HIBCH GDF8 Hs chromosome 219 SPEG PLCD4 RQCD1 Myo1 B GLS STAT4 STAT1 Ψ STAT1 b HIBCH GDF8 HIBCH GDF8 Dr chromosome 191 192 39 Nab1 NP0 6 INPP1 T0 0 40 GLS Nab1 zgc9 Figure Synteny between regions containing STAT4 and STAT1 genes on human chromosome and Danio rerio chromosomes 22 and Synteny between regions containing STAT4 and STAT1 genes on human chromosome and Danio rerio chromosomes 22 and Genes transcribed on the top or bottom strands are shown above and below the lines representing the chromosomes Homologous regions are shown by colored arrows A further duplication of the region containing HIBCH and GDF8 is found on Danio rerio chromosome 11 Numbers represent nucleotide positions in the genome in megabases based on the Zv6 assembly Gene names are Swissprot, Zebrafish Information Network, or Ensembl identifiers the mammalian lineage Traf4 is duplicated only in zebrafish [30], whereas there have been several duplication events in the fish lineage for Traf2 In summary, for the families described thus far, clear orthologies exist between the teleost and mammalian lineages, with a few duplications for some of the gene family members Class II cytokines and their receptors Class II cytokine receptors Mammals have two distinct, heterodimeric receptors for type I and type II IFNs, as well as a set of closely related receptors for other class II helical cytokines Although a large group of this type is found in fish, there are no simple orthologies between the receptors of this class in mammals and teleost fish [16,17,31] A previous analysis identified 11 genes in Tetraodon, named cytokine receptor family B (CRFB)1 to CRFB11 [17] The authors found that the genomic region containing IFN-α receptor (IFNAR)chain 2, IL-10 receptor (IL10R)chain 2, IFNAR1, and IFN-γ receptor (IFNGR)chain in mammals is syntenic with a region containing six class II cytokine receptor genes in Tetraodon [17] (see Figure 8) However, sequence comparison allowed no clear assignment of the fish genes to their mammalian counterparts, with the exception of the genes encoding tissue factor (TF), which is duplicated in Tetraodon (TF1 and TF2) A subsequent study [31], which included all available sequences throughout the animal kingdom, came to a slightly different conclusion regarding the phylogenetic relationships In this study the authors subdivided the genes into groups encoding ligandbinding and non-ligand-binding chains before conducting their phylogenetic analysis However, the justification for the assignment of particular fish genes that have no clear orthologs in mammals to one or other group is not obvious, especially because no sequence data were given in this study that unambiguously identify the genes analyzed We therefore revisited the phylogeny of class II cytokine receptors in teleosts and mammals The family is defined by the presence of the D200 domain, which consists of two immunoglobulin domain-like subdomains of the fibronectin type III class, SD100A and SD100B As has previously been pointed out [17], the bioinformatic identification of class II cytokine receptor genes is not trivial, and it is therefore unsurprising that Ensembl [24] contained predictions for only ten such genes in zebrafish Three of these not encode class II cytokine receptors but for thrombopoietin and titins, which have similar domains To identify further receptor genes we searched the zebrafish genome and all available zebrafish ESTs for the subdomains SD100A and SD100B (see Materials and methods, below) We identified 22 candidates, of which seven had incomplete D200 domains or exhibited only spurious resemblance to D200 domains These and the three genes encoding the D200-containing proteins thrombopoietin and titin were eliminated from further analysis Gene predictions were available for eight of the remaining 12 genes Of the four genes Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, (a) Stein et al R251.9 Structure of the mammalian receptor chains (b) Volume 8, Issue 11, Article R251 Chromosomal organization Hs IL2 R2 IL1 R1 Fr IL2 R2 TF2 TF1 Tn IL2 R2 TF2 TF1 Dr CRFB16 TFa TFb CRFB12 TF IL2 R1 IL2 R1 IL2 R1 IL2 BP IFNGR1 IFNAR2 IL1 R2 CRFB3 CRFB4 CRFB5 CRFB6 CRFB3 CRFB4 CRFB5 CRFB6 CRFB9 CRFB1 CRFB7 CRFB1 CRFB8 CRFB9 CRFB1 CRFB7 CRFB1 CRFB8 CRFB9 CRFB13 CRFB7 CRFB1 CRFB2 CRFB8 CRFB14 CRFB2 IFNAR1 CRFB4 IFNGR2 CRFB15 CRFB5 CRFB6 Figure Syntenic8organization of classII cytokine receptor genes Syntenic organization of classII cytokine receptor genes (a) Diagram of the structures of the mammalian receptor chains, with the blue and green rectangles representing the S100A and S100B domains, the red rectangle the intracellular domain of the ligand binding chains, and the gray rectangle the intracellular domain of the non-ligand-binding chains and TF (after Renauld [57]) (b) Synteny between regions containing class II cytokine receptor genes in mammals and fish Fat horizontal lines indicate chromosomes in the four species The brackets above the human genes show evolutionary relationships between the paralogs Vertical broken lines indicate suggested evolutionary relationships between the genes in the different species, based on the tree in Figure Color coding of names: red = long intracellular domain; black = short intracellular domain; blue = no intracellular domain; and pink = intermediate length intracellular domain Circled names indicate ligand binding chains Round brackets denote groups of genes in cases where there are no clear orthologous relationships of individual members with genes in the other species that had not been predicted by automated annotation tools, two (CRFB15 and CRFB16) were found only in the as yet unplaced whole genome shotgun sequences We re-annotated all 12 genes using the known gene structure of class II cytokine receptor genes and homology to known class II receptor genes as support We used these sequences for a phylogenetic analysis, which, in addition to the mouse and human sequences, also included Takifugu rubripes and Tetraodon nigroviridis CRFB1 to CRFB11 and IL20R2, as well as an additional gene, the product of which we shall call CRFB13 (Ensembl: NEWSINFRUG00000164405 and GSTENG0003154300) A set of recently described zebrafish class II cytokine receptor genes included two genes not identified by us (DrCRFB2 and DrCRFB6), which we have added to our analysis [18] Finally, DrCRFB14 was found by Georges Lutfalla, who generously contributed its sequence for inclusion in this analysis The phylogram of the class II cytokine receptors (Figure 9) corroborates previous conclusions that this gene family has undergone independent gene duplications and divergence in teleost fish and mammals Some of the fish genes cannot be matched to likely orthologs in mammals, and vice versa, with four exceptions in which high bootstrap values justify the interpretation of the genes sharing direct common ancestors The genes encoding TF in mammals cluster with two genes from each fish The phylogeny indicates independent duplication events in the pufferfish and zebrafish lineage The other set of genes that reliably group together are those encoding IL20R1, IL20R2, and IL-22 binding protein (IL22BP), with one representative in each of the mammals and fish For the other relationships between mammalian and fish genes the bootstrap values are so low that the relationships discussed below must be considered with caution Several mammalian genes have no plausible orthologs in the three fish genomes analyzed here, and others have more than one We therefore sought further evidence for evolutionary relationships by analyzing the genomic context of the genes A summary is shown in Figure Two sets of genes are linked both in mammals and in the two pufferfish The first is the IFNAR2, IL10R2, IFNAR1, and IFNGR2 complex and its syntenic complex described by Lutfalla and colleagues [17] for Tetraodon This synteny is also maintained in Takifugu and in all three cases continues outside the class II cytokine recep- Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 HsIL20R2 MmIL20R2 DrCRFB16 FrIL20R2 TnIL20R2 100 100 99 100 100 62 100 76 100 75 HsIL22BP MmIL22BP DrCRFB9 FrCRFB9 TnCRFB9 HsIFNAR2 MmIFNAR2 DrCRFB1 FrCRFB1 TnCRFB1 DrCRFB2 TnCRFB2 100 34 98 68 Genome Biology 2007, 100 93 86 HsIL20R1 MmIL20R1 DrCRFB8 FrCRFB8 TnCRFB8 100 12 100 82 100 100 100 100 100 100 91 100 100 48 91 100 80 83 100 HsTF MmTF FrTF2 TnTF2 DrTFb DrTFa FrTF1 TnTF1 HsIL28R1 MmIL28R1 DrCRFB14 HsIL10R1 MmIL10R1 HsIL22R1 MmIL22R1 DrCRFB12 100 HsIFNGR1 MmIFNGR1 DrCRFB13 FrCRFB13 TnCRFB13 DrCRFB7 FrCRFB7 TnCRFB7 100 DrCRFB6 FrCRFB6 TnCRFB6 100 94 49 79 100 100 32 44 100 100 HsIFNGR2 MmIFNGR2 100 53 HsIFNAR1 MmIFNAR1 100 66 100 79 84 100 39 FrCRFB3 TnCRFB3 48 46 100 100 HsIL10R2 MmIL10R2 FrCRFB4 TnCRFB4 DrCRFB4 DrCRFB15 DrCRFB5 FrCRFB5 TnCRFB5 0.1 Figure Phylogenetic tree of the class II cytokine receptors Phylogenetic tree of the class II cytokine receptors Details are as in figure Volume 8, Issue 11, Article R251 Stein et al R251.10 tor complex, in that the gene neighboring IFNGR2 is Tm50b in all cases, followed by Nnp1 However, the corresponding genes in the zebrafish are no longer linked (although they all lie on the same chromosome) The synteny is roughly reflected in the sequence similarities, in that IFNAR2 is most similar to CRFB1 and CRFB2 and that the IL10R2/IFNAR1/IFNGR2 group clusters with the CRFB3/4/5/6/15 group In particular, the IL10R2/IFNAR1/ IFNGR2 and CRFB3/4/5/6/15 genes encode receptors with short cytoplasmic domains, whereas IFNAR2 and CRFB1 and CRFB2 have long cytoplasmic tails However, within the group orthologies are not clear It is therefore not possible to conclude whether the ancestral complex that existed before the split of the teleosts and tetrapods contained two genes (a precursor for IFNAR2 and a precursor of the IL10R2/ IFNAR1/IFNGR2 group) with subsequent independent duplications in teleosts and mammals, or four genes, with fast divergence in the IL10R2/IFNAR1/IFNGR2 and the CRFB3/ 4/5/6/15 groups obscuring their common origin The second region in which a syntenic arrangement of genes is retained is the one containing IFNGR1, IL20R1 and IL22BP in mammals, and CRFB9 and the previously undetected CRFB13 in Tetraodon and Takifugu Again, the closest relatives of these genes (CRFB9 and CRFB13, respectively) are not syntenic in zebrafish Notably, fish CRFB9 proteins share the absence of a transmembrane domain with the mammalian IL22BPs In view of this and the syntenic arrangement, the most reasonable interpretation is a homology of IFNGR1/ CRFB13 and IL22BP/CRFB9 In summary, teleost fish have approximately the same number of class II cytokine receptors as mammals, but the genes have evolved rapidly and independently since the separation of the species We shall leave the discussion at this point, because the current set of data does not support further speculation A statement about which of these receptors are functionally equivalent will have to await experimental analysis, as has been conducted for two of the zebrafish CRFBs [18] It will be interesting to determine whether fish distinguish between viral and bacterial induced IFN signaling pathways in the same way as mammals Class II cytokines IFNs have been reported in several fish species with an ambiguous nomenclature [32-40] We find ten class II cytokine genes in zebrafish, and five in each pufferfish (Figure 10) The large group of mammalian type I IFNs cluster together on one branch of the phylogenetic tree that does not include any fish cytokines This fits with the view that the generally intronless type I IFN genes are the product of a retrotransposition event [17], which occurred after the split of teleosts and tetrapods Apart from the clear fish orthologs of the mammalian type II IFNs, the remaining fish class II cytokines are more similar to the mammalian ILs and type III Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, HsIFNα8 HsIFNα17 HsIFNα4 100 38 HsIFNα10 99 HsIFNα21 HsIFNα7 41 HsIFNα14 HsIFNα1 39 95 HsIFNα2 28 HsIFNα5 19 26 HsIFNα6 MmIFNα2 98 86 MmIFNα11 MmIFNα4 100 MmIFNα9 MmIFNα1 40 MmIFNα5 34 MmIFNα6 53 15 MmIFNα7 14 MmIFNα10 17 MmIFNα12 HsIFNω HsIFNκ MmIFNκ HsIFNε1 100 MmIFNε1 HsIFNβ MmIFNβ DrIFNφ6 HsIFNλ1 99 HsIFNλ2 100 HsIFNλ3 100 100 MmIFNλ1 MmIFNλ3 100 MmIFNλ2 HsIL10 100 MmIL10 DrIL10 FrIL10 100 TnIL10 HsIL19 100 MmIL19 HsIL20 100 MmIL20 DrIL34 FrIL34 100 TnIL34 HsIL26 HsIL24 100 MmIL24 HsIL22 100 MmIL22 FrIL35 100 TnIL35 HsIFNγ MmIFNγ DrIFNγ1 DrIFNγ2 FrIFNγ 100 TnIFNγ DrIFNφ2 DrIFNφ3 DrIFNφ4 DrIFNφ1 FrIFNφ 100 TnIFNφ DrIFNφ5 15 18 100 41 100 99 50 99 13 36 29 100 99 87 100 84 100 68 62 17 99 97 34 42 20 26 100 100 97 69 0.1 Figure 10 Phylogenetic tree for the classII cytokines Phylogenetic tree for the classII cytokines Details are as in Figure See text for gene names Volume 8, Issue 11, Article R251 Stein et al R251.11 IFNs Like these, they are mostly encoded by genes with four phase introns, supporting the view that this constitutes the gene structure of the ancestral class II cytokine gene Among these class II cytokines, IL-10 exhibits an apparent orthology between fish and mammals [41,42] This is also supported by the genomic locations of the IL-10 genes, which are situated adjacent to and on the opposite strand of the Mapkap2 genes in all five species (Figure 11) The genes that had been annotated as IL-20 in the zebrafish (Refseq: NP_001076424.1) and Tetraodon (Uniprot: Q7SX60), and initially as IL-19 and then changed to IL-24 in Takifugu (Ensembl: SINFRUG00000154816) are equally related to mammalian IL-19 and IL-20 The previous automated naming of the fish genes should therefore be amended In concordance with the nomenclature rules for vertebrate gene families, this gene has therefore been given the next available number in the IL series (IL-34) The fish IL-34 genes and the mammalian IL-19, IL-20, and IL-24 genes are located in the vicinity of the IL-10 genes (in the zebrafish this gene has not yet been placed on a chromosome), but duplications and inversions have broken up the syntenic relationships downstream of IL-10 The phylogenetic tree argues for a common precursor for these genes that has duplicated in mammals, yielding IL-19 and IL-20 Whether IL-24 is the product of a second local duplication or of an older duplication of a larger segment of the genome is not clear, but it shows a higher degree of similarity to the class II cytokine genes found in a complex on a different chromosome in all five species (Figure 11) A second group of class II cytokines exhibiting high sequence similarity are the mammalian IL-22, IL-24 and IL-26, and two pufferfish interleukins annotated as 'IL-24' in Tetraodon (Uniprot: Q7SX82) and 'homologous to IL-24' in Takifugu (Ensembl: SINFRUG00000156387) Again, the phylogram shows that this name is problematic, because if anything these proteins are more similar to IL-22, and their genes exhibit the same syntenic relation to the flanking MDM1 gene as the IL-22 genes in mammals (Figure 11) However, the zebrafish gene in the same position (RefSeq: NP_001018628), annotated as IL-22 [33], is highly divergent in sequence Because frequent duplications and loss of genes as well as rapid sequence divergence appear to operate within this family, originally orthologous genes may no longer be recognizable This is further illustrated by the flanking IL-26 gene in the human genome The mouse genome has lost this gene; in the zebrafish a class II cytokine gene described as IL26 [33] is present in this position, but it does not cluster with the IL-22/24/26 group Although the IL genes between MDM1 and IFN-γ are in apparently orthologous positions in all five species, there is no indication that the mammalian arrangement MDM1/IL-22/IL-26/IFN-γ represents the ancestral cluster, rather than the IL genes having arisen by independent duplications in mammals and teleosts Because the names given to the fish cytokines of this group are Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, Mm chromosome Stein et al R251.12 Mm chromosome prolargin Volume 8, Issue 11, Article R251 Ikbke Rassf Dyrk3 mapkap2 IL1 IL2 IL2 3 MDM1 Mb f ibromodulin IL2 IFN-g 11 11 Mb IL1 Hs chromosome MDM1 IL2 IL2 IFN-g 6 Mb 0 Dr chromosome 11 Dr chromosome Ikbke Rassf Dyrk3 mapkap2 MDM1 f ibromodulin IFN-phi5 IFNg1 IFNg2 9 Mb IFN-phi6 IL1 IL3 prolargin Tn chromosome 11 ( and Fr scaff _ ) Tn Un_ random ( and Fr scaff _ ) Ikbke Dyrk3 mapkap2 f ibromodulin 8 MDM1 Mb IL1 IL3 prolargin IL3 IFNg Mb Figure 11 Genomic organization of two class II cytokine gene clusters Genomic organization of two class II cytokine gene clusters Chromosomes are shown as lines with the positions of the region marked in megabase pairs underneath Genes transcribed on the top strand are shown above the line, and those transcribed in the opposite direction are shown below Class II cytokine encoding genes are shaded in gray In the left diagram the syntenic regions and duplications, and inversions surrounding the IL-10 locus are shaded in red and blue The human IL-10 gene is located on chromosome and the region shows the same arrangement as in the mouse The current zebrafish genome assembly Zv7 does not yet contain the recently sequenced clone CU459075, which places IL-34 into the interval between IL-10 and prolargin (IL34 is included in Zv7 on the unplaced contig Zv7_NA1656) There are therefore no coordinates for the right end of the interval The two pufferfish show the same arrangement both for the region around IL-10 and for the MDM1/cytokine/IFN-γ region The names for the fish genes are explained in the text IFN, interferon; IL, interleukin extremely confusing and suggest relationships for which there is no evidence, we again propose a new nomenclature, as shown in Figures 10 and 11 (IFN-ϕ6 for zebrafish IL-22, IFN-ϕ5 for zebrafish IL-26, and IL-35 for the pufferfish IL24) Four of the remaining fish class II cytokine genes cluster with the mammalian INF-γ genes and the rest not group with any of the mammalian genes The pufferfish each have one IFN-γ gene, whereas the zebrafish has two, namely IFN-γ1 and IFN-γ2 [33,34], which lie in tandem in a position in the genome that has retained its synteny between mammals and teleosts (Figure 11) Finally, a group of teleost class II cytokines, some of which had previously been called IFN-λ, cluster on a branch without mammalian cytokines Because they are not more related to mammalian IFN-λ than to other cytokines, we call them IFNϕ1 to IFN-ϕ4 IFN-ϕ1 has previously been described as 'zebrafish interferon', 'IFNab', and 'IFN-λ' [17,18,32], and IFN-ϕ2 and IFN-ϕ3 as 'type I IFN 2' and 'type I IFN 3' [43] Only one gene of this type, most closely related to the zebrafish IFN-ϕ1 gene, is found in the two pufferfish This may be due to the difficulty in identifying these genes, and it would not be surprising if further class II cytokine genes were found in the pufferfish genomes In summary, like the receptors, the class II cytokine genes have duplicated and diverged independently in fish and mammals It remains to be tested experimentally which class II cytokines are responsible for which immune function Intracellular pathogen sensors: the NACHT-domain family A large family of cytoplasmic proteins, characterized by the presence of a nucleotide-binding domain, the NACHT domain [44,45] or the closely related NB-ARC domain [46], has been implicated in inflammation and innate immune signaling in animals and plants Some of them have been shown to recognize intracellular pathogen-associated molecular patterns through their carboxyl-terminal leucine-rich repeats (LRRs) They differ in their amino-terminal effector domains (for example, CARD or pyrin domains), which mediate signal transduction to downstream targets, leading to the activation of NF-κB or the apoptotic pathway An initial search in the fish genomes for homologs of the known mammalian NLR proteins of the Nod subfamily found homologs for Nod3 and Nod9 in all three fish species: Nod2 in zebrafish and Takifugu, and Nod1 in Takifugu Three genes in zebrafish, two in Takifugu, and one in Tetraodon were annotated as 'Nalps' (NACHT, leucine rich repeat and PYD containing proteins) but did not group with the mamma- Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, Nod1 Nod2 CIITA Nod9 mammalian Nalp Apaf NACHT-P1 Ipaf Nod3 Fr/ Tn expansion Dr expansion Fr/ Tn expansion Tn expansion Fr expansion Hs Figure 12 Mm Dr Fr Tn Volume 8, Issue 11, Article R251 Stein et al R251.13 Figure 12 Overview of a phylogenetic tree of 277 NLR proteins Overview of a phylogenetic tree of 277 NLR proteins Each sequence is assigned a background color to illustrate species relationships: pink = human, yellow = mouse, blue = zebrafish, green = Takifugu, and turquoise = Tetraodon The 'canonical' proteins Nod1, Nod2, Nod3, Nod9, CIITA, and Apaf, which show clear homologous relationships between the five species, cluster at the top (rainbow colors) The mammalian Nalp proteins cluster together (pink/yellow region) Each fish has a large group of species-specific proteins (blue, green, and turquoise regions) In addition, Takifugu and Tetraodon share several apparently orthologous gene pairs (green and turquoise region) Apaf, apoptotic protease activating factor; CIITA, major histocompatibility complex class II, transactivator; Nalp, NACHT, leucine rich repeat and PYD containing protein; NLR, nucleotide-binding domain/NACHT domain and leucine rich repeat containing family; Nod, nucleotide oligomerization domain containing protein lian Nalps on a phylogenetic tree We found no homologs for any of the mammalian Nalps in fish We therefore screened the whole zebrafish genome for sequences encoding NACHTdomains This revealed a large number of additional sequences encoding NACHT domains Most of these were not within genes found by the automated gene prediction algorithms, because the number of and similarity between the genes was so high that they had been masked as repeats We therefore annotated these genes manually using ESTs as guides and identified a large set of novel NACHT-domain containing genes After we had completed our initial annotations, automated predictions for 205 NACHT-domain encoding genes were deposited at the National Center for Biotechnology Information (NCBI) These showed only a partial overlap with our sequences Many were incomplete or contained two NACHT domains, indicating incorrect annotations We therefore re-screened and re-annotated the zebrafish genome and have found more than 200 genes of this class (the complete list is given in Additional data file 9) These are numbered sequentially by chromosome number and by their order on the chromosome We have not been able to produce perfect gene models for all of them As discussed below, they have novel amino-terminal sequences, and in the absence of sufficient EST evidence we were unable in all cases to draw reliable conclusions regarding the 5' end of the gene Similarly, the LRRs in the carboxyl-terminal region are difficult to predict reliably Extensive experimental work will be needed to characterize these genes For our analysis here we have selected a set of 70 representative sequences We also searched the two pufferfish genomes for members of this gene family to find out whether the group we found in zebrafish was specific to this species, or whether the massive gene duplication had occurred early in the fish lineage We found 70 members of this family among the annotated genes in the genome of Takifugu rubripes A large number of matches found in the Tetraodon genome were not parts of predicted or annotated genes, as had been the case in the zebrafish Again, these sequences had been masked as repeats We manually assembled a set of sequences using Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, homology to the zebrafish and Takifugu sequences as guides It is striking that the majority of the members of this gene family (40/49) are located within incompletely assembled contigs/scaffolds that have not been assigned to chromosomes (the 'Un_random' set) Initially, our searches for NACHT-domain encoding genes resulted in a number of predictions that spanned separate contigs, but which had additional fragments of genes of this family interspersed within their predicted introns This suggests that these predictions were not correct, but were due to accidental occurrence of apparently spliceable gene fragments in neighboring contigs of this assembly that are in fact not located next to each other in the genome This view is supported by the finding that three sequences, which are very closely related to consecutive parts of the other fish Nod2 genes, were positioned on widely separated contigs in the Un_random assembly We have combined these three fragments into one sequence, which we call TnNod2 The high proportion of genes from this family in the nonassembled part of the genome might be an indication that the proper assembly of these contigs is made difficult or impossible precisely because of the repetitive nature of this family before the split of the two species and suggesting conservation of their function We note again that the Tetraodon gene predictions are less reliable and are often incomplete, leading to spurious homology assignments The relationship of these sequences to the other fish sequences therefore represents an approximate picture that must be interpreted with caution Phylogenetic relationships of NLR protein families in mammals and fish A phylogenetic tree of all NLR containing predicted peptides from human, mouse, and the three fish species reveals the following relationships (Figure 13) The canonical Nod proteins Nod1, Nod2, Nod3 (recently renamed as Nlrc3) and Nod9 (recently renamed as NlrIX), as well as Apaf1 (apoptotic protease activating factor 1) and CIITA (major histocompatibility complex class II, transactivator), are present in all five species and exhibit clear orthologous relationships (Figure 14) The Nalp proteins (which have recently been renamed Nlrp) form a separate branch, representing a mammalian expansion of NLR proteins For most of the genes on this branch, there are closely related pairs of mouse and human genes, but several cases of mouse-specific or human-specific duplications can also be found, notably the mouse Nalp4 genes Two zebrafish sequences that cluster with this group, 2.03 and 2.05, encode only a NACHT domain with a divergent P-loop and should therefore not be considered Nalp-like proteins Most strikingly, the large groups of newly identified fish sequences lie on mostly species-specific branches The majority of the zebrafish genes form a branch of their own, which includes no genes from either of the two pufferfish Consistent with the closer relationship between the two pufferfish, the genes from these two species are less clearly separated Whereas one branch contains exclusively a subset of genes from Takifugu, the branch that contains the majority of Tetraodon genes also includes several Takifugu genes There are two branches with several cases of apparent orthologies between Takifugu and Tetraodon (genes from the two species that are more similar to each other than to any other gene in their own species), indicating the existence of these genes Volume 8, Issue 11, Article R251 Stein et al R251.14 Whereas most of the novel fish NLR proteins are more related to each other than to mammalian NLR proteins, there are exceptions (apart from the canonical proteins mentioned above) One group of new fish proteins, which we named NACHT-P1, clustered with Apaf1 We wished to know whether this was a fish-specific NACHT protein and searched the mouse and human genomes for similar sequences We found one ortholog in each case, neither of which had been characterized previously Their amino-terminal parts contain no motifs known from other proteins Like the Apaf proteins, these sequences contain WD40 repeats instead of LRRs FrNACHT-P2 and TnNACHT-P2 have an unusual amino-terminal addition, a filament domain We found no other sequence in any organism that encodes a protein composed of a filament domain and a NACHT domain Fish-specific properties of novel fish NLR proteins The large groups of novel, fish-specific NLR proteins are highly conserved in each species, indicating recent speciesspecific expansions (Figure 12 and additional file 10) Like other NLR proteins, they contain LRRs at the carboxyl-terminus, but the majority does not contain any of the amino-terminal effector domains that have been found in conjunction with NACHT-domains in mammals or plants (such as CARD, pyrin or TIR domains) However, the region immediately upstream of the NACHT domain is highly conserved in all of the fish proteins (Figure 14) To find out whether this region corresponded to other known peptide motifs, we used a hidden Markov model built from the zebrafish sequences for a BLAST search of the mammalian genomes No good matches were found We then searched the three fish genomes In the zebrafish and in Takifugu we found only those genes we had already identified via their NACHT domains In the Tetraodon genome many but not all of the matches we found were upstream of NACHT domains or were part of our previous gene predictions As the remaining ones were again located mainly in the Un-random set, we did not attempt to link them to the predictions for the NACHT domains, for the reasons discussed above As in the other two fish genomes, none of the matches were within gene predictions for other (non-NACHT-domain) genes This indicates that this domain, which we will call the Fisna (fish-specific NACHT associated) domain, has been recruited specifically by a common ancestor of the novel NLR proteins in the fish lineage Confirming this view, a cursory search of other fish genomes showed highly similar sequences in cat- Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, Volume 8, Issue 11, Article R251 Stein et al R251.15 HsNod1 MmNod1 DrNod1 FrNod1 100 100 TnNod1 HsNod2 100 MmNod2 100 DrNod2 FrNod2 100 100 TnNod2 100 HsCIITA MmCIITA FrCIITA 100 TnCIITA DrCIITA HsIpaf 100 MmIpaf HsNaip MmNaip1 100 100 MmNaip6 HsApaf1 100 MmApaf1 100 DrApaf1 FrApaf1 100 100 TnApaf1 HsNACHT-P1 100 MmNACHT-P1 100 DrNACHT-P1 FrNACHT-P1 100 100 TnNACHT-P1 HsNod9 100 MmNod9 100 DrNod9 FrNod9 100 100 TnNod9 HsNod3 100 MmNod3 100 DrNod3 FrNod3 92 100 TnNod3 100 100 100 94 99 100 52 100 87 99 35 FrNACHT-P2 TnNACHT-P2 HsNalp10 MmNalp10 FrNACHT-P3a 100 FrNACHT-P3b HsNalp6 MmNalp6 HsNalp1 MmNalp1 MmNalp1 like MmNalp1c 100 100 82 39 100 84 100 100 100 34 75 HsNalp3 MmNalp3 HsNalp12 MmNalp12 HsNalp14 MmNalp14 HsNalp5 MmNalp5 HsNalp13 HsNalp8 HsNalp2 HsNalp7 MmNalp2 100 100 100 100 23 33 95 100 60 100 62 100 98 99 100 97 100 85 100 100 100 77 100 100 HsNalp11 HsNalp9 MmNalp9a MmNalp9c MmNalp9b HsNalp4 MmNalp4b MmNalp4d MmNalp4f MmNalp4a MmNalp4c MmNalp4e 100 MmNalp4e like 0.1 100aa fish-specific Filament BIR Figure 13 (see legend on next page) Genome Biology 2007, 8:R251 PYD CARD NB-ARC NACHT LRR WD 40 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, Volume 8, Issue 11, Article R251 Stein et al R251.16 Figure 13 (see previous page) Phylogenetic tree of NACHT proteins shared by mammals and fish and diagram of their protein structures Phylogenetic tree of NACHT proteins shared by mammals and fish and diagram of their protein structures In addition to the known proteins Nod1, Nod2, Nod3, Nod9, CIITA, and Apaf, this tree shows that a new protein is shared by all five species, which we have named NACHT-P1 The protein domain structure diagram shown next to NACHT-P3 is representative of the majority of the novel fish proteins Apaf, apoptotic protease activating factor; CIITA, major histocompatibility complex class II, transactivator; Nod, nucleotide oligomerization domain containing protein fish and Medaka, also associated with NACHT-domain encoding genes, which we did not follow up further Although, as mentioned above, there is no evidence for the presence of this domain other than in fish, we noticed that a short peptide motif within this domain (LK/E/NQ/K/ RYITE/D) is also found in mammalian Nod2 (LEDYITE), and another (LYIIEGESEGVNEEHEVLQ) just downstream of the first, in Nod3 (LLLVD/EGLSDLQQK/REHDLM/V/TQ) The region containing these sequences in Nod2 and Nod3 is neither part of the NACHT nor of the CARD domain and has not been assigned a cell biologic function Their conservation in the new NLR gene families might indicate a shared origin and possibly shared functions A similar expansion of NLR-encoding genes was recently described in the sea urchin [47,48] We compared the predicted sea urchin protein sequences with our sequences In addition to sharing high similarity with the fish proteins in the NACHT domain and the LRRs, the sea urchin proteins also have a region upstream of the NACHT domain that is highly conserved among the sea urchin set of proteins, and includes sequence motifs similar to those in the fish proteins and in mammalian Nod2 Peptide motifs in the amino-terminal part of the zebrafish NLR proteins Further study of the amino-terminal regions of the new zebrafish NLR proteins showed that many of them contained considerable stretches of predicted peptide sequences upstream of the conserved fish domain, in some cases with multiple, related sequence repeats Manual editing of the automated alignment created by ClustalW [49] revealed the following structure of the amino-terminal regions of this protein family (Figure 15) Based on sequence similarity in the NACHT-domain, which is equally recognizable in the Fisna domain, the protein family can be subdivided into four groups (Figure 14) Each of these groups has further shared motifs upstream of the Fisnadomain (Figure 15) The amino-terminal sequences in group are highly conserved and not found in any of the other families (darker green shading in Figure 15) A comparison with mammalian proteins showed that it has significant similarity with the pyrin-domain found in mammalian Nalp and MEFV (mediterranean fever)proteins Group has a 101 amino acid stretch upstream of the Fisna domain that is shared by all members of this group (lighter green shading in Figure 15) It shows a distant resemblance to the pyrin domain of group The most amino-terminal sequences in this group contain motifs shared with members from groups and A motif shared by members from these three groups is a repeat (different hues of blue shading in Figure 15 indicate different versions of the repeat), which occurs in one, two, or three copies per protein, or in one case, in ten copies Group has a version of this repeat with a four-amino-acid insertion, which is also found in some members of group These repeats are usually combined with a specific amino-terminal peptide of 14 amino acids (pink shading) Other conserved amino-terminal peptides (yellow or orange shading) are associated with a particular type of repeat Group is the least homogeneous, showing divergence both within the group and in comparison with the other groups, in the repeats as well as in the Fisna and NACHT domains No significant homologies to the repeat sequence are found in mammals In summary, the amino-terminal parts of the novel NLR proteins contain up to three different motifs, two of which are found only in fish The Fisna domain is found in all of the proteins and is located immediately upstream of the NACHT domain It is specific for this protein family in fish Groups and contain a pyrin-related domain upstream of the NACHT domain Members of groups to can in addition contain one or more copies of a motif that is also specific for the novel fish NLR proteins Members of groups and contain multiple variants of this motif but no pyrin-domain-like sequences Figure 14 (see domain upstream of the NACHT domain The fish-specificfollowing page) The fish-specific domain upstream of the NACHT domain (a) Alignment of a representative subset of the Fisna domain (the region upstream of the NACHT domain that is shared by all of the novel fish NLR proteins The group names on the right refer to the subdivision of the Danio rerio groups according to similarities in the NACHT-domain and the Fisna domain (also see Figure 15) or indicate which species form the group Peptide motifs with similarity to Nod2 and Nod3 are underlined (b) Hidden Markov model (HMM) logo representing the consensus sequence of the Fisna domain in all three fish species The logo has been generated using the software HMMER [58,59] and visualized using the HMM-Logo web server [60,61] Peptide sequences from human Nod2 and Nod3 with similarity to short stretches of the Fisna consensus, color coded to highlight conserved residues, are listed underneath, as are stretches from the regions upstream of the NACHT domain present in 140 sea urchin NLR proteins NLR, nucleotide-binding domain/NACHT domain and leucine rich repeat containing family; Nod, nucleotide oligomerization domain containing protein Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, (a) (b) Figure 14 (see legend on previous page) Genome Biology 2007, 8:R251 Volume 8, Issue 11, Article R251 Stein et al R251.17 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, Distribution in the genome for INF-ϕ1 and are involved in defense against viruses [18] Similar studies will be necessary to determine the functions of the remaining ligands and receptors The rapid evolution of the gene families for the class II cytokines and their receptors probably reflects the fact that the IFN system is frequently subverted by pathogens, resulting in the need for compensatory mutations to escape inactivation Significantly, the receptor family member that is not primarily associated with pathogen defense, TF, does not exhibit this high level of divergence The genes encoding the novel proteins are distributed throughout the genome Some chromosomes contain single genes, or a few, widely spaced genes, but many of the genes occur in large tandem clusters (Figure 16) Conclusion Our findings show that the components of the TLR and class II cytokine signaling systems known from mammals are also found in teleosts Although all of the main constituents are present, there are differences in the degree to which the various functional groups are conserved This is the case both for the divergence in sequence as well as for the creation of new genes by duplications The most highly conserved group of proteins are those involved in intracellular signal transduction downstream of the transmembrane receptors: the kinases, adaptors, Stats, Trafs and transcriptional regulators They exhibit high sequence conservation and largely orthologous relationships, such that for each gene there is one copy in each species, and these genes are more closely related to each other than to other genes of the family We see only a few cases of duplications In some cases (Ticam-1, Ticam-2, and IRAK2) there appear to have been gene duplications only in mammals, but more often we find additional genes in the fish genomes Additional copies of genes in the teleosts need not necessarily be generated by lineage-specific individual gene duplications, but may instead be remnants of the third whole genome duplication postulated for the teleost lineage [3] We not see as a general rule that for each mammalian gene there is more than one copy in the zebrafish genome However, in the highly conserved gene groups we in fact see more duplications of fish genes than of mammalian genes (additional copies for 12 genes in the case of teleosts, although not always in all three species, and only three duplicates in the two mammals) This suggests that at least some of these may indeed be remnants of the third whole genome duplication in teleosts, as is supported by the syntenic organization of the duplicated genes and the flanking genes in the case of the Stat genes The family of the class II cytokine receptors is neither highly conserved, nor does it exhibit species-specific expansions The five species we compared have approximately the same number of receptor chain genes, but the divergence is so great that no reliable orthologies can be established A similar lack of orthology is seen for the ligands Apart from the lineage specific expansions of the type I IFNs, there are similar numbers of class II cytokine genes in the five species, but they cannot be assigned into orthologous groups (with the exception of IL-10 and IFN-γ) The strong divergence also prohibits speculations on which ligand might bind to which receptor in the zebrafish For one pair this has recently been established experimentally; CRFB1 and CRFB5 are the receptor chains Volume 8, Issue 11, Article R251 Stein et al R251.18 The greatest divergence is found in the NLR protein family, with lineage-specific expansions in each organism, as has also been found for this type of protein in echinoderms [47,48] Similar, if less extreme, situations are found for the TLRs [6,7] and the novel immune-type receptors [8-10], gene families that also have sets of orthologous receptors in fish and mammals as well as fish-specific expansions Thus, the elements of the systems that are directly involved in interactions with pathogen components are those that are most likely to diversify by undergoing lineage-specific expansions Indeed, a study that specifically tested the role of lineage-specific gene families in five eukaryotic species found that the genes that were particularly prone to such expansions included those involved in responses to pathogens [50] Furthermore, our results are in concordance with recent findings from a comparison of three insect genomes that showed the following [51]: first, the genes associated with immune functions are on average more divergent than the rest of the genome; and second, that the divergence occurs primarily in those genes whose products interact with the pathogen This study found that in addition to pathogen recognition proteins, this was also the case for the effectors, a set of proteins we have not analyzed in the zebrafish The expansion of gene families involved in pathogen recognition is likely to reflect adaptations of the species to new pathogen environments We have not yet tested whether there is a particularly high level of sequence variability associated with particular parts of the NLR proteins The number of LRRs varies greatly, but it will be necessary to validate the gene models for each gene before any reliable conclusions can be drawn It will also be interesting to see whether the genes are more polymorphic than other genes in the genome The fact that the few ESTs that are available, which are derived from a different strain of zebrafish, not correspond 100% to any of the gene models is a hint that this might be the case The function of the NLR genes and the significance of their species-specific expansion will be an exciting topic for experimental analysis Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, Volume 8, Issue 11, Article R251 Stein et al R251.19 (a) 4.08 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M A N A K Q L L K N S L D E L V D A E L K E F Q WY L I N D H R D I S K A E M E N A D R L K T V D K M V S C F G P K G A V K T T V D I L R K I N Q N E L A E E L E N K H K Q G A V L E T C K S P P F D Y T N T S R E L K K K L K E - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M A N V K Q L L K K S L D E L V E D T L K D F Q WH L M N D H R E I S K A E M E N A D R R N T V D K L V S C F G S E R A V K I T V D T L R K L N Q N Q L A E D L E N T Q K Q G A A S E T C K S P P V D Y T H T S H E L K E K L K E 4.44 4.41 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M A N V K Q R L K D S L D E L K E D T L K D F Q WH L M I D H R E I S T G E L E N A D R R K T V D K L V S C F G S E R A V K I T V D T L R K I K Q N Q L A E E L E K K Q Q Q G A A S E T C K S P P V D Y T N T S H E L K E K L K E - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MA N V K Q L L D N S L D E L L E A E L K K F Q R C L V N D H R D I S K A E ME N A D R L D T V D K MV S C F G S E R A V K I T V N T L R K I K Q N Q L A E E L E N T Q K Q G A A S E S C K S P P V D Y T N T S R E L K E K L K E - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M A N V K Q L L D N S L D E L L E A E L K K F Q WC L V N D H N E I S K A E L E N A D R L D T V D K M V S C F G S E R A V K V T V D T L R K I K Q N D L A D Q L E N T Q N Q G T A L E N C K T L P L D Y T N I S H E L K K K L K E 14.14 23.05 4.42 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M A N V K Q L L K N S L D E L R E A E L K E F Q WC L V N D H R E I S T A E L E N A D R L K T V D K L V S C F K P E R A L K T T V D T L R K I K Q N E L A E E L E N T Q K Q G A A S E T C N S P P V D Y T N T S R E L K K K L K E - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M E N V K Q L L K N S L K E L V E V E L K E F Q WC L R N D Y R C I S K S E M E N A D R L E T V D K M E S C F G P E G A V K I T V D I L R K I N Q N D L A E K L E N T Q K Q - - A S E N S K T P - L D Y T N I S H E L K K N L K E 14.40 4.30 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M A S V E E L L L K S L E D L E N P E L K K F Q WH L K K Y P K R I Y K C E M E K A D R L D T V D K M V E C F G A E D A V N N T V S I L R K I N Q N N L A E Q L E N E H K N Q G S A S A D S K Q V L Q E N - - S K R L K D K L K Q 4.14 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M A E E R V K G S L S E K H - - - - - - - - - - - S V R S G S F V S S S V S L K S D WS K G G P P P D L R G K T P S S V K S - - - - - - - - - - V R S G S C V S S S V S L K S D WS M G H L P P D L R E K T P S S A - - - - - - - - - - - - - - - - - - - - - R R H L V A D L V S Y S - - - - - - - - D N L Q WI F Q N L E S K M I R F L K N Q L E N F R K I L Q H K N R Q E F I K E F I E N R S I L T E A A L D L T L F F L R E M K Q D Q A A D T L Q G - - - - - - - - - - - - - - - - - E L F F I N Q L K C S L K K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M A E E R V K D S L S E K H - - - - - - - - - - - S V R S G S R V S S S V S L K S D H S K D G R P P N F R E K T P S S V K S - - - - - - - - - - V R S G S R V S S S V S L K S D C S K G G P S P D L R E K T P S S A - - - - - - - - - - - - - - - - - - - - - R R H L V A D L V S Y S - - - - - - - - D N L Q WI F Q N L E S K M F R F L K N Q L E N F R K I L Q H K N R Q E F I K E F N E N R S G I T E A A L D L T L F F L R E M K Q D K A A D T L E G - - - - - - - - - - - - - - - - - E L F F I N H F K C S L K K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MA E E R V K D S L S E K H - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S V R S G S C V S S S V S L K S D R S K D P P P G F S G E R A S S A Q S - - - - - - - - - - - - - - - - - - - V E Y E - S D S G D E T H R R H K S F T - D N L Q S I F Q N L E S K MI R F L K N E L E K F K K I L K E E N R Q E F V K E F N E N R S I I T E A A L D L T L L F L R E MK Q D Q A A D T L Q G - - - - - - - - - - - - - - - - - E L L F N N Q L K C S L K K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MA E E R V K D S L S E K H - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S V R S G S C V S S S V S L K S D R S K D R P P E F S G E T P S S A Q S - - - - - - - - - - - - - - - - - - - V E Y E S A D S G D E T H R R H K S F T - D N L Q S I F Q N L E S K MI R F L K N Q L E N F K K I L Q E E N R Q E F V K E F N E N R S I I T E A A L D L T L F F L R E MT Q D Q A A D T L Q G - - - - - - - - - - - - - - - - - E L L Y I N Q L K C S L K K 4.31 4.21 4.46 4.50 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S V S L K S D R S K Q G Y P P D L R E E T P S S V R S - - - - - - - - - - - - - - - - - - V E Y E A A D L G D E T H R R H K S F T - D N L Q S I F Q N L E S K MI R F L K N E L E K F K K I L Q E E N R E E F V K E F N E N R S I I T E A A L D L T L F F L R E MK Q D Q A A D T L Q G - - - - - - - - - - - - - - - - - E L F F I N Q L K C S L K K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V F S V R S G S - - S S S V S L K S D WS K D E - P P K F S E K S P S S A R S - - - - - - - - - - - - - - - - - - V E Y E L P Y S G D K T H R S H K G F T - K D L P R I F Q N L E S K I I R Y L K N E L E K F K K I L Q E E N R Q D F V K H F N E N R S I I T E A A L D L T L F F L R E M K Q D E A A D T L E G - - - - - - - - - - - - - - - - - E L F F I N Q L K C G L K K 4.12 4.49 4.19 4.38 4.34 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M A E E R V K D S L S E K H - - - - - - - - - - - S V R S G S C - - - - - - M K S D WS K E D - P P E F S G E T P S P A Q I - - - - - - - - - - V D S G S Y V S S S V S L K S N WS K N H - P P T L T G K E P S S A K S - - - - - - - - - - - - - - - - - - V E Y E L P E P R H R T H R R H K S F T - D N L V WI F Q N L E N K I D R F L K N E L K K F K K I L Q E E N R Q D F V K N F N D N R C R I T E A A L D L T L F F L R D M K Q D E V A D T L Q G - - - - - - - - - - - - - - - - - E L F F I N Q L K C S M K K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ME D T H T - - D L D L S T G C - - S S V H Q K R A E A E P S C V S - - - - MK S D A S MT - P P V K F K S G N T G A A V S - - - S V H Q K R A E A E P S C V S - - - - MK S D V S MD I - P I K F R S K N T Q P A V S - - - - - - - - - - - - - - - - - - V E Y E S A D S G D E T H R R H K S F T - D N L Q S I F Q N L E S K MI R F L K N E L G N F K K I L Q E E N S Q E F V K E F T E N R S I I R E A A L D L T L F F L R E MK Q D Q A A D I L E D - - - - - - - - - - - - - - - - - E L F F I N Q L K C S L K R - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MA E E R V K D S L S E K H - - - - - - - - - - - S V R S G S C V S S A V S L K S D G S MG P P P E L S E K S P S S A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Q S V D G D D Q T G D L Q Q D S L Q P E H D E L Q R V K E Q H K T S MK N - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MA E E R V K D S L S E K H - - - - - - - - - - - S V R S G S R V C S S V S L K S N R S K D N P P I F R E K T Q S F A K S - - - - - - - - - - - V R S G S C V S S Y V S L K S D R S K D E - P P Y F R E K T - S S A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - K S V D G D D Q T G D L Q Q D S L Q P E H D E L Q R V K E Q H K T S MK N - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V R S G S C V S S S V S MK S D G S MG H P P D L R E K T L S S A K S - - - - - - - - - - - V R S E S C V S S S V S L K S D G S MG K - P P D L R E K T P S S A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - E S V D G D D Q T G D L Q Q D S L Q P E H D E L Q R V K E Q H K T S MK N - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - R S G S C V S S S V S L K S D R S K D R P L N F R E K T P S S A K R Y F V - - - - - - - - - - - - - - - - - - - S V H Q K R A E A E P S C V S - - - - MK S D A S ME R P - I A F K S G N T G P A V S - - - S V H Q K R A E A E P S C L S - - - - MK S D Q S MG V - P I T F K S E N T R P A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V S V D G D D Q T G D L Q Q D S L Q P E H D E L Q R V K E Q H K T S MK N - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S V H Q K R A E A E P S C V S M - - - - K S D WS M D P P V K F K S G N T G - - - - - - - - - - - - - - - - - - - - - - - - - - A A V R R R A E A E P S C V S - - - - L K S D A S M G H P E N N F R S E H T P P A L S - - - S Q K R R R A E A E P S C V S - - - - L K S D A S M N H P K T D F R S E H T P P V V V - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - F S V D G D D Q T G D L Q Q D S L Q P E H D E L Q R V K E Q H K T S M K N Y V A Y L L S A P C P A I I L K C L L E F L L D I T L Y T S N - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ME D T H S S G D Q D L S T G C - - S P R K R K R E E A E P S C V S - - - - MR S D Q S MG E P - L T F K T E N T Q P V V S - - - S C K R R R A E A E P S C V S - - - - L K S D Q S ML - T P L S F R S E H T Q P A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V - - - R D D Q T G D L Q Q D L L Q P E H D E L Q R V K E Q H K T S MK N - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MG N T H T A G D Q D L S T G C - - S S V H Q K R A E A E P S C V S - - - - MK S D K S MA L P - I N F K S G N T R P A V S - - - S V H Q K R A E A E P S C V S - - - - MK S D A S MN - P P - T F K N E N T G P A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V S V D G D D Q T G D L Q Q D S L Q P E H D E L Q R V K E Q H K T S MK N 4.02 4.01 18.03 4.28 18.11 4.17 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MG N T H T A G D Q D L S T G C - - S S V H Q K R A E A E P S C V S - - - - MK S D A S MH - P P I H F K S G N T G P A V S - - - L V H Q K R A E A E P S C V S - - - - MK S D K S MI - Q P I T F R S E N T R P A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V S V D G D D Q T G D L Q Q D S L Q P E H D E L Q R V K E Q H K T S MK N - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MQ K R A E A E P S C V S - - - - MK S D A S MG V P - I T F K S E N T G P A V S - - - S V H Q K R A E A E L S C V S - - - - MK S D A S MN - P P I E F K S G N T G P A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V S V D R D G Q T G D L Q Q D S L Q P E H D E L Q R V K E Q H K T I MK N 4.15 4.09 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ME D K H T P G D L D L S T G C - - S S V H Q N R A E A E P S C V S - - - - MK S D V S MD - P P T N F K S G N T G P A V S - - - S R K R K R A E A E P S C V S - - - - MK S D A S MD - P P L K F K S K N T Q P A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V S V D R D D Q S G D L L Q D S L Q P E H D E L Q R V K E Q H K T T MK N - M F M E E R E T K L C E M S G S F F C D R A V N S Q L K R K R A A S P L S S C V S L K S N R S I R L P P D L S D G A V N S D S - - - - V V E H G G G F R T V S T L H K Y A C K L T L D P N T A N N N L V L S E E N R T V T R V R R K WS R S Y P D H S D R F D E I N P Q V V C K E S L T G R C Y WE A Q WS G S S A E I A V C Y Q G I K R K G G T D D C V F G R N D Q S WS L F C T D Q L F S V N H N N E R T D I S V N P D S S R T V G V F V D E S S G S L S F Y R V S D K L T Q L H I I N T T F T D T L H A G F R L G Y K - - - - S S V S L S S P L S S C V S L K S N Q S I G V R P D L S D G P V N S E S - - - - - - - - - - - - - - - - - - - - - - - - - - - - V R N G H R S P L S S C V S - - - - L K S N Q S I G V R P D L S D G A V N S D S V K S S K K - R K K A E S L L S S C V S - - - - L K S D Q S I G V R P D L S D G A V N S D S V K S S K K - R K K A E S L L S S C V S L K S D Q S I G V R P D L S D G P V N S E S V K S S K K R K K A E S L L S S C V S L K S N K S I G I R P D L S D G P V N S D S V S S S Y Q K H T S H Y K T E A Y I Q I E S Q Q P V N - - - - - - - - - - - - - - - - - - - - D D L Q R V K E Q H K F I M K N - - - M E R K A K H P C E M S - - V F E E R E R E A F I Q T V R A V S P L S S C V S M M C N H S I G L P P D R S D G A V S S D F V WE R T R S P L S S C V S M K S N N S M G L P P D L S D G A V N S E F - - - L K R K R E E S S L S S S M F I K S D H P F G L P L Y L S D R A V N S D S V - - - - - - - - - - - - - - - - - WE R T R S P L S S C V S M K S N N S M G L P P D L S D G A V N S E F - - - - - - - - - - - - - - L K R K R E E S S L S S S M F I K S D H P F G L P L Y L S D R A V N S D S V - - - - - - - - - - - - - - - WE R T R S P L S S C V S M K S N N S M G L P P D L S D G A V N S E F - - - - - - - - - - - - - - - - - - - - - - - - - - - L K R K R E E S S L S S S M F - - - - I K S D H P F G L P L Y L S D R A V N S D S V - - - - - - W K R T R S P L S S C V S - - - - M K S N N S M G L P P D L S D G A V N S E F L K - - - - - R K R E E S S L S S S M S M K S D H Y I D L P P - - - - - - - - - - - C K L I S N R K R E S S P L S S C V S M K S N H S M S L P P Y L S D G A V N S D S V S P Y Y Q K H I S H F K T E A C L M I D S Q Q L V H - - - - - - - - - - - - - - - - - - - - D D L Q R V K D Q H R F I M K I - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - R R T K I L K L I T P V P N S T P N Y Q T H N I Q D N T D A - - - - H ML E T G D L Q R V K D Q H K T S MK I - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MS L K K R E D E D S D S E M- - - - - - - - S S A S P G S I C G S - - - - V E S D G S I E K T P P A L N N A S V T S D L R - - - - - - P R S E S P K P S G V S - - - - L K S D R S MN - F P P L L S V E P V T S D P - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Q R K K I R K L I T P V P N S T P N Y Q T P I I Q D N T D A - - - - H ML E T G D L Q R V K D Q H K T N MK N - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Q N K R E D V D P T F E M- - - - - - - - S S A S P G S G C V S - - - - L K S D R S ME K A P L A L N D E S V T S E L R K - - - - T P R S E S P K P S C V S - - - - L K S D R S V R - H P P E F S D E P V T P D P - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Q R K K T H K P I T - - - - - - S N Y Q T H I I Q D N T E A - - - - H T L E T G D L Q R V K D Q H K T S MK N - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MS L Q N E S E D E D S A C K M- - - - - - - - S S A S P G S S C V S - - - - MK S D Q S I L MP P - N L S D V S V T S D L S R - - - - R Q R S E S P E H S C V S - - - - L K S D Q S K G K N P P D L S G A S V T S D L S F - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S S Q K K P K V V A F V Q S S K S K N E T C I I Q E N T E T P L Q R Q A L E T G D L Q R V K D H H K T S MK N 3.06 3.08 1.25 1.27 1.26 1.20 1.28 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MS K R K R E D E D A A S E I - - - - - - - - S S A S P G S S C V S - - - - V K S D Q S MK K N L P D F S D A S V T S D L S I S K T I N Q R S A S P E F S V L S - - - - MT S N R S I G - Q P E N F S D E P V T F D S V L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S D A L D K P D Y E K L H - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - R I K D Q H K T S MK N - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M S Q K K S E D V D S D S E M - - - - - - - - N S V S P G S A C E S - - - - L K S D WS M E K T P P N L G D R - - - - - - - - - - - K R Q R S Q S P E P S G V S - - - - K K R S R S M T - P P S D F S D K P V T F D S M M K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - I K L Q K I K D Q H K T S M K N - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MK R P - - - - - - - - - - - - - - - - - - - - - E S P E S S S V S - - - - MK S G R S ME Q P M- R F S D D P MI S D P R I N - - - R Q R L E S S E F S S V S - - - - V MS G R S ME - Q P MR F S D A P V T S D P R K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C Q N Q R H M- - - - - - - - - - - - I Q E N T E T V L Q R Q T L E T G D L Q R V K D Q H K T S MK R - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MK R P - - - - - - - - - - - - - - - - - - - - - E S P E S S S V S - - - - MK S G R S ME Q P M- R F S D D P V T S D P R I - - - K R Q R L E S S E I S D V S - - - - V K S G R S ME - Q P MR F S D A P V T S N P Q M- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - R R E R L E S P E F S N V S V K S S R S ME Q P MR F S G E P V T S D P H MMG F T S S F H Y K N Q G H I - - - - - - - - - - - - S Q D N T A T I L Q MQ T L E T V N L Q R V K D Q H K T S MR K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - N V S - - - - V K S D Q S MG Y H P - N L S D E T L T S D F R - R K - - R P R L K S P E P S S V S - - - - V K S S R S MD - Q P H T F S D G P MT S D P - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H F D Q E I I E P V F - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Q T Q A L E T G D L Q R I K D Q Y K T S MK K 1.31 1.13 1.21 1.18 1.22 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - D E N M T S D P R K T P R L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - E S S G P S G V S - - - - V K S D WS M E R P P - A L S N Q P V T S D P R L R K - - T P R L E S P E L S G V S - - - - V K S D WS K E - R P P A F S D E P V T S D P R K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - T P R L E S P E L S G V S V K S D WS I E R P P A F S N E P V T S D P R R C D I H E R C V N I T Q S N I G S - - - - - - - - - - - - - - - - - - - - - Q T L K T R D L Q R V K D Q L K A S M K N - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M S L Q K K N E D E D E E S A S E M - - - - - - - - S Y P P P E S S C V S - - - - M K S D WS M V Y P P - D L S D A S V T S D P N N S K R K G Q H L E F K E S S C V S - - - - V M S N R S I T - - P P E M K K I G L K S E D - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - N M L E K I K N K H K T S M K H - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - P S G V S - - - - I R S N T S L F L P P - N I S D G A V T S D P - - - - - S R H K L A S P K P S C V S - - - - V K S T G S L V - M P P N I S D G P V T S D L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - G R K K E N L S Q S Q S R C G V S E S R P T E D I D C P P C R K R C R WG S S F S G S A H G I T R A D Q Q E N P G I L E K S V - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H D E V Q R V K D Q H K T S M K N - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MQ K S K - - - - - - - - - - - - - - - - - - - - - S P E S S V A S - - - - V T S D Q S T D - P P Q N Y R G - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H T E L T F S H D H - - - - - - - - - L H A E V H K T F R S N L L T 1.24 1.17 1.01 up06 15.07a 25.01 15.03 15.04 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M H R P D - - - - - - - - - - - - - - - - - - - - - P P G P S C V S - - - - I R S D WS M N - P P H N L S G D S D A R S - - - - - - - - - R L D P P E P S C V S - - - - T K S D WP A N S S L K F S E - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - R T E L R F S H G P - - - - - - - - - Q H A E A T K T Y R S N L R R - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MD Q P I H F K S G D T K S D L - - S S V H Q K S T E S E S S C V S - - - - MR S N D K S V H Q P L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H V R S G D T E T D - - - - - - - - - L S H E A L N T F R S N L L K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MD D T Q I S R D E N V S P G C - - S S V H Q K S T E S E P S C V S - - - - MR S D D E S V D Q P L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H V R S G D T E T D - - - - - - - - - L S H K A L N T F R S D L L K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MK A G D D N E E R R - - A Q T K S G S F V P S V V S I K S D K S MK T P I E L P N G - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H V P D Q R H H S P D P S A V S - - - - L K S D K S MK T P I E L Q N G H N P G D Q S F - - - - - R E R Y N S P D P S A V S - - - - L K S D K S MK I P I E L K I G D A T E N Q R Y S R - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - T K Y Y Q D I L T A D S L P G R F I H Q Q D S E T H R Q K I Q R - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - F K S N I K Q - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MS R S A A I T N A L S S S K R T A L T L N A N L C G S R T G F S S F P L I L Q Q Y Q G K F P G S A L P S P H R C L L Q E Y Y D D H R F Q T E S V Q Q L G N R N I MK N A E D D T D Y P D T S H G E R R - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V Q Q K R S D S P V H S V L P - - - - MK R H K S MK T P D Q L Q N E D P L L N Q S S - - - - - E E R S Q S P E P S V L S - - - - MK S D K S MK I P I Q L K D L D S P V D Q R H G - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Q Q T K V S E I Q - - Q A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - L K S Y MK T (b) S D F G L MWY L K E L N K K E F I K F K E F L I QE I L K L K L K Q I S WT E V K K A S R E D L A N L L L K C Y E E N QA WD MT F N I L QK I N R K D L T E R A T E S S Y G L QWC L Y E L D K E E F QT F K E L L K K K S S E S T T C S I P QF E I E N A N V E C L A L L L H E Y Y GA S L A WA T S I S I F E N MN L R T L S E K A R D L E WT L QT L L E Q L N E D E L K S F K S L L WA F P L E D V L QK T P WS E V E E A D GK K L A E I L V N T S S E N WI R N A T V N I L E E MN L T E L C K MA K A L GD H L L N T L E E L L P Y D F E K F K F K L QN T S L E K GH S K I P R GH MQMA R P V K L A S L L I T Y Y GE E Y A V R L T L Q I L R A T N QR Q L A E E L R K R V D H L L N T L E E L L P Y E L E K F K F K L H T T S L E K GH S R I P L S L V K MA R P I K L T R L L L T Y Y GE E Y A V R L T L Q I L R A T N QR Q L A E E L H K P S D H L L S T L E E L V P Y D F E K F K F K L QN T S V QK E H S R I P R S Q I QR A R P V K MA T L L V T Y Y GE E Y A V Q L T L QV L R A I N QR L L A E E L H R NALP4_MOUS E NALP5_HUMA N NALP7_HUMA N MEFV_MOUS E MEFV_RAT MEFV_HUMA N 4.08 4.44 4.41 14.14 23.05 4.42 14.40 4.30 4.14 4.31 4.21 4.46 4.50 4.19 4.12 4.49 4.38 4.34 4.02 4.01 18.03 4.28 18.11 4.17 4.15 4.09 3.06 3.08 1.25 1.27 1.26 1.20 1.28 1.31 1.13 1.21 1.18 1.22 1.24 1.17 1.01 up06 15.07a 25.01 15.03 15.04 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MA E E R V K GS L S E K H - - - MA E E R V K D S L S E K H - - - MA E E R V K D S L S E K H - - - MA E E R V K D S L S E K H - - - - - - - - - - - - - - - - - - - - ME D T H T - - D L D L S T GC - - - - - - - - - - - - - - - - - - MA E E R V K D S L S E K H - - - MA E E R V K D S L S E K H - - - MA E E R V K D S L S E K H - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ME D T H S S GD QD L S T GC - MGN T H T A GD QD L S T GC - MGN T H T A GD QD L S T GC - - - - - - - - - - - - - - - - - - ME D K H T P GD L D L S T GC - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MS L K K R E D E D S D S E M - - - - QN K R E D V D P T F E M - MS L QN E S E D E D S A C K M - - MS K R K R E D E D A A S E I - - MS QK K S E D V D S D S E M - - MK R P - - - - - - - - - - - - - MK R P - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - L QK K N E D E D E E S A S E M - - - - - - - - - - - - - - - - - - - MQK S K - - - - - - - - - - - - MH R P D - - - - - - - - - - - - MD QP I H F K S GD T K S D L - MD D T Q I S R D E N V S P GC - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S V R S GS F V S S S V S L K S D WS K GGP P P D L R GK T P S S V K S - - - WS - - - - - - - S V R S GS R V S S S V S L K S D H S K D GR P P N F R E K T P S S V K S - - - - - - - - - - S V R S GS C V S S S V S L K S D R S K D P - P P GF S G E R A S S A QS - - - GE - - - - - - - S V R S GS C V S S S V S L K S D R S K D R - P P E F S GE T P S S A Q S - - - QS - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S S V H QK R A E A E P S C V S - - - - MK S D A S MT - P P V K F K S GN T GA A V S S V H - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S V R S GS C - - - - - - MK S D WS K E D - P P E F S GE T P S P A Q I - - - WS - - - - - - - S V R S GS C V S S A V S L K S D GS MGP P P E L S E K S P S S A - - - - - - - - - - - - - S V R S GS R V C S S V S L K S N R S K D N P P I F R E K T QS F A K S - - - - - - - - - - - - V R S GS C V S S S V S MK S D GS MGH P P D L R E K T L S S A K S - - - - - S V H QK R A E A E P S C V S - - - - MK S D A S ME R P - I A F K S GN T G P A V S S V H GP - A A V R R R A E A E P S C V S - - - - L K S D A S MGH P E N N F R S E H T P P A L S S QK S P R K R K R E E A E P S C V S - - - - MR S D QS MGE P - L T F K T E N T Q P V V S S C K QP S S V H QK R A E A E P S C V S - - - - MK S D K S MA L P - I N F K S GN T R P A V S S V H S S V H QK R A E A E P S C V S - - - - MK S D A S MH - P P I H F K S GN T G P A V S L V H GP - - - MQK R A E A E P S C V S - - - - MK S D A S MGV P - I T F K S E N T G P A V S S V H GP S S V H QN R A E A E P S C V S - - - - MK S D V S MD - P P T N F K S GN T G P A V S S R K GP - - V R N GH R S P L S S C V S - - - - L K S N QS I GV R P D L S D GA V N S D S V K S S K K - L K R K R E E S S L S S S MF - - - - I K S D H P F G L P L Y L S D R A V N S D S V - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S S A S P GS I C GS - - - - V E S D GS I E K T P P A L N N A S V T S D L R - - - - - - - - S S A S P GS GC V S - - - - L K S D R S ME K A P L A L N D E S V T S E L R K - - - - - - - S S A S P GS S C V S - - - - MK S D QS I L MP P - N L S D V S V T S D L S R - - - - - - - S S A S P GS S C V S - - - - V K S D QS MK K N L P D F S D A S V T S D L S I S K T - - - - - N S V S P GS A C E S - - - - L K S D WS ME K T P P N L GD R - - - - - - - - - - WS - - - - - - - E S P E S S S V S - - - - MK S GR S ME QP M - R F S D D P M I S D P R I N - - - - - - - - E S P E S S S V S - - - - MK S GR S ME QP M - R F S D D P V T S D P R I - - - - - - - - - - - - - - - N V S - - - - V K S D QS MGY H P - N L S D E T L T S D F R - R K - - - - - - - E S S GP S GV S - - - - V K S D WS ME R P P - A L S N QP V T S D P R L R K WS - - - - - S Y P P P E S S C V S - - - - MK S D WS MV Y P P - D L S D A S V T S D P N N S K R WS - - - - - - - - - - - P S GV S - - - - I R S N T S L F L P P - N I S D GA V T S D P - - S R H - - - - - - - - S P E S S V A S - - - - V T S D QS T D - P P QN Y R G - - - - - - - - - - - - - - - - - - - P P GP S C V S - - - - I R S D WS MN - P P H N L S GD S D A R S - - - - - WS MN S S V H QK S T E S E S S C V S - - - - MR S N D K S V H QP L - - - - - - - - - - - - - - - S S V H QK S T E S E P S C V S - - - - MR S D D E S V D QP L - - - - - - - - - - - - - - - H V P D QR H H S P D P S A V S - - - - L K S D K S MK T P I E L QN GH N P G D QS F - - - GD - V QQK R S D S P V H S V L P - - - - MK R H K S MK T P D Q L QN E D P L L N QS S - - - - - - - - - - - - - - - - - - - - - - - - - - MA N A K Q L L K N S L D E L V D A E L K E F QWY L - - - - - I N D H R D I S K A E ME N A D R L K T V D K MV S C F GP K GA V K T T V D I L R K I N QN E L A E E L E N K H K QGA V L E T C K S P P F D Y T N T S R E L K QWY W - - - - - - - - - - - - - - - - - - - - - - MA N V K Q L L K K S L D E L V E D T L K D F QWH L - - - - - MN D H R E I S K A E ME N A D R R N T V D K L V S C F GS E R A V K I T V D T L R K L N QN Q L A E D L E N T QK QGA A S E T C K S P P V D Y T H T S H E L K QWH W - - - - - - - - - - - - - - - - - - - - - - MA N V K QR L K D S L D E L K E D T L K D F QWH L - - - - - M I D H R E I S T GE L E N A D R R K T V D K L V S C F GS E R A V K I T V D T L R K I K QN Q L A E E L E K K QQQGA A S E T C K S P P V D Y T N T S H E L K QWH W - - - - - - - - - - - - - - - - - - - - - - MA N V K Q L L D N S L D E L L E A E L K K F QR C L - - - - - V N D H R D I S K A E ME N A D R L D T V D K MV S C F GS E R A V K I T V N T L R K I K QN Q L A E E L E N T QK QGA A S E S C K S P P V D Y T N T S R E L K - - - - - - - - - - - - - - - - - - - - - - MA N V K Q L L D N S L D E L L E A E L K K F QWC L - - - - - V N D H N E I S K A E L E N A D R L D T V D K MV S C F GS E R A V K V T V D T L R K I K QN D L A D Q L E N T QN QGT A L E N C K T L P L D Y T N I S H E L K W - - - - - - - - - - - - - - - - - - - - - - MA N V K Q L L K N S L D E L R E A E L K E F QWC L - - - - - V N D H R E I S T A E L E N A D R L K T V D K L V S C F K P E R A L K T T V D T L R K I K QN E L A E E L E N T QK QGA A S E T C N S P P V D Y T N T S R E L K W - - - - - - - - - - - - - - - - - - - - - - ME N V K Q L L K N S L K E L V E V E L K E F QWC L - - - - - R N D Y R C I S K S E ME N A D R L E T V D K ME S C F GP E GA V K I T V D I L R K I N QN D L A E K L E N T QK Q - - A S E N S K T P - L D Y T N I S H E L K W - - - - - - - - - - - - - - - - - - - - - - MA S V E E L L L K S L E D L E N P E L K K F QWH L - - - - - K K Y P K R I Y K C E ME K A D R L D T V D K MV E C F GA E D A V N N T V S I L R K I N QN N L A E Q L E N E H K N QGS A S A D S K QV L QE N - - S K R L K QWH W - - - - - - - - - R R H L V A D L V S Y S - - - - - - - - - D N L QWI F QN L E S K M I R F L - - - - - K N Q L E N F R K I L QH K N R QE F I K E F I E N R S I L T E A A L D L T L F F L R E MK QD QA A D T L Q - - - - - - - - - - - - - - - - - E L F F I N Q L K QG - - - - - - - - - R R H L V A D L V S Y S - - - - - - - - - D N L QWI F QN L E S K MF R F L - - - - - K N Q L E N F R K I L QH K N R QE F I K E F N E N R S G I T E A A L D L T L F F L R E MK QD K A A D T L E G - - - - - - - - - - - - - - - - - E L F F I N H F K - - - - - - - - - V E Y E S - D S GD E T H R R H K S F T - D N L QS I F QN L E S K M I R F L - - - - - K N E L E K F K K I L K E E N R QE F V K E F N E N R S I I T E A A L D L T L L F L R E MK QD QA A D T L Q - - - - - - - - - - - - - - - - - E L L F N N Q L K QG - - - - - - - - - V E Y E S A D S GD E T H R R H K S F T - D N L QS I F QN L E S K M I R F L - - - - - K N Q L E N F K K I L QE E N R QE F V K E F N E N R S I I T E A A L D L T L F F L R E MT QD QA A D T L Q - - - - - - - - - - - - - - - - - E L L Y I N Q L K QG - - - - - - - - - V E Y E A A D L GD E T H R R H K S F T - D N L QS I F QN L E S K M I R F L - - - - - K N E L E K F K K I L QE E N R E E F V K E F N E N R S I I T E A A L D L T L F F L R E MK QD QA A D T L Q - - - - - - - - - - - - - - - - - E L F F I N Q L K QG - - - - - - - - - V E Y E S A D S GD E T H R R H K S F T - D N L QS I F QN L E S K M I R F L - - - - - K N E L GN F K K I L QE E N S QE F V K E F T E N R S I I R E A A L D L T L F F L R E MK QD QA A D I L E D - - - - - - - - - - - - - - - - - E L F F I N Q L K - - - - - - - - - V E Y E L P Y S GD K T H R S H K GF T - K D L P R I F QN L E S K I I R Y L - - - - - K N E L E K F K K I L QE E N R QD F V K H F N E N R S I I T E A A L D L T L F F L R E MK QD E A A D T L E G - - - - - - - - - - - - - - - - - E L F F I N Q L K - - - - - - - - - V E Y E L P E P R H R T H R R H K S F T - D N L V WI F QN L E N K I D R F L - - - - - K N E L K K F K K I L QE E N R QD F V K N F N D N R C R I T E A A L D L T L F F L R D MK QD E V A D T L Q - - - - - - - - - - - - - - - - - E L F F I N Q L K QG - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - QS V D GD D QT GD L QQD S L QP E H D E L QR V K E QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - K S V D GD D QT GD L QQD S L QP E H D E L QR V K E QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - E S V D GD D QT GD L QQD S L QP E H D E L QR V K E QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V S V D GD D QT GD L QQD S L QP E H D E L QR V K E QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - F S V D GD D QT GD L QQD S L QP E H D E L QR V K E QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V - - - R D D QT GD L QQD L L QP E H D E L QR V K E QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V S V D GD D QT GD L QQD S L QP E H D E L QR V K E QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V S V D GD D QT GD L QQD S L QP E H D E L QR V K E QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V S V D R D GQT GD L QQD S L QP E H D E L QR V K E QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V S V D R D D QS GD L L QD S L QP E H D E L QR V K E QH K - R K K A E S L L S S C V S L K S D QS I GV R P D L S D GP V N S E S V K S S K K R K K A E S L L S S C V S L K S N K S I G I R P D L S D GP V N S D S V S S S Y QK H T S H Y K T E A Y I Q I E S QQP V N - - - - - - - - - - - - - - - - - - - - D D L QR V K E QH K - R K R E E S S L S S S MS MK S D H Y I D L P P - - - - - - - - - - - C K L I S N R K R E S S P L S S C V S MK S N H S MS L P P Y L S D GA V N S D S V S P Y Y QK H I S H F K T E A C L M I D S QQ L V H - - - - - - - - - - - - - - - - - - - - D D L QR V K D QH R - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - R R T K I L K L I T P V P N S T P N Y QT H N I QD N T D A - - - - H M L E T GD L QR V K D QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - QR K K I R K L I T P V P N S T P N Y QT P I I QD N T D A - - - - H M L E T GD L QR V K D QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - QR K K T H K P I T - - - - - - S N Y QT H I I QD N T E A - - - - H T L E T GD L QR V K D QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S S QK K P K V V A F V QS S K S K N E T C I I QE N T E T P L QR QA L E T GD L QR V K D H H K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - S D A L D K P D Y E K L H - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - R I K D QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - I K L QK I K D QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - C QN QR H M - - - - - - - - - - - - I QE N T E T V L QR QT L E T GD L QR V K D QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - R R E R L E S P E F S N V S V K S S R S ME QP MR F S GE P V T S D P H MMGF T S S F H Y K N QGH I - - - - - - - - - - - - S QD N T A T I L QMQT L E T V N L QR V K D QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H F D QE I I E P V F - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - QT QA L E T GD L QR I K D QY K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - T P R L E S P E L S GV S V K S D WS I E R P P A F S N E P V T S D P R R C D I H E R C V N I T QS N I GS - - - - - - - - - - - - - - - - - - - - - QT L K T R D L QR V K D Q L K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - NM LE K I K NK HK - - - - - - - - - - - - - - - - - - - - - - - - - - - GR K K E N L S QS QS R C GV S E S R P T E D I D C P P C R K R C R WGS S F S GS A H G I T R A D QQE N P G I L E K S V - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H D E V QR V K D QH K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HT E LT F S HDH - - - - - - - - - LHA E V HK T F R - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - R T E L R F S H GP - - - - - - - - - QH A E A T K T Y R - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H V R S GD T E T D - - - - - - - - - L S H E A L N T F R - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - H V R S GD T E T D - - - - - - - - - L S H K A L N T F R - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - T K Y Y QD I L T A D S L P GR F I H QQD S E T H R QK I QR - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - F K - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - QQT K V S E I Q - - QA - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - L K Group Figure 15 Structure of the amino-termini of the new zebrafish NLR proteins Structure of the amino-termini of the new zebrafish NLR proteins ClustalW alignment of the set of 70 predicted NLR proteins was truncated four amino acids downstream of the start of the Fisna domain, and the alignment of the remaining amino-terminal sequences was edited manually using Jalview [62] Sequences that did not extend significantly beyond the Fisna domain were deleted, as were some sequences in groups with many similar or identical sequences The remaining sequences represent a set of characteristic compositions of motifs found in the amino-terminal part of this family of proteins (a) Overview of the alignment with characteristic sequence motifs shaded in color: green = pyrin-like domain in groups and 2; blue = repeated motif (different shades of blue mark different versions of the repeat); yellow/orange tones = conserved amino-terminal amino acids; and pink = specific aminoterminal peptide of 14 amino acids (b) Details of the alignment in panel a in which amino acid similarities and identities are highlighted in ClustalW colors A set of mammalian PYD domains are aligned above the zebrafish group and group pyrin-like domains to illustrate the similarity NLR, nucleotidebinding domain/NACHT domain and leucine rich repeat containing family Materials and methods Software Standard web-based programs were used for sequence comparisons, alignments, and phylogenies The phylogenetic trees in the figures were generated using the MEGA software package [27] In all phylogenetic trees presented in this study complete sequences were used rather than only the conserved domains The alignments for generating the phylogenetic trees were performed with ClustalW using the Blosum matrix with standard parameters For the phylogenetic reconstruction the neighbor-joining method [52] was used with a bootstrap test of 1,000 replicates Gaps and missing data were treated as pair-wise deletions Manual annotations of genes were carried out by the Havana group at the Sanger Institute, in accordance with human annotation workshop guidelines [53] Search for class II cytokine receptor genes To identify class II cytokine receptor genes we searched the zebrafish genome and all available zebrafish ESTs for the subdomains SD100A and SD100B running the Prosite protein annotation [54] with the hidden Markov model matrices with accession numbers PS50299 (SD100A) and PS50300 (SD100B) The screen of genomic sequences encoding SD100A or SD100B domains identified 12 genes, of which two encoded titins, one encoded thrombopoeitin, eight encoded cytokine class II receptor genes that previously were found to belong to the Interpro IPR000282 family, and one Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, Volume 8, Issue 11, Article R251 Stein et al R251.20 chromosome: 14 15 17 18 20 21 22 ( number of NACHT domain genes) (32) (9) (50) (10) (47) (11) (12) (11) (20) (10) (25) Mbp 0 0 0 0 - 10 - 7 - 1 1 - 4 - 2 - 2 2 - 2 1 2 X - 20 2 1 30 40 4 4 - - 4 - - 4 3 - - 4 4 - 4 4 4 4 - 5 - 2 2 2 - 2 2 - 2 1 - - 1 4 - 1 2 2 , 2 4 50 5 - 8 - 1 -1 2 4 4 - - 1 - 1 - 2 - - 1 - 1 2 60 3 , - - 4 4 4 1 - - 70 4 - 4 80 90 Figure 16 Chromosomal locations of zebrafish NLR proteins Chromosomal locations of zebrafish NLR proteins The 11 chromosomes containing the main clusters of NLR genes are shown The number of NLR genes on each chromosome is listed below the chromosome number A further 42 genes) are distributed on 11 other chromosomes, and 20 genes are on as yet unplaced contigs This list includes a compilation of all predictions (automated as well as manually annotated) and locations of hits from a TBLASTN search for NACHT domains Future improvements of the genome assembly and further manual annotations will most likely result in minor changes of this map Genes are denoted by lines on the right of the chromosome irrespective of orientation NLR, nucleotide-binding domain/NACHT domain and leucine rich repeat containing family (GENSCAN0000036149) encoded a previously unidentified gene of this class To screen the ESTs, we first translated every EST sequence in the six possible frames and then searched for the subdomains We followed a similar procedure with all the ab initio predictions (Genscan and Fgenesh) obtained in the analysis of the zebrafish Zv6 assembly [24] From the EST analysis we obtained 69 different sequences, of which 14 encoded both subdomains Comparison of the 69 sequences showed that they represented 20 different genes, for which we analyzed the known or predicted full-length sequences in more detail One of the ESTs (accession CK692344) was not represented in the zebrafish genome (neither assembly Zv6 nor trace sequences) and turned out to correspond to a mouse gene Three sequences had only spurious resemblances to SD100A or SD100B encoding sequences, often over very short stretches, and encoded known proteins with other functions This left 16 potential candidates for cytokine class II receptor encoding genes, which we named zf1 to zf16 Six of these had also been identified by the genomic screen Two candidates from the genomic screen were not in this group, because no ESTs exist for them We Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 Genome Biology 2007, named these candidates zf17 and zf18 We then assessed the annotations of zf1 to zf18, and annotated or re-annotated the sequences manually, if no annotations existed (zf1, zf2, zf6, and zf14) or the previous annotations appeared incomplete or incorrect This analysis showed that twelve of the genes encoded proteins with the characteristics of class II cytokine receptors Authors' contributions Search for new NLR proteins Additional data files For the manual annotation of NLR genes in the zebrafish genome, we initially used the ESTs with the accession numbers CF347458.1, CD284951.1, CO915312.1, CF266152.1, BM534859.1, and DT055906.1 as guides The ESTs were not 100% identical to any of the genomic sequences we identified, which may be due to polymorphisms between the strains from which the genome sequence and the ESTs were derived The NLR proteins were identified as follows A TBLASTN search of the Ensembl zebrafish genome assembly Zv4 with the mammalian Nalp3 gene identified more than 200 sites in the genome encoding complete or partial NACHT domains A collection of 170 NACHT-domain encoding zebrafish genes from the NCBI database, which only partly overlapped the set identified by TBLASTN, were also mapped onto the genome The merged list of the two nonoverlapping sets of sites in the genome were sorted by chromosomal location, each site was given a number (chromosome number plus numerical ordering) The regions containing the potential genes were then further refined using available ESTs and gene predictions as guides The resulting sequences were blasted against the finished and unfinished clone sequences and the hits on finished clones were finally manually annotated For further refinement of annotations we also used the motifs identified in Figure 15 in particular to improve the predictions for the full amino-terminal extensions of the genes Abbreviations Apaf1, apoptotic protease activating factor 1; CRFB, cytokine receptor family B; EST, expressed sequence tag; Fisna, fishspecific NACHT associated; IL, interleukin; IFN, interferon; IFNAR, interferon-α receptor; IFNGR, interferon-γ receptor; IL10R, interleukin-10 receptor; IL22BP, interleukin-22 binding protein; IRAK, interleukin-1 receptor associated kinase; IRF, interferon response factor; LRR, leucine-rich repeat; NF-κB, nuclear factor-κB; NLR, NACHT-domain and leucine rich repeat containing; Nalp, NACHT, leucine rich repeat and PYD containing protein; Nod, nucleotide oligomerization domain containing protein; Stat, signal transducer and activator of transcription; Tab, Tak1-binding protein; TF, tissue factor; Ticam, Toll-interleukin receptor domain (TIR) containing adaptor molecule; TLR, Toll-like receptor; TNF, tumor necrosis factor; Traf, TNF-receptor associated factor Volume 8, Issue 11, Article R251 Stein et al R251.21 CS and ML conducted BLAST searches, made alignments and phylogenetic trees, made the figures and wrote the text MC identified the cytokine and cytokine receptor genes and analyzed their genomic contexts, GL made the manual annotations of the novel NLR genes and cytokine receptor genes The following additional data are available with the online version of this paper Additional file lists the kinase protein sequences in FASTA format Additional file lists the adaptor protein sequences in FASTA format Additional file lists the IRF protein sequences in FASTA format Additional file lists the Stat protein sequences in FASTA format Additional file lists the Traf protein sequences in FASTA format Additional file lists the class II cytokine receptor protein sequences in FASTA format Additional file lists the class II cytokine protein sequences in FASTA format Additional data file lists the NLR protein sequences in FASTA format, except for the zebrafish-specific NLRs Additional data file lists the zebrafish-specific NLR protein sequences in FASTA format Additional data file 10 is a high resolution of the large phylogram of 277 NLRs presented in Figure 12 Figure protein protein A high format format.12 Zebrafish-specificofII cytokine sequences in the NLR 12 FASTAresolutionprotein Class here data fileprotein sequences sequences in FASTA Traf protein filereceptor sequences in of 277 format, Stat protein sequences protein sequencesFASTA format IRFzebrafish-specificprotein phylogram FASTA format Adaptorcytokine sequencessequences in FASTA sequences format Clickprotein sequences protein proteinprotein sequences in FASTA ListedIIprotein sequencessequences in in FASTA format in Kinaseare for classNLR large sequences protein format except in Additionalthesequences proteinreceptor FASTA NLRs presentedfor NLR protein IRF II NLRs kinase cytokine sequences zebrafish-specific NLR Traf Stat adaptor 10 the Acknowledgements This work was supported by the Wellcome Trust and the European Molecular Biology Organization ML thanks Richard Durbin, Kerstin Jekosch, and staff at the Sanger Center for providing space and a stimulating sabbatical environment We thank our colleagues, in particular Jonathan Howard, for discussions and suggestions, Dale Richardson for assembling the set of NCBI NACHT-domain predictions, and Jane Parker and Jeff Dangl for comments on the manuscript Jonathan Rast very kindly provided a file with the sequences of the sea urchin NACHT domain proteins We are especially thankful to Georges Lutfalla and Dina Aggad for sharing ideas and information, and for generously providing the sequences of DrCRFB14 and IFN-ϕ4 References Trede NS, Langenau DM, Traver D, Look AT, Zon LI: The use of zebrafish to understand immunity Immunity 2004, 20:367-379 Venkatesh B: Evolution and diversity of fish genomes Curr Opin Genet Dev 2003, 13:588-592 Volff JN: Genome evolution and biodiversity in teleost fish Heredity 2005, 94:280-294 Traver D, Herbomel P, Patton EE, Murphey RD, Yoder JA, Litman GW, Catic A, Amemiya CT, Zon LI, Trede NS: The zebrafish as a model organism to study development of the immune system Adv Immunol 2003, 81:253-330 Nonaka M, Kimura A: Genomic view of the evolution of the complement system Immunogenetics 2006, 58:701-713 Meijer AH, Gabby Krens SF, Medina Rodriguez IA, He S, Bitter W, Ewa Snaar-Jagalska B, Spaink HP: Expression analysis of the Tolllike receptor and TIR domain adaptor families of zebrafish Mol Immunol 2004, 40:773-783 Jault C, Pichon L, Chluba J: Toll-like receptor gene family and TIR-domain adapters in Danio rerio Mol Immunol 2004, 40:759-771 Litman GW, Hawke NA, Yoder JA: Novel immune-type receptor genes Immunol Rev 2001, 181:250-259 Yoder JA, Mueller MG, Wei S, Corliss BC, Prather DM, Willis T, Litman RT, Djeu JY, Litman GW: Immune-type receptor genes in zebrafish share genetic and functional properties with genes encoded by the mammalian leukocyte receptor cluster Proc Natl Acad Sci USA 2001, 98:6771-6776 Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 Genome Biology 2007, Yoder JA, Litman RT, Mueller MG, Desai S, Dobrinski KP, Montgomery JS, Buzzeo MP, Ota T, Amemiya CT, Trede NS, et al.: Resolution of the novel immune-type receptor gene cluster in zebrafish Proc Natl Acad Sci USA 2004, 101:15706-15711 Panagos PG, Dobrinski KP, Chen X, Grant AW, Traver D, Djeu JY, Wei S, Yoder JA: Immune-related, lectin-like receptors are differentially expressed in the myeloid and lymphoid lineages of zebrafish Immunogenetics 2006, 58:31-40 Bobe J, Goetz FW: Molecular cloning and expression of a TNF receptor and two TNF ligands in the fish ovary Comp Biochem Physiol B Biochem Mol Biol 2001, 129:475-481 Engelsma MY, Huising MO, van Muiswinkel WB, Flik G, Kwang J, Savelkoul HF, Verburg-van Kemenade BM: Neuroendocrineimmune interactions in fish: a role for interleukin-1 Vet Immunol Immunopathol 2002, 87:467-479 Zou J, Secombes CJ, Long S, Miller N, Clem LW, Chinchar VG: Molecular identification and expression analysis of tumor necrosis factor in channel catfish (Ictalurus punctatus) Dev Comp Immunol 2003, 27:845-858 Huising MO, Stet RJ, Savelkoul HF, Verburg-van Kemenade BM: The molecular evolution of the interleukin-1 family of cytokines; IL-18 in teleost fish Dev Comp Immunol 2004, 28:395-413 Reboul J, Gardiner K, Monneron D, Uze G, Lutfalla G: Comparative genomic analysis of the interferon/interleukin-10 receptor gene cluster Genome Res 1999, 9:242-250 Lutfalla G, Roest Crollius H, Stange-Thomann N, Jaillon O, Mogensen K, Monneron D: Comparative genomic analysis reveals independent expansion of a lineage-specific gene family in vertebrates: the class II cytokine receptors and their ligands in mammals and fish BMC Genomics 2003, 4:29 Levraud JP, Boudinot P, Colin I, Benmansour A, Peyrieras N, Herbomel P, Lutfalla G: Identification of the zebrafish IFN receptor: implications for the origin of the vertebrate IFN system J Immunol 2007, 178:4385-4394 Baoprasertkul P, Peatman E, Somridhivej B, Liu Z: Toll-like receptor and TICAM genes in catfish: species-specific expression profiles following infection with Edwardsiella ictaluri Immunogenetics 2006, 58:817-830 Ben J, Jabs EW, Chong SS: Genomic, cDNA and embryonic expression analysis of zebrafish IRF6, the gene mutated in the human oral clefting disorders Van der Woude and popliteal pterygium syndromes Gene Expr Patterns 2005, 5:629-638 Lewis RS, Ward AC: Conservation, duplication and divergence of the zebrafish stat5 genes Gene 2004, 338:65-74 Oganesyan G, Saha SK, Guo B, He JQ, Shahangian A, Zarnegar B, Perry A, Cheng G: Critical role of TRAF3 in the Toll-like receptor-dependent and -independent antiviral response Nature 2006, 439:208-211 van der Sar AM, Stockhammer OW, van der Laan C, Spaink HP, Bitter W, Meijer AH: MyD88 innate immune function in a zebrafish embryo infection model Infect Immun 2006, 74:2436-2441 Birney E, Andrews D, Caccamo M, Chen Y, Clarke L, Coates G, Cox T, Cunningham F, Curwen V, Cutts T, et al.: Ensembl 2006 Nucleic Acids Res 2006, 34:D556-561 Sprague J, Doerry E, Douglas S, Westerfield M: The Zebrafish Information Network (ZFIN): a resource for genetic, genomic and developmental research Nucleic Acids Res 2001, 29:87-90 Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool J Mol Biol 1990, 215:403-410 Kumar S, Tamura K, Nei M: MEGA3: Integrated software for molecular evolutionary genetics analysis and sequence alignment Brief Bioinform 2004, 5:150-163 Sullivan C, Postlethwait JH, Lage CR, Millard PJ, Kim CH: Evidence for evolving Toll-IL-1 receptor-containing adaptor molecule function in vertebrates J Immunol 2007, 178:4517-4527 Miller DJ, Hemmrich G, Ball EE, Hayward DC, Khalturin K, Funayama N, Agata K, Bosch TC: The innate immune repertoire in cnidaria - ancestral complexity and stochastic gene loss Genome Biol 2007, 8:R59 Kedinger V, Alpy F, Tomasetto C, Thisse C, Thisse B, Rio MC: Spatial and temporal distribution of the traf4 genes during zebrafish development Gene Expr Patterns 2005, 5:545-552 Krause CD, Pestka S: Evolution of the Class cytokines and receptors, and discovery of new friends and relatives Pharmacol Ther 2005, 106:299-346 Altmann SM, Mellon MT, Distel DL, Kim CH: Molecular and functional analysis of an interferon gene from the zebrafish, 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 Volume 8, Issue 11, Article R251 Stein et al R251.22 Danio rerio J Virol 2003, 77:1992-2002 Igawa D, Sakai M, Savan R: An unexpected discovery of two interferon gamma-like genes along with interleukin (IL)-22 and -26 from teleost: IL-22 and -26 genes have been described for the first time outside mammals Mol Immunol 2006, 43:999-1009 Chen JY, You YK, Chen JC, Huang TC, Kuo CM: Organization and promoter analysis of the zebrafish (Danio rerio) interferon gene DNA Cell Biol 2005, 24:641-650 Milev-Milovanovic I, Long S, Wilson M, Bengten E, Miller NW, Chinchar VG: Identification and expression analysis of interferon gamma genes in channel catfish Immunogenetics 2006, 58:70-80 Long S, Milev-Milovanovic I, Wilson M, Bengten E, Clem LW, Miller NW, Chinchar VG: Identification and expression analysis of cDNAs encoding channel catfish type I interferons Fish Shellfish Immunol 2006, 21:42-59 Zou J, Carrington A, Collet B, Dijkstra JM, Yoshiura Y, Bols N, Secombes C: Identification and bioactivities of IFN-gamma in rainbow trout Oncorhynchus mykiss: the first Th1-type cytokine characterized functionally in fish J Immunol 2005, 175:2484-2494 Zou J, Yoshiura Y, Dijkstra JM, Sakai M, Ototake M, Secombes C: Identification of an interferon gamma homologue in Fugu, Takifugu rubripes Fish Shellfish Immunol 2004, 17:403-409 Long S, Wilson M, Bengten E, Bryan L, Clem LW, Miller NW, Chinchar VG: Identification of a cDNA encoding channel catfish interferon Dev Comp Immunol 2004, 28:97-111 Robertsen B, Bergan V, Rokenes T, Larsen R, Albuquerque A: Atlantic salmon interferon genes: cloning, sequence analysis, expression, and biological activity J Interferon Cytokine Res 2003, 23:601-612 Zhang DC, Shao YQ, Huang YQ, Jiang SG: Cloning, characterization and expression analysis of interleukin-10 from the zebrafish (Danio rerio) J Biochem Mol Biol 2005, 38:571-576 Zou J, Clark MS, Secombes CJ: Characterisation, expression and promoter analysis of an interleukin 10 homologue in the puffer fish, Fugu rubripes Immunogenetics 2003, 55:325-335 Zou J, Tafalla C, Truckle J, Secombes CJ: Identification of a second group of type I IFNs in fish sheds light on IFN evolution in vertebrates J Immunol 2007, 179:3859-3871 Damiano JS, Oliveira V, Welsh K, Reed JC: Heterotypic interactions among NACHT domains: implications for regulation of innate immune responses Biochem J 2004, 381:213-219 Koonin EV, Aravind L: The NACHT family: a new group of predicted NTPases implicated in apoptosis and MHC transcription activation Trends Biochem Sci 2000, 25:223-224 van der Biezen EA, Jones JD: The NB-ARC domain: a novel signalling motif shared by plant resistance gene products and regulators of cell death in animals Curr Biol 1998, 8:R226-227 Hibino T, Loza-Coll M, Messier C, Majeske AJ, Cohen AH, Terwilliger DP, Buckley KM, Brockton V, Nair SV, Berney K, et al.: The immune gene repertoire encoded in the purple sea urchin genome Dev Biol 2006, 300:349-365 Rast JP, Smith LC, Loza-Coll M, Hibino T, Litman GW: Genomic insights into the immune system of the sea urchin Science 2006, 314:952-956 Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice Nucleic Acids Res 1994, 22:4673-4680 Lespinet O, Wolf YI, Koonin EV, Aravind L: The role of lineagespecific gene family expansion in the evolution of eukaryotes Genome Res 2002, 12:1048-1059 Waterhouse RM, Kriventseva EV, Meister S, Xi Z, Alvarez KS, Bartholomay LC, Barillas-Mury C, Bian G, Blandin S, Christensen BM, et al.: Evolutionary dynamics of immune-related genes and pathways in disease-vector mosquitoes Science 2007, 316:1738-1743 Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees Mol Biol Evol 1987, 4:406-425 Havana [http://www.sanger.ac.uk/HGP/havana/hawk.shtml] Prosite [http://www.expasy.org/prosite/] Pfam [http://www.sanger.ac.uk/Software/Pfam] Smart [http://smart.embl-heidelberg.de] Renauld JC: Class II cytokine receptors and their ligands: key antiviral and inflammatory modulators Nat Rev Immunol 2003, 3:667-676 Genome Biology 2007, 8:R251 http://genomebiology.com/2007/8/11/R251 58 59 60 61 62 Genome Biology 2007, Eddy SR: Profile hidden Markov models Bioinformatics 1998, 14:755-763 HMMER [http://bioweb.pasteur.fr/seqanal/interfaces/ hmmbuild.html] Schuster-Bockler B, Schultz J, Rahmann S: HMM Logos for visualization of protein families BMC Bioinformatics 2004, 5:7 HMM logo web server [http://www.sanger.ac.uk/cgi-bin/software/ analysis/logomat-m.cgi] Clamp M, Cuff J, Searle SM, Barton GJ: The Jalview Java alignment editor Bioinformatics 2004, 20:426-427 Genome Biology 2007, 8:R251 Volume 8, Issue 11, Article R251 Stein et al R251.23 ... Figure Phylogenetic trees of the innate immune signaling adaptors and diagrams of their protein structures Phylogenetic trees of the innate immune signaling adaptors and diagrams of their protein structures... Phylogenetic tree of the kinases Phylogenetic tree of the kinases Details of the tree are as in Figure Figure Phylogenetic tree of the interferon response factors Phylogenetic tree of the interferon... leucine rich repeat and PYD containing protein; NLR, nucleotide-binding domain/NACHT domain and leucine rich repeat containing family; Nod, nucleotide oligomerization domain containing protein

Ngày đăng: 14/08/2014, 08:20

Xem thêm: Báo cáo y học: "Conservation and divergence of gene families encoding components of innate immune response systems in zebrafish" pdf