• Login
    View Item 
    •   Home
    • Massey Documents by Type
    • Theses and Dissertations
    • View Item
    •   Home
    • Massey Documents by Type
    • Theses and Dissertations
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Factors affecting the performance of phylogenetic methods : a thesis presented in partial fulfilment of the requirements for the degree of Ph.D. in Mathematics at Massey University

    Icon
    View/Open Full Text
    02_whole.pdf (5.424Mb)
    01_front.pdf (1.202Mb)
    Export to EndNote
    Abstract
    This thesis comprises several computer simulation experiments in which the performance of a selection of phylogenetic methods was assessed. Data were generated according to a known model and used as input for the phylogenetic methods. Some new methods were introduced, and their performance compared with extant methods. Performance was judged by several criteria, being accuracy, consistency, efficiency, falsifiability and robustness. The experiments were designed to be biologically relevant, and yet computationally tractible. Hence the models of evolution used were simple, to allow a wide range of parameters to be tested for their effects within the bounds of available computing resources. The experiments were divided into two main types, the "small n" with up to 10 taxa, and the "large n" with from 10 to 30 taxa. Parameters which were allowed to vary in the "small n" case included number of taxa (n), sequence length, tree topology, edge length probability distribution, and purity of data. In the "large n" case, number of taxa, sequence length, and edge length probability distribution were varied. The simulation experiments show that the accuracy of phylogenetic methods decreases with increasing n, and that the mean number of internal edges of the generating tree which are incorrectly inferred increases at least linearly with n. The rate at which the sequence length must increase with n, to retain a fixed confidence in the inferred tree, is shown to be at least linear in n. All the methods are approximately as susceptible as each other to sampling error, which is exacerbated by the generating tree having very short or very long internal edges, and by finite sequence length. All the methods are susceptible to random error such as sequencing error, but provided such error is small, the effect is not great. One type of method, using edge lengths inferred by the Hadamard conjugation process, is shown to be much more robust to impure data and to sequencing error than are the other methods. With n ≥ 10 only the fastest methods were used. Increasing n again decreased the accuracy of the methods. Varying the "molecular-clockness" of the generating tree was shown to have a much greater effect upon those methods inconsistent with data which do not satisfy the molecular clock hypothesis. All the methods used are described algorithmically, and their computational complexity is discussed. New proofs are provided of the consistency /inconsistency of several methods with the models of evolution used. A notation is introduced to characterize all tree topologies, and used throughout this thesis. Pseudocode is provided for all the major algorithms used in the simulation experiments.
    Date
    1994
    Author
    Charleston, Michael A.
    Rights
    The Author
    Publisher
    Massey University
    URI
    http://hdl.handle.net/10179/4118
    Collections
    • Theses and Dissertations
    Metadata
    Show full item record

    Copyright © Massey University
    | Contact Us | Feedback | Copyright Take Down Request | Massey University Privacy Statement
    DSpace software copyright © Duraspace
    v5.7-2020.1-beta1
     

     

    Tweets by @Massey_Research
    Information PagesContent PolicyDepositing content to MROCopyright and Access InformationDeposit LicenseDeposit License SummaryTheses FAQFile FormatsDoctoral Thesis Deposit

    Browse

    All of MROCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    Statistics

    View Usage Statistics

    Copyright © Massey University
    | Contact Us | Feedback | Copyright Take Down Request | Massey University Privacy Statement
    DSpace software copyright © Duraspace
    v5.7-2020.1-beta1