Exploring biological sequence space : selected problems in sequence analysis and phylogenetics : a thesis presented in partial fulfilment of the requirements for the degree of Doctor of Philosophy in Computational Biology at Massey University

dc.contributor.authorMcComish, Bennet James
dc.date.accessioned2013-12-13T01:49:26Z
dc.date.available2013-12-13T01:49:26Z
dc.date.issued2012
dc.description.abstractAs the volume and complexity of available sequence data continues to grow at an exponential rate, the need for new sequence analysis techniques becomes more urgent, as does the need to test and to extend the existing techniques. These include, among others, techniques for assembling raw sequence data into usable genomic sequences; for using these sequences to investigate the evolutionary history of genes and species; and for examining the mechanisms by which sequences change over evolutionary time scales. This thesis comprises three projects within the Veld of sequence analysis. It is shown that organelle genome DNA sequences can be assembled de novo using short Illumina reads from a mixture of samples, and deconvoluted bioinformatically, without the added cost of indexing the individual samples. In the course of this work, a novel sequence element is described, that probably could not have been detected with traditional sequencing techniques. The problem of multiple optima of likelihood on phylogenetic trees is examined using biological data. While the prevalence of multiple optima varies widely with real data, trees with multiple optima occur less often among the best trees. Overall, the results provide reassurance that the value of maximum likelihood as a tree selection criterion is not often compromised by the presence of multiple local optima on a single tree. Fundamental mechanisms of mutation are investigated by estimating nucleotide substitution rate matrices for edges of phylogenetic trees. Several large alignments are examined, and the results suggest that the situation may be more complex than we had anticipated. It is likely that genome scale alignments will have to be used to further elucidate this question.en
dc.identifier.urihttp://hdl.handle.net/10179/4945
dc.identifier.wikidataQ112158878
dc.identifier.wikidata-urihttps://www.wikidata.org/wiki/Q112158878
dc.language.isoenen
dc.publisherMassey Universityen_US
dc.rightsThe Authoren_US
dc.subjectNucleotide sequenceen
dc.subjectPhlyogenyen
dc.subjectBioinformaticsen
dc.subjectMethodologyen
dc.subjectData processingen
dc.titleExploring biological sequence space : selected problems in sequence analysis and phylogenetics : a thesis presented in partial fulfilment of the requirements for the degree of Doctor of Philosophy in Computational Biology at Massey Universityen
dc.typeThesisen
massey.contributor.authorMcComish, Bennet Jamesen
thesis.degree.disciplineComputational Biologyen
thesis.degree.grantorMassey Universityen
thesis.degree.levelDoctoralen
thesis.degree.nameDoctor of Philosophy (Ph.D.)en
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
02_whole.pdf
Size:
7.15 MB
Format:
Adobe Portable Document Format
Description:
Loading...
Thumbnail Image
Name:
01_front.pdf
Size:
52.76 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
804 B
Format:
Item-specific license agreed upon to submission
Description: