Pātaka Rangahau Massey Research Online will be unavailable on Tuesday 11 November from 4:00pm to 8:00pm for a planned upgrade | For further assistance please contact the Library team, library@massey.ac.nz
 

Evaluation of a text-mining application for the rapid analysis of free-text wildlife necropsy reports

dc.citation.issue11 November
dc.citation.volume20
dc.contributor.authorSaverimuttu S
dc.contributor.authorMcInnes K
dc.contributor.authorWarren K
dc.contributor.authorYeap L
dc.contributor.authorHunter S
dc.contributor.authorGartrell B
dc.contributor.authorPas A
dc.contributor.authorChatterton J
dc.contributor.authorJackson B
dc.date.accessioned2025-12-12T02:13:01Z
dc.date.issued2025-11-25
dc.description.abstractThe ability to efficiently derive insights from wildlife necropsy data is essential for advancing conservation and One Health objectives, yet close reading remains the mainstay of knowledge retrieval from ubiquitous free-text clinical data. This time-consuming process poses a barrier to the efficient utilisation of such valuable resources. This study evaluates part of a bespoke text-mining application, DEE (Describe, Explore, Examine), designed for extracting insights from free-text necropsy reports housed in Aotearoa New Zealand’s Wildbase Pathology Register. A pilot test involving nine veterinary professionals assessed DEE’s ability to quantify the occurrence of four clinicopathologic findings (external oiling, trauma, diphtheritic stomatitis, and starvation) across two species datasets by comparison to manual review. Performance metrics—recall, precision, and F1-score—were calculated and analysed alongside tester-driven misclassification patterns. Findings reveal that while DEE (and the principals underlying its function) offers time-efficient data retrieval, its performance is influenced by search term selection and the breadth of vocabulary which may describe a clinicopathologic finding. Those findings characterized by limited terminological variance, such as external oiling, yielded the highest performance scores and the most consistency across application testers. Mean F1-scores across all tested findings and application testers was 0.63–0.93. Results highlight the utility and limitations of term-based text-mining approaches and suggests that enhancements to automatically capture this terminological variance may be necessary for broader implementation. This pilot study highlights the potential of relatively simple, rule-based text-mining approaches to derive insights natural language wildlife data in the support of One Health goals.
dc.description.confidentialfalse
dc.identifier.citationSaverimuttu S, McInnes K, Warren K, Yeap L, Hunter S, Gartrell B, Pas A, Chatterton J, Jackson B. (2025). Evaluation of a text-mining application for the rapid analysis of free-text wildlife necropsy reports. Plos One. 20. 11 November.
dc.identifier.doi10.1371/journal.pone.0337720
dc.identifier.eissn1932-6203
dc.identifier.elements-typejournal-article
dc.identifier.issn1932-6203
dc.identifier.numbere0337720
dc.identifier.urihttps://mro.massey.ac.nz/handle/10179/73940
dc.languageEnglish
dc.publisherPLOS
dc.publisher.urihttp://journals.plos.org/plosone/article?id=10.1371/journal.pone.0337720
dc.relation.isPartOfPlos One
dc.rightsCC BY 4.0
dc.rights(c) 2025 The Author/s
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleEvaluation of a text-mining application for the rapid analysis of free-text wildlife necropsy reports
dc.typeJournal article
pubs.elements-id608554
pubs.organisational-groupOther

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
608554 PDF.pdf
Size:
1.13 MB
Format:
Adobe Portable Document Format
Description:
Evidence

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
9.22 KB
Format:
Plain Text
Description:

Collections