Massey Documents by Type
Permanent URI for this communityhttps://mro.massey.ac.nz/handle/10179/294
Browse
4 results
Search Results
Item Digital Gazetteers: Review and Prospects for Place Name Knowledge Bases(Association for Computing Machinery, 2026-02) Wijegunarathna KI; Stock K; Jones CB; Thai MT; Tong HGazetteers typically store data on place names, place types, and the associated coordinates. They play an essential role in disambiguating place names in online geographical information retrieval systems for navigation and mapping, detecting and disambiguating place names in text, and providing coordinates. Currently, there are many gazetteers in use derived from many sources, with no commonly accepted standard for encoding the data. Most gazetteers are also very limited in the extent to which they represent the multiple facets of the named places yet they have potential to assist user search for locations with specific physical, commercial, social, or cultural characteristics. With a focus on understanding digital gazetteer technologies and advancing their future effectiveness for information retrieval, we provide a review of data sources, components, software and data management technologies, data quality and volunteered data, and methods for matching sources that refer to the same real-world places. We highlight the need for future work on richer representation of named places, the temporal evolution of place identity and location, and the development of more effective methods for data integration.Item Speaking of Location: Communicating about Space with Geospatial Natural Language(CEUR-WS.org, 2019-09-23) Stock K; Jones CB; Tenbrink T; Stock K; Jones CB; Tenbrink TSpeaking of Location 2019 is the second edition of the Speaking of Location workshop series, which aims to foster transdisciplinary research to address the problem of automatic interpretation and generation of geospatial natural language. This introduction to the workshop proceedings provides background, discussing the definition and nature of geospatial natural language, presenting the papers contained in the proceedings volume, and situating them within the theoretical framework of The Semantic Pyramid, which is also described.Item Predicting Distance and Direction from Text Locality Descriptions for Biological Specimen Collections(Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022-08-22) Liao R; Das PP; Jones CB; Aflaki N; Stock K; Ishikawa T; Fabrikant SI; Winter SA considerable proportion of records that describe biological specimens (flora, soil, invertebrates), and especially those that were collected decades ago, are not attached to corresponding geographical coordinates, but rather have their location described only through textual descriptions (e.g. North Canterbury, Selwyn River near bridge on Springston-Leeston Rd). Without geographical coordinates, millions of records stored in museum collections around the world cannot be mapped. We present a method for predicting the distance and direction associated with human language location descriptions which focuses on the interpretation of geospatial prepositions and the way in which they modify the location represented by an associated reference place name (e.g. near the Manawatu River). We study eight distance-oriented prepositions and eight direction-oriented prepositions and use machine learning regression to predict distance or direction, relative to the reference place name, from a collection of training data. The results show that, compared with a simple baseline, our model improved distance predictions by up to 60% and direction predictions by up to 31%.Item Detecting geospatial location descriptions in natural language text(Taylor and Francis Group, 2022) Stock K; Jones CB; Russell S; Radke M; Das P; Aflaki NReferences to geographic locations are common in text data sources including social media and web pages. They take different forms from simple place names to relative expressions that describe location through a spatial relationship to a reference object (e.g. the house beside the Waikato River). Often complex, multi-word phrases are employed (e.g. the road and railway cross at right angles; the road in line with the canal) where spatial relationships are communicated with various parts of speech including prepositions, verbs, adverbs and adjectives. We address the problem of automatically detecting relative geospatial location descriptions, which we define as those that include spatial relation terms referencing geographic objects, and distinguishing them from non-geographical descriptions of location (e.g. the book on the table). We experiment with several methods for automated classification of text expressions, using features for machine learning that include bag of words that detect distinctive words, word embeddings that encode meanings of words and manually identified language patterns that characterise geospatial expressions. Using three data sets created for this study, we find that ensemble and meta-classifier approaches, that variously combine predictions from several other classifiers with data features, provide the best F-measure of 0.90 for detecting geospatial expressions.
