RMIDDM: An unsupervised and interpretable concept drift detection method for data streams

dc.citation.issue6
dc.citation.volume39
dc.contributor.authorNeto R
dc.contributor.authorAlencar B
dc.contributor.authorGomes HM
dc.contributor.authorBifet A
dc.contributor.authorGama J
dc.contributor.authorCassales G
dc.contributor.authorRios R
dc.date.accessioned2026-02-23T22:40:21Z
dc.date.issued2025-11-01
dc.description.abstractTraditional machine learning techniques assume that data is drawn from a stationary source. This assumption is challenged in contexts with data streams for presenting constant and potentially infinite sequences whose distribution is prone to change over time. Based on these settings, detecting changes (a.k.a. concept drifts) is necessary to keep learning models up-to-date. Although state-of-the-art detection methods were designed to monitor the loss of predictive models, such monitoring falls short in many real-world scenarios where the true labels are not readily available. Therefore, there is increasing attention to unsupervised concept drift detection methods as approached in this paper. In this work, we present an unsupervised and interpretable method based on Radial Basis Function Networks (RBFN) and Markov Chains (MC), referred to as RMIDDM (Radial Markov Interpretable Drift Detection Method). In our method, RBF performs, in the intermediate layer, an activation process that implicitly produces groups of observations collected over time. Simultaneously, MC models the transitions between groups to support the detection of concept drifts, which happens when the active group changes and its probability exceeds a given threshold. A set of experiments with synthetic datasets and comparisons with state-of-the-art algorithms demonstrated that the proposed method can detect drifts at runtime in an efficient, interpretable, and independent way of labels, presenting competitive results and behavior. Additionally, to show its applicability in a real-world scenario, we analyzed new COVID-19 cases, deaths, and vaccinations to identify new waves as concept drifts and generate Markov models that allow understanding of their interaction.
dc.description.confidentialfalse
dc.description.notesAccepted for publication in Data Mining and Knowledge Discovery in August 2025. sortkey: 3d
dc.edition.editionNovember 2025
dc.identifier.citationNeto R, Alencar B, Gomes HM, Bifet A, Gama J, Cassales G, Rios R. (2025). RMIDDM: An unsupervised and interpretable concept drift detection method for data streams. Data mining and knowledge discovery. 39. 6.
dc.identifier.doi10.1007/s10618-025-01155-x
dc.identifier.eissn1573-756X
dc.identifier.elements-typejournal-article
dc.identifier.issn1384-5810
dc.identifier.number85
dc.identifier.piis10618-025-01155-x
dc.identifier.urihttps://mro.massey.ac.nz/handle/10179/74200
dc.languageEnglish
dc.publisherSpringer Science+Business Media, LLC
dc.publisher.urihttps://link.springer.com/article/10.1007/s10618-025-01155-x
dc.relation.isPartOfData mining and knowledge discovery
dc.rights(c) The author/sen
dc.rights.licenseCC BY 4.0 - CAUL Read and Publishen
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en
dc.subjectData stream
dc.subjectConcept drift
dc.subjectRadial basis function network
dc.subjectMarkov chain
dc.titleRMIDDM: An unsupervised and interpretable concept drift detection method for data streams
dc.typeJournal article
pubs.elements-id609779
pubs.organisational-groupOther

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
609779 PDF.pdf
Size:
3.04 MB
Format:
Adobe Portable Document Format
Description:
Published version.pdf

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
9.22 KB
Format:
Plain Text
Description:

Collections