Mining Meaningful Keys and Foreign Keys with High Precision and Recall

Loading...
Thumbnail Image

Date

2025-01-01

DOI

Open Access Location

Journal Title

Journal ISSN

Volume Title

Publisher

VLDB Endowment

Rights

(c) 2025 The Author/s
CC BY-NC-ND 4.0

Abstract

We demonstrate a next-generation Entity/Relationship (E/R) Profiler that mines meaningful key/foreign key relationships from a given data repository. Core novelties include a strict hierarchy of key variants ranging from candidate keys to SQL unique constraints that represent different ways to identify incomplete entities, a measure of orthogonality that separates accidental from meaningful keys, and algorithms for mining approximate keys for all these variants under different thresholds of arity, completeness, dirtiness, and orthogonality. We showcase the high precision and recall achieved by our tool and how it facilitates the users’ understanding which entity and referential integrity constraints govern their data.

Description

Keywords

Citation

Koehler H, Link S. (2025). Mining Meaningful Keys and Foreign Keys with High Precision and Recall. Palpanas T, Tatbul N. Proceedings of the VLDB Endowment. (pp. 5363-5366). VLDB Endowment.

Endorsement

Review

Supplemented By

Referenced By

Creative Commons license

Except where otherwised noted, this item's license is described as (c) 2025 The Author/s