Mining Meaningful Keys and Foreign Keys with High Precision and Recall
| dc.citation.issue | 12 | |
| dc.citation.volume | 18 | |
| dc.contributor.author | Koehler H | |
| dc.contributor.author | Link S | |
| dc.contributor.editor | Palpanas T | |
| dc.contributor.editor | Tatbul N | |
| dc.coverage.spatial | London, UK | |
| dc.date.accessioned | 2025-10-12T22:35:09Z | |
| dc.date.available | 2025-10-12T22:35:09Z | |
| dc.date.finish-date | 2025-09-05 | |
| dc.date.issued | 2025-01-01 | |
| dc.date.start-date | 2025-09-01 | |
| dc.description.abstract | We demonstrate a next-generation Entity/Relationship (E/R) Profiler that mines meaningful key/foreign key relationships from a given data repository. Core novelties include a strict hierarchy of key variants ranging from candidate keys to SQL unique constraints that represent different ways to identify incomplete entities, a measure of orthogonality that separates accidental from meaningful keys, and algorithms for mining approximate keys for all these variants under different thresholds of arity, completeness, dirtiness, and orthogonality. We showcase the high precision and recall achieved by our tool and how it facilitates the users’ understanding which entity and referential integrity constraints govern their data. | |
| dc.description.confidential | false | |
| dc.format.pagination | 5363-5366 | |
| dc.identifier.citation | Koehler H, Link S. (2025). Mining Meaningful Keys and Foreign Keys with High Precision and Recall. Palpanas T, Tatbul N. Proceedings of the VLDB Endowment. (pp. 5363-5366). VLDB Endowment. | |
| dc.identifier.doi | 10.14778/3750601.3750672 | |
| dc.identifier.eissn | 2150-8097 | |
| dc.identifier.elements-type | c-conference-paper-in-proceedings | |
| dc.identifier.uri | https://mro.massey.ac.nz/handle/10179/73675 | |
| dc.publisher | VLDB Endowment | |
| dc.publisher.uri | http://dl.acm.org/doi/10.14778/3750601.3750672 | |
| dc.rights | (c) 2025 The Author/s | |
| dc.rights | CC BY-NC-ND 4.0 | |
| dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ | |
| dc.source.journal | Proceedings of the VLDB Endowment | |
| dc.source.name-of-conference | 51st International Conference of Very Large Data Bases- 2025 VLDB | |
| dc.title | Mining Meaningful Keys and Foreign Keys with High Precision and Recall | |
| dc.type | conference | |
| pubs.elements-id | 503441 | |
| pubs.organisational-group | Other |