Abstra: Toward Generic Abstractions for Data of Any Model - Département d'informatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

Abstra: Toward Generic Abstractions for Data of Any Model

Nelly Barret
Ioana Manolescu
Prajna Upadhyay
  • Fonction : Auteur
  • PersonId : 1110205

Résumé

Digital data sharing leads to unprecedented opportunities to develop data-driven systems for supporting economic activities, the social and political life, and science. Many open-access datasets are RDF (Linked Data) graphs, but others are JSON or XML documents, CSV files, Neo4J property graphs, etc. Potential users need to understand a dataset in order to decide if it is useful for their goal. While some published datasets come with a schema and/or documentation, this is not always the case. We demonstrate Abstra, a dataset abstraction system, which applies on a large variety of data models. Abstra computes a description meant for humans, and integrates Information Extraction to classify dataset content among a set of categories of interest to the user. Our abstractions are conceptually close to Entity-Relationship diagrams, but our entities can have deeply nested structure.
Fichier principal
Vignette du fichier
abstra-demo.pdf (1.46 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03767967 , version 1 (02-09-2022)

Identifiants

  • HAL Id : hal-03767967 , version 1

Citer

Nelly Barret, Ioana Manolescu, Prajna Upadhyay. Abstra: Toward Generic Abstractions for Data of Any Model. CIKM 2022 - 31st ACM International Conference on Information and Knowledge Management, Oct 2022, Atlanta, Georgia / Hybrid, United States. ⟨hal-03767967⟩
121 Consultations
128 Téléchargements

Partager

Gmail Facebook X LinkedIn More