Skip to content

Generate&use the Simplified Technical Dictionary Generic Ontology (STDGOnt) in OpenRefine operation JSON to verify, reconcile and enrich material master data

License

Notifications You must be signed in to change notification settings

v1st-git/stdgont

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 

Repository files navigation

stdgont

Short description of the project.

Add Semantic Web ontology/Knowledge Graph power to OpenRefine!

(Proof of concept/demonstration purposes only).

Generate&use the Simplified Technical Dictionary Generic Ontology (STDGOnt) as OpenRefine operation JSON.

For example, to reconcile and enrich corporate material master data.

Introduction. This project demonstrates a practical approach for reconciliation and enrichment of source messy data, processed by OpenRefine. It just shows the concept and is not expected to be used in production environment.

The project is partially inspired by a series of ECCMA webinars, devoted to data quality in supply chain.

There are a number of sources, explaining how material master data quality impacts business, and there are a number of commercial and free solutions for improving of material master data quality. This project demonstrates another one practical approach in this area as one of use cases.

Source of poor material master data quality. Explained in ECCMA materials, and briefly in demonstration video.

Solution design. The proposed solution includes the following components:

  • OpenRefine (previously Google Refine) application
  • R Shiny application to generate or reuse Simplified Technical Dictionary Generic Ontology (STDGOnt) JSON files and OpenRefine operation JSON
  • Github Repository to keep and exchange by STDGOnt JSON files (this repository/project)
  • Semantic Web publishing service/wrapper for SPARQL endpoint
  • Semantic Web fuzzy search service/wrapper for SPARQL endpoint
  • Generic OpenRefine reconciliation API service (minimal set of features)
  • OpenRefine reconciliation API for QUDT units of mesurement service (minimal set of features)
  • SPARQL endpoint (internal service, is not available from Internet)

    Three basic use cases:

  • Use OpenRefine and R Shiny STDGOnt application to create and publish STDGOnt ontology for further use in master data reconciliation and enrichment, based on fuzzy search
  • Use R Shiny STDGOnt application to generate JSON for a set of OpenRefine operations based on STDGOnt fuzzy seach web service
  • Use OpenRefine to get master data reconciled and structured by utilizing of JSON operations, created by R Shiny STDGOnt application

    Solution demonstration video, explaining how the application can be used to solve typical problem, related to material master data cleansing.

    OpenRefine reconciliation API demonstration video, explaining how the application can be used to reconcile OpenRefine data column, containing Wikidata properties URI

    How to use STDGOnt service video - to verify and reconcile of QUDT units of measurement as OpenRefine/QUDT reconciliation API

  • About

    Generate&use the Simplified Technical Dictionary Generic Ontology (STDGOnt) in OpenRefine operation JSON to verify, reconcile and enrich material master data

    Topics

    Resources

    License

    Stars

    Watchers

    Forks

    Releases

    No releases published

    Packages

    No packages published

    Languages