Developing an open source data pipeline for participation in community knowledge bases

2. Dezember 2021 : 12:20 - 12:40

Lozana Rossenova (TIB – Leibniz-Informationszentrum Technik und Naturwissenschaften), Lucia Sohmen (TIB – Leibniz-Informationszentrum Technik und Naturwissenschaften)

Veranstaltungsraum: Tech Corner

How can users participate in Wikidata and other community knowledge bases at a large scale? We will showcase OpenRefine, an open source software for manipulating, enriching and uploading a big amount of data at once.

Many communities maintain digital knowledge bases to store and curate their communal knowledge in one place. In the biggest online community knowledge base, Wikidata and the software behind it - Wikibase - that knowledge is structured in semantic, machine-readable way by using linked open data. This presentation will showcase OpenRefine, an open source software for participation in Wikidata and Wikibase with a user-friendly interface that is still able to manipulate, enrich and upload a big amount of data at once. We will discuss a variety of use cases that are based on a data upload pipeline for 3D data enrichment in NFDI4Culture as well as a bring-your-own-data workshop from November 2021. In addition, we will demo the use of OpenRefine to transform data, match it with existing items in public knowledge bases, use the information from those items to enrich our own data, and finally contribute new information back to the community knowledge base. The core principles behind our data.


🎥 Hier gehts zur Aufzeichnung des Beitrags im TIB-AV-Portal


zurück zur Liste