Estermann ENICPA Wiki Loves Performing Arts 20191022

beatestermann 193 views 31 slides Oct 25, 2019
Slide 1
Slide 1 of 31
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31

About This Presentation

Presentation at the ENICPA Round Table on 22 October 2019 in Prague on Wikidata and performing arts. Author: Beat Estermann, Bern University of Applied Sciences.


Slide Content

Wikidata & Performing Arts
Prof. Beat Estermann, Bern University of Applied Sciences
ENICPA Round Table, Prague, 22 October 2019
Unless otherwise noted, the content of this presentation is made available under the CC BY 4.0 license.
Photo: Phantom blacklight theatre(theatregroup HILT), User:Blacklight theatrePrague, Wikimedia Commons, CC BY-SA 4.0.

•Short introduction to Wikidata
•What is its purpose?
•How does it work?
•Wikidata+ Performing Arts
•Aim & vision
•Where do we stand?
•From “Sum of All GLAMs” to “Wiki Loves Performing Arts”
•What is Sum of All GLAMs?
•The role of Wikidatain the wider context of the Linked Open Data
Ecosystem for the Performing Arts
On the Programme Today

Short Introduction to Wikidata
•What is its purpose?
•How does it work?

Imagine a world in which every single human being can freely
share in the sum of all knowledge. That's our commitment.

Structured
Commons

Purpose of Wikidata
•Centralized Interwiki-Links [Example: Prague]
•Centralized Data Management for Infoboxes[Example: Bern Theatre]
•Centralized Data Management for Lists [Example: Listade pinturasde A. Norfini]
•Possibility of Querying the Data in a Standardized Format
[Example: Histropedia]
« The Sum of All Human Knowledge» as Linked Open Data
Multilingual
With Sourced Statements
Freely usable by anyone (CC Zero)

Wikidata + Performing Arts
•Aims & Vision
•Where do we stand?

•Realizean international performingartsdatabaseon the
basisofWikidata
•Providea powerful findingaidforperformingartsrelatedcontent
on Wikimedia Commons
•Promote Wikidata-poweredperformingartsrelatedinformationin
thevariouslanguageversionsofWikipedia
•Getheritageinstitutionstomaketheirperformingartsrelated
dataand contentavailablethroughWikidata& Wikimedia
Commons
The Vision: International Database for the Performing
Arts (Wikidata Project Performing Arts)
Role Model Projects for Inspiration:
•MusicBrainz(music recordings)
•IMDb (movies)
•IMSLP (music scores)
•Operabase(opera)

Thematic Projects
https://www.wikidata.org/wiki/Wikidata:WikiProject_Cultural_heritage[Example]

Wikidata & Performing Arts: Some Statistics
Class ofitems N items(Oct. 2019)ΔsinceApril 2019
musicalwork 450’000 +13’000
play 22’000 +650
choreographicwork 900 +7
characterrole 23’000 +12’000
performingartsbuilding 21’000 +1000
musician 270’000 +10’000
actor/actress 270’000 +16’000
musicalensemble 93’000 +5’900
theatretroupe 5’200 +150
dancetroupe 370 +28
performingartsproduction 21’000 +2’500

Core Aspects of Linked Data Publication
Source: eCH-0205 –Linked Open Data

Current Challenges
Source of the graphic: eCH-0205 –Linked Open Data
Data
scraping &
cleansing
Data Ingest (data
mapping &
matching)
Data
Modelling
Issues
✔ ✔
Manual Data
Entry
How to Overcome the
Chicken-and-Egg
Problem?

•Currentexamplesshowgreatpotential fortheinclusionof
Wikidata-poweredcontentin thefieldoftheperformingarts.
•Currentinitiatives maybenefitfromimprovedcoordination–
also acrosslinguisticborders.
•Examples...
Status Quo –Wikipedia

Example: List of Productions of « Les Galas Karsenty »
(French Wikipedia)

Example: Infobox « Das Land des Lächelns »
(German Wikipedia)

Example: Infobox « Aksandr Golovin »
(Russian Wikipedia)

Example: Listing of the scenographic work of
Aleksandr Golovin (Russian Wikipedia)

Example: List of theatrical works by William
Shakespeare (German Wikipedia)

Example: Infobox Opéra Bastille (French Wikipedia)

Example: List of Artistic Directors and Well-Known
Artists at Stadttheater Bern (German Wikipedia)

Status Quo –Wikipedia/Wikidata
Despite many examples of how structured data is used, the data is
usually not pulled from Wikidata.
Large parts of the structured data in Wikipedia related to the
performing arts is not available on Wikidata.

From “Sum of All GLAMs”
to Wiki Loves Performing Arts
•Presenting “Sum of All GLAMs”
•Wiki Loves Performing Arts

Various layers of information about heritage institutions
Sum of All GLAMs Project
Source: Fontenelle & Estermann (2019) An International Knowledge Base for All Heritage Institutions
Wiki Movement Brazil and OpenGLAM CH, with the
support of the MY-D Foundation
FindingGLAMs
Wikimedia Sweden, UNESCO and WMF, with the support of the
Swedish Postcode Foundation

Ensuring data quality (references!) and completeness

Wikidata-powered Infoboxes

Mbabel Tool: Automatically Creating Stub-articles
based on Wikidata entries

•Start by describing performing arts venues and organizations with the
goal of engaging people and organizations directly.
•Continue with further classes...
Wiki Loves Performing Arts
Phase 1
•Tackle data modelling issues
•Locate and ingest existing datasets
•Create infobox templates
•Start monitoring the data for quality and completeness
Phase 2
•Run crowdsourcing campaigns to complement the data,
targeting both Wikipedians and arts organizations

Breakdown of Tasks / Possibilities for Contribution
International
Coordination
Group
Wikidata
Team
Country-
specific
Teams
Language-
specific
Teams
(Wikipedia)
Arts organi-
zations;
heritage
institutions
Tackle data modelling issues
on Wikidata
Contribute Lead Contribute
Track data quality &
completeness
Contribute Lead ContributeContribute
Ingest data from existing
databases
Contribute Lead Contribute
Run campaigns to enhance
the data on Wikidata
Contribute Lead Contribute
Get heritage institutions to
curate their own data
Lead Contribute
Implement Infobox and Mbabel
templates on Wikipedias
(secure community buy-in)
Lead
Promote the use of the
templates
Contribute Lead Contribute
Promote other uses of the dataContribute Lead Contribute
Write guidelines & reach out to
people in further countries
Lead ContributeContributeContribute Contribute

Wikidata and classical LOD are complementary
Wikidata Classical LOD
Strengths
Fully-fledged crowdsourcing
platform; further parties can easily be
invited to contribute.
Data owners keep the control over
their «graphs»; data quality and
completeness remains under the
control of the data provider.
Immediate integration with the
worldwide linked data cloud
(reconciliation at the moment of data
ingest)
Data can be published in RDF format,
is linkable; reconcilation against other
databases can be done step by step.
Community-supported LOD service
with a certain level of reliability
Weaknesses
Monitoring data quality and
completeness is a permanent and
challenging task.
Third parties cannot readily fix issues
related to data quality or comple-
teness that are not taken care of by the
data provider.
Harmonization of data modelling
practices is a challenge.
Harmonization of data modelling
practices within one’s own «silo» is
straightforward, but might be a great
challenge to implement across «silos».
Perceived «loss of control» by data
owners
Introducing collaborative data
maintenance practices is difficult.
Many current LOD services are of
questionable reliability.

Wikidata in the wider context of the Linked Open
Data Ecosystem for the Performing Arts
When it comes to publishing data on Wikidata, priorityshould be
given to data:
•where it is unclear who would be the «natural» authorityin the
given area (on a global scale);
•where there is a high potential for enhancing data through
crowdsourcing approaches (including community or expert
sourcing);
•where data is likely to be reused in the context of Wikipedia;
•where international coordination to ensure semantic
interoperability of the data is unlikely to take place elsewhere.
Focus on base registers / authority files and controlled vocabularies
first; they facilitate further interlinking of datasets!

Questions / Feedback?
Contact
Prof. Beat Estermann
Bern University ofApplied Sciences
Institute for Public Sector Transformation
[email protected]
+41 31 848 34 38
https://www.wikidata.org/wiki/Wikidata:WikiProject_Performing_arts