Building a Semantic Knowledge Graph - web.pdf

rjw 247 views 87 slides Sep 17, 2024
Slide 1
Slide 1 of 143
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39
Slide 40
40
Slide 41
41
Slide 42
42
Slide 43
43
Slide 44
44
Slide 45
45
Slide 46
46
Slide 47
47
Slide 48
48
Slide 49
49
Slide 50
50
Slide 51
51
Slide 52
52
Slide 53
53
Slide 54
54
Slide 55
55
Slide 56
56
Slide 57
57
Slide 58
58
Slide 59
59
Slide 60
60
Slide 61
61
Slide 62
62
Slide 63
63
Slide 64
64
Slide 65
65
Slide 66
66
Slide 67
67
Slide 68
68
Slide 69
69
Slide 70
70
Slide 71
71
Slide 72
72
Slide 73
73
Slide 74
74
Slide 75
75
Slide 76
76
Slide 77
77
Slide 78
78
Slide 79
79
Slide 80
80
Slide 81
81
Slide 82
82
Slide 83
83
Slide 84
84
Slide 85
85
Slide 86
86
Slide 87
87
Slide 88
88
Slide 89
89
Slide 90
90
Slide 91
91
Slide 92
92
Slide 93
93
Slide 94
94
Slide 95
95
Slide 96
96
Slide 97
97
Slide 98
98
Slide 99
99
Slide 100
100
Slide 101
101
Slide 102
102
Slide 103
103
Slide 104
104
Slide 105
105
Slide 106
106
Slide 107
107
Slide 108
108
Slide 109
109
Slide 110
110
Slide 111
111
Slide 112
112
Slide 113
113
Slide 114
114
Slide 115
115
Slide 116
116
Slide 117
117
Slide 118
118
Slide 119
119
Slide 120
120
Slide 121
121
Slide 122
122
Slide 123
123
Slide 124
124
Slide 125
125
Slide 126
126
Slide 127
127
Slide 128
128
Slide 129
129
Slide 130
130
Slide 131
131
Slide 132
132
Slide 133
133
Slide 134
134
Slide 135
135
Slide 136
136
Slide 137
137
Slide 138
138
Slide 139
139
Slide 140
140
Slide 141
141
Slide 142
142
Slide 143
143

About This Presentation

Presentation to European Bibframe Workshop 2024 - Helsinki.
Description of the National Library Board Singapore semantic knowledge graph and its developments.


Slide Content

A Semantic Knowledge Graph at
National Library Board Singapore
BIBFRAME Workshop in Europe 2024
17
th
September 2024 -Helsinki
Richard Wallis
Evangelist and Founder
Data Liberate
[email protected]

Independent Consultant, Evangelist & Founder
W3C Community Groups:
•Bibframe2Schema (Chair) –Standardised conversion path(s)
•Schema Bib Extend (Chair) - Bibliographic data
•Schema Architypes (Chair) - Archives
•Financial Industry Business Ontology –Financial schema.org
•Tourism Structured Web Data (Co-Chair)
•Schema Course Extension
•Schema IoT Community
•Educational & Occupational Credentials in Schema.org
[email protected] — @dataliberate
40+ Years – Computing
30+ Years – Cultural Heritage technology
20+ Years – Semantic Web & Linked Data
Worked With:
•Google – Schema.org vocabulary, site, extensions. documentation and community
•OCLC – Global library cooperative
•FIBO – Financial Industry Business Ontology Group
•Various Clients – Implementing/understanding Linked Data, Schema.org:
National Library Board Singapore
British Library — Stanford University — Europeana
2

3
Agenda for today

4
Agenda for today
•National Library and their resources

5
Agenda for today
•National Library and their resources
•Knowledge Graph ambition

6
Agenda for today
•National Library and their resources
•Knowledge Graph ambition
•Linked Data Management System –the LDMS delivered

7
Agenda for today
•National Library and their resources
•Knowledge Graph ambition
•Linked Data Management System –the LDMS delivered
•Continued development

8
Agenda for today
•National Library and their resources
•Knowledge Graph ambition
•Linked Data Management System –the LDMS delivered
•Continued development
–Data sharing with the Entity Data Service

9
Agenda for today
•National Library and their resources
•Knowledge Graph ambition
•Linked Data Management System –the LDMS delivered
•Continued development
–Data sharing with the Entity Data Service
–User experience enrichment –the sidebar API

10
Agenda for today
•National Library and their resources
•Knowledge Graph ambition
•Linked Data Management System –the LDMS delivered
•Continued development
–Data sharing with the Entity Data Service
–User experience enrichment –the sidebar API
–Data quality enhancement utilizing external authorities

11
National Library Board Singapore
PublicLibraries
Networkof28PublicLibraries,
including2 partnerlibraries*
ReadingProgrammesandInitiatives
ProgrammesandExhibitions
targetedat Singaporecommunities
*Partnerlibrariesarelibrarieswhicharepartnerowned
and
fundedbutmanagedbyNLB/NLB’ssubsidiaryLibraries
and ArchivesSolutionsPteLtd.Library@Chinatownand
theLifelong LearningInstituteLibraryarePartnerlibraries.
NationalArchives
TransferredfromNHBtoNLBinNov
2012
CustodianofSingapore’s
Collective Memory:Responsible
forCollection, Preservationand
Managementof
Singapore’s Publicand Private
ArchivalRecords
PromotesPublicInterestinour
Nation’s
HistoryandHeritage
NationalLibrary
PreservingSingapore’sPrint
and LiteraryHeritage,and
Intellectual memory
ReferenceCollections
LegalDeposit(including
electronic)

12
Over
560,000
Singapore&
SEAitems
Over147,000
Chinese,Malay &
TamilLanguages
items
ReferenceCollection
Over62,000
SocialSciences
& Humanities
items
Over39,000
Science &
Technology
items
Over
53,000
Artsitems
Over19,000
RareMaterials
items
ArchivalMaterials
Over290,000
Governmentfiles&
Parliamentpapers
Over190,000
Audiovisual&sound
recordings
Over70,000
Maps & building
plans
Over
1.14m
Photographs
Over35,000
Oral history
interviews
Over55,000
Speeches&press
releases
Over
7,000
Posters
National Library Board Singapore
Over5m
printcollection
Over2.4m
musictracks
78
databases
Over7,400
e-newspapersand
e-magazinestitles
Over
8,000
e-learning
courses
Over1.7m
e-booksand
audiobooks
LendingCollection

13
National Library Board Online Services

15
The Ambition
•To enable the discovery & display of entitles from different sources
in a combined interface

16
The Ambition
•To enable the discovery & display of entitles from different sources
in a combined interface
•To bring together resources physical and digital

17
The Ambition
•To enable the discovery & display of entitles from different sources
in a combined interface
•To bring together resources physical and digital
•To bring together diverse systems across the National Library,
National Archives, and Public Libraries in a Linked Data Environment

18
The Ambition
•To enable the discovery & display of entitles from different sources
in a combined interface
•To bring together resources physical and digital
•To bring together diverse systems across the National Library,
National Archives, and Public Libraries in a Linked Data Environment
•To provide a staff interface to view and manage all entities, their
descriptions and relationships

19
The Ambition –Technical Challenges
•To produce a Knowledge Graph that is [daily] up to date

20
The Ambition –Technical Challenges
•To produce a Knowledge Graph that is [daily] up to date
•Not to replace current cataloging processes & practices

21
The Ambition –Technical Challenges
•To produce a Knowledge Graph that is [daily] up to date
•Not to replace current cataloging processes & practices
–Marc cataloguing in the ILS

22
The Ambition –Technical Challenges
•To produce a Knowledge Graph that is [daily] up to date
•Not to replace current cataloging processes & practices
–Marc cataloguing in the ILS
–TTE maintenance in authority control

23
The Ambition –Technical Challenges
•To produce a Knowledge Graph that is [daily] up to date
•Not to replace current cataloging processes & practices
–Marc cataloguing in the ILS
–TTE maintenance in authority control
–Dublin Core content management for CMS sites and Archives

24
The Ambition –Technical Challenges
•To produce a Knowledge Graph that is [daily] up to date
•Not to replace current cataloging processes & practices
–Marc cataloguing in the ILS
–TTE maintenance in authority control
–Dublin Core content management for CMS sites and Archives
•Data sharable with the world
–Linked Open Data
–Schema.org

25
The Ambition –Technical Challenges
•To produce a Knowledge Graph that is [daily] up to date
•Not to replace current cataloging processes & practices
–Marc cataloguing in the ILS
–TTE maintenance in authority control
–Dublin Core content management for CMS sites and Archives
•Data sharable with the world
–Linked Open Data
–Schema.org
•An aggregated source of truth

26
Contract Awarded
metaphactoryplatform
Low-code knowledge graph platform
Semantic knowledge modeling
Semantic search & discovery
AWS Partner
Public sector partner
Singapore based
Linked Data, Structured data, Semantic
Web, bibliographic meta data, Schema.org
and management systems consultant

27
Basic Data Model
•Linked Data
–BIBFRAME to capture detail of bibliographic records
–Schema.org to deliver structured data for search engines

28
Basic Data Model
•Linked Data
–BIBFRAME to capture detail of bibliographic records
–Schema.org to deliver structured data for search engines
–Schema.org representation of CMS, NAS, TTE data

29
Basic Data Model
•Linked Data
–BIBFRAME to capture detail of bibliographic records
–Schema.org to deliver structured data for search engines
–Schema.org representation of CMS, NAS, TTE data
–Schema.org enrichment of BIBFRAME

30
Basic Data Model
•Linked Data
–BIBFRAME to capture detail of bibliographic records
–Schema.org to deliver structured data for search engines
–Schema.org representation of CMS, NAS, TTE data
–Schema.org enrichment of BIBFRAME
•Schema.org as the ‘lingua franca’ vocabulary of the Knowledge graph

31
Basic Data Model
•Linked Data
–BIBFRAME to capture detail of bibliographic records
–Schema.org to deliver structured data for search engines
–Schema.org representation of CMS, NAS, TTE data
–Schema.org enrichment of BIBFRAME
•Schema.org as the ‘lingua franca’ vocabulary of the Knowledge graph
–All entities described using Schema.org as a minimum.

32
Data Data Data!
Data Source Source Records Entity CountUpdate Frequency
ILS 1.4m 7.9m Daily
CMS 82k 228k Weekly
NAS 1.6m 6.7m Monthly
TTE 3k 317k Monthly
3.1m 15.15m

33
Data Ingest Pipelines
•Triggered by data upload from source system

34
Data Ingest Pipelines
•Triggered by data upload from source system
•ILS –daily

35
Data Ingest Pipelines
•Triggered by data upload from source system
•ILS –daily
–MARC-XML parsed through Open Source scripts:
•Marc2bibframe2 –Library of Congress
•Bibframe2schema –Bibframe2Schema.org

36
Data Ingest Pipelines
•Triggered by data upload from source system
•ILS –daily
–MARC-XML parsed through Open Source scripts:
•Marc2bibframe2 –Library of Congress
•Bibframe2schema –Bibframe2Schema.org
•TTE Authorities –Monthly

37
Data Ingest Pipelines
•Triggered by data upload from source system
•ILS –daily
–MARC-XML parsed through Open Source scripts:
•Marc2bibframe2 –Library of Congress
•Bibframe2schema –Bibframe2Schema.org
•TTE Authorities –Monthly
–Bespoke CSV conversion

38
Data Ingest Pipelines
•Triggered by data upload from source system
•ILS –daily
–MARC-XML parsed through Open Source scripts:
•Marc2bibframe2 –Library of Congress
•Bibframe2schema –Bibframe2Schema.org
•TTE Authorities –Monthly
–Bespoke CSV conversion
•CMS & NAS –Weekly / Monthly

39
Data Ingest Pipelines
•Triggered by data upload from source system
•ILS –daily
–MARC-XML parsed through Open Source scripts:
•Marc2bibframe2 –Library of Congress
•Bibframe2schema –Bibframe2Schema.org
•TTE Authorities –Monthly
–Bespoke CSV conversion
•CMS & NAS –Weekly / Monthly
–Dublin Core to Schema.org

40
Technical Architecture (simplified)
Hosted on Amazon Web Services
Batch Scripts
import control
Etc.
SOURCE DATA
IMPORT

41
Technical Architecture (simplified)
Hosted on Amazon Web Services
Pipeline
processing
Batch Scripts
import control
Etc.
SOURCE DATA
IMPORT

42
Technical Architecture (simplified)
Hosted on Amazon Web Services
GraphDB
Cluster
GraphDB
Cluster
GraphDB
Cluster
GraphDB
Cluster
Pipeline
processing
Batch Scripts
import control
Etc.
SOURCE DATA
IMPORT

43
Technical Architecture (simplified)
Hosted on Amazon Web Services
EDS
GraphDB
Cluster
GraphDB
Cluster
GraphDB
Cluster
GraphDB
Cluster
Pipeline
processing
Batch Scripts
import control
Etc.
SOURCE DATA
IMPORT

44
Technical Architecture (simplified)
Hosted on Amazon Web Services
EDS
GraphDB
Cluster
GraphDB
Cluster
GraphDB
Cluster
GraphDB
Cluster
Pipeline
processing
Batch Scripts
import control
Etc.
SOURCE DATA
IMPORT
DMI

45
A need for entity reconciliation …..
•Lots (and lots and lots) of source entities –10 million entities

46
A need for entity reconciliation …..
•Lots (and lots and lots) of source entities –10 million entities
•Lots of duplication
–Lee, KuanYew –1
st
Prime Minister of Singapore
•160 individual entities in ILS source data

47
A need for entity reconciliation …..
•Lots (and lots and lots) of source entities –10 million entities
•Lots of duplication
–Lee, KuanYew –1
st
Prime Minister of Singapore
•160 individual entities in ILS source data
–Singapore Art Museum
•Entities from source data
•21 CMS, 1 NAS, 66 ILS, 1 TTE

48
A need for entity reconciliation …..
•Lots (and lots and lots) of source entities –10 million entities
•Lots of duplication
–Lee, KuanYew –1
st
Prime Minister of Singapore
•160 individual entities in ILS source data
–Singapore Art Museum
•Entities from source data
•21 CMS, 1 NAS, 66 ILS, 1 TTE
•Users only want 1 of each!

49
Adaptive Data Model Concepts
•Source entitles
–Individual representation of source data

50
Adaptive Data Model Concepts
•Source entitles
–Individual representation of source data
•Aggregation entities
–Tracking relationships between source entities for the same thing
–No copying of attributes

51
Adaptive Data Model Concepts
•Source entitles
–Individual representation of source data
•Aggregation entities
–Tracking relationships between source entities for the same thing
–No copying of attributes
•Primary Entities
–Searchable by users
–Displayable to users
–Consolidation of aggregated source data & managed attributes

52

53

54

55

56

57

58

59

60

66
The entity iceberg

67
The entity iceberg
Primary

68
The entity iceberg
Primary
Discovery

69
The entity iceberg
Primary
Aggregation
Discovery

70
The entity iceberg
Primary
Aggregation
Source
Ingestion
Pipelines
Discovery

71
The entity iceberg
Primary
Aggregation
Source
Ingestion
Pipelines
Discovery
Management

72
The NLB Knowledge Graph
•666M Triples

73
The NLB Knowledge Graph
•666M Triples
•10M Source Entities

74
The NLB Knowledge Graph
•666M Triples
•10M Source Entities
•5.8M Primary Entities

75
The NLB Knowledge Graph
•666M Triples
•10M Source Entities
•5.8M Primary Entities
–Aggregation of source derived entities

76
The NLB Knowledge Graph
•666M Triples
•10M Source Entities
•5.8M Primary Entities
–Aggregation of source derived entities
–Searchable

77
The NLB Knowledge Graph
•666M Triples
•10M Source Entities
•5.8M Primary Entities
–Aggregation of source derived entities
–Searchable
–Shared with world

78
The NLB Knowledge Graph
•666M Triples
•10M Source Entities
•5.8M Primary Entities
–Aggregation of source derived entities
–Searchable
–Shared with world

79
NLB Linked Data Management System (LDMS)
•Powered by the Knowledge Graph

80
NLB Linked Data Management System (LDMS)
•Powered by the Knowledge Graph
•Updated daily

81
NLB Linked Data Management System (LDMS)
•Powered by the Knowledge Graph
•Updated daily
•A new separate environment built on established systems

82
NLB Linked Data Management System (LDMS)
•Powered by the Knowledge Graph
•Updated daily
•A new separate environment built on established systems
•No changes in cataloguing practices

83
NLB Linked Data Management System (LDMS)
•Powered by the Knowledge Graph
•Updated daily
•A new separate environment built on established systems
•No changes in cataloguing practices
•No cataloguer retraining

84
NLB Linked Data Management System (LDMS)
•Powered by the Knowledge Graph
•Updated daily
•A new separate environment built on established systems
•No changes in cataloguing practices
•No cataloguer retraining
•Not just the bibliographic (MARC) data

85
NLB Linked Data Management System (LDMS)
•Powered by the Knowledge Graph
•Updated daily
•A new separate environment built on established systems
•No changes in cataloguing practices
•No cataloguer retraining
•Not just the bibliographic (MARC) data
•No replacement systems –to implement Linked Data

86
NLB Linked Data Management System (LDMS)
•Powered by the Knowledge Graph
•Updated daily
•A new separate environment built on established systems
•No changes in cataloguing practices
•No cataloguer retraining
•Not just the bibliographic (MARC) data
•No replacement systems –to implement Linked Data
–MARC based ILS swap out occurred mid project –without LDMS impact

87
NLB Linked Data Management System (LDMS)
•Powered by the Knowledge Graph
•Updated daily
•A new separate environment built on established systems
•No changes in cataloguing practices
•No cataloguer retraining
•Not just the bibliographic (MARC) data
•No replacement systems –to implement Linked Data
–MARC based ILS swap out occurred mid project –without LDMS impact
•Delivering Linked Data benefits back into the organization

88
Building on the Knowledge Graph
Entity Data Service
•Open Linked Data interface

89
Building on the Knowledge Graph
Entity Data Service
•Open Linked Data interface
•Dereferencing entity URIs

90
Building on the Knowledge Graph
Entity Data Service
•Open Linked Data interface
•Dereferencing entity URIs
•Content negotiation for RDF/XML / JSON-LD / Turtle / N-Triples

91
Building on the Knowledge Graph
Entity Data Service
•Open Linked Data interface
•Dereferencing entity URIs
•Content negotiation for RDF/XML / JSON-LD / Turtle / N-Triples
•Download formats RDF/XML / JSON-LD / Turtle / N-Triples

92
Building on the Knowledge Graph
Entity Data Service
•Open Linked Data interface
•Dereferencing entity URIs
•Content negotiation for RDF/XML / JSON-LD / Turtle / N-Triples
•Download formats RDF/XML / JSON-LD / Turtle / N-Triples
•Embedded Schema.org

93
Building on the Knowledge Graph
Entity Data Service
•Open Linked Data interface
•Dereferencing entity URIs
•Content negotiation for RDF/XML / JSON-LD / Turtle / N-Triples
•Download formats RDF/XML / JSON-LD / Turtle / N-Triples
•Embedded Schema.org
•Enhanced navigation

97
Building on the Knowledge Graph
Enriching the User Journey

98
Building on the Knowledge Graph
Enriching the User Journey
•Systems are often silos

99
Building on the Knowledge Graph
Enriching the User Journey
•Systems are often silos
•User search and navigation constrained to the data in the silo

100
Building on the Knowledge Graph
Enriching the User Journey
•Systems are often silos
•User search and navigation constrained to the data in the silo
•Knowledge Graph populated from several individual systems

101
Building on the Knowledge Graph
Enriching the User Journey
•Systems are often silos
•User search and navigation constrained to the data in the silo
•Knowledge Graph populated from several individual systems
•Entities aggregated and related across system sources

102
Building on the Knowledge Graph
Enriching the User Journey
•Systems are often silos
•User search and navigation constrained to the data in the silo
•Knowledge Graph populated from several individual systems
•Entities aggregated and related across system sources
•The fuel to explore between systems

103
Building on the Knowledge Graph
Enriching the User Journey
•Systems are often silos
•User search and navigation constrained to the data in the silo
•Knowledge Graph populated from several individual systems
•Entities aggregated and related across system sources
•The fuel to explore between systems
•Via a navigational interface sidebar –bridging the silos

104
Building on the Knowledge Graph
Enriching the User Journey
•Systems are often silos
•User search and navigation constrained to the data in the silo
•Knowledge Graph populated from several individual systems
•Entities aggregated and related across system sources
•The fuel to explore between systems
•Via a navigational interface sidebar –bridging the silos
•Plugged into user interface

105
Building on the Knowledge Graph
Enriching the User Journey
•Systems are often silos
•User search and navigation constrained to the data in the silo
•Knowledge Graph populated from several individual systems
•Entities aggregated and related across system sources
•The fuel to explore between systems
•Via a navigational interface sidebar –bridging the silos
•Plugged into user interface
•Powered by a JavaScript Sidebar API

Use of the JavaScript Sidebar API

Use of the JavaScript Sidebar API
→ API call to KG –article ID passed as parameter

Use of the JavaScript Sidebar API
→ API call to KG –article ID passed as parameter
← Description of associated Primary entity returned

Description includes list of ‘about’ related entity IDs
used to build display and navigation links
Use of the JavaScript Sidebar API
→ API call to KG –article ID passed as parameter
← Description of associated Primary entity returned

Clicking sidebar links trigger new API calls to rebuild
the sidebar display as entity relationships are
followed
Description includes list of ‘about’ related entity IDs
used to build display and navigation links
Use of the JavaScript Sidebar API
→ API call to KG –article ID passed as parameter
← Description of associated Primary entity returned

Clicking sidebar links trigger new API calls to rebuild
the sidebar display as entity relationships are
followed
Knowledge Graph navigation via a sidebar
Description includes list of ‘about’ related entity IDs
used to build display and navigation links
Use of the JavaScript Sidebar API
→ API call to KG –article ID passed as parameter
← Description of associated Primary entity returned

Clicking sidebar links trigger new API calls to rebuild
the sidebar display as entity relationships are
followed
Knowledge Graph navigation via a sidebar
Description includes list of ‘about’ related entity IDs
used to build display and navigation links
Use of the JavaScript Sidebar API
→ API call to KG –article ID passed as parameter
← Description of associated Primary entity returned

115
KG Quality Enhancement from Authorities
LCNAF URI Ingestion
•For Person / Organization entities with LCNAF URIs

116
KG Quality Enhancement from Authorities
LCNAF URI Ingestion
•For Person / Organization entities with LCNAF URIs
•Created via the marc2bibframe2 scripts -from $0subfield

117
KG Quality Enhancement from Authorities
LCNAF URI Ingestion
•For Person / Organization entities with LCNAF URIs
•Created via the marc2bibframe2 scripts -from $0subfield
•Create rdfs:labelvalues from the marc record
eg.700$a+ 700$d

118
KG Quality Enhancement from Authorities
LCNAF URI Ingestion
•For Person / Organization entities with LCNAF URIs
•Created via the marc2bibframe2 scripts -from $0subfield
•Create rdfs:labelvalues from the marc record
eg.700$a+ 700$d
•These values are not controlled –entity can have several different labels

119
KG Quality Enhancement from Authorities
LCNAF URI Ingestion
•For Person / Organization entities with LCNAF URIs
•Created via the marc2bibframe2 scripts -from $0subfield
•Create rdfs:labelvalues from the marc record
eg.700$a+ 700$d
•These values are not controlled –entity can have several different labels
•Use LCNAF authority data to introduce naming consistency

120
KG Quality Enhancement from Authorities
LCNAF URI Ingestion
•For Person / Organization entities with LCNAF URIs
•Created via the marc2bibframe2 scripts -from $0subfield
•Create rdfs:labelvalues from the marc record
eg.700$a+ 700$d
•These values are not controlled –entity can have several different labels
•Use LCNAF authority data to introduce naming consistency
•Lookup against LCNAF to identify & ingest authoritative version

121
KG Quality Enhancement from Authorities
LCNAF URI Ingestion
•For Person / Organization entities with LCNAF URIs
•Created via the marc2bibframe2 scripts -from $0subfield
•Create rdfs:labelvalues from the marc record
eg.700$a+ 700$d
•These values are not controlled –entity can have several different labels
•Use LCNAF authority data to introduce naming consistency
•Lookup against LCNAF to identify & ingest authoritative version
•LCNAF values take precedence in primary entity consolidation

Quality Enrichment from Authorities
LCNAF URI Ingestion
MARC XML:

Quality Enrichment from Authorities
LCNAF URI Ingestion
MARC XML:

Quality Enrichment from Authorities
LCNAF URI Ingestion
MARC XML:
Bibframe RDF:

Quality Enrichment from Authorities
LCNAF URI Ingestion
MARC XML:
Bibframe RDF:

Quality Enrichment from Authorities
LCNAF URI Ingestion
MARC XML:
Bibframe RDF:
MARC XML:

Quality Enrichment from Authorities
LCNAF URI Ingestion
MARC XML:
Bibframe RDF:
MARC XML:
Bibframe RDF:

Quality Enrichment from Authorities
LCNAF URI Ingestion
MARC XML:
Bibframe RDF:
MARC XML:
Bibframe RDF:
Entity result in Knowledge Graph
Which is correct?

Quality Enrichment from Authorities
LCNAF URI Ingestion
MARC XML:
Bibframe RDF:
MARC XML:
Bibframe RDF:
Entity result in Knowledge Graph
Which is correct?

Quality Enrichment from Authorities
LCNAF URI Ingestion
MARC XML:
Bibframe RDF:
MARC XML:
Bibframe RDF:
Entity result in Knowledge Graph
Which is correct?
Ingest from LCNAF and give precedence in consolidation

131
Quality Enhancement from Authorities
LCNAF Person & Organization Name Matching
•For all Person and Organization primary entities

132
Quality Enhancement from Authorities
LCNAF Person & Organization Name Matching
•For all Person and Organization primary entities
•Perform a string-matching LCNAF lookup for schema:namevalues

133
Quality Enhancement from Authorities
LCNAF Person & Organization Name Matching
•For all Person and Organization primary entities
•Perform a string-matching LCNAF lookup for schema:namevalues
•Automatic background process

134
Quality Enhancement from Authorities
LCNAF Person & Organization Name Matching
•For all Person and Organization primary entities
•Perform a string-matching LCNAF lookup for schema:namevalues
•Automatic background process
•If exact match
–Ingest LCNAF entity –takes precedence in consolidation

135
Quality Enhancement from Authorities
LCNAF Person & Organization Name Matching
•For all Person and Organization primary entities
•Perform a string-matching LCNAF lookup for schema:namevalues
•Automatic background process
•If exact match
–Ingest LCNAF entity –takes precedence in consolidation
•If close match
–Add to list of match candidates

136
Quality Enhancement from Authorities
LCNAF Person & Organization Name Matching
•For all Person and Organization primary entities
•Perform a string-matching LCNAF lookup for schema:namevalues
•Automatic background process
•If exact match
–Ingest LCNAF entity –takes precedence in consolidation
•If close match
–Add to list of match candidates
–[Human] curator either accepts as a match or not

Quality Enrichment from Authorities
LCNAF Person & Organization Name Matching

Quality Enrichment from Authorities
LCNAF Person & Organization Name Matching

139
•2 years in development
NLB Linked Data Management System (LDMS)

140
•2 years in development
•Live and operational for 1.5 years
NLB Linked Data Management System (LDMS)

141
•2 years in development
•Live and operational for 1.5 years
•Built on a 666M triple Knowledge Graph
NLB Linked Data Management System (LDMS)

142
•2 years in development
•Live and operational for 1.5 years
•Built on a 666M triple Knowledge Graph
•Automatically updated daily
NLB Linked Data Management System (LDMS)

143
•2 years in development
•Live and operational for 1.5 years
•Built on a 666M triple Knowledge Graph
•Automatically updated daily
•Using Bibframe & Schema.org
NLB Linked Data Management System (LDMS)

144
•2 years in development
•Live and operational for 1.5 years
•Built on a 666M triple Knowledge Graph
•Automatically updated daily
•Using Bibframe & Schema.org
•Built on –not replacing –established systems & practices
NLB Linked Data Management System (LDMS)

145
•2 years in development
•Live and operational for 1.5 years
•Built on a 666M triple Knowledge Graph
•Automatically updated daily
•Using Bibframe & Schema.org
•Built on –not replacing –established systems & practices
•A Linked Data Service for NLB
NLB Linked Data Management System (LDMS)

146
•2 years in development
•Live and operational for 1.5 years
•Built on a 666M triple Knowledge Graph
•Automatically updated daily
•Using Bibframe & Schema.org
•Built on –not replacing –established systems & practices
•A Linked Data Service for NLB
–Utilizing external authorities to enrich and standardize descriptions
NLB Linked Data Management System (LDMS)

147
•2 years in development
•Live and operational for 1.5 years
•Built on a 666M triple Knowledge Graph
•Automatically updated daily
•Using Bibframe & Schema.org
•Built on –not replacing –established systems & practices
•A Linked Data Service for NLB
–Utilizing external authorities to enrich and standardize descriptions
–Part of Open Linked Data Cloud –via Entity Data Service
NLB Linked Data Management System (LDMS)

148
•2 years in development
•Live and operational for 1.5 years
•Built on a 666M triple Knowledge Graph
•Automatically updated daily
•Using Bibframe & Schema.org
•Built on –not replacing –established systems & practices
•A Linked Data Service for NLB
–Utilizing external authorities to enrich and standardize descriptions
–Part of Open Linked Data Cloud –via Entity Data Service
–Enriching user journeys on non-linked data systems –via sidebar API
NLB Linked Data Management System (LDMS)

A Semantic Knowledge Graph at
National Library Board Singapore
BIBFRAME Workshop in Europe 2024
17
th
September 2024 -Helsinki
Richard Wallis
Evangelist and Founder
Data Liberate
[email protected]