Presentation given at the Association of Moving Image Archivists Conference, November 14, 2009 in Savannah, GA. Part of the panel PBCore: What is it good for?
Size: 648.07 KB
Language: en
Added: Jul 14, 2009
Slides: 20 pages
Slide Content
PBCore, METS,
PREMIS, MODS, METSRights...
oh my!
Kara van Malssen
Senior Research Fellow, NYU
Preserving Digital Public Television
AMIA 2008
A little bit about the Preserving Digital
Public Television Project
•Identify at-risk born digital public
television content
•Build an OAIS-compliant prototype
repository
•Explore and apply standards
•Create selection guidelines
•Research sustainability models,
copyright encumbrances
GOALS:
PBS
Library of
Congress
NYU
WNET
WGBH
SIP site
Repository
Project Partners
Producing Stations
WGBH
WNET
Station
B
Station
A
Station
C
PBS
Satellite
NYU PDPTV Prototype
Repository
Transmitting
Stations
WNET
Station
A
Station
C
Station
E
Station
G
Station
I
WGBH
Station
B
Station
D
Station
F
Station
H
Station
J
Submission Workflow
• Create a prototype repository for long term retention
• Aggregate content from partner stations + PBS for
sample programs
• Populate records with metadata that already
exists (in station databases, files, scheduling systems, etc)
• Transform data and package content, while
preserving relationships between items
NYU Goals:
Important Vocabulary
•
The Repository: NYU
prototype preservation repository
•
OAIS: Open Archival
Information System
•
SIP: Submission Information
Package
•
AIP: Archival Information
Package
OAIS
Terms!
Applying standards
•
Normalize disparate metadata
•
XML based
•
One uniform scheme
•
Easier to manage over the long term
•
Rules, vocabularies, schemas help
maintain consistency
Production
Master
(mov)
HD
Broadcast
Master
(mov/data)
SD
Broadcast
Master
(mov/aiff/
m2v)
SD
Broadcast
Master
(mpeg)
Production
Master
(mxf)
SIP Class 1: WNET National
Broadcast (Nature)
SIP Class 2: WGBH National
Broadcasts
SIP Class 3: WNET Local Broadcast
(New York Voices)
SIP Class 4: Religion and Ethics
PODS
PRO
TRACK
TEAMS
INMAGICDATABASE EXPORTS
ADDITIONAL ITEMS
Scripts,
etc
PODSPODS
PODS
Scripts,
etcPRO
TRACK
PRO
TRACK
INMAGIC
INMAGIC
TEAMS
HD
Broadcast
Master
(mov/data)
SD
Broadcast
Master
(mov/aiff/
m2v)
Production
Master
(mxf)
Production
Master
(mxf)
SD
Broadcast
Master
(mpeg)
SD
Broadcast
Master
(mov/aiff/
m2v)
Production
Master
(mov)
Production
Master
(mov)
SD
Broadcast
Master
(mov/aiff/
m2v)
Challenge of
managing
diverse
SIPs:
PDPTV metadata model
METS: Metadata Encoding
and Transmission Standard
Structural and administrative
PBCore: Public Broadcasting
Metadata Dictionary
Descriptive and technical
PREMIS: Preservation
Metadata Implementation
Strategy
Technical preservation metadata
METS : Metadata Encoding and Transmission
Standard
•
Provides a structure to bundle all content
(essence + metadata) in one AIP
•
Identifies types of metadata, but not the
terms to define them (with a few exceptions)
METS
dmdSec
amdSec
techMDrightsMDsourceMDdigiprovMD
fileSec
structMap
behaviorSec
PBCore: What is it good for?
•
Descriptive metadata elements that are
specific to public broadcasting
•
Controlled vocabularies with broadcast terms
•
Easy to map to from legacy station databases
•
Granular technical metadata (PBCore 1.2)
➡Accurately represents the file specific metadata
➡Can be auto populated using technical metadata
extraction tools & sytlesheets
PREMIS : Preservation Metadata Implementation Strategy
Intellectual
Entity
Object
Rights
Agents
Events
Object Entity:
•Creating
application info
•Playback
environment
(hardware and
software
PBCore
PREMIS
Issue of Redundancy between standards
METS
Agents
Checksums
Structure
File Size
Hardware
Software
Rights
Relationships
File Format
Title
Creator
Description
PBCore
PREMIS
Putting it all together
METS
Agents
Checksums
Structure
File Size
Hardware
Software
Rights
Relationships
File Format
Title
Creator
Description
METSRights!
MODS
Descriptive elements only
map to MODS
1.Content submitted, verified
2.METS automatically generated (checksums
into METS attributes)
3.Source database exports automatically
converted to PBCore
4.Technical metadata extracted from files using
MediaInfo, converted to PBCore
5.MODS created from completed PBCore
6.Rights metadata (METSRights), preservation
metadata (PREMIS) created
7.AIP complete
AIP creation simplified
AIPs:
AIP Class 1: Nationally distributed content (Nature)
ESSENCE FILE
TYPES
METADATA
ADDITIONAL ITEMS
Scripts,
etc
METS PBCore PREMIS
METS
Rights
MODS
METS
PBCore
PREMIS
METS
Rights
MODS
AIP Class 4: Religion and Ethics
METS
PBCore
PREMIS
METS
Rights
MODS
Scripts,
etc
Production
Master
(mov)
HD
Broadcast
Master
(mov/data)
SD
Broadcast
Master
(mov/aiff/
m2v)
SD
Broadcast
Master
(mpeg)
Production
Master
(mxf)
Original
database
exports
HD
Broadcast
Master
(mov/data)
SD
Broadcast
Master
(mov/aiff/
m2v)
Production
Master
(mxf)
SD
Broadcast
Master
(mov/aiff/
m2v)
Production
Master
(mov)
Original
database
exports
Original
database
exports
Production
Master
(mov)
HD
Broadcast
Master
(mov/data)
SD
Broadcast
Master
(mov/aiff/
m2v)
SD
Broadcast
Master
(mpeg)
Production
Master
(mxf)
SIP Class 1: WNET National
Broadcast (Nature)
SIP Class 2: WGBH National
Broadcasts
SIP Class 3: WNET Local Broadcast
(New York Voices)
SIP Class 4: Religion and Ethics
PODS
PRO
TRACK
TEAMS
INMAGICDATABASE EXPORTS
ADDITIONAL ITEMS
Scripts,
etc
PODSPODS
PODS
Scripts,
etcPRO
TRACK
PRO
TRACK
INMAGIC
INMAGIC
TEAMS
HD
Broadcast
Master
(mov/data)
SD
Broadcast
Master
(mov/aiff/
m2v)
Production
Master
(mxf)
Production
Master
(mxf)
SD
Broadcast
Master
(mpeg)
SD
Broadcast
Master
(mov/aiff/
m2v)
Production
Master
(mov)
Production
Master
(mov)
SD
Broadcast
Master
(mov/aiff/
m2v)
Challenge of
managing
diverse
SIPs: