Goobi overview

625 views 28 slides Feb 28, 2016
Slide 1
Slide 1 of 28
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28

About This Presentation

The Goobi workflow system, introduction and demos


Slide Content

Steffen&Hankiewicz,&intranda&GmbH
The$Goobi$workflow$system,$introduction$and$demos$
Steffen&Hankiewicz,&intranda&GmbH&
London,&10.11.2014
1
10.11.2014

Steffen&Hankiewicz,&intranda&GmbH
2
10.11.2014
1.$What$are$we$doing?
+ + =

Steffen&Hankiewicz,&intranda&GmbH
2
10.11.2014
1.$What$are$we$doing?
Wearecontentproviders
+ + =

Steffen&Hankiewicz,&intranda&GmbH
3
10.11.2014
2.$We$are$content$providers$…
Imagine:++
‣10.000&pages&
Precondi1on:+
‣digi?zed&images&on&
the&web&online&
Search+for:+Illustra1on+

Steffen&Hankiewicz,&intranda&GmbH
4
10.11.2014
2.$We$are$content$providers$…
Imagine:++
‣10.000&pages&
Precondi1on:+
‣digi?zed&images&on&
the&web&online&
Search+for:+Illustra1on+
‣…+in+the+full+text+

Steffen&Hankiewicz,&intranda&GmbH
5
10.11.2014
2.$We$are$content$providers$…
Imagine:++
‣10.000&pages&
Precondi1on:+
‣digi?zed&images&on&
the&web&online&
Search+for:+Illustra1on+
‣…&in&the&full&text&
‣…+on+page+CCXVI+

Steffen&Hankiewicz,&intranda&GmbH
6
10.11.2014
2.$We$are$content$providers$…
Imagine:++
‣10.000&pages&
Precondi1on:+
‣digi?zed&images&on&
the&web&online&
Search+for:+Illustra1on+
‣…&in&the&full&text&
‣…&on&page&CCXVI&
‣…+as+structure+
element+

Steffen&Hankiewicz,&intranda&GmbH
7
10.11.2014
2.$We$are$content$providers$…
Imagine:++
‣10.000&pages&
Precondi1on:+
‣digi?zed&images&on&

the&web&online&
Search+for:+Illustra1on+
‣…&in&the&full&text&
‣…&on&page&CCXVI&
‣…&as&structure&element&
‣…+as+word+in+the+1tle+
of+a+chapter+

Steffen&Hankiewicz,&intranda&GmbH
8
10.11.2014
2.$We$are$content$providers$…
Imagine:++
‣10.000&pages&
Precondi1on:+
‣digi?zed&images&on&

the&web&online&
Search+for:+Illustra1on+
‣…&in&the&full&text&
‣…&on&page&CCXVI&
‣…&as&structure&element&
‣…&as&word&in&the&?tle&of&
a&chapter&
‣…+as+synonym+for+
‚drawing’

Steffen&Hankiewicz,&intranda&GmbH
9
10.11.2014
3.$We$need$some$workflow
1.&Source&material
2.&Create&digital&version
3.&Transforma?on&&&Enrichment
4.&Publish&digital&version(s)
foreachitem

Steffen&Hankiewicz,&intranda&GmbH
9
10.11.2014
3.$We$need$some$workflow
1.&Source&material
2.&Create&digital&version
3.&Transforma?on&&&Enrichment
4.&Publish&digital&version(s)
Image&conversion
Image&valida?on
OCR
ALTO&genera?on
Descrip?ve&
metadata
Technical&metadata
Pagina?on
Catalogue&
enrichment
Ingest&into&archive
NER
Authority&data
Persistent&
Iden?fiers
foreachitem
Logical&structures Invoicing

Steffen&Hankiewicz,&intranda&GmbH
10
10.11.2014
4.$Goobi$C$a$quick$overview
...&try&to&solve&common&problems
‣Web&applica?on&
‣Workflow&tool&
‣Manage&users&
‣Organize&projects&
‣Deadlines&
‣Data&storage&&
‣Metadata&formats

Steffen&Hankiewicz,&intranda&GmbH
11
10.11.2014
5.$Goobi$C$how$it$works$...
...&a&simple&approach
‣Workflows&cut&into&
small&pieces&
‣Simple&sequen?al&
order&of&tasks&
‣As&much&valida?on&
as&early&as&possible&
‣Restrict&access&to&
the&requirements&
‣Hide&everything&else&
from&the&user

Steffen&Hankiewicz,&intranda&GmbH
12
10.11.2014
6.$Goobi$C$the$users$perspective
...&avoid&difficul?es
‣Simple&UI&
‣Work&with&TodDodList&
‣Hidden&complexi?es:&
‣Storage&
‣Projects&
‣Infrastructure&
‣Clean&desk

Steffen&Hankiewicz,&intranda&GmbH
13
10.11.2014
6.$Goobi$C$the$users$perspective
Steffen
Goobi&web&interface

Steffen&Hankiewicz,&intranda&GmbH
13
10.11.2014
6.$Goobi$C$the$users$perspective
working&directory&
of&Steffen
Steffen
Goobi&web&interface

Steffen&Hankiewicz,&intranda&GmbH
13
10.11.2014
6.$Goobi$C$the$users$perspective
working&directory&
of&Steffen
Steffen
Goobi&web&interface

Steffen&Hankiewicz,&intranda&GmbH
14
10.11.2014
6.$Goobi$C$the$users$perspective
Server&side&&
programs
Plugins&without&
user&interface
Plugins&with&

user&interface

Steffen&Hankiewicz,&intranda&GmbH
15
10.11.2014
7.$Goobi$C$Management$overview
...&manage&your&workflows
‣Manage&all&typical&
configura?on&in&the&UI&
‣Workflows&
‣Projects&
‣Users&
‣User&groups&
‣Imports&
‣Exports&

Steffen&Hankiewicz,&intranda&GmbH
16
10.11.2014
7.$Goobi$C$Management$overview
...&control&your&progress
‣Controlling&and&
sta?s?cs&
‣Manipulate&workflows&
aferwards&(e.g.&&with&
GoobiScript)&
‣Collaborate&with&
external&partners&or&
agencies

Steffen&Hankiewicz,&intranda&GmbH
17
10.11.2014
8.$Goobi$C$technical$background
...&what&else&can&be&done?
‣Workflows&can&be&...&
‣simple&or&complex&
‣short&or&long&
‣contain&tasks&
‣have&a&progress&
‣used&as&template&
Import from catalogue
Scanning
Quality control
Image conversion
OCR
Structure- & metadata
ID-Generating
Presentation
Archiving

Steffen&Hankiewicz,&intranda&GmbH
18
10.11.2014
8.$Goobi$C$technical$background
...&what&else&can&be&done?
‣Workflow&steps&can&...&
‣be&executed&manually&by&a&user&
‣be&executed&automa?cally&by&the&
server&
‣interrupt&the&workflow&for&a&given&
?me&
‣contain&a&valida?on&
‣allow&or&forbid&access&or&changes&
‣be&triggered&by&a&webdAPI&
‣call&scripts&or&external&programs&
‣have&their&own&UI&as&pluginsImport from catalogue Scanning Quality control
Image conversionOCR Structure- & metadata ID-Generating Presentation Archiving

Steffen&Hankiewicz,&intranda&GmbH
19
10.11.2014
9.$Goobi$C$Extend$its$functionality
Goobi

Steffen&Hankiewicz,&intranda&GmbH
19
10.11.2014
9.$Goobi$C$Extend$its$functionality
Goobi
Web API
command
plugins
Import 

plugins
Validation
plugins
Step plugins

... ... ... ... Close
step
Create
process
Run
script
Delete
files... ...
OAIWord
Steffen&Hankiewicz,&intranda&GmbH
19
10.11.2014
9.$Goobi$C$Extend$its$functionality
Goobi
Web API
command
plugins
Import 

plugins
Validation
plugins
Step plugins
PICAMARC... ...
QA
JP2
Ingest
Export... ...
JP2 MD5
Schema
Color
depth... ... ... ... ... ...

Steffen&Hankiewicz,&intranda&GmbH
20
10.11.2014
10.$Goobi$C$Scripts$&$applications
‣OCR&
‣JPEG&
‣JPEG&2000&
‣Jpylyzer&
‣Archiving&
‣DownloaddJobs&
‣Exporters&
‣Named&En?ty&
Recogni?on

Steffen&Hankiewicz,&intranda&GmbH
21
10.11.2014
11.$Goobi$C$production$proven
‣Lots&of&ins?tu?ons&
‣Different&kinds&of&material&
‣Community&driven&
‣Open&Source&
‣Ac?ve&development&
‣Inhouse&or&hosted&
‣Scalabilty
Alotofhappycontentproviders

Steffen&Hankiewicz,&intranda&GmbH
Questions?
22
10.11.2014
‣hip://www.intranda.com&
[email protected]&
‣+49&551&29176100&
intranda&GmbH&d&Steffen&Hankiewicz