TORCH DH
Zooniverse
IoT
Engineering
Cyber
Linguis7cs
English
Oxford
Mar7n
School
Saïd
ARC IT Services
ECI
Geography
SKA
CUDA
Physics
Computer Science
Maths
History
Oxford
Internet
Ins7tute
Music
Archaeology
Classics
Zoology
Museums
Wolfson
Law
Bodleian Libraries
Pharmacology
Biochemistry
Social
Sciences
Mathematical,
Physical &
Life Sciences
Humanities
Medical
Sciences
Colleges
GLAM
More people
More machines
HPC
Conven7onal
Computa7on
Social
Networks
Science 2.0
Internet of Things
Big
Data AI
Computa7onal
Linguis7cs,
Musicology,
Social Science,
Humani7es
Ci7zen Science
Volunteer
Compu7ng
SKA
Visualiza7on
Bio Data sharing
Machine
Learning
Scien7fic
Compu7ng
Cybersecurity
Energy Efficient
Computing
Infrastructure
(STFC)
De-identified admin
(inc. health) data
Business
data
Open data
(public sector)
Social media
data
Research
data
Longitudinal
survey data
Open data
Securely held data
Environment
data
Business and LG Data
Research Centres
(ESRC)
Admin Data Research
Centres (ESRC)
High Performance
Data Environment
(NERC)
Clinical
data
Medical Bioinformatics (MRC)
Understanding Populations
(ESRC)
Clinical Practice Datalink
(MHRA, NIHR)
100,000 Genome Project NHS)
Research Data Facility (EPSRC)
European Bioinformatics
Institute (EMBL)
Bioscience E-Infrastructure
(BBSRC)
Square Kilometre Array (STFC)
Digital Transformations
(AHRC)
Archive
data
Open Data
Institute
Commercial
Research
Understanding Populations
(ESRC)
New Research Questions
▶ Social media data offers
the possibility of studying
social processes as they
unfold at the level of
populations, as an
alternative to traditional
surveys or interviews.
▶ The data from social media
is described as "qualitative
data on a quantitative
scale" and requires
innovative analysis
techniques.
Social media data and
real time analytics
https://twitter.com/CR_UK/status/446223117841494016/
Some people's smartphones
had autocorrected the word
"BEAT" to instead read
"BEAR".
"Thank you for choosing an
adorable polar bear," the
reply from the WWF said.
"We will call you today to set
up your adoption."
http://www.bbc.com/news/technology-26723457
Social Media Triangle
social media
data and
analytics
social media for
engagement with
research
social media
as a subject
of research
Sam McGregor
Tweets per day
http://www.internetlivestats.com/twitter-statistics/
New Forms of Data
▶ Internet data, derived from social
media and other online interactions
(including data gathered by
connected people and devices, eg
mobile devices, wearable
technology, Internet of Things)
▶ Tracking data, monitoring the
movement of people and objects
(including GPS/geolocation data,
traffic and other transport sensor
data, CCTV images etc)
▶ Satellite and aerial imagery (eg
Google Earth, Landsat, infrared,
radar mapping etc) http://www.oecd.org/sti/sci-tech/new-data-for-
understanding-the-human-condition.htm
New Forms of Data Centre for Doctoral Training
Much of the value of ‘new forms of data’ lie in the
potential for them to be analysed in near real-time,
which presents opportunities for revealing
phenomena as they unfold, enabling timely response
with immediate influence.
Such analysis brings distinct new computational
requirements, requires new skills, and makes new
demands on the ease of use and capability of the
national e-Infrastructure.
http://www.esrc.ac.uk/funding-and-guidance/postgraduates/dtc/
dtc-policy/commissioning-of-centres-for-doctoral-training.aspx
A rehearsal for the future
▶ The Internet of Things
describes a world in which
everyday objects are
connected to a network so that
data can be shared
▶ But it is really as much about
people as the inanimate object
▶ It is impossible to anticipate
all the social changes that
could be created by connecting
billions of devices
https://www.gov.uk/government/publications/internet-of-things-blackett-review
There is no such thing as the Internet of Things
There is no such thing as a closed system
Humans are creative and subversive
The Rise of the Bots A Swarm of Drones
Accidents happen (in the lab, bin)
Holding machines to account Software vulnerability
Where are the throttle points?
@dder
PETRAS
Privacy, Ethics, Trust, Reliability, Acceptability, and Security
for the Internet of Things
• The fusion of the cyber, physical and human elements
• Scale: from 1mm
3
devices to large infrastructure systems
• Managing devices throughout their (decades long) lifetimes
• New and evolving threat landscape
• Continue to operate when partially compromised
The Challenges are numerous
• Safety vs Security
• Security vs Efficiency
• Hardening vs Adaptive Response
Tradeoffs
Data Detect Store Analytics Filter Analysts
Real life is and must be full of all kinds of social
constraint – the very processes from which society
arises. Computers can help if we use them to create
abstract social machines on the Web: processes in
which the people do the creative work and the
machine does the administration... The stage is set
for an evolutionary growth of new social engines.
The ability to create new forms of social process
would be given to the world at large, and
development would be rapid.
Berners-Lee, Weaving the Web, 1999 (pp. 172–175)
Social Machines
The Macroscope
Observer of
one social
machine
Observers using third
party observatory
Observer of
multiple social
machines
Human
participants in
Social
Machine
Human participants in
multiple Social Machines
Observer of Social
Machine infrastructure
1
4
2
3
5
6
SM
SM
SM
Social Machine
Observing Social
Machines
7
@dder
De Roure, D.,
Hooper, C., Page,
K., Tarte, S., and
Willcox, P. 2015.
Observing Social
Machines Part 2:
How to Observe?
ACM Web Science
Edwards, P. N., et al. (2013) Knowledge Infrastructures: Intellectual Frameworks and Research
Challenges. Ann Arbor: Deep Blue. http://hdl.handle.net/2027.42/97552
A computationally-enabled
sense-making network of
expertise, data, software,
models and narratives
Big Data, in a"
Big Data Centre
The R Dimensions
reproducible repeatable replicable reusable
referenceable retrievable reviewable rerunnable
replayable re-interpretable reprocessable
recomposable reconstructable repurposable
reliable respec]ul reputable revealable
recoverable restorable reparable refreshable
@dder 14 April 2014
sci method
access
understand
new use
social
cura7on
De Roure, D. 2014. The future
of scholarly communications.
Insights: the UKSG journal,
27, (3), 233-238.
DOI 10.1629/2048-7754.171
Human Digital Physical Triangle
human
digital physical
social
media
IoT
automation
and scale
New Data New Computation
▶ From Large Hadron Collider to Large People Collider
▶ New forms of data, new social processes
▶ Internet of Things is a thing
▶ Growing requirement for realtime data fusion and
analytics
▶ Increasing automation: growth of machine learning
▶ Fourth quadrant is underpinned by emerging
computational capability
David De Roure [email protected]
Thanks to Christine Borgman, Emil Lupu, Sam McGregor,
Nigel Shadbolt, Vonu Thakuriah.
http://www.slideshare.net/davidderoure/new-data-new-computation