BioVault.net ADW 2025 CODATA: SyftBox: a General Purpose Solution for Data Visitation and Equitable Data Sharing

madhavajay 0 views 20 slides Oct 14, 2025
Slide 1
Slide 1 of 20
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20

About This Presentation

SyftBox: a General Purpose Solution for Data Visitation and Equitable Data Sharing


Slide Content

Lightning Talk SyftBox: a General Purpose Solution for Data Visitation and Equitable Data Sharing

Madhava Jay 🧬 Rare Disease Patient Software Engineer @ OpenMined 🚀 Help solve data access with open source 🌏 Brisbane, Australia đź“§ [email protected]

Mission Building the public network for non-public information

Founded in 2017 Tech Nonprofit and 501(c)(3) We build open-source privacy-preserving technologies > 30 Team Members > 230 GitHub Repos ~ 20k Slack Community

This lightning talk 1. Problems with data sharing 2. A general purpose solution 3. A use-case for equitable genomics

Data’s true power comes from collaboration. But many data owners are forced to choose between giving up data ownership through copying and centralization, or simply not participating. Due to legal and ethical constraints, copying data across borders is often unacceptable; resulting in no action. We need a new way to collaborate fairly and securely. The Motivating Problem

Remotely study data on a computer at another organisation Data Scientist Datasite Can answer a “specific” question …and only that question Retains governance over the information they steward …and never shares a copy of the data Data Visitation

Data Scientist Datasite Datasite Datasite FL Project FL Project FL Project Datasite FL Project Federated Learning Federated Learning

SyftBox.net An open-source, privacy-first, decentralized network for secure data collaboration

The SyftBox Platform Apache 2.0 Open-source End-to-end encrypted Permissionless network Supports any data format Runs any program or code Enables federated analysis across multiple datasites Low latency and large file transfer support

Try it out! https://github.com/OpenMined/syftbox

BioVault.net A free, open-source, permissionless network for collaborative genomics Built on SyftBox

Problems with equitable genomics and data sharing Genomic data is private and very sensitive Researchers face lengthy requests and institutional reviews Difficulties sharing data across datasites due to different policies Participants lack transparency on how their data is used Many communities, especially in the Global South, lack expertise and resources to analyze their own data Unequal and inequitable benefit sharing between haves and have nots

Our solution - data visitation for genomics We allow data owners to make their data available for remote analysis without uploading or exposing the raw data Because we’re built on SyftBox and Nextflow, researchers can easily run arbitrary analysis and complex data pipelines

Video Slide Demo https://youtu.be/6PJxS9Z030Q

Dr Carika Weldon (Bermuda) Founder of CariGenetics Pilot Programmes BioVault is enabling researchers in the Global South, to participate in genomics and derive equitable benefits from their data We are also partnering with Human Genome Project II to help deliver infrastructure and capacity building in genomics Dr Rana Dajani (Jordan) Professor at Hashemite University

Looking for partners and pilots We are on a mission to deliver equitable access to data We have resources to help solve your data access problems Contact us to learn more: [email protected]
Tags