CoC25US - Utilizing Real-Time Transit Data for Travel Optimization
bunkertor
69 views
26 slides
Sep 15, 2025
Slide 1 of 26
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
About This Presentation
CoC25US - Utilizing Real-Time Transit Data for Travel Optimization
https://communityovercode.org/schedule/
September 12, 2025
Timothy Spann
Utilizing Real-Time Transit Data for Travel Optimization
There are a lot of factors involved in determining how you can find our way around and avoid delays,...
CoC25US - Utilizing Real-Time Transit Data for Travel Optimization
https://communityovercode.org/schedule/
September 12, 2025
Timothy Spann
Utilizing Real-Time Transit Data for Travel Optimization
There are a lot of factors involved in determining how you can find our way around and avoid delays, bad weather, dangers and expenses. In this talk I will focus on public transport in the largest transit system in the United States, the MTA, which is focused around New York City. Utilizing public and semi-public data feeds, this can be extended to most city and metropolitan areas around the world. As a personal example, I live in New Jersey and this is an extremely useful use of open source and public data.
Once I am notified that I need to travel to Manhattan, I need to start my data streams flowing. Most of the data sources are REST feeds that are ingested by Apache NiFi to transform, convert, enrich and finalize it for usage in Parquet files stored as Apache Iceberg tables.
Lake of the Isles Fri 4:10 pm - 4:50 pm
Size: 8.55 MB
Language: en
Added: Sep 15, 2025
Slides: 26 pages
Slide Content
Utilizing Real-Time Transit
Data for Travel
Optimization
Tim Spann, Senior Solutions Engineer, Snowflake
This week in Snowflake, Apache NiFi,
Apache Flink, Apache Kafka, ML, AI,
Streamlit, Jupyter, Apache Iceberg, Apache
Polaris, Python, Java, LLM, GenAI, Vectors
and Open Source friends.
https://bit.ly/32dAJft
AI + Streaming Weekly by Tim Spann
There are a lot of factors involved in determining how you can find our way around and avoid delays, bad weather, dangers
and expenses. In this talk I will focus on public transport in the largest transit system in the United States, the MTA, which is
focused around New York City. Utilizing public and semi-public data feeds, this can be extended to most city and metropolitan
areas around the world. As a personal example, I live in New Jersey and this is an extremely useful use of open source and
public data.
Once I am notified that I need to travel to Manhattan, I need to start my data streams flowing. Most of the data sources are
REST feeds that are ingested by Apache NiFi to transform, convert, enrich and finalize it for usage in Parquet files stored as
Apache Iceberg tables.