OpenStreetMap, one geographic database to rule them all?
This is a modal window.
The media could not be loaded, either because the server or network failed or because the format is not supported.
Formal Metadata
Title |
| |
Subtitle |
| |
Title of Series | ||
Number of Parts | 542 | |
Author | ||
License | CC Attribution 2.0 Belgium: You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor. | |
Identifiers | 10.5446/61428 (DOI) | |
Publisher | ||
Release Date | ||
Language |
Content Metadata
Subject Area | ||
Genre | ||
Abstract |
|
FOSDEM 2023350 / 542
2
5
10
14
15
16
22
24
27
29
31
36
43
48
56
63
74
78
83
87
89
95
96
99
104
106
107
117
119
121
122
125
126
128
130
132
134
135
136
141
143
146
148
152
155
157
159
161
165
166
168
170
173
176
180
181
185
191
194
196
197
198
199
206
207
209
210
211
212
216
219
220
227
228
229
231
232
233
236
250
252
256
258
260
263
264
267
271
273
275
276
278
282
286
292
293
298
299
300
302
312
316
321
322
324
339
341
342
343
344
351
352
354
355
356
357
359
369
370
372
373
376
378
379
380
382
383
387
390
394
395
401
405
406
410
411
413
415
416
421
426
430
437
438
440
441
443
444
445
446
448
449
450
451
458
464
468
472
475
476
479
481
493
494
498
499
502
509
513
516
517
520
522
524
525
531
534
535
537
538
541
00:00
Open setMaximum likelihoodOpen setLevel (video gaming)InformationSoftwareDatabaseProjective planeMereologyOpen sourceMultiplication signComputer animation
00:44
Operator (mathematics)Open sourceDistanceWave packetComputer networkOpen setRevision controlComa BerenicesModal logicGauge theoryLimit (category theory)Continuous trackCurveWorkstation <Musikinstrument>Term (mathematics)PlanningCalculationObservational studyFormal languageComputing platformConsistencyData modelPerimeterDirection (geometry)InfinityPosition operatorType theoryLevel (video gaming)PlanningReal-time operating systemMultiplication signRoutingDirection (geometry)Shared memoryWave packetSoftwareInternet service providerObject (grammar)Latent heatProjective planeCollaborationismParameter (computer programming)Data managementOpen setOpen sourceCalculationData modelNetwork topologyDifferent (Kate Ryan album)Revision controlDatabaseComputing platformLine (geometry)PerimeterPoint (geometry)Observational studyGeometryComputing platformPairwise comparisonGoodness of fitSource codeDistanceTrailWorkstation <Musikinstrument>PlastikkarteElectronic visual displayBeta functionBit rateLimit (category theory)Real numberCovering spaceRootTwitterTerm (mathematics)File formatComputer animation
04:54
Computer configurationLink (knot theory)File formatComputer reservations systemService (economics)Formal languageComputer networkDirection (geometry)Multiplication signFormal languageLink (knot theory)WebsiteComputer animation
05:05
Lemma (mathematics)Macro (computer science)Direction (geometry)Metropolitan area networkLink (knot theory)MetadataWeb pageTranslation (relic)Computer animation
05:19
Graphical user interfaceDirection (geometry)Open setData qualityFormal languageData managementWeb pageTranslation (relic)Multiplication signSoftwareWebsiteReading (process)Office suiteStokes' theorem
06:49
Open sourceOpen setObservational studyDirection (geometry)TrailPairwise comparisonLengthPressure volume diagramPrinciple of relativityUsabilityReal numberWave packetContinuous trackModal logicWorkstation <Musikinstrument>Gauge theoryLimit (category theory)Level (video gaming)Open setTrailDatabaseImplementationCASE <Informatik>Time zoneObservational studyFormal languagePrice indexOpen sourceInternet service providerSocial classAttribute grammarResultantFerry CorstenModal logicMultiplication signType theoryTotal S.A.Graph (mathematics)Pairwise comparisonStandard deviationLink (knot theory)MereologyComputer animation
10:43
BlogDirection (geometry)Maxima and minimaComputer iconOpen setExpert systemProduct (business)Lattice (order)WebsiteProjective planePresentation of a groupQueue (abstract data type)Mathematical analysisSoftware developerBlogLevel (video gaming)Computer iconOpen sourceEmailOpen setMappingSet (mathematics)Computer animation
11:53
Program flowchart
Transcript: English(auto-generated)
00:05
Thank you for coming to my talk. I'm very glad that the room is so packed, so I hope that this will be of interest to you. My talk is named OpenStreetMap, one geographic database to rule them all, mapping the railway network for the public, with the public.
00:22
And I will focus on OpenStreetMap and open data related topics for OSRD, which is an open source project developed by SNSF, the French railway company, which is part of the Open Rail Foundation. So there are many information about this here on the panel.
00:45
Just a few remembers about why the railway company should invest in open data. I think you are all convinced that open data is the way to go for all of projects. But inside the railway companies, it's not always that obvious.
01:03
So we want long distance trains across Europe, so we can construct together the transport network of the future on rails. We want to do European cooperations, because we have railway infrastructure managers in all European countries that have the same needs.
01:21
And yet we are still paying for different software providers for the same tools and the same data. And of course, we want free competition to prove that all of the train operators we work with are treated the same. So if we share the same source code and the same data, we can ensure that.
01:45
I will dive into the specific need of OSRD, which is our project. Of course, you may have different data needs, so I will focus on this. If any in the room have other experience with other types of data, I will be very happy to discuss with you.
02:04
So in OSRD, we have four main features. Pathfinding or route compatibility check is to find train paths in the European railway network. Running time calculation is to calculate the time that the train will take to go from point A to point B.
02:23
Conflict detection is to ensure that the train will not run into another train during its route. And short-term train planning is to add a new train into the timetable at the last minute. Maybe you were lucky to hear my colleague, Elois, this morning talk about this topic.
02:41
So to do these four features, we need a lot of data. Tracks, geometry and topology at track level and not line level. Signals, switches, routes and detectors, which are kind of technical objects. Electrification of the tracks, loading gauge, speed limits, slopes, curves, real-time position of trains,
03:06
and stations can be useful for display use. So I've detailed the needs for each of the features, but what you can remind is that we need a lot of data, which is all geographic and in high quality.
03:22
So the goal of this study and what I will share with you today is we want to find and compare European level open data to choose the best source for our needs at OSRD, but also maybe for your needs if you're working with the same data needs in your projects.
03:40
I've compared four data sources. The RINF or Register of Infrastructure is a data source provided by the Agency for Railways of the European Union. INSPIRE is a European directive that ensures to share geographic data across Europe. Then we can find open data platforms of infrastructure managers, but they are one data platform for each company,
04:08
so it can be quite confusing to find good data, and of course they all use different formats. And finally, OpenStreetMap, which is, as you all know, I hope, collaborative database of geographic data,
04:31
and it fits all of our needs. We want open data. We want a data model which is consistent across Europe so that we don't have to change the parameter of our tool in each country.
04:41
We want a data model that can evolve if we want to add a new feature. Of course, we need English documentation, easy data access, and a wide data perimeter. Let's try to access some data. So here I am on the INSPIRE website.
05:00
I can find a broken link in a mixed language. Another example of INSPIRE data, which is supposed to have good metadata. Here you can see the link to access the data, which is in the middle of the page, so very easy to find.
05:20
And finally, another example. I could go on and on about this, but this is a page in, I think, Swedish, but it cannot be translated nor copy and paste in any translator. So you have to click and download the data, hoping for the best. This is not to blame the people that have created these pages, but just
05:41
to share that finding open data can be very time consuming and very difficult, especially if you, as me, don't talk all the European languages. Then, once you have downloaded the data, we can try to assess data quality. For example, this is the railway network in Italy that I've downloaded from the INSPIRE data set.
06:06
And as you can see, there's supposedly a railway tunnel that links Tibet and Sardinia. So I was very, very surprised by that. I checked on the official RFI website, which is the infrastructure manager for Italy.
06:23
And in the official website, we cannot find this underwater tunnel. So, of course, I was not allowed to travel across all Europe to check all the data quality that I've downloaded.
06:40
In some places, it is true, but there it is not. So first question we want to ask is for all the open data sources that I've found, are they compatible with OpenStreetMap?
07:02
In many cases, this is the case. But unfortunately, for the Creative Commons license, we must ask the provider if the attribution in OpenStreetMap is good enough. So this can take more time, and it's not as easy as other types of licenses.
07:23
So if you publish open data, it's important to check if the license is compatible with OSM. And as you can see, unfortunately, there are still many European countries where I have found no open data source at all. So maybe it's because I don't speak the language, but still it's problematic.
07:43
Then I've done a little quantitative comparison of the data I've found. So this is a comparison of track length, total for one country, so by country and by source. As you can see, I have found data on OpenStreetMap for all of the European countries, but not an open data source that is not OSM for all countries.
08:06
And even more, what we can see on the graph is that in every country, the OpenStreetMap data shows more tracks than the open data. So even if there is open data available, it seems that the OpenStreetMap data is more complete.
08:24
Then I tried to design an indicator to see if all the useful data was available for OSRD needs. So here you can see the same data needs that I've presented before, and I have classified them by necessity.
08:45
So we require tracks and signals to make OSRD run, and then the other data are optional, which means if we have them, this is good, and we will have a better result. But if we don't have them, we can still run our tool and have partial results.
09:03
So I've designed an indicator which is good if we have the two required data and two optional data or more. Then an OK indicator if we have part of the required data. The required indicator can be one and a half if we have partial data.
09:26
So it's quite complicated, but I have shared the full methodology on the blog, and I will send you the link after, so don't worry. What you have to remember is that this indicator will give you an overview of if the available data can be used for OSRD needs.
09:47
So what are the results of this study? First, what we can do is open data. Unfortunately, as you can see, the map is not so green, so there are a few countries where you can do OK or poor implementation of OSRD
10:05
using open data, excluding OpenStreetMap, and then we can see the map for the OpenStreetMap data. It's better. It's not that better, but it's better. So there are many countries that were red in the first map that are now green, and there are many countries that were gray that are now red.
10:31
So it's not that good, but it's better. What we can see is that OpenStreetMap is the database we should use and improve because it's currently the best standard across Europe.
10:45
So as I've said, you can look at the full data and methodology on our blog. So there is the detailed analysis for each country as well as the sources for each open data set that I found.
11:01
So if you're interested in one country specifically, you can check out this. I'd like to thank the people that have done the icons for this presentation and also a special thanks for the QGIS community that has allowed me to make the maps and most of the analysis.
11:22
So maybe if there are QGIS developers there, thank you so much for your work. And finally, if you want to contact us, there are emails. You can learn more about the OSRD project on our website. You can chat with us. And if you are a railway company, you might be interested in joining the Open Rail Foundation.
11:41
So let us know. Thank you for listening.