Writing better PostGIS queries - TIB AV-Portal

Writing better PostGIS queries

00:00

152

Zugehöriges Material

Open Source Geospatial Foundation (OSGeo)

Formale Metadaten

Titel

Writing better PostGIS queries

Serientitel

FOSS4G 2014 Portland

Anzahl der Teile

188

Autor

Lizenz

CC-Namensnennung 3.0 Deutschland:
Sie dürfen das Werk bzw. den Inhalt zu jedem legalen Zweck nutzen, verändern und in unveränderter oder veränderter Form vervielfältigen, verbreiten und öffentlich zugänglich machen, sofern Sie den Namen des Autors/Rechteinhabers in der von ihm festgelegten Weise nennen.

Identifikatoren

10.5446/31741 (DOI)

Herausgeber

Open Source Geospatial Foundation (OSGeo)

Erscheinungsjahr

Sprache

Produzent

Open Source Geospatial Foundation (OSGeo)

Produktionsjahr

2014

Produktionsort

Portland, Oregon, United States of America

Inhaltliche Metadaten

Fachgebiet

Genre

Abstract

This presentation will demonstrate ways to take most advantage of spatial indexes, SQL constructs, and PostGIS specific functions. For these exercises we'll be using PostGIS 2.1+ and PostgreSQL 9.3+ . We'll demonstrate common cases people often do inefficiently.This presentation demonstrates the following1) Various SQL constructs including ANTI join, LEFT, RIGHT, EXISTS, LATERAL, CASE clauses, aggregates2) What common table expressions (CTEs) are and when to and when not to use them3) We'll demonstrate these concepts in use in a couple of common spatial query problems - e.g. proximity analysis (both geometry and geography), raster analysis and generation, aggregation of data based on various attributes, other correlation queries.

Schlagwörter

FOSS4G 2014 Portland28 / 188

1

28:37

GRASS GIS, Star Trek and old Video Tape Ð a reference case on audiovisual preservation for the OSGeo communities

2

26:05

Cartography in Mapserver from a user's perspective

3

35:38

Mapping in GeoServer with SLD and CSS

4

52:38

MapLoom: A New Web-client With Versioned Editing (GeoGit) Integration

5

26:48

Geo Trio: Putting MapServer, KML, and Google Earth to work at the Province of British Columbia

6

21:43

projections in web browsers are terrible and you should be ashamed of yourself

7

26:31

Introduction to MapGuide

8

29:14

GIS-based modeling with tangible interaction

9

27:33

The state of geospatial WebGL in the browser

10

24:28

3D slippy map with three.js

11

36:45

What's new in Cesium: the open-source alternative for 3D maps

12

26:24

Helping Farmers, Helping the Environment: An Affordable GPS Guidance System for Agricultural Sprayers

13

22:15

Using FOSS Tools, UAVs and Linear Referencing to Better Manage Federal Levee Data

14

21:34

State of the (Geo) Gem

15

51:39

PostGIS Feature Frenzy 2014

16

1:02:01

Mapping for Investigations

17

21:04

Dynamic mapping on the web: building a scalable service for thousands of companies

18

44:28

MapMint: The 100% service-oriented GIS platform

19

25:18

Local Ecological Footprinting Tool (LEFT)

20

28:04

EarthExplorer - On-line Search Tool for USGS Remote Sensing Data

21

26:24

Regional Conservation Strategy Viewer

22

21:10

The unrelenting progress of design in open source

23

31:19

Advanced CartoCSS Techniques

24

27:38

Cartography from code...?

25

27:34

26

24:44

3D-printing with GRASS GIS Ð a work in progress in report

27

21:37

GIS goes 3D : an OpenSource stack

28

23:39

Writing better PostGIS queries

29

31:11

Automated Vehicle Location (AVL)

30

33:26

Open source, open standards and 50 lines of code: A look behind GitHub's GeoJSON rendering and diffing

31

29:01

Distributed Versioned Editing in Action

32

19:20

GeoTools, GeoServer, GeoGit: A Case Study of Use in Utility Field Work

33

28:27

Js.Geo part Deux recap

34

22:47

Server-Side Marker Clustering For Rapid Display of Large Datasets

35

29:40

Accurate polygon search in Lucene Spatial (with performance benefits to boot!)

36

25:39

GeoCouch: A distributed multidimensional index

37

19:48

Connected Cars with PouchDB

38

54:53

State of GeoServer and GeoTools 2014

39

26:18

WPS Benchmarking Session

40

29:23

A GeoNode primer.

41

28:55

The DiscoverTotems Project: Social Curation with Mapping

42

21:38

The Mapossum: A System for Creating, Collecting and Displaying Spatially-Referenced Answers to User-Contributed Questions

43

28:05

MapStory: The next plateau

44

26:07

ZOO-Project 1.4.0: news about the Open WPS Platform

45

18:21

PyWPS - 4 project report

46

31:53

Easy ETL with OGR

47

22:19

Disparate data, technology fiefdoms and 65 pictures of your cat

48

23:31

GeoMOOSE at 10 Years

49

30:00

Quadcopter GIS for less than $700 - Hardware and software to map your local community

50

19:16

An Easy Web Mapping Framework

51

27:27

Open Web Mapping: An educational resource for creating online maps using free and open source software

52

20:28

A User-centered Design for Interactive Masking Capability within Web GIS

53

06:35

Case Study of Brazilian Institute of Environment and Renewable Natural Resources with FOSS GIS

54

27:32

An Automated, Open Source Pipeline for Mass Production of 2 m/px DEMs from Commercial Stereo Imagery

55

25:12

Exposing NASA's Earth Observations

56

29:54

MapServer #ProTips

57

28:03

MapServer Project Update - Introducing Version 7.0

58

28:15

MapCache: Overview of MapServer's tile caching server

59

23:57

GeoExt2 Ð Past, Present and Future

60

22:42

Creating Custom HTML Elements for Maps

61

28:47

Choose your own Adventure - Open Source Spatial on OpenShift

62

26:15

24-hr Latency End to End Data Processing Using Open Source Technologies for the Airborne Snow Observatory

63

28:33

Geolode: the motherlode of geospatial data sources

64

30:08

Compositing a Global Mosaic

65

24:23

Glob3 Mobile (Mobile Map Tools)

66

26:10

From Nottingham to PDX: QGIS 2014 roundup

67

24:56

Case Study: Developing OpenLayers-based Mobile Applications

68

39:07

Web and mobile enterprise applications

69

28:55

Fixing GIS Data Discovery

70

25:44

Open Source Geospatial Production of United States Forest Disturbance Maps from Landsat Time Series

71

29:51

Data.gov/Geoplatform.gov CSW implementation through pycsw and CKAN integration

72

21:48

State of QGIS Server

73

26:46

An automated classification and change detection system for rapid update of land-cover maps of South Africa using Landsat data.

74

29:15

75

26:24

"Sliding" datasets together for more automated map tracing

76

26:57

A Mobile Situated Learning Module using Open Source Geoweb Technology

77

26:30

Educating 21st Century Geospatial Technology Industry Workers with Open Source Software

78

23:17

Extracting geographic data from Wikipedia

79

22:32

ILWIS, the next generation tool framework for GIS and remote sensing

80

24:43

A FOSS4G-Based Geo Connection System for Education and Research

81

26:15

GRASS GIS 7: your reliable geospatial number cruncher

82

15:13

Open for Business Down Under

83

20:47

Seven ways of injecting Python to QGIS

84

29:27

Evaluation of Web Processing Service Frameworks

85

22:37

Adding Phylogenies to QGIS and Lifemapper for Evolutionary Studies of Species Diversity

86

18:22

TileMill and the Tower of Prince Henry, Reversed

87

18:15

Open Source Work-flow for Surface Interpolation with Curvilinear Anisotropy

88

33:05

Mapping Words and Phrases from Geographic Knowledge on the Web

89

21:35

Next Generation of Printed Maps

90

25:10

Köppen-Geiger classifications of paleoclimate model simulations

91

16:56

Mapping with AngularJS

92

29:31

OnEarth: NASA's Boundless Solution to Rapidly Serving Geographic Imagery

93

20:50

Responsive Interactivity: Toward User-centered Adaptive Map Experiences

94

35:35

Spatial-Temporal Prediction of Climate Change Impacts using pyimpute, scikit-learn and GDAL

95

26:27

Open Source Social Media Aggregation and Geolocating for Emergency Management

96

24:34

Inteligeo - Geographic Intelligence System in the Brazilian Federal Police

97

19:59

Creating Charts and Legends for 3D Atlas Maps - A Mashup of D3.js, osgEarth, and the Chromium Embedded Framework

98

25:20

Tracking Slippy Map Analytics

99

22:15

Building development environments using Vagrant

100

41:47

Vert.x - web sockets and async programming for everyone

101

28:07

Tuning Open Source GIS Tools to Support Weather Data / Rapidly Changing Rasters

102

27:16

pyModis: from satellite to GIS maps

103

31:12

A glimpse of FOSS4G in the environmental consulting arena

104

26:36

Big (enough) data and strategies for distributed geoprocessing

105

52:09

Don't Copy Data! Instead, Share it at Web-Scale

106

28:08

The role of geospatial open source (FOSS4G) as a component of hybrid systems

107

40:53

Open Source is People

108

21:58

Spatio-temporal data visualization in GRASS GIS: desktop and web solutions

109

27:41

Geodesign: An Introduction to Design with Geography

110

27:22

Serving high-resolution sptatiotemporal climate data is hard, let's go shopping

111

18:46

OSGeo Incubation

112

32:17

Barriers to FOSS4G Adoption: OSGeo-Live case study

113

28:19

Avoiding Burnout, and Other Essentials of Open Source Self-Care

114

22:08

Open Source Geo Certification

115

26:08

Update on new OGC Standards: GeoPackage, OWS Context & Geosync

116

35:55

Anchoring and PostGIS cure Post-Polygon Stress Disorder

117

13:42

Spatial Temporal Network Web Visualization Techniques

118

24:57

Finding the Where in Big Fuzzy Data

119

18:30

Trusting the Crowd in a Geospatial Crowdsourcing Application

120

21:57

Real-time Scenario Planning with OpenLayers

121

30:55

GeoMesa: Distributed Spatiotemporal Analytics

122

26:22

Adding value to Open Data using Open Source GIS.

123

27:45

Building Open Source Projects in Government Esri Ecosystems

124

26:35

Managing public data on GitHub: Pay no attention to that git behind the curtain

125

08:50

Small town GIS - Leveraging GitHub, QGIS and community members to manage local data

126

28:23

Supporting Open Data with Open Source

127

22:36

Empowering people, popularizing open source, and building a business

128

20:36

GIS in the Browser - The Good Parts

129

22:34

OpenLayers 3: a unique mapping library

130

19:43

Tilez: serving seamless polygons in the browser with TopoJSON and Node.js

131

23:46

Vector tiles for fast custom maps

132

28:56

Getting Started with OpenLayers 3

133

57:42

The Development and Evolution of an open source mapping application within the USG <- Now with More Google Glass

134

26:26

GeoNode for Humanitarian Crisis and Risk Reduction

135

23:37

Scaling for NYC while Tracking Plows

136

25:18

Leaflet + UtfGrids + d3.js = liquid fast, massively scalable interactive web map & data visualization

137

26:37

Client-side versus server-side geoprocessing: Benchmarking the performance of web browsers processing geospatial data using common GIS operations.

138

14:46

CS-Map - coordinate system libraries

139

20:48

Fast Travel Sheds using GTFS Data in GeoTrellis Transit

140

28:24

Mobile vector map rendering with Mapbox tools

141

14:08

A jumpstart for your mobile map app

142

25:22

"Fast Big Data?" A High-Performance System for Creating Global Satellite Image Time Series

143

25:48

Community Health Mapping

144

23:33

GeoScript - A Geospatial Swiss Army Knife

145

31:33

UrbanSim2: Simulating the Connected Metropolis

146

27:50

Assessing the distribution of disease vectors and fruit crop pests from satellite in GRASS GIS 7

147

1:02:53

Making Space for Diverse Mappers

148

57:52

Exploring Openness in Geospatial Education

149

41:41

How Simplicity Will Save GIS

150

52:28

The Toolmaker’s Guide

151

25:44

Government as a Contributing Member of the OpenStreetMap (OSM) Community

152

24:11

An Open Source Approach to Communicating Weather Risks

153

16:21

Shortest Path search in your Database and more with pgRouting

154

20:42

Repurposing OpenTripPlanner for Ride Sharing

155

26:01

A Complete Multi-Modal Carpooling and Route Planning Solution

156

24:18

How to tell stories and engage an audience with maps

157

21:32

158

24:37

Implementing change in OpenStreetMap

159

28:15

Using OpenStreetMap Infrastructure to Collect Data for our National Parks

160

27:53

Raster Data In GeoServer And GeoTools: Achievements, Issues And Future Developments

161

23:34

Using QGIS server

162

26:02

Integrating FOSS4G into an enterprise system for Disaster Management

163

25:26

"Do This, and also That: Integrating Open Source tools into traditional GIS shops"

164

27:26

The Manager's Guide to PostGIS

165

23:33

Gimme some YeSQL ! - and a GIS -

166

33:12

Spatial in Lucene and Solr

167

29:48

Running Your Own Rendering Infrastructure

168

17:31

The best of both worlds: combining geometry and key-value stores using PostGIS and HStore

169

20:44

Crazy data: Using PostGIS to fix errors and handle difficult datasets

170

30:17

Geospatial-Semantic Knowledge Management and Linked Data for Humanitarian Assistance

171

27:36

Fiona and Rasterio: Data Access for Python Programmers and Future Python Programmers

172

20:29

Big size meteorological data processing and mobile displaying system using PostgresSQL and GeoServer

173

25:51

Advanced Security With GeoServer

174

25:23

GeoServer Feature Frenzy 2014

175

31:40

GeoNetwork opensource 3.0

176

25:47

MapJakarta - Enabling civic co-management through GeoSocial Intelligence

177

23:24

OpenSource GIS surveying - water application

178

27:10

Developing Tools for Humanitarian Decision Making

179

21:41

Tileserver on a diet using node.js

180

24:23

Adopting OGC Standards in a Flood Alert System

181

24:57

ScribeUI: MapServer Mapfile management made easy

182

21:01

Creating Map Style & Visibility Rules from Statistics

183

56:25

OSGeoLive: An Overview of the best Geospatial Open Source Software

184

28:56

Implementing basic GeoCouch support in Couchbase Lite

185

29:38

Mending Spatial Data with PostGIS

186

36:45

Introduction to the geospatial goodies in Elasticsearch

187

1:11:54

Open Source Geospatial Foundation - Annual General Meeting

188

39:49

UrbanFootprint: Next-Gen Scenario Planning Tool

Automatisches Abspielen

Sprache

Text

Bild

00:00

ZeichenketteTypentheorieElementargeometrieGeradeMaßerweiterungPolygonProjektive EbeneTabelleVerschlingungPunktInformationsspeicherungWeb-SeiteWeb SiteSchlüsselverwaltungSchreiben <Datenverarbeitung>Mapping <Computergraphik>BenutzerbeteiligungElektronisches MarketingSoftwareentwickler

01:56

AnalysisDatentypOrdnung <Mathematik>SoftwareTransformation <Mathematik>Automatische IndexierungElementargeometrieForcingFunktionalGeradeMereologieMultiplikationPolygonTabelleZählenFlächeninhaltAbfrageServerAutomatische HandlungsplanungAbstandCASE <Informatik>Strategisches SpielÄußere Algebra eines ModulsQuaderCAN-BusUmwandlungsenthalpieAliasingSoundverarbeitungStatechartMini-DiscURLStandardabweichungMapping <Computergraphik>BenutzerbeteiligungElektronisches MarketingEins

09:38

TransportproblemFunktion <Mathematik>Automatische IndexierungElementargeometrieForcingFunktionalGeradeMereologieProjektive EbeneRotationsellipsoidTabelleTermQuick-SortFlächeninhaltAbfrageKonstanteReelle ZahlEinflussgrößeAutomatische HandlungsplanungNichtlinearer OperatorAbstandCASE <Informatik>NormalvektorLuenberger-BeobachterÄußere Algebra eines ModulsAdditionPunktBetrag <Mathematik>Schiefe WahrscheinlichkeitsverteilungQuaderSchnittmengeExpandierender GraphShape <Informatik>Reverse EngineeringAuswahlverfahrenDateiformatArithmetischer AusdruckPuffer <Netzplantechnik>MeterTrennschärfe <Statistik>DifferenteRechenbuchEinfache GenauigkeitKurvenanpassungURLStandardabweichungZwei

17:20

DämpfungDatensatzTransformation <Mathematik>Ganze ZahlAutomatische IndexierungFunktionalGarbentheorieLoopPhysikalisches SystemProjektive EbeneVerschlingungFlächeninhaltVersionsverwaltungCASE <Informatik>PunktPixelBitmap-GraphikGrößenordnungUmwandlungsenthalpieAuflösung <Mathematik>MultiplikationsoperatorURLTesselationFigurierte ZahlVorlesung/Konferenz

21:07

MAPAutomatische IndexierungElementargeometrieDivergente ReiheFunktionalMereologiePolygonRechenschieberNichtlinearer OperatorAbstandCASE <Informatik>PunktBitmap-GraphikRadiusMixed RealityWrapper <Programmierung>EinsVorlesung/KonferenzBesprechung/Interview

Transkript: Englisch(automatisch erzeugt)

00:00

Okay. How many people are new to PostGIS here? Okay. All right. So I'll try to be easy. I'm Regina O'Bay and I'm one of the developers on the PostGIS team and also the Project Steering Committee. And I'm also a co-author of a couple of books with my husband Leo Sue who's over there. And the links here on the

00:23

on the first page are links to sites that we manage which have examples and also links to the books that we've written. For this talk I'm going to show a couple of examples of things that I see people commonly do with PostGIS and how they do them wrong and the better way of doing it. So for these examples I'm

00:46

going to use some OpenStreetMap data which I pulled with this command. And I'm also going to use the extension called HSTOR. How many people are familiar with HSTOR? Okay. So I don't have to, I'll explain a little bit. It's

01:03

basically a key value store that allows you to put random keys and associated values with it in a single column. So it's kind of like the schemaless design type concept. So those slides, so after you load awesome data you end up with four tables that look as presented here. You end up

01:23

with a point line polygon roads. And I'm going to focus mostly on the point one but the the lessons here apply to if you're dealing with polygons and line strings and other things. So the first thing that people do when that

01:41

they often want to do is they want to change the data type. Like if you want to change it from geometry to geography because you decided you don't want to use like the Web Marketer projection which is commonly used for web mapping. How many people are familiar with Web Marketer? Okay. So I don't need to explain that. So if you had let's say a Web Marketer and you

02:05

decided that you wanted to use the geography type because you're going to do mostly like proximity analysis. You'd use a command. You can do it in one step. You just basically do alter. You transform to WGS 84 which has the ID

02:22

4326 and that's about it. But people usually drop the column. Well they usually add a new column, update it and then drop it. But you don't really need to do that if you just want to convert an existing data type. I mean an existing column. Sometimes you want to do both web mapping and very fine

02:46

tuned proximity analysis. In which case you might want to keep both a geometry column and a geography column. So in that case you'd want to create a new column as shown here. So we add a geography column and then we update it

03:05

with the data that we loaded from Web Marketer by transforming. And then we create an index, a spatial index. The spatial index is very important so you don't want to forget that because your queries will be really slow without that. And so if you look at the geography columns table then you should see the

03:24

new tables that you created. If you notice here when I brought in the polygon it was just geometry but with the ST multi I kind of forced all the single polygons to multi polygon. And I specified it as polygon so now I get a

03:41

clean multi polygon type. So the first thing that people want to do with PostGIS is to find how close things are to a specific location. So like let's say you want to find all the restaurants within one kilometer of a particular location. The standard way that people do this is to use the ST

04:04

distance function. And the problem with that is that that doesn't take advantage of spatial indexes. So this is kind of like the brute force way of doing it. In SQL server this actually works and can use a spatial index but not in PostGIS. So the alternative way, the more PostGIS specific

04:27

way is to use the STD within function. And so here it is the same thing. So you basically ask for what things are within 1,000 meters and that

04:44

are restaurants using this hstore query. And this basically does a bounding box check and expand bounding box check and then it does a short circuit distance. So it doesn't actually have to compute the distance so it works

05:01

much faster. And if you just want a count of things you don't really want to just pull the data and then count it. You should just do a count. It's way faster because you don't get the effect of having to pull all the data out of disk. So yeah the basic lesson here is SD distance can't

05:28

use a spatial index, STD within can. And this is kind of a textual plan to demonstrate using a geometric network like. Okay in that case you probably

05:59

want to use PG routing because that would have a concept of a network. So

06:03

this one just does straight line of sight distance but it's really the minimum distance within the geometry. So if it's a polygon then it's the minimum distance within the polygon. So yeah I mean you could do kind of linear referencing and do close this point but yeah you'd be better off with PG routing on that. Okay so one of the common things that people do with

06:27

OpenStreetMap data is they query both PostGIS and the hstore kind of like the examples that I already showed. And even if you have an index on both the geometry and the tags column, the hstore column, it

06:43

sometimes prefers one column, one spatial index over another. So you get you get the geometry being used but not the tags index. And sometimes it uses both with what's called a bitmap scan strategy. So people usually just create two GIST indexes, one for the geometry and one for the tags. But the

07:07

alternative is you can actually create a single index that has both the geometry and the hstore. And this applies to other data types that support the GIST index type. They can also use, like if you have an array you could

07:23

have geometry in an array. So here's an example of how you create a compound index. For this I drop the original ones because you don't need the other two if you're going to have a single one that contains both. I mean

07:41

there is a downside to this. The downside is this index is fatter than the two separate. So if you just always query them separately you don't necessarily want to do that. Another common thing you might want to do is you don't want to just find the distance but you want to find everything.

08:05

You also want to know the actual distance of the things that are within the area. So in that case you would use both STD within and ST distance. So you'd use STD within in the where clause and then you'd use ST

08:23

distance to sort. And you can actually use the alias name that you use in the select and the order. You don't have to repeat it. So that's why I have dist here because I define it in the select part. Now some people use

08:40

WebMarketer for proximity analysis and that's fraught with some problems. The first problem is if you use WebMarketer to say what's within one kilometer, like what restaurants are within one kilometer, it's not really going to give you what restaurants are within one kilometer. So for example this

09:02

is the same query with the WebMarketer geometry column and you notice that you only get one answer whereas in the geography we got five or six answers. And that's just with the way WebMarketer skews the world. So you could still use WebMarketer if you overshoot by, well it depends how much

09:24

where you are in the world, how much you have to overshoot. But the idea is you want to overshoot and then you want to do a true distance check you know by casting to geography. So the idea behind that is the first part of your query will use the spatial index of the way column. The second part

09:44

will then take that set and then filter it down. So you have a fewer set to like do a real check on. The other alternative is you create a functional index that is geography. With some people it's kind of a iffy thing to do

10:03

but it seems to improve speed and then you don't have to do this kind of hokey expand. You can just cast to geography directly and just use it and your index will kick in because you have a geography index. The other way

10:20

I've seen people do is to what I call a mutant geography Mercator buffer. And the idea behind that is that you buffer in geography because that will give you a true one meter buffer and then you convert that to Mercator. And because it kind of skews things in more or less the same way in the same area it more or less works. So you end up with the same answer as if

10:46

you were doing and you could still take advantage of the Mercator spatial index. The next common thing that people ask for is what are the end closest things to me. And for that you don't really care about how far

11:03

those end closest things are. You just want to know like what are the five closest restaurants to me. I don't care how far they are. So for that the brute force way is to just do a distance check across all your geometries in your table and then sort by distance and then take the N which is if you have a large table is really slow. For a small table it's you

11:27

know reasonable. So the way to get to use a spatial index in that case is to use the KNN operators, geometry operators which were introduced in Postures 2.0. But these are only bounding box they don't actually work

11:43

against the geometry. So for points like if you're using a measure preserving projection it's absolute. For anything else it's not. So you have the same issue of the other thing is that it only works in the order by clause. It doesn't work in the where. So here's an example and so this is using what's

12:12

called the common table expression where we define a subquery called s1 and this one we see is using the operator and then our final so we

12:26

overshoot that and then we take the top five based on the real distance function the base real distance calculation. And here's the explained plan to demonstrate that it is using an index. So I already covered that if you have

12:45

point data it's more or less right unless if you're using something like geometry in WGS 84 then it's it's not right. So here we see our answers and

13:03

they're much faster which I don't have a much faster part. Unfortunately KNN doesn't work for geography at least not yet. Hopefully in Postures 2.2 we'll have KNN for geography but not right now. But you could use geometry with geography and it's kind of like the reverse of what we talked about

13:23

where instead of a just the geography you use a geometry spatial index in addition. And so when you want to use the geometry spatial index you just cast your geography to geometry and it will use the index because it's functional

13:41

on that. Just it's probably not clear but this and this are equivalent it's just that for whatever reason I can't throw the standard cast syntax into a create index I have to use a function. But they resolve to the same thing. And so

14:02

here you see it's much faster like before we had 29 seconds now we're down to three seconds. And it's it's almost as fast as the geometry one. Now in terms of using KNN if you use geometry what we call Platt-Carré so basically

14:20

geometry in WGS 84 it's actually not as good as Mercator from what I've observed. So you always have to do that double check. So you see here that the answers we get are different. They're not the right answers. But if we if we

14:40

you but when we use Mercator it is right. Sometimes you want to get you don't have a single location like let's say you want to know the closest the closest transportation point to a to like all the restaurants in your set. Then you don't have a single point of reference. So KNN doesn't normally kick

15:05

in. So the way to get it to kick in is to use the lateral clause which was introduced in Postgres 9.3. How many people are familiar with lateral? Okay just one. So basically the idea behind the lateral clause is that it allows you

15:20

to do a sub select in the from clause. So here's here's a lateral clause. So the lateral part is this. And the important thing to note is that the P dot way comes from this table. Normally you can't get away with that. So this basically treats P dot way as a constant. So in this case my index my

15:44

spatial index can kick in. But normally it wouldn't be able to. I'd have to have an absolute constant here. So that allows me to basically say for each for each restaurant I mean for each Japanese restaurant that I want to go to

16:01

or each Japanese restaurant in my whole table what's the closest to transportation locations to it. One of the cool features that was introduced in Postgres 2.1 is the segment ties function for geography. And how is that different from the segment ties for geometry? Geometry

16:22

even if you have WGS 84 it treats it like a Cartesian plane. So when you segment ties it's still linear. It's still you know so it's not right. Like if you were to plot it on a curve it's not right. But the geography considers the the spheroid shape and it actually segmentizes across the you know

16:43

along the spheroid. So you can throw this on you know you can convert this to WKT. And we also have a new function in 2.2 which isn't released yet which which will output to the Google encoded format. So it makes it

17:01

easy to just throw the the the segment ties on the on the on a Google map. So here's kind of the difference which are still from the boundless docks. This is the geometry. You see how that's a straight line. But if you use geography then you get the true curve behavior. So now I'm gonna get into some

17:25

Postgres specific stuff. So how many people use Postgres raster here? Oh okay so more than I thought. Okay that's good. So for this I just loaded some elevation data and aerial data. I'm gonna skip those commands. The first thing you want

17:47

to do when you're using raster is you usually tile the data because it you know rasters are big. So you don't want stuff like a you know a 10 gigabyte raster in a single column. It becomes kind of hard to query. So you chop it up into tiles. But then if you're trying to get an area of interest and

18:03

it goes across tiles you want to first you want to carve out that area. So you want to figure out what tiles fit in your area of interest. And what people mistakenly do is they first union and then they clip. So if your tiles are

18:20

relatively big this is really slow. It's actually much faster if you clip first and then you union. It's actually I think in many cases orders of magnitude but it depends on how big your tiles are. The bigger your tiles the more efficient this is. So here's that's just an example. But I forgot to put the

18:47

timings in. I'm sorry about that. And one function I really love using is the ST resize function which allows you to basically take a lower resolution without any consideration of the spatial reference system of it. But the

19:01

ST resize function has a lot of it's very overloaded. So it has one variant that takes integers which is the pixel size within height. And then it has one that takes percentages and then it has another one it takes text. So it's easy to fall into the wrong loop. So this is what often happens when

19:22

people coming to raster they do this. Can anybody figure out what's wrong with this? I'll point it at this section that is a problem. Nobody knows what's wrong with this. Yes. What. Oh yeah that's it. That's it. You got it. OK. So so what happens in this case is it falls into the float version

19:44

which thinks that you're talking about percentages and expects everything to be lower than one. But it's not lower than one. So you get this knowing your percentages must be value greater because it doesn't. Well the other thing that Leo complained about is it doesn't allow you to go above 100 percent. That kind of pissed him off. But so he's like this

20:05

function is stupid. So so what you have to do is you have to ensure that both are an integer. If you want pixels you have to cast it to integer so it gets treated as an integer. And then you get your nice

20:20

picture. And then the other lesson which is not really raster specific but when you're working with any spatial data is you always want to transform the fewer records. So in that case it's usually your location of interest and you want to transform it to what you have indexed unless

20:40

if you have a functional index on the transform. So in this case we want to transform our elevation which is in WGS 84 to the same projections our raster. And we get the elevation value. OK. So that's about it. And here's the link to if you want to buy any of our books. Any questions.

21:13

I was just wondering if you had these slides available. Oh yeah. I'll post them on. I'll post them on our posts. Yes. We're also giving a talk in Chicago. We're giving some tutorials so we also post those

21:24

slides. One is for geometry geography and the other ones for raster PG routing and topology. So we'll post those slides as well. OK. So just remember that site. You want to buy books you want to read our stuff our slides. They're there. Any other questions. Yeah. You were talking

21:45

about KNN and using that to see what points are in geometries or geographies. I'm often looking at whether this series of in this series of geometries and this other series of geometries which ones

22:00

overlap. Is there a way. Could you use KNN to do something with that or in that case you just use the intersects function. You wouldn't really need over you wouldn't use KNN for that. Oh no. I mean the faster way is you can simplify but it's not like absolutely right. But if I have something

22:22

like a polygon that's huge you know that's got like a hundred thousand points I usually simplify it first and then you can actually create a spatial index on the simplified. So so that ends up giving you faster answers too. Or you can do like actually you know you don't you don't create the spatial index on the simplified you create it. You basically you

22:45

can write you can write a wrapper function that basically simulates the STD within but it does like a simplification to the level that you want to the accuracy that you want. So you get faster but it's not absolutely accurate. I have a tricky question about KNN. Have you found a

23:02

way to make to use KNN to get points within a distance that can beat D within. That can what. So to you use the KNN operator. Yeah. To get points within within a radius so you can consistently beat D within. No because I

23:21

don't think those two mix well together. So yeah I would use STD within there and then give up the KNN part.