Logo TIB AV-Portal Logo TIB AV-Portal

Performance Analysis of MongoDB Vs. PostGIS/PostGreSQL Databases For Line Intersection and Point Containment Spatial Queries.

Video in TIB AV-Portal: Performance Analysis of MongoDB Vs. PostGIS/PostGreSQL Databases For Line Intersection and Point Containment Spatial Queries.

Formal Metadata

Performance Analysis of MongoDB Vs. PostGIS/PostGreSQL Databases For Line Intersection and Point Containment Spatial Queries.
Title of Series
CC Attribution - NonCommercial - ShareAlike 3.0 Germany:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal and non-commercial purpose as long as the work is attributed to the author in the manner specified by the author or licensor and the work or content is shared also in adapted form only under the conditions of this license.
Release Date
Production Year
Production Place
Seoul, South Korea

Content Metadata

Subject Area
Relational databases have been around for a long time and Spatial databases have exploited this feature for close to two decades. The recent past has seen the development of NoSQL on- relational databases, which are now being adopted for spatial object storage and handling too. And this is gaining ground in the context of increased shift towards GeoSpatial Web Services on both the Web and mobile platforms especially in the usercentric services, where there is a need to improve the query response time. While SQL databases face scalability and agility challenges and fail to take the advantage of the cheap memory and processing power available these days, NoSQL databases can handle the rise in the data storage and frequency at which it is accessed and processed which are essential features needed in geospatial scenarios, which do not deal with a fixed schema(geometry) and fixed data size.This paper attempts to evaluate the performance of an existing NoSQL database 'MongoDB' with its inbuilt spatial functions with that of an SQL database with spatial extension 'PostGIS' for two primitive spatial problems LineIntersection and Point Containment problem, across a range of datasets, with varying features counts. For LineIntersection function, the dataset consisted of two independent layers of horizontal lines and vertical lines with incremental lengths and their size varied from ten lines to ten thousand lines in each layer and another dataset with two layers, one of random lines of variable size and shape and another layer of a single line which is intersecting many lines of layer1. For Point Containment problem, the dataset consists of two layers, one of polygons in a space of different shape and size and another layer of random points in the space, some inside the polygons and some outside. All the data in the analysis was processed In memory and no secondary memory was used. Initial results suggest that MongoDB performs better by an average factor of 25x for Line Intersection Problem and 10x for Point Containment Problem which increases exponentially as the data size increases in both indexed and non indexed operations. Given these results NoSQL databases may be better suited for simultaneous multipleuser query systems including WebGIS and mobileGIS. Further studies are required to understand the full potential of NoSQL databases across various geometries and spatial query types.
Computer animation Information Decision theory Database Right angle
Magnetic-core memory State of matter Length View (database) Multiplication sign Decision theory Sheaf (mathematics) Numbering scheme Open set Order of magnitude Dimensional analysis Mathematics Different (Kate Ryan album) Special functions Data conversion Physical system Electric generator File format Relational database Infinity Funktionalanalysis Category of being Message passing Data storage device Website Right angle Geometry Point (geometry) Open source Student's t-test Scalability Number Goodness of fit Energy level Form (programming) Noise (electronics) Information Cellular automaton Interface (computing) Database Line (geometry) Cartesian coordinate system Vector potential Word Computer animation Query language Customer relationship management Statement (computer science) Video game Natural language Object (grammar)
Decision theory View (database) 1 (number) Sheaf (mathematics) Inference Semiconductor memory Different (Kate Ryan album) Square number Videoconferencing Special functions Position operator Stability theory Physical system Social class Area Decision tree learning Relational database Sound effect Thermal expansion Bit Funktionalanalysis Exterior algebra Process (computing) Spacetime Point (geometry) Functional (mathematics) Divisor Distance Inequality (mathematics) Event horizon Scalability Machine vision Template (C++) Number Revision control Population density Average Operator (mathematics) Energy level Best, worst and average case Condition number Noise (electronics) Pairwise comparison Standard deviation Focus (optics) Dependent and independent variables Graph (mathematics) Polygon Database Line (geometry) Computer animation Query language Personal digital assistant Customer relationship management Musical ensemble Family Library (computing)
Point (geometry) State of matter Closed set Multiplication sign Execution unit Moment (mathematics) Planning Database Element (mathematics) Subject indexing Sparse matrix Computer animation Different (Kate Ryan album) Query language Personal digital assistant Statement (computer science) Spacetime
but money everyone myself fighters from the
intervention issued of information technology has arrived in here thank you yes so the adopting perform analysis of military and those yeah it's uh databases for decisions and then continue inquiries so 1st of all right
we need an endemic you learn from them and right and who did play this so on deformities suppose that I'm incorporate manner I don't have the formation life at nation but I want to conform with the format of this with so highway I I need to really really maps for that I was working inside that in the news I Standard likely so on the form itself can read about the the the scale of the cell is really really that can not be deployed on the phone so run on me look forward it says that can be deployed on quality in when the race it's just noise to databases so in the in the past that utilizes is mean but I might be useful this is especially the reason that may be used on the romance level was that that is moved on the little students have good potential to store manage Willie's realized it disagrees and but especially a student faces scalability and ingenuity channels it and the most important thing is especially the applications that do not have a thick skin maybe linking standard that we use XML the state of the schema of the data the the changes the scheme offers offered to make the changes the other points that have made the land use and that led to associated but what about the information we we can have more information about about those points late of the only global ordinary users for example if it's of wooden and everything Slovenian different it's what they would think I mean if you have a sentence we need to make joint everything right out we have as you might as databases such as noise due to devices that can include all this information within the same so that we have less joints and lists a number of statements not because this gives an opening open source so that when the legislation database system was yeah is site you what was was is you and that adds gematria pass objects to it the main function of those yes can be during the fight activities management the management of the didn't conversion from a genetic database tools generally a good idates information repeated from the statements that combines and between the geometries and the query generation non-spatial databases S. and noise due uh view of noise databases and nice candidate is nondeletion did us because that would be a potentially to store and manage last queries the thing is that we need to improve quality time and when the biggest is didn't really big so we can't apply and so such as out the spatial databases it is being objected and added everything sports that and spatial applications deviate from uh with becomes a core wording elution of schema and interface as discussed before so we'll will be using but we have used and then with the magnitude of a system that it has all it has already been implemented solving was use up some of the special functions Lake Lanier decision bond and it meant that it has had at 1 and then at infinity properties of this is that it uses you have these objects to store to geometries so that you do some objects in duties of this against what latitudes and reduced as well as we can so we can store other tags is it but in imported onto the fight back and looking at the duties is on this on and then would he be like normal matter who might have support for arteries but in future the most probably being up the most probably a better ordered yes so embedded in between the the the the the reasons but it's still not not designed for a distributed systems and noise skill Mr. databases can this is spread over many so is there a the estimated is mostly out with so I would 1st up to date as late the death of what a banker lately but I'm such a data is and then others who did before but in noise get as it a schema-less data this week and I might eventually be stored in the same column you do not have to make a the problem but a column with infinite dimensions you can
have lines and boiling in supporting the single no so it doesn't unless you have a promise and conducts especially those especially where he's been escaping from data in a really that we can use it for obtaining of in enveloping as discussed in what it is that of the found all that we really need really specific special functions and or or like needed and and sort of of floating firstly the media in that uh in that interest and then we use letter that section where queries to find the relation between the would be in the length and the other geometries so
horrible compatible databases of making use dimensionality datasets and databases to compare but are not used we have used you to users and add useful functions pointing appalling function and Lang intersection function winding appalling and functions but they don't that uh point later than that given to 3 and 9 decision function as we all know is that this is the point of intersection between pituitaries how the beautiful generated by the generated synthetic datasets for all the cases of and that best what I'm from this this scenarios a worst-case scenarios and business and as well of online intersection queries up we have on and would be delayed from into millions of lays then randomly adding those lines in the space and then we systematically areas related industries and that and we can relate to perform some of the databases and 4 point in body and the spectral problem we use a point and square of 4 firstly 1 point of being ones with inequality and then points within inequality and and so on and so forth we did it for the random guesses as well that millions of points and millions of the but since all the data was used all responses using in memory and secondary memory was use everything was running on the family so this is the performance start from the uh this is the performance of well noise you and this is for the US it has in the for the missing money because of what it sees it was yes on this exponentially higher then we also want to avoid level all of this on and he on was the at at but that most of the density of the resistances expansion and you didn't know any of these industry that without having to publish understanding that didn't have the position and the abolition of indecision and the condition was the political vote in next we can see that the performance of the expression is a slightly bigger than and its relation that backgrounds difference between the best you and skin is this was the big the not so much in France because as it in as we see the same ended up in the event of England Efficient noticing the Beaverton the assignment of and we found that the average distance increases because yes all the skills standard as it's about the demolitions moves on and the that as the size increases its some dense foods so so in important was the event the board because that's where that was of most of is exponentially as the size of the datasets increased when nobody stable and cement distinction that isn't it isn't going to be around a bit about how it is that the defendant In the factor increases as the dataset increases and well indexed and non-indexed operations uh other than that it's mostly can be stated but of simultaneous when the user query system including videos in the US mission and mentioned it and so this would do to the user and system it is the potentially to wind down so so was really good the competition but now the radical leprosy is standing at the scale databases and Mary and so was also should be moved to node labeled data users such as uh escalated amuses is at least 1 event and on so we use the focus that a certain event red in future we are planning on spending less than 2 brothers vision where functions as the and and that and with some such as something
In questions please without the 1 what it's not 1 of of the latest versions of this graphs online for a while I think they have introduced as a non square communities in 1 most just did you start of those in and effects on the ability of those but they they depend on it and and try to kinds of the the comparison and post squares and using a large database last dataset that's a definition of large or not this is the 1st to air the view after we used in the event of a bit of a bit of a person from section and fair opened in the beleaguered lousy at the point in in the 1st layer is is ranked and then use that there's as a template and the standard and and the space and that led to the number of of bands and the number of patients in both of these in both the datasets and the number of intersections and apparent that within a and the people in the paper we can the the number of the intersection of number of points within a polygon rendering million and then these intersections and the engraving billion that was was last datasets I and you are talking about class sections of the sentence and would you define that those graphs on Don's uh this past fall below the largest dataset federal managing among depends on the case that the book 3 cases for the focus is 1 the systematically expected when the distance between all the points with increasing sequentially they think it is the distance between all the points exponentially and then in the Pope regarded begin and so there was past cases they were the quenching is and they would expand introduces of now a preposterous library has gone on for a special functions compared to all can come from the current understanding that is is a alternatives to implement a special functions you just mention the intersection in the you the plant contains uh is is the ways that you can associate to the french special special functions on yet a new food under the with a more restricted areas of interest and that's the spatial that functions and the idea of the dopamine we did the uh and did they did we discuss the possibility of indicating that special functions and I don't know but I police do I please do the and the melody and we're still in the process of including the functions to all of these the inferences this and the not in this this paper and also in the history of the business of the another thing and so I don't know that as soon as I did that mentality and and we some of the regression trees of the fact that on I cuisinary
understanding extensive close to each other of the nineties involves the the the difference between indexed and non-indexed was then that of the use of the but the Paducah predicting lobbyist and that detainees being 4 laps to a single unit this is the reason for that not of and then we may and hence the experience this is because in the moment in sparse cases is then it's in and an intersection use of it in the the 1st the him the for the intersection of plane intersection written queries for In other questions the the insurance agent of the consignee against in to index the spatial data snow of which are the things which are special indexing techniques used to to index is not the way for the it did in India how are the former to change if you had spatial and temporal query and temporary so if you the and statement in the I yeah so you get this place as a sport certain point in time as in space so basically I combine not just a spatial query but I couldn't find the elements as well over the years new database and the other the more I like this did do staple food even I data uh elite and the and the states it's assisted here and you think Christians in the notes and improvements in