Geosocial Big Data Analysis Using Python and FOSS4G with the Case Study of Korean Data

Cite

FOSS4G

Open Source Geospatial Foundation (OSGeo)

Hong, Ilyoung

Formal Metadata

Title

Geosocial Big Data Analysis Using Python and FOSS4G with the Case Study of Korean Data

Title of Series

FOSS4G Seoul 2015

Number of Parts

183

Author

Hong, Ilyoung

License

CC Attribution - NonCommercial - ShareAlike 3.0 Germany:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal and non-commercial purpose as long as the work is attributed to the author in the manner specified by the author or licensor and the work or content is shared also in adapted form only under the conditions of this

Identifiers

10.5446/32019 (DOI)

Publisher

FOSS4G

Open Source Geospatial Foundation (OSGeo)

Release Date

2015

Language

English

Producer

FOSS4G KOREA

Production Year

2015

Production Place

Seoul, South Korea

Content Metadata

Subject Area

Computer Science

Genre

Conference/Talk

Abstract

Nowadays, there are many researches on the analysis of Geosocial big data, such as geotweeet and as foursquare venues and OSS(Open Source Software) has an important role on this. In the analyzing geosocial big data, there are several different steps such as data collection, data parsing, data conversion, statistical analysis, visualizing and database management. So, the integrated system architecture and the compatible analysis environment has a key role to acquire the relevant analysis results. The Python programming support the interoperable analysis environment for the various and different software functions and enable to process for geosocial big data in the integrated platforms. FOSS4G support software environment for geovisualization and data management for the collected data. In this study, the way and process of geosocial big data analysis is introduced with case study of geotweet and foursquare venues and the analysis results are presented with the case study of Korean data. For this study, Python API libraries for tweeter(tweepy) and foursquare(pyforsquare) used to collect the geosocial data, and Pandas and Simplejson are used to parse and extract the valid data, and GDAL and PySAL are used to convert and analyze for GIS data. PyTagCloud and WordCloud are used to visualize the qualitative text. MongoDB is used to store the collected dataset and QGIS are applied for the geovisualization.