Exploring Spatio-Temporal Patterns of Volunteered Geographic Information: A Case Study on Flickr Data of Sweden
Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
This thesis aims to seek interesting patterns from massive amounts of Flickr data in Sweden with pro- posed new clustering strategies. The aim can be further divided into three objectives. The first one is to acquire large amount of timestamped geolocation data from Flickr servers. The second objective is to develop effective and efficient methods to process the data. More specifically, the methods to be developed are bifold, namely, the preprocessing method to solve the “Big Data” issue encountered in the study and the new clustering method to extract spatio-temporal patterns from data. The third one is to analyze the extracted patterns with scaling analysis techniques in order to interpret human social activities underlying the Flickr Data within the urban envrionment of Sweden.
During the study, the three objectives were achieved sequentially. The data employed for this study was vector points downloaded through Flickr Application Programming Interface (API). After data ac- quisition, preprocessing was performed on the raw data. The whole dataset was firstly separated by year based on the temporal information. Then data of each year was accumulated with its former year(s) so that the evovling process can be explored. After that, large datasets were splitted into small pieces and each piece was clipped, georeferenced, and rectified respectively. Then the pieces were merged together for clustering. With respect to clustering, the strategy was developed based on the Delaunay Triangula- tion (DT) and head/tail break rule. After that, the generated clusters were analyzed with scaling analysis techniques and spatio-temporal patterns were interpreted from the analysis results. It has been found that the spatial pattern of the human social activities in the urban environment of Sweden generally follows the power-law distribution and the cities defined by human social activities are evolving as time goes by.
To conclude, the contributions of this research are threefold and fulfill the objectives of this study, respectively. Firstly, large amount of Flickr data is acquired and collated as a contribution to other aca- demic researches related to Flickr. Secondly, the clustering strategy based on the DT and head/tail break rule is proposed for spatio-temporal pattern seeking. Thirdly, the evolving of the cities in terms of human activities in Sweden is detected from the perspective of scaling. Future work is expected in major two aspects, namely, data and data processing. For the data aspect, the downloaded Flickr data is expected to be employed by other studies, especially those closely related to human social activities within urban environment. For the processing aspect, new algorithms are expected to either accelerate the processing process or better fit machines with super computing capacities.
Place, publisher, year, edition, pages
2013. , v+32+appendix p.
Big Data, VGI, Flickr, Delaunay Triangulation, Power Law, Scaling Analysis, Spatio-Temporal Pattern
IdentifiersURN: urn:nbn:se:hig:diva-15031OAI: oai:DiVA.org:hig-15031DiVA: diva2:641904
Subject / course
Geomatics – bachelor’s programme (swe or eng)
2013-06-04, 11221, Kungsbäcksvägen 47, Gävle, 13:00 (English)
Jiang, Bin, Professor
Ahlen, Julia, Dr.