Scalable Storage of Sensor Data for a High Frequency Measurement Vehicular Monitoring System
Independent thesis Advanced level (professional degree), 20 credits / 30 HE creditsStudent thesis
This thesis investigates how to store sensor data for a large scale vehicular monitoringsystem with high frequency measurements. The system is intended to measure multiplesensors at a rate of one measurement per second and allow for non-real-time analysisof the data. The goal is that the system should be able to support all cars in Sweden,which means that 240 TB of raw sensor data must be stored every year. To be ableto keep up with the new that is data generated by cars, sensor batches (a batch is acollection of sensor measurements captured at the same time by a single car) must alsobe inserted at a rate of at least 110 000 per second.Cassandra is identifed to be the database that best suits the system requirements.It is then investigated how to store and index the sensor data in Cassandra to be ableto perform one- and two-dimensional range queries. The data models focus on how tostore the data so that it is equally distributed over the cluster and is able to scale upfor large numbers of records.An empirical study is performed that evaluates the performance for insertion, retrievaland storage for the different data models. The result is then used to determinehow many servers would be required to meet the system requirements. The study showsthat it is feasible to store the data in Cassandra. Less than 21 servers are required toreach the insertion rate and the data can be stored using between 360 and 1260 servers(with 2 TB of storage each) per year with three replicas of each node, depending onthe indexing approach.
Place, publisher, year, edition, pages
2012. , 110 p.
Teknik, databases, scalable data storage, Cassandra, big data, sensor data, vehicular monitoring system, vehicle monitoring
IdentifiersURN: urn:nbn:se:ltu:diva-54170Local ID: b25d1239-3557-4ccb-bbb7-125746b6a197OAI: oai:DiVA.org:ltu-54170DiVA: diva2:1027550
Subject / course
Student thesis, at least 30 credits
Computer Science and Engineering, master's level
Validerat; 20121211 (global_studentproject_submitter)2016-10-042016-10-04Bibliographically approved