Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
S3DA: A Stream-based Solution for Scalable DataAnalysis
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology.
2017 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Data processing frameworks based on cloud platforms are gaining significant attentionas solutions to address the challenges posed by the 3Vs (Velocity, Volume andVariety) of BigData. Very large amounts of information is created continuously, giving rise to data streams. This imposes a high demand on the stream processing system to be very efficient and to cope with massive volumes and fluctuating velocity of data. Existing systems such as Apache Storm, Spark Streaming and Flink rely on messaging systems to handle unreliable data rates with a trade-off of additional latency. Incontrast, data streams arising from scientific applications is often characterized huge tuple sizes, and might suffer in performance from the intermediate layer created bythe messaging systems. The processing system should be scalable enough to overcome the fluctuation of data velocity while maintaining quality of service with low latency and high throughput. It should also provide flexibility in its deployment towork well for fog-computing scenarios where data is generated and handled close to the scientific infrastructure generating the data. In this thesis, we would like tointroduce a framework called HarmonicIO, designed for scientific applications. We show that an optimized data flow and real-time scaling (as seen in HarmonicIO) canreduce the cost per operation while maximizing throughput with low latency.

Place, publisher, year, edition, pages
2017. , p. 64
Series
IT ; 17058
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:uu:diva-334714OAI: oai:DiVA.org:uu-334714DiVA, id: diva2:1160422
Educational program
Master Programme in Computer Science
Supervisors
Examiners
Available from: 2017-11-27 Created: 2017-11-27 Last updated: 2017-11-27Bibliographically approved

Open Access in DiVA

fulltext(764 kB)38 downloads
File information
File name FULLTEXT01.pdfFile size 764 kBChecksum SHA-512
5ec914e16c612b32352b1f4a4aa8b5ddea317380dfdd17ce6212577628704e4c520a3df45512c6db07e1a8d21bbc7b9ebb75460fe390d791eba6609ccbb6264d
Type fulltextMimetype application/pdf

By organisation
Department of Information Technology
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 38 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 100 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf