Streaming Graph Analytics Framework Design
Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Along with the spread of the World Wide Web, social networks and the Internet of Things in the past decades, the need for systems and algorithms that can process massive graphs has been continuously increasing. There has been considerable amount of research done in distributed graph processing since the emergence of such large-scale graphs. Another steadily growing field in the past years has been stream processing. This rise of interest can be attributed to the need to process large amounts of continuously streaming data with scalability, fault tolerance and very low latency. Graph streaming, the unification of these two fields is a rather new idea, with some research already being done on it. Processing graphs that are unbounded, and so large that they cannot be stored in memory or even on the disk, is only possible with a distributed graph streaming model. Our goal is to provide a graph streaming model and API that can handle common transformations and provide statistics on streamed graphs. This graph streaming API is created on top of Flink streaming and provides similar interfaces to Gelly, which is the graph library on the batch processing part of Flink.
Place, publisher, year, edition, pages
2015. , 52 p.
Engineering and Technology
IdentifiersURN: urn:nbn:se:kth:diva-170425OAI: oai:DiVA.org:kth-170425DiVA: diva2:830662
Subject / course
Information and Communication Technology
Master of Science in Engineering - Information and Communication Technology
2015-06-18, Ada, Isafjordsgatan 22, 164 40 Kista, Stockholm, 13:00 (English)
Kalavri, VasilikiCarbone, Paris