Open this publication in new window or tab >>2019 (English)In: Proceedings of The World Wide Web Conference WWW 2019, New York, NY, USA: ACM Digital Library, 2019, p. 3595-3599Conference paper, Published paper (Refereed)
Abstract [en]
Given the increasing number of heterogeneous data stored in relational databases, file systems or cloud environment, it needs to be easily accessed and semantically connected for further data analytic. The potential of data federation is largely untapped, this paper presents an interactive data federation system (https://vimeo.com/ 319473546) by applying large-scale techniques including heterogeneous data federation, natural language processing, association rules and semantic web to perform data retrieval and analytics on social network data. The system first creates a Virtual Database (VDB) to virtually integrate data from multiple data sources. Next, a RDF generator is built to unify data, together with SPARQL queries, to support semantic data search over the processed text data by natural language processing (NLP). Association rule analysis is used to discover the patterns and recognize the most important co-occurrences of variables from multiple data sources. The system demonstrates how it facilitates interactive data analytic towards different application scenarios (e.g., sentiment analysis, privacyconcern analysis, community detection).
Place, publisher, year, edition, pages
New York, NY, USA: ACM Digital Library, 2019
Keywords
heterogeneous data federation, RDF, interactive data analysis
National Category
Natural Language Processing
Identifiers
urn:nbn:se:umu:diva-160892 (URN)10.1145/3308558.3314138 (DOI)000483508403101 ()2-s2.0-85066893934 (Scopus ID)978-1-4503-6674-8 (ISBN)
Conference
WWW '19, The World Wide Web Conference, San Francisco, CA, USA, May 13–17, 2019
2019-06-252019-06-252025-02-07Bibliographically approved