Change search
ReferencesLink to record
Permanent link

Direct link
Random indexing of multi-dimensional data
Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Embedded Internet Systems Lab. (EISLAB)ORCID iD: 0000-0001-5662-825X
SICS Swedish ICT, SE-722 13 Västerås, Sweden.
SICS Swedish ICT, SE-164 29 Kista, Sweden.
Number of Authors: 3
2016 (English)In: Knowledge and Information Systems, ISSN 0219-1377, E-ISSN 0219-3116Article in journal (Refereed) Accepted
Abstract [en]

Random indexing (RI) is a lightweight dimension reduction method, which is used for example to approximate vector-semantic relationships in online natural language processing systems. Here we generalise RI to multi-dimensional arrays and thereby enable approximation of higher-order statistical relationships in data. The generalised method is a sparse implementation of random projections,which is the theoretical basis also for ordinary RI and other randomisation approaches to dimensionality reduction and data representation. We present numerical experiments which demonstrate that a multi-dimensional generalisation of RI is feasible, including comparisons with ordinary RI and principal component analysis (PCA). The RI method is well suited for online processing of data streams because relationship weights can be updated incrementally in a fixed-size distributed representation,and inner products can be approximated on the fly at low computational cost. An open source implementation of generalised RI is provided.

Place, publisher, year, edition, pages
2016.
Keyword [en]
Data mining, random embeddings, dimensionality reduction, sparse coding, semantic similarity, streaming algorithm, natural language processing
National Category
Other Electrical Engineering, Electronic Engineering, Information Engineering
Research subject
Industrial Electronics
Identifiers
URN: urn:nbn:se:ltu:diva-60658DOI: 10.1007/s10115-016-1012-2OAI: oai:DiVA.org:ltu-60658DiVA: diva2:1049308
Funder
The Kempe Foundations, GÖF
Available from: 2016-11-24 Created: 2016-11-24 Last updated: 2016-11-30

Open Access in DiVA

fulltext(1872 kB)8 downloads
File information
File name FULLTEXT01.pdfFile size 1872 kBChecksum SHA-512
326eac0dd7e060d4ebb8e69c8d94d2637f14ee7a7eaf74af07cb0ab31e66f30bec4991a0098c3a57377659a4d6c40f6c8bb9949bed3169054b688ba2ef818c81
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Search in DiVA

By author/editor
Sandin, FredrikEmruli, Blerim
By organisation
Embedded Internet Systems Lab
In the same journal
Knowledge and Information Systems
Other Electrical Engineering, Electronic Engineering, Information Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 8 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 11 hits
ReferencesLink to record
Permanent link

Direct link