Change search
Refine search result
12345 151 - 200 of 202
CiteExportLink to result list
Permanent link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Rows per page
  • 5
  • 10
  • 20
  • 50
  • 100
  • 250
Sort
  • Standard (Relevance)
  • Author A-Ö
  • Author Ö-A
  • Title A-Ö
  • Title Ö-A
  • Publication type A-Ö
  • Publication type Ö-A
  • Issued (Oldest first)
  • Issued (Newest first)
  • Created (Oldest first)
  • Created (Newest first)
  • Last updated (Oldest first)
  • Last updated (Newest first)
  • Disputation date (earliest first)
  • Disputation date (latest first)
  • Standard (Relevance)
  • Author A-Ö
  • Author Ö-A
  • Title A-Ö
  • Title Ö-A
  • Publication type A-Ö
  • Publication type Ö-A
  • Issued (Oldest first)
  • Issued (Newest first)
  • Created (Oldest first)
  • Created (Newest first)
  • Last updated (Oldest first)
  • Last updated (Newest first)
  • Disputation date (earliest first)
  • Disputation date (latest first)
Select
The maximal number of hits you can export is 250. When you want to export more records please use the Create feeds function.
  • 151.
    Karlgren, Jussi
    et al.
    KTH, School of Electrical Engineering and Computer Science (EECS), Theoretical Computer Science, TCS.
    Kanerva, P.
    Hyperdimensional utterance spaces2018In: CEUR Workshop Proceedings, CEUR-WS , 2018, Vol. 2167, p. 29-35Conference paper (Refereed)
    Abstract [en]

    Human language has a large and varying number of features, both lexical items and constructions, which interact to represent various aspects of communicative information. High-dimensional semantic spaces have proven useful and effective for aggregating and processing lexical information for many language processing tasks. This paper describes a hyperdimensional processing model for language data, a straightforward extension of models previously used for words to handling utterance or text level information. A hyperdimensional model is able to represent a broad range of linguistic and extra-linguistic features in a common integral framework which is suitable as a bridge between symbolic and continuous representations, as an encoding scheme for symbolic information and as a basis for feature space exploration. This paper provides an overview of the framework and an example of how it is used in a pilot experiment.

  • 152.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Karlgren, Hans
    Pettersson, Paul
    Nordström, Magnus
    Wahrolén, Bengt
    Dilemma: a tool for rapid manual translation1994In: ACM Conference Human Factors in Computing Systems (CHI), Conference Companion, 1994, 1, , p. 2Conference paper (Refereed)
    Abstract [en]

    Dilemma is a tool built to aid human translators in achieving higher productivity and better quality, by presenting lexical information which is automatically extracted from previous translations. The design decisions have been based on analyses of the human translation process. We present the ideas behind the tool, and outline the functionality. The system described has been used by professional translators with good results.

  • 153.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Karlgren, Hans
    Pettersson, Paul
    Nordström, Magnus
    Wahrolén, Bengt
    Dilemma: a tool to ensure quality of new translations by using previous ones1994In: Proceedings of the 4th International Conference on Intelligent Multimedia Information Retrieval Systems and Management (RIAO), 1994, 1Conference paper (Refereed)
    Abstract [en]

    Dilemma is a tool built to aid human translators in achieving higher productivity and better quality, by presenting lexical information which is automatically extracted from previous translations. The design decisions have been based on analyses of the human translation process. We present the ideas behind the tool, and outline the functionality. The system described has been evaluated by professional translators with good results, and is now being developed further.

  • 154.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Kauber, Markus
    Boujemaa, Nozha
    Compañó, Ramón
    Dosch, Christoph
    Geurts, Joost
    Gouraud, Henri
    King, Paul
    Köhler, Joachim
    van der Linden, Pieter
    Ortgies, Robert
    Rudström, Åsa
    RISE, Swedish ICT, SICS, Decisions, Networks and Analytics lab.
    Sebe, Nicu
    CHORUS Deliverable 3.4: Vision Document2009Other (Other academic)
    Abstract [en]

    The goal of the CHORUS Vision Document is to create a high level vision on audio-visual search engines in order to give guidance to the future R&D work in this area and to highlight trends and challenges in this domain. The vision of CHORUS is strongly connected to the CHORUS Roadmap Document (D2.3). A concise document integrating the outcomes of the two deliverables will be prepared for the end of the project (NEM Summit).

  • 155.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Kauber, Markus
    Geurts, Joost
    Boujemaa, Nozha
    Sebe, Nicu
    Ortgies, Robert
    Dosch, Christoph
    Compañó, Ramón
    Köhler, Joachim
    CHORUS Deliverable 4.5: Report of the 3rd CHORUS Conference2009Other (Other academic)
    Abstract [en]

    The third and last CHORUS conference on Multimedia Search Engines took place from the 26th to the 27th of May 2009 in Brussels, Belgium. About 100 participants from 15 European countries, the US, Japan and Australia learned about the latest developments in the domain. An exhibition of 13 stands presented 16 research projects currently ongoing around the world.

  • 156.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Olsson, Fredrik
    RISE, Swedish ICT, SICS.
    Trusting the results in crosslingual keyword-based image retrieval2007In: Evaluation of Multilingual and Multi-modal Information Retrieval, 7th Workshop of the Cross-Language Evaluation Forum, CLEF 2006: Revised Selected Papers, 2007, 1, , p. 3Conference paper (Refereed)
    Abstract [en]

    This paper gives a brief description of the starting points for the experiments the SICS team has performed in the 2006 interactive CLEF campaign.

  • 157.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Rudström, Åsa
    RISE, Swedish ICT, SICS, Decisions, Networks and Analytics lab.
    Dosch, Christoph
    Ortgies, Robert
    CHORUS Deliverable 4.3: Report from CHORUS workshops on national initiatives and metadata2007Other (Other academic)
    Abstract [en]

    Minutes of the following Workshops: • National Initiatives on Multimedia Content Description and Retrieval, Geneva, October 10th, 2007. • Metadata in Audio-Visual/Multimedia production and archiving, Munich, IRT, 21st – 22nd November 2007 Workshop in Geneva 10/10/2007 This highly successful workshop was organised in cooperation with the European Commission. The event brought together the technical, administrative and financial representatives of the various national initiatives, which have been established recently in some European countries to support research and technical development in the area of audio-visual content processing, indexing and searching for the next generation Internet using semantic technologies, and which may lead to an internet-based knowledge infrastructure. The objective of this workshop was to provide a platform for mutual information and exchange between these initiatives, the European Commission and the participants. Top speakers were present from each of the national initiatives. There was time for discussions with the audience and amongst the European National Initiatives. The challenges, communalities, difficulties, targeted/expected impact, success criteria, etc. were tackled. This workshop addressed how these national initiatives could work together and benefit from each other. Workshop in Munich 11/21-22/2007 Numerous EU and national research projects are working on the automatic or semi-automatic generation of descriptive and functional metadata derived from analysing audio-visual content. The owners of AV archives and production facilities are eagerly awaiting such methods which would help them to better exploit their assets.Hand in hand with the digitization of analogue archives and the archiving of digital AV material, metadatashould be generated on an as high semantic level as possible, preferably fully automatically. All users of metadata rely on a certain metadata model. All AV/multimedia search engines, developed or under current development, would have to respect some compatibility or compliance with the metadata models in use. The purpose of this workshop is to draw attention to the specific problem of metadata models in the context of (semi)-automatic multimedia search.

  • 158.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Sahlgren, Magnus
    RISE - Research Institutes of Sweden, ICT, SICS.
    From Words to Understanding2001In: Foundations of Real-World Intelligence, Stanford, California: CSLI Publications , 2001, 1, p. 294-308Chapter in book (Refereed)
  • 159.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Sahlgren, Magnus
    RISE - Research Institutes of Sweden, ICT, SICS.
    Vector-based semantic analysis using random indexing and morphological analysis for cross-lingual information retrieval2002In: Revised Papers from the Second Workshop of the Cross-Language Evaluation Forum on Evaluation of Cross-Language Information Retrieval Systems, Darmstadt, Germany, September 3 - 4, 2001, Springer-Verlag , 2002, 1, p. 169-176Chapter in book (Refereed)
    Abstract [en]

    Meaning, the main object of study in information access, is most decidedly situation-dependent. While much of meaning appears to achieve consistency across usage situations -- a term will seem to mean much the same thing in many of its contexts -- most everything can be negotiated on the go. Human processing appears to be flexible in this respect, and oriented towards learning from prototypes rather than learning by definition: learning new words, and adding new meanings or shades of meaning to an existing word does not need a formal re-training process. We have built a query expansion and translation tool for information retrieval systems. When used in one single language it will expand the terms of a query using a thesaurus built for that purpose; when used across languages it will provide numerous translations and near translations for the source language terms. The underlying technology we are testing is that of vector-based semantic analysis, an analysis method related to latent semantic indexing based on stochastic pattern computing. This paper will briefly describe how we acquired training data, aligned it, analyzed it using morphological analysis tools, and finally built a thesaurus using the data, but will concentrate on an overview of vector-based semantic analysis and how stochastic pattern computing differs from latent semantic indexing in its current form.

  • 160.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Sahlgren, Magnus
    RISE - Research Institutes of Sweden, ICT, SICS.
    Cöster, Rickard
    RISE, Swedish ICT, SICS.
    Weighting Query Terms Based on Distributional Statistics2006In: Accessing Multilingual Information Repositories, 6th Workshop of the Cross-Language Evalution Forum, CLEF 2005, Vienna, Austria, 21-23 September, 2005: Revised Papers, 2006, 1, , p. 5Conference paper (Refereed)
    Abstract [en]

    This year, the SICS team has concentrated on query processing and on the internal topical structure of the query, specifically compound translation. Compound translation is non-trivial due to dependencies between compound elements. This year, we have investigated topical dependencies between query terms: if a query term happens to be non-topical or noise, it should be discarded or given a low weight when ranking retrieved documents; if a query term shows high topicality its weight should be boosted. The two experiments described here are based on the analysis of the distributional character of query terms: one using similarity of occurrence context between query terms globally across the entire collection; the other using the likelihood of individual terms to appear topically in individual texts. Both -- complementary -- boosting schemes tested delivered improved results.

  • 161.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Sahlgren, Magnus
    RISE - Research Institutes of Sweden, ICT, SICS.
    Järvinen, Timo
    Cöster, Rickard
    RISE, Swedish ICT, SICS.
    Dynamic lexica for query translation2005In: Multilingual Information Access for Text, Speech and Images, Third Workshop of the Cross-Language Evaluation Forum (CLEF), 2005, 1Conference paper (Refereed)
    Abstract [en]

    This experiment tests a simple, scalable, and effective approach to building a domain-specific translation lexicon using distributional statistics over parallellized bilingual corpora. A bilingual lexicon is extracted from aligned Swedish-French data, used to translate CLEF topics from Swedish to French, which resulting French queries are then in turn used to retrieve documents from the French language CLEF collection. The results give 34 of fifty queries on or above median for the ``precision at 1000 documents'' recall oriented score; with many of the errors possible to handle by the use of string-matching and cognate search. We conclude that the approach presented here is a simple and efficient component in an automatic query translation system.

  • 162.
    Karlgren, Jussi
    et al.
    KTH, School of Computer Science and Communication (CSC), Theoretical Computer Science, TCS.
    Sahlgren, Magnus
    Olsson, Fredrik
    Espinoza, Fredrik
    Hamfors, Ola
    Profiling Reputation of Corporate Entities in Semantic Space: Notebook for RepLab at CLEF 20122012In: CLEF 2012 Evaluation Labs and Workshop Online Working Notes, 2012Conference paper (Refereed)
    Abstract [en]

    Gavagai used its first-generation baseline system for the profiling taskfor evaluation campaign for online reputation management systems of CLEF2012. The system builds on large scale analysis of streaming text and performedexcellently on this task with standard settings.

  • 163.
    Karlgren, Jussi
    et al.
    KTH, School of Computer Science and Communication (CSC), Theoretical Computer Science, TCS.
    Sahlgren, Magnus
    Olsson, Fredrik
    Espinoza, Fredrik
    Hamfors, Ola
    Usefulness of Sentiment Analysis2012In: ECIR 2012, 34th European Conference on Information Retrieval, Springer Berlin/Heidelberg, 2012, p. 426-435Conference paper (Refereed)
    Abstract [en]

    What can text sentiment analysis technology be used for,and does a more usage-informed view on sentiment analysis pose newrequirements on technology development?

  • 164.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Straszheim, Troy
    Visualizing Stylistic Variation1997In: Proceedings of the 30th Hawaii International Conference on Systems Sciences, 1997, 1Conference paper (Refereed)
    Abstract [en]

    Texts vary not only by topic, but by style; indeed, often the variation between texts `about the same thing' can be just as noticeable as the variation between texts `about different things'. Some facets of this variation are quite easy to detect, and quite predictable when applied to categorization of texts by genre, functional style, or - tentatively - quality. Making use of such variation in an retrieval context is quite straightforward in principle; our work consists of an implementation of a visualization tool for document databases. The issues addressed include 1) choice of stylistic items to investigate, 2) composition of dimensions of variation, and 3) judicious naming of dimensions for presentation. We use use principal components analysis to combine our quite large number of stylistic items into two most significant dimensions of variation and plot the document space under consideration into a plane. This space can be used as a first or last filter in an information retrieval task. The composition of the most significant dimensions is naturally corpus dependent, as is the naming of them: our work is tested on Internet and TREC data.

  • 165.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Tarhio, Jorma
    Hyyrö, Heikki
    String processing and information retrieval2009 (ed. 1)Book (Refereed)
    Abstract [en]

    Proceedings of the 16th International Symposium on String Processing and Information Retrieval (SPIRE 2009), Saariselkä, Finland, 25-27 August 2009.

  • 166. Karlgren, Jussi
    et al.
    Tarhio, Jorma
    Hyyrö, Heikki
    String processing and information retrieval: 16th International Symposium, SPIRE 2009 Saariselkä, Finland, August 25-27, 2009 Proceedings2009Book (Refereed)
  • 167.
    Karlsson, Vide
    et al.
    KTH. Gavagai, Sweden.
    Herman, Pawel
    KTH, School of Computer Science and Communication (CSC), Computational Science and Technology (CST). Gavagai, Sweden.
    Karlgren, Jussi
    KTH, School of Computer Science and Communication (CSC), Theoretical Computer Science, TCS.
    Evaluating Categorisation in Real Life: An argument against simple but impractical metrics2016In: 7th CLEF Conference and Labs of the Evaluation Forum, Springer, 2016Conference paper (Refereed)
    Abstract [en]

    Text categorisation in commercial application poses several limiting constraints on the technology solutions to be employed. This paper describes how a method with some potential improvements is eval- uated for practical purposes and argues for a richer and more expressive evaluation procedure. In this paper one such method is exemplified by a precision-recall matrix which sacrifices convenience for expressiveness. 

  • 168. Klamberg, Mark
    et al.
    Nilsson, Mikael
    Petersson, Anna
    Seipel, Peter
    Flyghed, Janne
    Magnusson Sjöberg, Cecilia
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Bylund, Markus
    RISE - Research Institutes of Sweden, ICT, SICS.
    Palmås, Karl
    Kullenberg, Christopher
    Ström, Pär
    Thorburn, Daniel
    Westerholm, Johan
    FRA-lagen medför massiv kartläggning av oskyldiga2008Other (Other (popular science, discussion, etc.))
    Abstract [sv]

    Mark Klamberg Doktorand juridik, Stockholms universitet, Mikael Nilsson Doktorand datalogi, Kungliga Tekniska högskolan, Anna Petersson Doktorand matematik, Uppsala universitet, Peter Seipel Professor emeritus rättsinformatik, Stockholms universitet, Janne Flyghed Professor kriminologi, Stockholms universitet, Cecilia Magnusson Sjöberg Professor i rättsinformatik, Stockholms universitet, Jussi Karlgren Docent språkteknologi, Swedish Institute of Computer Science, Markus Bylund Doktor i data- och systemvetenskap inriktning personlig integritet, Swedish Institute of Computer Science, Karl Palmås Civilingenjör och doktor i sociologi, Handelshögskolan vid Göteborgs universitet, Christopher Kullenberg Doktorand vetenskapsteori, Göteborgs universitet, Pär Ström Civilingenjör och författare, integritetsombudsman på tankesmedjan Den nya välfärden, Daniel Thorburn Professor i statistik, Stockholms universitet, Johan Westerholm Kapten, reservofficerare underrättelsetjänst, rådgivare Greycat

  • 169. Lundblad, Nicklas
    et al.
    Jardenberg, Joakim
    Wormbs, Nina
    KTH, School of Architecture and the Built Environment (ABE), Philosophy and History of Technology, History of Science and Technology (name changed 20120201).
    Karlgren, Jussi
    Bylund, Markus
    Krantz, Peter
    Kontrollen av Internet bör redovisas öppet2012In: Svenska dagbladet, ISSN 1101-2412, no 2012-04-18Article in journal (Other (popular science, discussion, etc.))
  • 170. Murdock, V.
    et al.
    Clarke, C. L. A.
    Kamps, J.
    Karlgren, Jussi
    KTH, School of Computer Science and Communication (CSC), Theoretical Computer Science, TCS. Gavagai.
    Second Workshop on Search and Exploration of X-Rated Information (SEXI'16): WSDM workshop summary2016In: WSDM 2016 - Proceedings of the 9th ACM International Conference on Web Search and Data Mining, Association for Computing Machinery (ACM), 2016, p. 697-698Conference paper (Refereed)
    Abstract [en]

    Adult content is pervasive on the web, has been a driving factor in the adoption of the Internet medium, and is responsible for a significant fraction of traffic and revenues, yet rarely attracts attention in research. The research questions surrounding adult content access behaviors are unique, and interesting and valuable research in this area can be done ethically. WSDM 2016 features a half day workshop on Search and Exploration of X-Rated Information (SEXI) for information access tasks related to adult content. While the scope of the workshop remains broad, special attention is devoted to the privacy and security issues surrounding adult content by inviting keynote speakers with extensive experience on these topics. The recent release of the personal data belonging to customers of the adult dating site Ashley Madison provides a timely context for the focus on privacy and security.

  • 171. Murdock, Vanessa
    et al.
    Clarke, Charles L A
    Kamps, Jaap
    Karlgren, Jussi
    KTH, School of Computer Science and Communication (CSC), Theoretical Computer Science, TCS.
    Report on the Workshop onSearch and Exploration of X-Rated Information(SEXI 2013): WSDM WORKSHOP REPORT2013In: SIGIR Forum, ISSN 0163-5840, E-ISSN 1558-0229, Vol. 47, no 1, p. 31-37Article in journal (Refereed)
    Abstract [en]

    The Workshop on Search and Exploration of X-Rated Information (SEXI) was presentedfor the rst time at the Conference on Web Search and Data Mining (WSDM) 2013 inRome, Italy. It represents a rst attempt to study adult content from the perspective of theresearch communities in Web Search and Data Mining. To this end, ve short papers werepresented covering dierent research questions in searching and evaluating adult content onthe Web, with two invited talks from experts in adult content from the elds of evolutionarypsychology and media studies. The day ended with a panel that included the two invitedspeakers, and an expert in human tracking on the Web.

  • 172. Murdock, Vanessa
    et al.
    Clarke, Charles L A
    Kamps, Jaap
    Karlgren, Jussi
    Search and exploration of X-Rated information (SEXI 2013)2013In: WSDM 2013: Proceedings of the 6th ACM International Conference on Web Search and Data Mining, Association for Computing Machinery , 2013, p. 795-796Conference paper (Refereed)
    Abstract [en]

    Adult content is pervasive on the Web, has been a driving factor in the adoption of the Internet medium. It is responsible for a significant fraction of traffic and revenues, yet rarely attracts attention in research. We propose that the research questions surrounding adult content access behaviors are unique, and we believe interesting and valuable research in this area can be done ethically. The workshop on Search and Exploration of X-Rated Information (SEXI) addresses these issues for information access tasks related to adult content.

  • 173.
    Nilsson, David
    et al.
    Nepa, Stockholm.
    Sahlgren, Magnus
    Gavagai, Stockholm.
    Karlgren, Jussi
    KTH, School of Computer Science and Communication (CSC), Theoretical Computer Science, TCS.
    Dead Man Tweeting2016In: RE-WOCHAT Workshop on Collecting and Generating Resources for Chatbots and Conversational Agents: Development and Evaluation, 2016Conference paper (Refereed)
    Abstract [en]

    This paper presents a prototype — Dead Man Tweeting — of a system that learns semantic avatars from (dead) people’s texts, and makes the avatars come alive on Twitter. The system includes a language model for generating sequences of words, a topic model for ensuring that the sequences are topically coherent, and a semantic model that ensures the avatars can be productive and generate novel sequences. The avatars are connected to Twitter and are triggered by keywords that are significant for each particular avatar. 

  • 174.
    Nylander, Stina
    et al.
    RISE, Swedish ICT, SICS, Software and Systems Engineering Laboratory.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Statistics and phonotactical rules in finding OCR errors1999In: Proceedings of NODALIDA'99 (Nordiska Datorlingvistikdagarna), 1999, 1Conference paper (Refereed)
  • 175.
    Olsson, Fredrik
    et al.
    RISE, Swedish ICT, SICS. Userware.
    Hansen, Preben
    RISE, Swedish ICT, SICS. Userware.
    Franzén, Kristofer
    RISE, Swedish ICT, SICS. Userware.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS. Userware.
    Information Access and Refinement - a Research Theme2001In: ERCIM NewsArticle in journal (Refereed)
  • 176.
    Olsson, Fredrik
    et al.
    RISE, Swedish ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Hansen, Preben
    RISE, Swedish ICT, SICS.
    Svensson, Martin
    Cöster, Rickard
    RISE, Swedish ICT, SICS.
    Sahlgren, Magnus
    RISE - Research Institutes of Sweden, ICT, SICS.
    Consensus and opinions; quality and churn2006Conference paper (Refereed)
    Abstract [en]

    The role of the web user is under transformation from merely being an information consumer to also being a content provider, ``from information age to participation age'', in the words of Sun CEO Scott McNealy. This increase in participation is most obviously manifested by the growth of online communities, weblogs (blogs), and various forms of cooperative and participatory publication of information. One main factor in the shift towards participation is the advent of authoring tools for wikipedias and blogs. Such tools have decreased the threshold for publishing material online considerably --- it is no longer necessary to have knowledge about the technical workings of the web to be able to use it for making information available to a massive number of potential readers. (Although the lion's share of information produced will probably remain in text form in the foreseeable future, it should be noted that other modalities, such as podcasts, screencasts, films and images, are increasingly attracting interest.) The dynamic nature of blogs and wikipedias poses new challenges to the field of information access and refinement; new theories, methods, and tools for alleviating the burden of digesting information on behalf of the readers are clearly needed. This paper presents some issues on readership and participation we are currently considering.

  • 177. Olsson, Fredrik
    et al.
    Karlgren, Jussi
    KTH, School of Computer Science and Communication (CSC), Theoretical Computer Science, TCS.
    Sahlgren, Magnus
    Espinoza, Fredrik
    Hamfors, Ola
    Technical Requirements For Knowledge Representation For Attitude Mining On A Realistic Scale2012In: Proceedings of the Workshop on Reputation Management in Social Media at LREC’12, 2012Conference paper (Refereed)
    Abstract [en]

    To be useful, a reputation mining system must cover a broad range of weakly, vaguely, and implicitly expressed human sentiments andcannot in the absence of prior knowledge rely on sampling the data stream of human-generated text. To achieve coverage, a reputationmining system must be sensitive to variation and change in the signal. These requirements pose a challenge which are an instance ofmore general semantic processing – this paper presents some design requirements used to design and implement a semantic layer for aprocessing stack for human-generated information.

  • 178. Ortgies, Robert
    et al.
    Dosch, Christoph
    Nesvadba, Jan
    Proidl, Adolf
    Gouraud, Henri
    van der Linden, Pieter
    Boujemaa, Nozha
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Compañó, Ramón
    Köhler, Joachim
    King, Paul
    Lowen, David
    CHORUS Deliverable 3.3: Vision Document - Intermediate version2008Other (Other academic)
    Abstract [en]

    The goal of the CHORUS vision document is to create a high level vision on audio-visual search engines in order to give guidance to the future R&D work in this area (in line with the mandate of CHORUS as a Coordination Action). This current intermediate draft of the CHORUS vision document (D3.3) is based on the previous CHORUS vision documents D3.1 to D3.2 and on the results of the six CHORUS Think-Tank meetings held in March, September and November 2007 as well as in April, July and October 2008, and on the feedback from other CHORUS events. The outcome of the six Think-Thank meetings will not just be to the benefit of the participants which are stakeholders and experts from academia and industry – CHORUS, as a coordination action of the EC, will feed back the findings (see Summary) to the projects under its purview and, via its website, to the whole community working in the domain of AV content search. A few subjections of this deliverable are to be completed after the eights (and presumably last) Think-Tank meeting in spring 2009.

  • 179. Palme, Jacob
    et al.
    Karlgren, Jussi
    KTH, School of Computer Science and Communication (CSC), Theoretical Computer Science, TCS.
    Pargman, Daniel
    Issues when designing filters in messaging systems1996In: Computer Communications, ISSN 0140-3664, E-ISSN 1873-703X, Vol. 19, no 2, p. 95-101Article in journal (Refereed)
    Abstract [en]

    The increasing size of messaging communities increases the risk of information overload, especially when group communication tools like mailing lists or asynchronous conferencing systems (like Usenet News) are used. Future messaging systems will require more capable filters to aid users in the selection of what to read. The increasing use of networks by non-computer professionals requires filters that are easier to use and manage than most filtering software today. Filters might use evaluations of messages made by certain users as an aid to filtering these messages for other users.

  • 180. Palme, Jacob
    et al.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Pargman, Daniel
    Issues when designing filters in messaging systems1996In: Computer Communications, ISSN 0140-3664, E-ISSN 1873-703X, Vol. 19, p. 95-101Article in journal (Refereed)
    Abstract [en]

    The increasing size of messaging communities increases the risk of information overload, especially when group communication tools like mailing lists or asynchronous conferencing systems (like Usenet News) are used. Future messaging systems will require more capable filters to aid users in the selection of what to read. The increasing use of networks by non-computer professionals requires filters that are easier to use and manage than most filtering software today. Filters might use evaluations of messages made by certain users as an aid to filtering these messages for other users.

  • 181. Parks, Magdalena
    et al.
    Karlgren, Jussi
    KTH, School of Computer Science and Communication (CSC), Theoretical Computer Science, TCS.
    Stymne, Sara
    Plausibility Testing for Lexical Resources2017In: Proceedings of CLEF 2017: Information Access Evaluation meets Multilinguality, Multimodality, and Visualization, 2017Conference paper (Refereed)
    Abstract [en]

    This paper describes principles for evaluation metrics for lexical components and an implementation of them based on requirements from practical information system

  • 182.
    Persson, Per
    et al.
    RISE, Swedish ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Korhonen, Panu
    Galore, Janet
    Tierney, Mark
    RISE, Swedish ICT, SICS.
    Redmon, Chad
    Hemanus, Juha
    Lönnqvist, Peter
    Laaksolahti, Jarmo
    RISE, Swedish ICT, SICS, Computer Systems Laboratory.
    Expressive messaging on mobile platforms2001Conference paper (Refereed)
    Abstract [en]

    This paper presents design requirements for expressive, avatar-based multi-modal messaging on mobile platforms. It is argued that expressive messaging needs to exploit context of peers, embodied appearance and behaviour, in combination with text. Our approach allows strong expressiveness and yet simple, on the fly message compositions required in a mobile, noisy setting. Technical challenges for a user ready prototype are sketched. It is argued that the context of usage between work-related stationary terminals and mobile ones is radically different.

  • 183.
    Persson, Per
    et al.
    RISE, Swedish ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Korhonen, Panu
    Galore, Janet
    Tierney, Mark
    RISE, Swedish ICT, SICS.
    Redmon, Chad
    Hemánus, Juha
    Lönnqvist, Peter
    Laaksolahti, Jarmo
    RISE, Swedish ICT, SICS, Computer Systems Laboratory.
    Expressive Messaging on Mobile Platforms2001Report (Other academic)
    Abstract [en]

    We present a design for expressive multimodal messaging on mobile platforms. Strong context, simple text messages, and crude animations combine well to produce surprisingly expressive results.

  • 184. Peters, Carol
    et al.
    Clough, PaulGey, FredericKarlgren, JussiRISE, Swedish ICT, SICS.Magnini, BernardoOard, Douglasde Rijke, MaartenStempfhuber, Maximilian
    Evaluation of Multilingual and Multi-modal Information Retrieval: 7th Workshop of the Cross-Language Evaluation Forum, CLEF 2006, Alicante, Spain, September 20-22, 2006, Revised Selected Papers2007Conference proceedings (editor) (Refereed)
    Abstract [en]

    This book constitutes the thoroughly refereed postproceedings of the 7th Workshop of the Cross-Language Evaluation Forum, CLEF 2006, held in Alicante, Spain, September 2006. The revised papers presented together with an introduction were carefully reviewed and selected for inclusion in the book. The papers are organized in topical sections on Multilingual Textual Document Retrieval, Domain-Specifig Information Retrieval, i-CLEF, QA@CLEF, ImageCLEF, CLSR, WebCLEF and GeoCLEF.

  • 185.
    Sahlgren, Magnus
    et al.
    Gavagai, Sweden.
    Gyllensten, Amaru Cuba
    Gavagai, Sweden.
    Espinoza, Fredrik
    Gavagai, Sweden.
    Hamfors, Ola
    Gavagai, Sweden.
    Karlgren, Jussi
    Gavagai, Sweden.
    Olsson, Fredrik
    Gavagai, Sweden.
    Persson, Per
    Gavagai, Sweden.
    Viswanathan, Akshay
    Gavagai, Sweden.
    Holst, Anders
    SICS, Sweden.
    The Gavagai Living Lexicon2016In: 10th edition of the Language Resources and Evaluation Conference, 23-28 May 2016, Portorož (Slovenia), ELRA , 2016Conference paper (Refereed)
    Abstract [en]

    This paper presents the Gavagai Living Lexicon, which is an online distributional semantic model currently available in 14 different languages. We describe the underlying distributional semantic model, and how we have solved some of the challenges in applying such a model to large amounts of streaming data. We also describe the architecture of our implementation, and discuss how we deal with continuous quality assurance of the lexicon.

  • 186.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Hansen, Preben
    RISE, Swedish ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    English-Japanese cross-lingual query expansion using random indexing of aligned bilingual text data2002Conference paper (Refereed)
    Abstract [en]

    Vector-space techniques can be used for extracting semantically similar words from the co-occurrence statistics of words in large text data. In this paper, we report on experiments with using the Random Indexing vector-space technique for extracting a cross-lingual thesaurus from aligned English-Japanese bilingual data. The cross-lingual thesaurus has been used for automatic cross-lingual query expansion in the NTCIR patent retrieval task.

  • 187. Sahlgren, Magnus
    et al.
    Karlgren, Jussi
    Swedish Institute of Computer Science, Sweden.
    Automatic Bilingual Lexicon Acquisition Using Random Indexing of Parallel Corpora2005In: Natural Language Engineering, ISSN 1351-3249, E-ISSN 1469-8110, Vol. 11, no 3, p. 327-341Article in journal (Refereed)
    Abstract [en]

    This paper presents a very simple and effective approach to using parallel corpora for automatic bilingual lexicon acquisition. The approach, which uses the Random Indexing vector space methodology, is based on finding correlations between terms based on their distributional characteristics. The approach requires a minimum of preprocessing and linguistic knowledge, and is efficient, fast and scalable. In this paper, we explain how our approach differs from traditional cooccurrence-based word alignment algorithms, and we demonstrate how to extract bilingual lexica using the Random Indexing approach applied to aligned parallel data. The acquired lexica are evaluated by comparing them to manually compiled gold standards, and we report overlap of around 60%. We also discuss methodological problems with evaluating lexical resources of this kind.

  • 188.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Automatic Bilingual Lexicon Acquisition Using Random Indexing of Parallel Corpora2005In: Natural Language Engineering, ISSN 1351-3249, E-ISSN 1469-8110, Vol. 11, no 3, p. 327-341Article in journal (Refereed)
    Abstract [en]

    This paper presents a very simple and effective approach to using parallel corpora for automatic bilingual lexicon acquisition. The approach, which uses the Random Indexing vector space methodology, is based on finding correlations between terms based on their distributional characteristics. The approach requires a minimum of preprocessing and linguistic knowledge, and is efficient, fast and scalable. In this paper, we explain how our approach differs from traditional cooccurrence-based word alignment algorithms, and we demonstrate how to extract bilingual lexica using the Random Indexing approach applied to aligned parallel data. The acquired lexica are evaluated by comparing them to manually compiled gold standards, and we report overlap of around 60\%. We also discuss methodological problems with evaluating lexical resources of this kind.

  • 189.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS. Attityd.
    Buzz monitoring in word space2008Conference paper (Refereed)
    Abstract [en]

    This paper discusses the task of tracking mentions of some topically interesting textual entity from a continuously and dynamically changing flow of text, such as a news feed, the output from an Internet crawler or a similar text source - a task sometimes referred to as buzz monitoring. Standard approaches from the field of information access for identifying salient textual entities are reviewed, and it is argued that the dynamics of buzz monitoring calls for more accomplished analysis mechanisms than the typical text analysis tools provide today. The notion of word space is introduced, and it is argued that word spaces can be used to select the most salient markers for topicality, find associations those observations engender, and that they constitute an attractive foundation for building a representation well suited for the tracking and monitoring of mentions of the entity under consideration.

  • 190.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Counting Lumps in Word Space: Density as a Measure of Corpus Homogeneity2005Conference paper (Refereed)
    Abstract [en]

    This paper introduces a measure of corpus homogeneity that indicates the amount of topical dispersion in a corpus. The measure is based on the density of neighborhoods in semantic word spaces. We evaluate the measure by comparing the results for five different corpora. Our initial results indicate that the proposed density measure can indeed identify differences in topical dispersion.

  • 191.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS. Attityd.
    Terminology mining in social media2009Conference paper (Refereed)
    Abstract [en]

    The highly variable and dynamic word usage in social media presents serious challenges for both research and those commercial applications that are geared towards blogs or other user-generated non-editorial texts. This paper discusses and exemplifies a terminology mining approach for dealing with the productive character of the textual environment in social media. We explore the challenges of practically acquiring new terminology, and of modeling similarity and relatedness of terms from observing realistic amounts of data. We also discuss semantic evolution and density, and investigate novel measures for characterizing the preconditions for terminology mining.

  • 192.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Cöster, Rickard
    RISE, Swedish ICT, SICS.
    SICS at CLEF 2002: automatic query expansion using random indexing2002Conference paper (Refereed)
    Abstract [en]

    Vector-space techniques can be used for extracting semantically similar words from the co-occurrence statistics of words in large text data. We have used a technique called Random Indexing to accumulate context vectors for Swedish, French and Italian. We have then used the context vectors to perform automatic query expansion. In this paper, we report on our CLEF 2002 experiments on Swedish, French and Italian monolingual query expansion.

  • 193.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Cöster, Rickard
    RISE, Swedish ICT, SICS.
    Järvinen, Timo
    Automatic query expansion using random indexing2003In: Advances in Cross-Language Information Retrieval: Third Workshop of the Cross-Language Evaluation Forum, CLEF 2002. Rome, Italy, September 19-20, 2002: Revised Papers, Springer-Verlag , 2003, 1, p. 311-320Chapter in book (Refereed)
    Abstract [en]

    Vector-space techniques can be used for extracting semantically similar words from the co-occurrence statistics of words in large text data. We have used a technique called Random Indexing to accumulate context vectors for Swedish, French and Italian. We have then used the context vectors to perform automatic query expansion. In this paper, we report on our CLEF 2002 experiments on Swedish, French and Italian monolingual query expansion.

  • 194.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Eriksson, Gunnar
    RISE, Swedish ICT, SICS.
    SICS: Valence annotation based on seeds in word space2007Conference paper (Refereed)
  • 195. Sahlgren, Magnus
    et al.
    Ylipää, Erik
    Brown, Barry
    Helms, Karey
    KTH, School of Electrical Engineering and Computer Science (EECS), Media Technology and Interaction Design, MID.
    Lampinen, Airi
    McMillan, Donald
    Karlgren, Jussi
    KTH, School of Computer Science and Communication (CSC), Theoretical Computer Science, TCS.
    The Smart Data Layer2018In: Papers from the 2018 AAAI Spring Symposium on Artificial Intelligence for the Internet of Everything, AAAI Press, 2018Conference paper (Refereed)
    Abstract [en]

    This paper introduces the notion of a smart data layerfor the Internet of Everything. The smart data layer canbe seen as an AI that learns a generic representationfrom heterogeneous data streams with the goal of un-derstanding the state of the user. The smart data layercan be used both as materials for design processes andas the foundation for intelligent data processing.

  • 196. Simaioforidis, Aggis
    et al.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Ereback, Anna-Lena
    RISE, Swedish ICT, SICS.
    Creating a task-specific collaborative information retrieval system1998In: Proceedings of the 2nd European Conference on Research and Advanced Technology for Digital Libraries (ECDL), 1998, 4Conference paper (Refereed)
  • 197.
    Simsarian, Kristian
    et al.
    RISE, Swedish ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Fahlén, Lennart
    RISE, Swedish ICT, SICS.
    Frecon, Emmanuel
    RISE, Swedish ICT, SICS.
    Bretan, Ivan
    Frost, Niklas
    Jonsson, Lars
    Achieving Virtual Presence with a Semi-Autonomous Robot Through a Multi-Reality and Speech Control Interface1996In: Proceedings of 3d Eurographics Workshop on Virtual Environments, 1996, 1Conference paper (Refereed)
    Abstract [en]

    This paper describes a model for a complex human-machine system where a human operator controls a remote robot through the mediation of a distributed virtual environment with a language interface. The system combines speech controlled graphical immersive environments with the live video from a robot working in a real environment. The worlds are synchronized and updated based on operator selections, commands and robot actions. This system allows the user to have a powerful tool with a high level of abstraction to create and control autonomous robots, thus making possible the realization of single and multiple autonomous robot applications.

  • 198. Strzalkowski, Tomek
    et al.
    Guthrie, Louise
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Leistensnider, Jim
    Lin, Fang
    Perez-Carballo, Jose
    Straszheim, Troy
    Wang, Jin
    Wilding, Jon
    Natural language information retrieval: TREC-5 report1996In: NIST Special Publication 500-238: The Fifth Text REtrieval Conference (TREC-5), November 1996, Gaithersburg, Maryland, USA, 1996, 1, , p. 28Conference paper (Refereed)
    Abstract [en]

    In this paper we report on the joint GE/Lockheed Martin/Rutgers/NYU natural language information retrieval project as related to the 5th Text Retrieval Conference (TREC-5). The main thrust of this project is to use natural language processing techniques to enhance the effectiveness of full-text document retrieval. Since our first TREC entry in 1992 (as NYU team) the basic premise of our research was to demonstrate that robust if relatively shallow NLP can help to derive a better representation of text documents for statistical search. TREC-5 marks a shift in this approach away from text representation issues and towards query development problems. While our TREC-5 system still performs extensive text processing in order to extract phrasal and other indexing terms, our main focus this year was on query construction using words, sentences, and entire passages to expand initial topic specifications in an attempt to cover their various angles, aspects and contexts. Based on our earlier TREC results indicating that NLP is more effective when long, descriptive queries are used, we allowed for liberal expansion with long passages from related documents imported verbatim into the queries. This method appears to have produced a dramatic improvement in the performance of two different statistical search engines that we tested (Cornell’s SMART and NIST’s Prise) boosting the average precision by at least 40%. The overall architecture of TREC-5 system has also changed in a number of ways from TREC-4. The most notable new feature is the stream architecture in which several independent, parallel indexes are built for a given collection, each index reflecting a different representation strategy for text documents. Stream indexes are built using a mixture of different indexing approaches, term extracting, and weighting strategies. We used both SMART and Prise base indexing engines, and selected optimal term weighting strategies for each stream, based on a training collection of approximately 500 MBytes. The final results are produced by a merging procedure that combines ranked list of documents obtained by searching all stream indexes with appropriately preprocessed queries. This allows for an effective combination of alternative retrieval and filtering methods, creating into a meta-search where the contribution of each stream can be optimized through training.

  • 199. Strzalkowski, Tomek
    et al.
    Perez-Carballo, Jose
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Hulth, Anette
    Tapanainen, Pasi
    Lahtinen, TImo
    Natural language information retrieval: TREC-8 report1999In: NIST Special Publication 500-246: The Eighth Text REtrieval Conference (TREC-8), 1999, 1, , p. 10Conference paper (Refereed)
    Abstract [en]

    This report describes the adhoc experiments performed by the GE/Rutgers/SICS/SU/Conexor team in the context of TREC-8. The research efforts went in four directions: 1. As in previous years, we performed a full linguistic analysis of the entire corpus, and used the re- sults of the analysis to provide index terms on a higher level of abstraction than can be provided by stems alone. 2. We made use of two different query expansion techniques, one automatic and one manual, both de- veloped for TREC-8. 3. The various analysis models were combined using a stream model architecture, where each stream represents an alternative text indexing method, and the stream's various overlapping knowledge was merged using a new merging algorithm derived from first principles. 4. The entire text was analyzed for various stylistic items. Due to the distributed approach, this years’ research efforts partly canceled out each other. New ex- periments in every step of the process did not result in an overwhelming overall result. We are able to determine that the manual query expansion technique developed at General Electric performed very well.

  • 200.
    Svensson, Martin
    et al.
    RISE, Swedish ICT, SICS.
    Johansson, Arnold
    Ereback, Anna-Lena
    RISE, Swedish ICT, SICS.
    Höök, Kristina
    RISE, Swedish ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Bretan, Ivan
    Moving Usability Testing onto the Web1998In: Proceedings of WebNet'98, 1998, 1Conference paper (Refereed)
    Abstract [en]

    Abstract: In order to remotely obtain detailed usability data by tracking user behaviors within a given web site, a server-based usability testing environment has been created. Web pages are annotated in such a way that arbitrary user actions (such as "mouse over link" or "click back button") can be selected for logging. In addition, the system allows the experiment designer to interleave interactive questions into the usability evaluation, which for instance could be triggered by a particular sequence of actions. The system works in conjunction with clustering and visualization algorithms that can be applied to the resulting log file data. A first version of the system has been used successfully to carry out a web usability evaluation.

12345 151 - 200 of 202
CiteExportLink to result list
Permanent link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf