Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Visualizing Stylistic Variation
RISE, Swedish ICT, SICS.ORCID iD: 0000-0003-4042-4919
1997 (English)In: Proceedings of the 30th Hawaii International Conference on Systems Sciences, 1997, 1Conference paper, Published paper (Refereed)
Abstract [en]

Texts vary not only by topic, but by style; indeed, often the variation between texts `about the same thing' can be just as noticeable as the variation between texts `about different things'. Some facets of this variation are quite easy to detect, and quite predictable when applied to categorization of texts by genre, functional style, or - tentatively - quality. Making use of such variation in an retrieval context is quite straightforward in principle; our work consists of an implementation of a visualization tool for document databases. The issues addressed include 1) choice of stylistic items to investigate, 2) composition of dimensions of variation, and 3) judicious naming of dimensions for presentation. We use use principal components analysis to combine our quite large number of stylistic items into two most significant dimensions of variation and plot the document space under consideration into a plane. This space can be used as a first or last filter in an information retrieval task. The composition of the most significant dimensions is naturally corpus dependent, as is the naming of them: our work is tested on Internet and TREC data.

Place, publisher, year, edition, pages
1997, 1.
National Category
Computer and Information Science
Identifiers
URN: urn:nbn:se:ri:diva-21024OAI: oai:DiVA.org:ri-21024DiVA: diva2:1041058
Conference
30th Hawaii International Conference on Systems Sciences, 7-10 Jan 1997, Maui, Hawaii
Projects
Proteus
Available from: 2016-10-31 Created: 2016-10-31 Last updated: 2017-07-11Bibliographically approved

Open Access in DiVA

No full text

Search in DiVA

By author/editor
Karlgren, Jussi
By organisation
SICS
Computer and Information Science

Search outside of DiVA

GoogleGoogle Scholar

Total: 2 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf