Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Putting a spin on SPINN: Representations of syntactic structure in neural network sentence encoders for natural language inference
Linköping University, Department of Computer and Information Science, Human-Centered systems.
2017 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

This thesis presents and investigates a dependency-based recursive neural network model applied to the task of natural language inference. The dependency-based model is a direct extension of a previous constituency-based model used for natural language inference. The dependency-based model is tested on the Stanford Natural Language Inference corpus and is compared to the previously proposed constituency-based model as well as a recurrent Long-Short Term Memory network. The experiments show that the Long-Short Term Memory outperform both the dependency-based models as well as the constituency-based model. It is also shown that what is to be explicitly represented depends on the model dimensionality that one use. With 50-dimensional models, more explicit representations of the dependency structure provides higher accuracies, and the best dependency-based model performs on par with the LSTM. Higher model dimensionalities seem to favor less explicit representations of the dependency structure. We hypothesize that a smaller dimensionality requires a more explicit representation of the relevant linguistic features of the input, while the explicit representation becomes limiting when a higher model dimensionality is used.

Place, publisher, year, edition, pages
2017. , 28 p.
National Category
Language Technology (Computational Linguistics) Human Computer Interaction
Identifiers
URN: urn:nbn:se:liu:diva-139229ISRN: LIU-IDA/KOGVET-A--17/003--SEOAI: oai:DiVA.org:liu-139229DiVA: diva2:1120398
Subject / course
Cognitive science
Supervisors
Examiners
Available from: 2017-07-06 Created: 2017-07-06 Last updated: 2017-07-06Bibliographically approved

Open Access in DiVA

fulltext(600 kB)43 downloads
File information
File name FULLTEXT01.pdfFile size 600 kBChecksum SHA-512
2db8cf5de77a6be9a083df7749c91d265edec4db18cc4e3930eff8158d0acb43bd4dc9d0f14fcec7d149c431e5666d9e345564b5b0f3258710cbadcbb188a6b7
Type fulltextMimetype application/pdf

By organisation
Human-Centered systems
Language Technology (Computational Linguistics)Human Computer Interaction

Search outside of DiVA

GoogleGoogle Scholar
Total: 43 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 647 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf