Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A typology of classifiers and gender: From description to computation
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Languages, Department of Linguistics and Philology.ORCID iD: 0000-0002-9057-642X
2019 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Categorization is one the most relevant tasks realized by humans during their life, as we consistently need to categorize the things and experience that we encounter. Such need is reflected in language via various mechanisms, the most prominent being nominal classification systems (e.g., grammatical gender such as the masculine/feminine distinction in French). Typological methods are used to investigate the underlying functions and structures of such systems, using a wide variety of cross-linguistic data to examine universality and variability. This analysis is itself a classification task, as languages are categorized and clustered according to their grammatical features. This thesis provides a cross-linguistic typological analysis of nominal classification systems and in parallel compares a number of quantitative methods that can be applied at different scales.

First, this thesis provides an analysis of nominal classification systems (i.e., gender and classifiers) via the description of three languages with respectively gender, classifiers, and both. While the analysis of the first two languages are more of a descriptive nature and aligns with findings in the existing literature, the third language provides novel insights to the typology of nominal classification systems by demonstrating how classifiers and gender may co-occur in one language in terms of distribution of functions. Second, the underlying logic of nominal classification systems is commonly considered difficult to investigate, e.g., is there a consistent logic behind gender assignment in language? is it possible to explain the distribution of classifier languages of the world while taking into account geographical and genealogical effects? This thesis addresses the lack of arbitrariness of nominal classification systems at three different scales: The distribution of classifiers at the worldwide level, the presence of gender within a language family, and gender assignment at the language-internal level. The methods of random forests, phylogenetics, and word embeddings with neural networks are selected since they are respectively applicable at three different scales of research questions (worldwide, family-internal, language-internal).

Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2019. , p. 78
Series
Studia Linguistica Upsaliensia, ISSN 1652-1366 ; 23
Keywords [en]
Classifiers, Gender, Nominal classification, Functions, Random Forests, Phylogeny, Word Embeddings, Neural Networks
National Category
General Language Studies and Linguistics Language Technology (Computational Linguistics)
Research subject
Linguistics
Identifiers
URN: urn:nbn:se:uu:diva-366598ISBN: 978-91-513-0507-3 (print)OAI: oai:DiVA.org:uu-366598DiVA, id: diva2:1265009
Public defence
2019-03-09, Humanistiska teatern, Thunbergsvägen 3H, Uppsala, 10:15 (English)
Opponent
Supervisors
Available from: 2019-02-06 Created: 2018-11-21 Last updated: 2019-03-08Bibliographically approved
List of papers
1. The lexical and discourse functions of grammatical gender in Marathi
Open this publication in new window or tab >>The lexical and discourse functions of grammatical gender in Marathi
2018 (English)In: Journal of South Asian languages and linguistics, ISSN 2196-0771, E-ISSN 2196-078X, Vol. 5, no 2Article in journal (Refereed) In press
National Category
General Language Studies and Linguistics
Research subject
Indology with Classical Sanscrit
Identifiers
urn:nbn:se:uu:diva-366417 (URN)10.1515/jsall-2018-0012 (DOI)
Available from: 2018-11-20 Created: 2018-11-20 Last updated: 2018-12-13
2. Nominal classification in Assamese: An analysis of function
Open this publication in new window or tab >>Nominal classification in Assamese: An analysis of function
2019 (English)In: Nominal Classification in Asia: Functional and diachronic perspectives / [ed] Marc Tang; Marcin Kilarski, John Benjamins Publishing Company, 2019Chapter in book (Refereed)
Place, publisher, year, edition, pages
John Benjamins Publishing Company, 2019
National Category
General Language Studies and Linguistics
Identifiers
urn:nbn:se:uu:diva-366419 (URN)
Available from: 2018-11-20 Created: 2018-11-20 Last updated: 2019-03-27
3. Functions of gender and numeral classifiers in Nepali
Open this publication in new window or tab >>Functions of gender and numeral classifiers in Nepali
(English)In: Poznan Studies in Contemporary Linguistics, ISSN 1732-0747, E-ISSN 1897-7499Article in journal (Refereed) Submitted
National Category
General Language Studies and Linguistics
Identifiers
urn:nbn:se:uu:diva-366473 (URN)
Available from: 2018-11-20 Created: 2018-11-20 Last updated: 2018-11-21
4. Numeral base, numeral classifier, and noun: Word order harmonization
Open this publication in new window or tab >>Numeral base, numeral classifier, and noun: Word order harmonization
(English)In: Language and linguistics/Yuyan Ji Yuyangxue, ISSN 1606-822X, E-ISSN 2309-5067, Vol. 22, no 1Article in journal (Refereed) In press
National Category
General Language Studies and Linguistics
Identifiers
urn:nbn:se:uu:diva-366474 (URN)
Available from: 2018-11-20 Created: 2018-11-20 Last updated: 2018-11-29
5. Insights on the Greenberg-Sanches-Slobin Generalization: Quantitative typological data on classifiers and plural markers
Open this publication in new window or tab >>Insights on the Greenberg-Sanches-Slobin Generalization: Quantitative typological data on classifiers and plural markers
(English)In: Folia linguistica, ISSN 0165-4004, E-ISSN 1614-7308Article in journal (Refereed) In press
National Category
General Language Studies and Linguistics
Identifiers
urn:nbn:se:uu:diva-366476 (URN)
Available from: 2018-11-20 Created: 2018-11-20 Last updated: 2018-11-21
6. Word order of numeral classifiers and numeral bases: Harmonization by multiplication
Open this publication in new window or tab >>Word order of numeral classifiers and numeral bases: Harmonization by multiplication
(English)In: Language Typology and Universals, ISSN 1867-8319, E-ISSN 2196-7148Article in journal (Refereed) In press
National Category
General Language Studies and Linguistics
Identifiers
urn:nbn:se:uu:diva-366477 (URN)
Available from: 2018-11-20 Created: 2018-11-20 Last updated: 2018-11-21
7. A Statistical Explanation of the Distribution of Sortal Classifiers in Languages of the World via Computational Classifiers
Open this publication in new window or tab >>A Statistical Explanation of the Distribution of Sortal Classifiers in Languages of the World via Computational Classifiers
2018 (English)In: Journal of Quantitative Linguistics, ISSN 0929-6174, E-ISSN 1744-5035Article in journal (Refereed) Published
Abstract [en]

Previous studies demonstrate that morphosyntactic plural markers and the structure of numeral systems have individually strong predictive power with regard to the usage of sortal classifiers in languages. We use these two factors as explanatory variables to train the computational classifier of random forests and evaluate the accuracy of their predictive power when selecting the existence/absence of sortal classifiers as response variable. Our results show that these two factors result in an excellent discrimination performance of random forests, even when taking into account sortal classifiers as an areal feature. However, the correlation between morphosyntactic plural markers and multiplicative bases is weaker than the correlation between sortal classifiers and plural markers plus multiplicative bases. We are thus able to provide novel insights with regard to probabilistic universals on sortal classifiers, and suggest an innovative cross-disciplinary approach to test the effect of implicational universals with computational methods.

National Category
General Language Studies and Linguistics Language Technology (Computational Linguistics)
Research subject
Linguistics
Identifiers
urn:nbn:se:uu:diva-362252 (URN)10.1080/09296174.2018.1523777 (DOI)
Available from: 2018-10-03 Created: 2018-10-03 Last updated: 2018-11-21Bibliographically approved
8. Measuring the phylogenetic signal of grammatical gender in Indo-Aryan languages
Open this publication in new window or tab >>Measuring the phylogenetic signal of grammatical gender in Indo-Aryan languages
(English)In: Language Dynamics and Change, ISSN 2210-5824, E-ISSN 2210-5832Article in journal (Refereed) Submitted
National Category
General Language Studies and Linguistics
Identifiers
urn:nbn:se:uu:diva-366596 (URN)
Available from: 2018-11-21 Created: 2018-11-21 Last updated: 2018-11-23
9. Linguistic information in word embeddings
Open this publication in new window or tab >>Linguistic information in word embeddings
2019 (English)In: Lecture notes in artificial intelligence / [ed] Randy Goebel, Yuzuru Tanaka, and Wolfgang Wahlster, Dordrecht: Springer, 2019Chapter in book (Refereed)
Place, publisher, year, edition, pages
Dordrecht: Springer, 2019
National Category
General Language Studies and Linguistics
Identifiers
urn:nbn:se:uu:diva-366478 (URN)
Available from: 2018-11-20 Created: 2018-11-20 Last updated: 2019-03-27
10. Lexical and Morpho-syntactic Features in Word Embeddings: A Case Study of Nouns in Swedish
Open this publication in new window or tab >>Lexical and Morpho-syntactic Features in Word Embeddings: A Case Study of Nouns in Swedish
2018 (English)In: Proceedings of the 10th International Conference on Agents and Artificial Intelligence: Volume 2, Setubal: SciTePress, 2018, p. 663-674Chapter in book (Refereed)
Place, publisher, year, edition, pages
Setubal: SciTePress, 2018
National Category
General Language Studies and Linguistics
Identifiers
urn:nbn:se:uu:diva-351926 (URN)978-989-758-275-2 (ISBN)
Available from: 2018-05-31 Created: 2018-05-31 Last updated: 2018-11-21Bibliographically approved

Open Access in DiVA

fulltext(1295 kB)192 downloads
File information
File name FULLTEXT02.pdfFile size 1295 kBChecksum SHA-512
3c1ea64a3327c8aa1c37f560d970ad7c992dc27eec686ca22491b927bc1156fe895e92f717b40c59389325115f64892d7bb2c9275e6522f7c6eed36ea4e7766f
Type fulltextMimetype application/pdf
errata(69 kB)16 downloads
File information
File name ERRATA02.pdfFile size 69 kBChecksum SHA-512
0570032fcc0c42d7f4c07a5bf48f5a7f50501a99b5d5a57d65671475c98ec4b9f2d6f43e04e8f1a2ade7a0cfc5649998d3629eaa0573588550c6cb92bcf7ba77
Type errataMimetype application/pdf
Buy this publication >>

Search in DiVA

By author/editor
Tang, Marc
By organisation
Department of Linguistics and Philology
General Language Studies and LinguisticsLanguage Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
Total: 192 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 1125 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf