Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
On the Measurement of Model Fit for Sparse Categorical Data
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Social Sciences, Department of Statistics.
2012 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

This thesis consists of four papers that deal with several aspects of the measurement of model fit for categorical data. In all papers, special attention is paid to situations with sparse data.

The first paper concerns the computational burden of calculating Pearson's goodness-of-fit statistic for situations where many response patterns have observed frequencies that equal zero. A simple solution is presented that allows for the computation of the total value of Pearson's goodness-of-fit statistic when the expected frequencies of response patterns with observed frequencies of zero are unknown.

In the second paper, a new fit statistic is presented that is a modification of Pearson's statistic but that is not adversely affected by response patterns with very small expected frequencies. It is shown that the new statistic is asymptotically equivalent to Pearson's goodness-of-fit statistic and hence, asymptotically chi-square distributed.

In the third paper, comprehensive simulation studies are conducted that compare seven asymptotically equivalent fit statistics, including the new statistic. Situations that are considered concern both multinomial sampling and factor analysis. Tests for the goodness-of-fit are conducted by means of the asymptotic and the bootstrap approach both under the null hypothesis and when there is a certain degree of misfit in the data. Results indicate that recommendations on the use of a fit statistic can be dependent on the investigated situation and on the purpose of the model test. Power varies substantially between the fit statistics and the cause of the misfit of the model. Findings indicate further that the new statistic proposed in this thesis shows rather stable results and compared to the other fit statistics, no disadvantageous characteristics of the fit statistic are found.

Finally, in the fourth paper, the potential necessity of determining the goodness-of-fit by two sided model testing is adverted. A simulation study is conducted that investigates differences between the one sided and the two sided approach of model testing. Situations are identified for which two sided model testing has advantages over the one sided approach.

Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2012. , 22 p.
Series
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Social Sciences, ISSN 1652-9030 ; 79
Keyword [en]
goodness-of-fit, sparseness, model fit, categorical data, fit statistic, sparse contingency table
National Category
Probability Theory and Statistics
Research subject
Statistics
Identifiers
URN: urn:nbn:se:uu:diva-173768ISBN: 978-91-554-8394-4 (print)OAI: oai:DiVA.org:uu-173768DiVA: diva2:525091
Public defence
2012-06-14, Hörsal 2, Ekonomikum, Kyrkogårdsgatan 10, Uppsala, 10:15 (English)
Opponent
Supervisors
Available from: 2012-05-24 Created: 2012-05-05 Last updated: 2012-08-01Bibliographically approved
List of papers
1. On the Computation of Pearson's Goodness-of-Fit Statistic for Sparse Contingency Tables
Open this publication in new window or tab >>On the Computation of Pearson's Goodness-of-Fit Statistic for Sparse Contingency Tables
(English)Manuscript (preprint) (Other academic)
National Category
Probability Theory and Statistics
Research subject
Statistics
Identifiers
urn:nbn:se:uu:diva-173764 (URN)
Available from: 2012-05-05 Created: 2012-05-05 Last updated: 2012-08-01
2. An Alternative to Pearson's Goodness-of-Fit Statistic when Expected Frequencies are Small
Open this publication in new window or tab >>An Alternative to Pearson's Goodness-of-Fit Statistic when Expected Frequencies are Small
(English)Manuscript (preprint) (Other academic)
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:uu:diva-173765 (URN)
Available from: 2012-05-05 Created: 2012-05-05 Last updated: 2012-08-01
3. Measurement of Fit in Categorical Data Analysis
Open this publication in new window or tab >>Measurement of Fit in Categorical Data Analysis
(English)Manuscript (preprint) (Other academic)
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:uu:diva-173766 (URN)
Available from: 2012-05-05 Created: 2012-05-05 Last updated: 2012-08-01
4. A Note on Two Sided Goodness-of-Fit Testing
Open this publication in new window or tab >>A Note on Two Sided Goodness-of-Fit Testing
(English)Manuscript (preprint) (Other academic)
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:uu:diva-173767 (URN)
Available from: 2012-05-05 Created: 2012-05-05 Last updated: 2012-08-01

Open Access in DiVA

fulltext(1639 kB)1019 downloads
File information
File name FULLTEXT01.pdfFile size 1639 kBChecksum SHA-512
59334723cd06903e3d23526cdd9754d64e19a7c5a2485902a71e115ce87a829a5faf3ea442d6c2f9acc081168c0c5943217f384e3a4069469746d81dd5f5d818
Type fulltextMimetype application/pdf
Buy this publication >>

Search in DiVA

By author/editor
Kraus, Katrin
By organisation
Department of Statistics
Probability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar
Total: 1019 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 1026 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf