Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Measuring correlation between commit frequency and popularity on GitHub
KTH, School of Computer Science and Communication (CSC).
KTH, School of Computer Science and Communication (CSC).
2017 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesisAlternative title
Mätning av korrelation mellan commitfrekvens och popularitet på GitHub (Swedish)
Abstract [en]

This thesis studies the correlation between the commit frequency and popularity of Github projects. Over 12 000 projects were retrieved using the Github API, resulting in a dataset containing 85 projects after filtering out projects that were deemed unfit. The analysis of the projects consisted of calculating the Pearson Correlation Coefficient using the frequency of commits and popularity as variables. Different time intervals were studied along with several metrics of popularity based upon the project’s metadata retrieved from Github. The results varied for the different time intervals and metrics of popularity but none of the measurements resulted in a correlation coefficient which indicated a strong or moderate correlation. Therefore this study reached the conclusion of no existing correlation between commit frequency and popularity. Although no correlation was found, several potential measures of improvement for further research were discovered.

Abstract [sv]

Denna studie undersöker korrelationen mellan frekvensen av commits och popularitet hos Github projekt. Över 12 000 projekt utvanns genom Github API:et vilket resulterade i en datamängd innehållandes 85 projekt efter att gallringen av oönskade projekt ägt rum. Analysen av projekten bestod av att beräkna Pearsons korrelationskoefficient med frekvensen av commits och popularitet som variabler. Baserat på projektens metadata från Github undersöktes olika tidsintervall kombinerat med flera mått på popularitet. Resultaten varierade för de olika tidsintervallen och popularitetsmåtten men ingen av mätningarna resulterade i en korrelationskoefficient som indikerade en stark eller medelstark korrelation. Således fastställde denna studie slutsatsen att ingen korrelation existerade mellan frekvensen av commits och popularitet. Trots att ingen korrelation hittades, upptäcktes däremot flera potentiella förbättringsåtgärder för vidare forskning

Place, publisher, year, edition, pages
2017.
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-209819OAI: oai:DiVA.org:kth-209819DiVA, id: diva2:1114676
Supervisors
Examiners
Available from: 2017-06-25 Created: 2017-06-25 Last updated: 2018-01-13Bibliographically approved

Open Access in DiVA

fulltext(1146 kB)223 downloads
File information
File name FULLTEXT01.pdfFile size 1146 kBChecksum SHA-512
53d7eca6055cd00cb02038f5edd081188923d7ca424d7677c7ad14839452055649dd3ca89fd8ec8fda8b300a05a124f04e786db97ba7c0c591bac876597a787c
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Grönlund, MårtenJefford-Baker, Jonathan
By organisation
School of Computer Science and Communication (CSC)
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 223 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 369 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf