Cross Site Product Page Classification with Supervised Machine Learning
Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesisAlternative title
Webbsideöverskridande klassificering av produktsidor med övervakad maskininlärning (Swedish)
This work outlines a possible technique for identifying webpages that contain product specifications. Using support vector machines a product web page classifier was constructed and tested with various settings. The final result for this classifier ended up being 0.958 in precision and 0.796 in recall for product pages. The scores imply that the method could be considered a valid technique in real world web classification tasks if additional features and more data were made available.
Place, publisher, year, edition, pages
2016. , 46 p.
svm support vector machine product page classification
Computer and Information Science
IdentifiersURN: urn:nbn:se:kth:diva-189555OAI: oai:DiVA.org:kth-189555DiVA: diva2:946837
Subject / course
Computer Technology, Program- and System Development
Master of Science in Engineering - Computer Science and Technology