Digitala Vetenskapliga Arkivet

Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Semantic Segmentation of Iron Ore Pellets in the Cloud
Luleå tekniska universitet, Institutionen för system- och rymdteknik.
2021 (Engelska)Självständigt arbete på avancerad nivå (yrkesexamen), 20 poäng / 30 hpStudentuppsats (Examensarbete)
Abstract [en]

This master's thesis evaluates data annotation, semantic segmentation and Docker for use in AWS. The data provided has to be annotated and is to be used as a dataset for the creation of a neural network. Different neural network models are then to be compared based on performance. AWS has the option to use Docker containers and thus that option is to be examined, and lastly the different tools available in AWS SageMaker will be analyzed for bringing a neural network to the cloud.

Images were annotated in Ilastik and the dataset size is 276 images, then a neural network was created in PyTorch by using the library Segmentation Models PyTorch which gave the option of trying different models. This neural network was created in a notebook in Google Colab for a quick setup and easy testing. The dataset was then uploaded to AWS S3 and the notebook was brought from Colab to an AWS instance where the dataset then could be loaded from S3. A Docker container was created and packaged with the necessary packages and libraries as well as the training and inference code, to then be pushed to the ECR (Elastic Container Registry). This container could then be used to perform training jobs in SageMaker which resulted in a trained model stored in S3, and the hyperparameter tuning tool was also examined to get a better performing model. The two different deployment methods in SageMaker was then investigated to understand the entire machine learning solution.

The images annotated in Ilastik were deemed sufficient as the neural network results were satisfactory. The neural network created was able to use all of the models accessible from Segmentation Models PyTorch which enabled a lot of options. By using a Docker container all of the tools available in SageMaker could be used with the created neural network packaged in the container and pushed to the ECR. Training jobs were run in SageMaker by using the container to get a trained model which could be saved to AWS S3. Hyperparameter tuning was used and got better results than the manually tested parameters which resulted in the best neural network produced. The model that was deemed the best was Unet++ in combination with the Dpn98 encoder. The two different deployment methods in SageMaker was explored and is believed to be beneficial in different ways and thus has to be reconsidered for each project. By analysis the cloud solution was deemed to be the better alternative compared to an in-house solution, in all three aspects measured, which was price, performance and scalability.

Ort, förlag, år, upplaga, sidor
2021. , s. 48
Nyckelord [en]
Data annotation, Semantic segmentation, Docker, AWS, SageMaker, Ilastik, PyTorch, Unet++
Nationell ämneskategori
Datorsystem
Identifikatorer
URN: urn:nbn:se:ltu:diva-86896OAI: oai:DiVA.org:ltu-86896DiVA, id: diva2:1588733
Externt samarbete
Data ductus
Utbildningsprogram
Datateknik, master
Handledare
Examinatorer
Tillgänglig från: 2021-09-27 Skapad: 2021-08-29 Senast uppdaterad: 2021-09-27Bibliografiskt granskad

Open Access i DiVA

fulltext(6109 kB)297 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 6109 kBChecksumma SHA-512
f85051a031dc3d61920d2e42582a93503c6f07eca93b5f468e69094d2dce41cba977c6e3ac0d50232cd67eb9ea6d15d3e1b0859cb044bf5ea9111108bce1449f
Typ fulltextMimetyp application/pdf

Sök vidare i DiVA

Av författaren/redaktören
Lindberg, Hampus
Av organisationen
Institutionen för system- och rymdteknik
Datorsystem

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 297 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 525 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf