Digitala Vetenskapliga Arkivet

Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Detection of artefacts in FFPE-sample sequence data
Uppsala University, Disciplinary Domain of Science and Technology, Biology, Biology Education Centre. (Department of Immunology, Genetics and Pathology, Facilities: Clinical Genomics Uppsala)
2019 (English)Independent thesis Advanced level (professional degree), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Next generation sequencing is increasingly used as a diagnostic tool in the clinical setting. This is driven by the vast increase in molecular targeted therapy, which requires detailed information on what genetic variants are present in patient samples. In the hospital setting, most cancer diagnostics are based on Formalin Fixed Paraffin Embedded (FFPE) samples. The FFPE routine is very beneficial for logistical purposes and for some histopathological analyses, but creates problems for molecular diagnostics based on DNA. These problems derive from sample immersion informalin, which results in DNA fragmentation, interstrand DNA crosslinking and sequence artefacts due to hydrolytic deamination. Distinguishing such artefacts from true somatic variants can be challenging, thus affecting both research and clinical analyses.

In order to identify FFPE-artefacts from true variants in next generation sequencing data from FFPE samples, I developed the novelprogram FUSAC (FFPE tissue UMI based Sequence Artefact Classifier) for the facility Clinical Genomics in Uppsala. FUSAC utilizes UniqueMolecular Identifiers (UMI's) to identify and group sequencing reads based on their molecule of origin. By using UMI's to collapse duplicate paired reads into consensus reads, FFPE-artefacts are classified through comparative analysis of the positive and negative strand sequences. My findings indicate that FUSAC can succesfully classify UMI-tagged next generation sequencing reads with FFPE-artefacts, from sequencing reads with true variants. FUSAC thus presents a novel approach in bioinformatic pipelines for studying FFPE-artefacts.

Place, publisher, year, edition, pages
2019. , p. 75
Series
UPTEC X ; 19039
Keywords [en]
Cancer, FFPE, Sequence Artefacts, Precision Medicine, Bioinformatics, Program
National Category
Bioinformatics (Computational Biology)
Identifiers
URN: urn:nbn:se:uu:diva-392623OAI: oai:DiVA.org:uu-392623DiVA, id: diva2:1349256
Educational program
Molecular Biotechnology Engineering Programme
Supervisors
Examiners
Available from: 2019-09-20 Created: 2019-09-07 Last updated: 2019-09-20Bibliographically approved

Open Access in DiVA

fulltext(2632 kB)1545 downloads
File information
File name FULLTEXT01.pdfFile size 2632 kBChecksum SHA-512
80e614c9e7706a7d3bdb7cb98f2a66858244ca0492eeacb3ab025400a7eb62226099ef70ebe63e36e93cce60630259c7a81f8ac37875284250ea28fadae266bf
Type fulltextMimetype application/pdf

By organisation
Biology Education Centre
Bioinformatics (Computational Biology)

Search outside of DiVA

GoogleGoogle Scholar
Total: 1546 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 1110 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf