Digitala Vetenskapliga Arkivet

Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Software engineering for scientific big data analysis
Univ Freiburg, Dept Comp Sci, Bioinformat Grp, Georges Koehler Allee 106, D-79110 Freiburg, Germany;Univ Freiburg, Ctr Biol Syst Anal ZBSA, Habsburgerstr 49, D-79104 Freiburg, Germany.
Uppsala University, Disciplinary Domain of Medicine and Pharmacy, Faculty of Pharmacy, Department of Pharmaceutical Biosciences. Stockholm Univ, Natl Bioinformat Infrastruct Sweden, Sci Life Lab, Dept Biochem & Biophys, Svante Arrhenius Vag 16C, S-10691 Solna, Sweden.ORCID iD: 0000-0001-6740-9212
Univ Bergen, Dept Clin Sci, KG Jebsen Ctr Diabet Res, Postboks 7804, N-5020 Bergen, Norway;Haukeland Hosp, Ctr Med Genet & Mol Med, Postboks 7804, N-5020 Bergen, Norway.
Cleveland Clin, Lerner Res Inst, Genom Med Inst, 9500 Euclid Ave NE50, Cleveland, OH 44106 USA.ORCID iD: 0000-0002-6833-9049
2019 (English)In: GigaScience, E-ISSN 2047-217X, Vol. 8, no 5, article id giz054Article, review/survey (Refereed) Published
Abstract [en]

The increasing complexity of data and analysis methods has created an environment where scientists, who may not have formal training, are finding themselves playing the impromptu role of software engineer. While several resources are available for introducing scientists to the basics of programming, researchers have been left with little guidance on approaches needed to advance to the next level for the development of robust, large-scale data analysis tools that are amenable to integration into workflow management systems, tools, and frameworks. The integration into such workflow systems necessitates additional requirements on computational tools, such as adherence to standard conventions for robustness, data input, output, logging, and flow control. Here we provide a set of 10 guidelines to steer the creation of command-line computational tools that are usable, reliable, extensible, and in line with standards of modern coding practices.

Place, publisher, year, edition, pages
Oxford University Press, 2019. Vol. 8, no 5, article id giz054
Keywords [en]
software development, big data, workflow, standards, data analysis, coding, software engineering, scientific software, integration systems, computational tools
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:uu:diva-390339DOI: 10.1093/gigascience/giz054ISI: 000474856100022PubMedID: 31121028OAI: oai:DiVA.org:uu-390339DiVA, id: diva2:1341613
Funder
EU, Horizon 2020, 654241Available from: 2019-08-09 Created: 2019-08-09 Last updated: 2023-02-06Bibliographically approved

Open Access in DiVA

fulltext(329 kB)271 downloads
File information
File name FULLTEXT01.pdfFile size 329 kBChecksum SHA-512
57fb2ff23b96f6e70a0a24d06d0ec97ae592d93680441866c0ada62d93d42742c201270d21b5dd681cd99d7dda44d974c6bab9fa20448a1b8e3cfbf50feaea32
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMed

Search in DiVA

By author/editor
Lampa, SamuelBlankenberg, Daniel
By organisation
Department of Pharmaceutical Biosciences
In the same journal
GigaScience
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 271 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 266 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf