Use of “default” parameter settings when analyzing single cell RNA sequencing data using Seurat: a biologist’s perspective

Abstract

Analysis of large datasets has become integral to biological studies due to the advent of high throughput technologies such as next generation sequencing. Techniques for analyzing these large datasets are normally developed by bioinformaticists and statisticians, with input from biologists. Frequently, the end-user does not have the training or knowledge to make informed decisions on input parameter settings required to implement the analyses pipelines. Instead, the end-user relies on “default” settings present within the software packages, consultations with in-house bioinformaticists, or on methods described in previous publications. The aim of this study was to explore the effects of altering default parameters on the cell clustering solutions generated by a common pipeline implemented in the Seurat R package that is used to cluster cells based on single cell RNA sequencing (scRNAseq) data.

Review

However, its thinking way of default parameters could help us widen our horizons. Acturally, we are not supposed to be confined to Seurat, there has many other packages and algorithms to use. Also, the comparison of result is too easy.

Reference