Sparrow pipeline
The Sparrow pipeline by created by Pollaris et al. (2024). It was designed to analyze spatial transcriptomics data by doing over the following steps:
Image preprocessing
Based on the input image with the staining and the input data with the transcript coordinates, an image is created with transcript densities to visualize regions where no transcripts were detected. Next the image with the staining is cleaned: artefacts and halos are removed and all cells are made equally bright…
Cell segmentation
Segmentation is done by default by CellPose, but other algorithms are included in Sparrow.
Registration
Based on the transcript coordinates and the segmentation results, the transcripts are allocated to cells, generating a count matrix. Cells are annotated automatically using lists of cell type markers. This method of annotation was developed by the Saeys Lab and gave better results than existing algorithms.
Filtering genes and cells
- Cells are filtered based on transcript density, cells with low transcript density are typically low quality
- Check for each gene how many of its transcripts could not be allocated to cells: allows to detect missing cell types
Recap: overview of the Sparrow pipeline
The results of each step in the pipeline are stored in the SpatialData object.
Comparison to the pipelines used by the vendors
Sparrow generates better results than the pipelines of the vendors:
- The vendors do not provide cell type annotation
- The vendors pick up more cells but Sparrow picks up more cells with a high transcript density. The vendors pick up a lot of low quality cells and debris. Using the data generated by the vendors you loose 40% of the cell types.
Troubleshooting
Cells can be missed during the
- imaging: not bright enough, too bright, big halo…
- segmentation: too small, too big, hard regions with high cell density…
- registration: cells with no transcripts often due to necrosis…
- filtering: cells with a total count < 10 are removed and cells that are too small/big are removed
Effect of membrane stainings
MERFish (Vizgen) allows you to do multiple stainings (DAPI, polyT, membrane). All stainings are far from perfect and will miss cells. If you are interested in a cell type that is missed by the general stainings you can try a cell type specific staining. By doing limited manual annotation of these cells you can teach the segmentation algorithm to recognize them.