Inspect

This module performs the extraction of useful metrics that can be either shown on the standard output or saved them to file, in CSV or Parquet format.

Usage

The minimal command is cl inspect --input-path <PATH> --output-path <PATH>, where input-path is the path to the AVITI data folder and output-path is the directory where the outputs will be placed.

Options

  • --stats-json: Path to the cell2stats 'RunStats.json' file. Required only if different from <INPUT_PATH>/RunStats.json.
  • --panel-json: Path to the cell2stats 'Panel.json' file. Required only if different from <INPUT_PATH>/Panel.json.
  • --raw-parquet: Path to the cell2stats 'RawCellStats.parquet' file. Required only if different from <INPUT_PATH>/RawCellStats.parquet.
  • --wells: Comma-separated list of wells to include in the analysis. By default, all wells will be used.
  • --batches: Comma-separated list of batches to include in the analysis. By default, all batches will be used.
  • --stats-list: Comma-separated list of metrics to produce. Accepted values include 'All', 'BatchWell', 'Well', 'Count', 'Correlation'. Default: All.
  • --format: It can be one of 'csv', 'parquet' or 'stdout'. Default is 'csv'.

Outputs

Barcoding performance metrics

Groups of barcodes are sequenced in serial batches, where each batch is defined by a specific sequencing primer.

Metric Description Expected Value
PercentAssignedReads Of all polonies, percentage assigned to an expected barcode > 70%
PercentMismatch Of all polonies assigned to a barcode, percentage assigned with a mismatch < 35%
  • DemuxStats.*: Metrics regarding the demultiplexing step by batch and well.

Cell segmentation performance metrics

Cell segmentation is performed based on the cell paint images for the cell membrane, nucleus and actin. The metrics below summarize the results of the segmentation process for each well in the flowcell.

Metric Description Expected Value
PercentConfluency Fraction of well area occupied by cells 25-50% (variable based on cell seeding)
CellCount Number of objects detected during segmentation >10,000 (variable based on cell seeding)
MedianCellDiameter Approximate median diameter of cells in microns ~35 um
PercentNucleatedCells Fraction of cells with segmented nucleus > 97%
  • SegmentationMetrics.*: Information about confluency, cell count and diameter by well.

Cell assignment metrics

After barcoding and cell segmentation is complete, individual barcodes are assigned to cells. The metrics below summarize this process.

Metric Description Expected Value
AssignedCountsPerMM2 Number of assigned polonies per mm2 of cell area ~150,000 (protein), 200,000-300,000 (RNA)
  • CytoStats.*: Assigned counts per mm^2 per batch and well.

Correlation metrics

For each pair of wells, we can calculate the correlation of log-transformed average counts as a measure of reproducibility. For both RNA and protein data types, replicates wells should have R2 > 0.95.

  • RNADistCorr.*: RNA counts distance correlation matrix.
  • ProteinDistCorr.*: Protein counts distance correlation matrix.