Adding functions to drop a percentage of counts and plot FMS #24

nbedanova · 2025-08-06T21:24:57Z

Added two functions:
downsample_counts_multinomial which will produce a dataset that has a specified percentage of reduced counts for each cells. Use multinomial sampling to construct the downsampled dataset.
fms_percent_drop_counts_multinomial will downsample the data, normalize and then factor. The factorization is compared to the baseline pf2 factorization with full counts. The function assumes that any low expressing genes are already filtered out and will set the geneThreshold to 0 when calling prepare_dataset to ensure that the same number of genes are contained between full count vs downsampled.
The figure CountFMS will plot this using our cytokine dataset.

Copilot

Pull Request Overview

This PR adds functionality to analyze factorization performance under reduced count conditions by implementing multinomial downsampling and FMS comparison metrics.

Implements multinomial count downsampling to simulate sequencing depth variations
Creates FMS comparison framework between full and downsampled datasets
Adds visualization capabilities for count drop analysis

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

File	Description
pf2rnaseq/factorization.py	Adds core functions for multinomial downsampling and FMS analysis with count reduction
pf2rnaseq/figures/commonFuncs/plotGeneral.py	Implements plotting function for FMS vs count drop percentage visualization
pf2rnaseq/figures/figureCountFMS.py	Creates figure demonstrating FMS analysis on cytokine dataset with various count drop percentages

pf2rnaseq/factorization.py

pf2rnaseq/figures/figureCountFMS.py

pf2rnaseq/factorization.py

aarmey

Good stuff. Just a couple of minor comments.

aarmey · 2025-08-07T03:33:40Z

pf2rnaseq/factorization.py

+    return sampled_data
+
+
+def fms_percent_drop_counts_multinomial(


I'm not generally a fan of having functions like this that combine functionality and loops. Make a function that does the processing for one situation, then you can put the loops elsewhere.

aarmey · 2025-08-07T03:34:35Z

pf2rnaseq/factorization.py

+    results = np.zeros((runs, len(percentList)))
+
+    # Main loop
+    for j in range(runs):


I'm not sure that you need to sweep across lots of runs. The result should be obvious at 0.5 downsampling, or there isn't a meaningful difference.

aarmey · 2025-08-07T03:49:19Z

pf2rnaseq/factorization.py

+        data[start_idx:end_idx] = new_counts.astype(cell_data.dtype)
+
+    # Create new sparse matrix
+    sampled_csr = sp.csr_matrix((data, indices, indptr), shape=original_csr.shape)


Ohh preserving the sparsity is clever.

Adding functions to drop a percentage of counts and plot FMS

25094c0

nbedanova requested a review from Copilot August 6, 2025 21:24

Copilot AI reviewed Aug 6, 2025

View reviewed changes

pf2rnaseq/factorization.py Outdated Show resolved Hide resolved

pf2rnaseq/figures/figureCountFMS.py Show resolved Hide resolved

pf2rnaseq/factorization.py Show resolved Hide resolved

pf2rnaseq/factorization.py Show resolved Hide resolved

nbedanova requested a review from aarmey August 6, 2025 21:27

aarmey requested changes Aug 7, 2025

View reviewed changes

Small fixes

2301740

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding functions to drop a percentage of counts and plot FMS #24

Adding functions to drop a percentage of counts and plot FMS #24

Uh oh!

nbedanova commented Aug 6, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aarmey left a comment

Uh oh!

aarmey Aug 7, 2025

Uh oh!

aarmey Aug 7, 2025

Uh oh!

aarmey Aug 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Adding functions to drop a percentage of counts and plot FMS #24

Are you sure you want to change the base?

Adding functions to drop a percentage of counts and plot FMS #24

Uh oh!

Conversation

nbedanova commented Aug 6, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aarmey left a comment

Choose a reason for hiding this comment

Uh oh!

aarmey Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

aarmey Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

aarmey Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants