Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Would this also be applicable to bulk data? #48

Open
Thapeachydude opened this issue Feb 9, 2024 · 2 comments
Open

Would this also be applicable to bulk data? #48

Thapeachydude opened this issue Feb 9, 2024 · 2 comments

Comments

@Thapeachydude
Copy link

Hi,

great package. I was wondering if this form of batch integration is also applicable to bulk RNA-Seq data. Sure the data is less sparse, but would that be an issue?

Happy about any feedback!
Best,
M

@LTLA
Copy link
Owner

LTLA commented Feb 10, 2024

Seems pretty reasonable to me. The sparsity wouldn't even matter in fastMNN, which typically operates on the PC space anyway. The only thing to keep in mind is that bulk datasets generally have fewer samples, so the default choices of k (the number of neighbors used to find MNNs) may not be appropriate.

I suppose the other reason that we don't use this class of batch correction methods for bulk data is that the output is not fit for DE analyses. (In fact, you could say that about any batch correction method.) So it's fine and all for exploratory analysis, a bit of clustering, visualization, etc. but if you plan on doing some DE, you'd want to get the raw counts.

@Thapeachydude
Copy link
Author

Hi thanks a lot for the quick reply and the feedback!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants