Usage of ANANSE in single-cell multiomic data #199

PauBadiaM · 2022-12-07T13:43:48Z

Hi developers,

Nice method and code/documentation! I was wondering how feasible is to apply ANANSE to single-cell multiomics data (RNA+ATAC):

For scalability, should I pseudobulk cells by cell type and sample? Do you have any utility functions for this?
For the ATAC-seq part, should I create a bam file for each cell/sample? Do you provide any pointers on how to do this?
Are there any plans to adapt ANANSE to use common python multiomics data objects such as MuData from the scverse?

Thank you for your time!

simonvh · 2022-12-07T14:04:40Z

It is possible, but not yet completely out-of-the-box. @JGASmits, @Arts-of-coding and/or @siebrenf may be able to help out?

Arts-of-coding · 2022-12-08T14:32:59Z

Hi @PauBadiaM,
We are currently in the process of implementing single-cell (multiomics) data into ANANSE. For Python this is already available: https://github.com/Arts-of-coding/AnanseScanpy. I have a vignette specifying how you can go from two separate scanpy objects: one containing expression data from scRNA-seq and one containing a cell-by-peak matrix from scATAC-seq into output data for ANANSNAKE (https://github.com/vanheeringen-lab/anansnake). ANANSNAKE is an automated pipeline that runs ANANSE, for instance based on the output files from AnanseScanpy of AnanseSeurat (https://github.com/JGASmits/AnanseSeurat).

If you want to only use the Python to generate the cell-by-peak matrix in scATAC-seq (required for AnanseScanpy), I recommend using the "pp.make_peak_matrix" function from snapatac2 (https://pypi.org/project/snapatac2/).

If you are in no rush, there will be an extended manual about this available soon, which will be mentioned on the pages of AnanseScanpy and AnanseSeurat when it is available.

If "MuData" will replace the currently extensively used "anndata" for single-cell objects in Python, it is likely to be implemented at a later stage.

I hope to have informed you sufficiently!

PauBadiaM · 2022-12-08T14:59:27Z

Hi @simonvh and @Arts-of-coding ,

Thanks for the replies! It is really nice that it can be used using AnnData. I proposed MuData because its the extension of AnnData to multiple omics in the same object (with nice behaviors like propagating filtering changes across omics), but in the end is a dictionary of AnnDatas objects which could be passed to Anansescanpy without a problem.
Just a minor comment, why is this being implemented as a separate package? Shouldn't it be utility functions of ANANSE? In any case, it would be good that once finished you would point users to the Anansescanpy or AnanseSeurat vignettes in the main ANANSE documentation.

I'm in no rush for now, I'll wait ;)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Usage of ANANSE in single-cell multiomic data #199

Usage of ANANSE in single-cell multiomic data #199

PauBadiaM commented Dec 7, 2022

simonvh commented Dec 7, 2022

Arts-of-coding commented Dec 8, 2022

PauBadiaM commented Dec 8, 2022

Usage of ANANSE in single-cell multiomic data #199

Usage of ANANSE in single-cell multiomic data #199

Comments

PauBadiaM commented Dec 7, 2022

simonvh commented Dec 7, 2022

Arts-of-coding commented Dec 8, 2022

PauBadiaM commented Dec 8, 2022