Replies: 4 comments 1 reply
-
Hi, I think you should look at the classification term in Spectre article. The relevant paragraphs are "3.6 Automated cellular classification and label transfer between aligned datasets" and " 2.11 Classification". The mapping is based on a knn neighbors algorithm, which is different from FlowSOM. There is also an interesting Suppl Fig about selecting k. I cite
The authors/maintainers will correct my answer. |
Beta Was this translation helpful? Give feedback.
-
Hi @Alby86 , thanks for reaching out! In theory the I can send you some notes on how to do this next week. |
Beta Was this translation helpful? Give feedback.
-
Hi. I take the opportunity of your question to get opinions about the sampling ratio. I don't know your experiment, but I feel that picking 1% out of 52M cells is too low although you pointed that you got the cell populations you expected. May be you are at the first level of the analysis and you only picked up the markers that separate the major populations. In that case, I understand 1% as sampling ratio. If this is not the case, I feel that 1% is too low. In my opinion, there should be at least the equivalent of one complete sample of each group in the concatenation. This would allow to exploit the depth aka wealth of a typical sample. So let's say that each sample is about 1M cells, so there are about 50 samples. Let's say there are at least 10 samples per group, so there are about 4 groups. So the concatenation should be around 4M cells, 1M cells sampled in each group. Or in another way, a sampling ratio of 1/50*4 = 8%. What's your opinion about the "1 complete representative sample per concatenation" design rule? |
Beta Was this translation helpful? Give feedback.
-
Hi Thomas!Thank you for your reply. I also think it should be a memory issue, but I am already taking a whole cluster node, so I don’t have another option.If you could send me some notes, that would be amazing. When is version 2 coming out?
Thanks a lot!
|
Beta Was this translation helpful? Give feedback.
-
Hello,
Thank you so much for this wonderful package!
I have been using the function 'Spectre::read.files' to read a large amount of .fcs files, but it appears that it can't do that: R crashes or the function gives up. I am running it on a cluster, so I am not sure whether the issue is just space. Any idea?
To go around this problem, I performed a sub-sample of the data to 1% of the total (around 52 million cells), and then everything works fine: I identified some clusters that makes sense and performed proper downstream statistics. Now, though, I would like to label the remaining data (I don't expect to see new clusters), and perform statistics over the whole dataset. How could I use the same FlowSOM model to predict the clusters? I know I can do that with flowSOM, but I can't see any documentation for Spectre. Is there such an option?
Thank you in advance!
Beta Was this translation helpful? Give feedback.
All reactions