Skip to content

Add script to reformat NBAtlas object #1165

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
sjspielman opened this issue Jun 9, 2025 · 3 comments · May be fixed by #1173
Open

Add script to reformat NBAtlas object #1165

sjspielman opened this issue Jun 9, 2025 · 3 comments · May be fixed by #1173
Assignees
Labels

Comments

@sjspielman
Copy link
Member

If you are filing this issue based on a specific GitHub Discussion, please link to the relevant Discussion.

Part of #1151

Describe the goals of the changes to the analysis module.

The NBAtlas reference object is disseminated as a Seurat object. We will need to use this object in different formats: an SCE object for use with SingleR, and an AnnData object for use with scArches/scANVI.

Since we'll need both these formats, we might as well tackle it in a single conversion script that reads in the Seurat object and export SCE and AnnData versions to scratch. We can then use this script to convert both the full and subsetted versions of NBAtlas; the specific one we'll use will depend on some runtime investigation (issue forthcoming), but we can still convert them to be prepared.

What will your pull request contain?

Script to convert the NBAtlas object from Seurat to both SCE and AnnData and associated docs.

Will you require additional software beyond what is already in the analysis module?

Dependencies will get added to both renv and conda

Will you require different computational resources beyond what the analysis module already uses?

This can be run on a laptop, but will need quite a few GB of memory. I will indicate the usage it ends up needing in docs as part of the PR.

If known, when do you expect to file the pull request?

No response

@sjspielman sjspielman self-assigned this Jun 9, 2025
@sjspielman sjspielman changed the title Add script to reformat NBAtlast object Add script to reformat NBAtlas object Jun 9, 2025
@sjspielman
Copy link
Member Author

We have received additional metadata indicating which cells are tumor:

  • The full atlas has 42413 tumor cells
  • The atlas subset has 17702 tumor cells

These are labeled as Neuroendocrine cells in the NBAtlas object. When exporting other versions of this data, we should update the cell type for those cells to be tumor; note that those were all determined to be tumor in the NBAtlas paper as Neuroendocrine cells with tumor calls from copyKAT.

@allyhawkins
Copy link
Member

One thought I had is that maybe we want to retain the fact that these cells were labeled initially as neuroendocrine and change this to tumor-neuroendocrine rather than just tumor?

@sjspielman
Copy link
Member Author

One thought I had is that maybe we want to retain the fact that these cells were labeled initially as neuroendocrine and change this to tumor-neuroendocrine rather than just tumor?

Yeah, this seems reasonable for retaining that original information.

@sjspielman sjspielman linked a pull request Jun 13, 2025 that will close this issue
8 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants