Fix callVariant with fusion in the same transcript #346
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
As described in #342 that Arriba calls fusion in the same gene, which causes a problem of the graph data structure because we currently use transcript/gene ID as the subgraph ID to differentiate between the main graph and subgraph being inserted or fused. So in the PR, I redesigned data structure of how the relationship between subgraphs are handled. I use a tree-like structure (
SubgraphTree
), which is just simply adict
, that the keys are the subgraph ID, and the values areSubgraphLocation
which has attributes oflevel
,parent_id
, and thelocation
at the parent graph. I also switched to use a UUID for any subgraph ID just to avoid any collision.Closes #341
Closes #342
Closes #343