Skip to content

Model fitting issue with HiFi data #154

@hae3

Description

@hae3

Thank you so much for providing such a great tool!
I am attempting de novo genome assembly using PacBio HiFi data for a diploid plant species, and I used GenomeScope 2.0 for genome size estimation.
From what I can see, the first peak appears to be between 70-80x coverage, and the second peak (I'm not sure if we can call it a peak as it looks quite dispersed) seems to be around 140x. I tried setting the average k-mer coverage for polyploid genome to 70, but didn't see significant differences in the results.
I feel like the model seems to be struggling to fit the observed distribution. Is this affecting the reliability of genome size and heterozygosity estimates? Could high coverage be causing these fitting issues?
Do you have any suggestions or advice I could follow?
Thank you for any guidance!

Image

Image

http://genomescope.org/genomescope2.0/analysis.php?code=Ock3bhuj3IBGvHk0mDc6

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions