You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Oct 18, 2024. It is now read-only.
I hope you are having a great day! I was testing out pycoQC and ran into an issue after generating a summaries file with Fast5_to_seq_summary.
Describe the bug
The Fast5_to_seq_summary output summaries file was passed to pycoQC and produced the following error:
Traceback (most recent call last):
File "/usr/local/bin/pycoQC", line 8, in <module>
sys.exit(main_pycoQC())
File "/usr/local/lib/python3.10/dist-packages/pycoQC/__main__.py", line 115, in main_pycoQC
pycoQC (
File "/usr/local/lib/python3.10/dist-packages/pycoQC/pycoQC.py", line 120, in pycoQC
parser = pycoQC_parse (
File "/usr/local/lib/python3.10/dist-packages/pycoQC/pycoQC_parse.py", line 96, in __init__
summary_reads_df = self._parse_summary()
File "/usr/local/lib/python3.10/dist-packages/pycoQC/pycoQC_parse.py", line 136, in _parse_summary
df = self._select_df_columns (
File "/usr/local/lib/python3.10/dist-packages/pycoQC/pycoQC_parse.py", line 397, in _select_df_columns
raise pycoQCError("Column {} not found in the provided sequence_summary file".format(col))
pycoQC.common.pycoQCError: Column read_len not found in the provided sequence_summary file
To Reproduce
Steps to reproduce the behavior:
Fast5_to_seq_summary command to generate the summary file:
I was expecting the summaries file generated by Fast5_to_seq_summary to be compatible with pycoQC. I also tried re-running the Fast5_to_seq_summary with the following fields option (to include everything):
however, that did not seem to help, and I am getting the same error message.
I can see here, in your parser, that you are looking for these columns to rename and then check to see if they exist.
however, if I try to pass sequence_length_2 or sequence_length to the --fields option of Fast5_to_seq_summary, it errors out:
Check input data and options
Traceback (most recent call last):
File "/usr/local/bin/Fast5_to_seq_summary", line 8, in <module>
sys.exit(main_Fast5_to_seq_summary())
File "/usr/local/lib/python3.10/dist-packages/pycoQC/__main__.py", line 168, in main_Fast5_to_seq_summary
Fast5_to_seq_summary (
File "/usr/local/lib/python3.10/dist-packages/pycoQC/Fast5_to_seq_summary.py", line 119, in __init__
raise pycoQCError ("Field {} is not valid, please choose among the following valid fields: {}".format(field, ",".join(self.attrs_grp_dict.keys())))
pycoQC.common.pycoQCError: Field sequence_length_2d is not valid, please choose among the following valid fields: mean_qscore_template,sequence_length_template,called_events,skip_prob,stay_prob,step_prob,strand_score,read_id,start_time,duration,start_mux,read_number,channel,channel_digitisation,channel_offset,channel_range,channel_sampling_rate,run_id,sample_id,device_id,protocol_run_id,flow_cell_id,calibration_strand_genome_template,calibration_strand_end,calibration_strand_start,calibration_strand_identity,barcode_arrangement,barcode_full_arrangement,barcode_score
Hello @a-slide,
I hope you are having a great day! I was testing out pycoQC and ran into an issue after generating a summaries file with
Fast5_to_seq_summary
.Describe the bug
The
Fast5_to_seq_summary
output summaries file was passed topycoQC
and produced the following error:To Reproduce
Steps to reproduce the behavior:
Fast5_to_seq_summary
command to generate the summary file:Here are the first few lines of the output
summary.tsv
file:As you can see here, there is no column containing sequence/read length information.
pycoQC
command to generate the report:Expected behavior
I was expecting the summaries file generated by
Fast5_to_seq_summary
to be compatible withpycoQC
. I also tried re-running theFast5_to_seq_summary
with the following fields option (to include everything):however, that did not seem to help, and I am getting the same error message.
I can see here, in your parser, that you are looking for these columns to rename and then check to see if they exist.
however, if I try to pass
sequence_length_2
orsequence_length
to the--fields
option ofFast5_to_seq_summary
, it errors out:Desktop:
If you need anything else, please let me know.
Best Regards,
@skchronicles
The text was updated successfully, but these errors were encountered: