-
Notifications
You must be signed in to change notification settings - Fork 16
apply_to _db.py #134
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
In this case, was the sequence in question originally a paralog, and you changed the designation to ortholog? |
Yes, i believe so would swithcing it to its original state fix it? i mean it still made the backup in the database. |
i kept going with the pipeline and at the matrix constructor step : i didn't get the indices.tsv file, the concatenated matrix nor the stats tsv files. ( attached below the error file) |
I'll take a look at it. Sorry I haven't gotten back to you yet about your original error. I've been trying to figure out why this might be happening. |
No worries, and thanks!! not sure if it was cause at least in the original error i changed the PetaCant ( paralog to an ortholog) but the PetaCant isn't part of my input_metadata, that being said the database ( metadataset) im using was given to me by a collaborator ( not sure if that might have anything to do with it but he did run it with his own data and seemed to work. I'll keep tinkering on my end as well and thanks for the help! |
So for this matrix construction error, it looks like prequal failed. You'll need to check |
For the original error, can you confirm that |
This is all i get from the prequal log [arodr412@sol-login02:/scratch/arodr412/hokuscyto_phylo/matrix_constructor_out_Apr.11.2025/logs/prequal]$ more FAM.log
|
Sorry again for just now getting back to you. I need to update my email on here, so that I see responses sooner. I would just remove FAM.fas from the prep_final_dataset_out directory. It looks like all but one sequence is removed after trimming. |
ok so i removed the FAM.fas from the prep final and now i received the following error ( the concatenation wasnt created) t [Thu Apr 17 19:09:13 2025] |
not sure if its an issue or if im missing something ive posted below what im encountering
apply_to_db.py [OPTIONS] -i <input_directory> -fi <fisher_output_dir>
Required arguments:
-i, --input <in_dir> Path to input directory containing parsed tsv labeled {gene_name}_parsed.tsv
-fi, --fisher_dir <fisher_output_dir> Path to initial fisher.py output directory
##new error peta cant
apply_to_db.py -i /scratch/arodr412/hokuscyto_phylo/parsing_tst/forest_out_copy_Apr.07.2025 -fi /scratch/arodr412/hokuscyto_phylo/fisher_out_Mar.28.2025
Traceback (most recent call last):
File "/home/arodr412/mambaforge/envs/fisher/bin/apply_to_db.py", line 379, in
main()
File "/home/arodr412/mambaforge/envs/fisher/bin/apply_to_db.py", line 325, in main
new_database(table)
File "/home/arodr412/mambaforge/envs/fisher/bin/apply_to_db.py", line 250, in new_database
orthologs, paralogs = parse_table(table)
File "/home/arodr412/mambaforge/envs/fisher/bin/apply_to_db.py", line 200, in parse_table
record = paralogs[name]
KeyError: 'PetaCant..p84149'
in the forest_out_copy_Apr.07.2025 are all the parsed.tsv aswell as the non parsed ones
the fisher out mar folder is the original fisher.py folder. i keep getting this issue
not sure whats up.
The text was updated successfully, but these errors were encountered: