You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
XspecT is a Python-based tool to taxonomically classify sequence-reads (or assembled genomes) on the species and/or sub-type level using [Bloom Filters](https://en.wikipedia.org/wiki/Bloom_filter) and a [Support Vector Machine](https://en.wikipedia.org/wiki/Support-vector_machine). It also identifies existing [blaOxa-genes](https://en.wikipedia.org/wiki/Beta-lactamase#OXA_beta-lactamases_(class_D)) and provides a list of relevant research papers for further information.
8
+
XspecT is a Python-based tool to taxonomically classify sequence-reads (or assembled genomes) on the species and/or sub-type level using [Bloom Filters] and a [Support Vector Machine]. It also identifies existing [blaOxa-genes] and provides a list of relevant research papers for further information.
5
9
<br/><br/>
6
10
7
11
XspecT utilizes the uniqueness of kmers and compares extracted kmers from the input-data to a reference database. Bloom Filter ensure a fast lookup in this process. For a final prediction the results are classified using a Support Vector Machine.
@@ -11,6 +15,10 @@ Local extensions of the reference database are supported.
11
15
<br/>
12
16
13
17
The tool is available as a web-based application and a smaller command line interface.
@@ -19,11 +27,7 @@ To install Xspect, please download the lastest 64 bit Python version and install
19
27
```
20
28
pip install xspect
21
29
```
22
-
If you would like to train filters yourself, you need to install Jellyfish, which is used to count distinct k-meres in the assemblies. It can be installed using bioconda:
23
-
```
24
-
conda install -c bioconda jellyfish
25
-
```
26
-
On Apple Silicon, it is possible that this command installs an incorrect Jellyfish package. Please refer to the official [Jellyfish project](https://github.com/gmarcais/Jellyfish) for installation guidance.
30
+
Please note that Apple Silicon is currently not supported.
Run the following command lines in a console, a browser window will open automatically after the application is fully loaded.
44
+
To run the web app, install and run [XspecT Web](https://github.com/aromberg/xspect-web). Additionally, run XspecT in API mode:
41
45
```
42
-
xspect web
46
+
xspect api
43
47
```
44
48
45
49
### How to use the XspecT command line interface
46
50
Run xspect with the configuration you want to run it with as arguments.
47
51
```
48
52
xspect classify your-genus path/to/your/input-set
49
53
```
50
-
For further instructions on how to use the command line interface, execute:
54
+
For further instructions on how to use the command line interface, please refer to the [documentation] or execute:
51
55
```
52
56
xspect --help
53
57
```
54
-
<!-- end quickstart -->
55
-
56
-
## Input Data
57
-
XspecT is able to use either raw sequence-reads (FASTQ-format .fq/.fastq) or already assembled genomes (FASTA-format .fasta/.fna). Using sequence-reads saves up the assembly process but high-quality reads with a low error-rate are needed (e.g. Illumina-reads).
58
-
59
-
The amount of reads that will be used has to be set by the user when using sequence-reads. The minimum amount is 5000 reads for species classification and 500 reads for sub-type classification. The maximum number of reads is limited by the browser and is usually around ~8 million reads. Using more reads will lead to a increased runtime (xsec./1mio reads).
0 commit comments