Skip to content
shenjean edited this page Jul 16, 2020 · 50 revisions

Welcome to the getMito wiki! This wiki describes the workflow used to create the reference database file(s) used in getMito. Click on each section for details.

getMito

Download reference database file here: mitofish.all.Jul2020.tsv (639,987 records; Jul 2020 update)

Usage: getMito.py [OPTIONS]

From a user-provided list of fish taxonomic names, getMito extracts available mitochondrial information and
FASTA file (optional) at various taxonomic levels. 
The reference database is prepared from the MitoFish database and NCBI (Jul 2020 update).

Options:
  -i, --input_file TEXT           Input query file (e.g. input.txt) [required]
  -o, --output_prefix TEXT        Output prefix (e.g. OUT)  [required]
  -d, --database_file TEXT        Database file (e.g. mitofish.all.Jul2020.tsv) [required]
  -l, --tax_level [1|2|3|4|5|6|7] The taxonomic level of the search (e.g. 7 for speecies search, 6 for genus search)
  --fasta / --no-fasta            Generate FASTA file containing sequences of all matching hits (default=FALSE)
  --help                          Show this message and exit.

Dependencies: Python3 and the conda- and pip-installable click package

Input file example:

Abraliopsis pfefferi
Ahliesaurus berryi
Alepisaurus ferox
Anotopterus pharao

Screen output example:

=== Searching query #1: <Abraliopsis pfefferi> ===
=== Searching query #2: <Ahliesaurus berryi> ===
Search string:Notosudidae       Taxonomic level:L5      Hits:55
=== Searching query #3: <Alepisaurus ferox> ===
Search string:Alepisauridae     Taxonomic level:L5      Hits:79
=== Searching query #4: <Anotopterus pharao> ===
DUPLICATE WARNING: Query has already been processed!

Output file example (e.g. OUT_L5_hits.tsv):

Query   Accession       Gene definition txid    Superkingdom    Phylum  Class   Order   Family  Genus   Species Sequence
Notosudidae     AP004201        Scopelosaurus hoedti mitochondrial DNA, almost complete genome  172128  Eukaryota       Chordata        Actinopteri     Au
lopiformes      Notosudidae     Scopelosaurus   Scopelosaurus hoedti    GCTAACGTAGTTTACTAAAAATATGACTCTGAAGAAGTTAAGACAGACCCTGAGAAGGCCTCGTAAGCACAAAAGCTTGGTC
CTGGCTTTACTGTCATCTCAAACCGAGCTTACACATGCAAGTCTCCGCACCCCTGTGAGGATGCCCTCCACCCTCCTTTCCGGAAACGAGGAGCCGGTATCAGGCACGCCTATCAAGGCAGCCCAAAACACCTTGCTCAGCCACACCCCCAAGG
GATTTCAGCAGTGATAGACATTAAGCAATAAGTGAAAACTTGACTTAGTTAAGGTTTAACAGGGCCGGTCAACCTCGTGCCAGCCGCCGCGGT