System Requirements

PlasmidExtractor has been developed and tested on Linux-based systems (more specifically Ubuntu and Mint). Any Linux/Unix-based system (including MacOS) should be able to run the pipeline, while users of Windows-based systems will have to use Docker in order to have the pipeline run.

In terms of system specs the more CPUs your system has, the better, as almost every step of the pipeline is multi-threaded. RAM requirements can be somewhat heavy, with some parts of the pipeline more than 20 GB due to the size of the database that PlasmidExtractor uses. 32 GB or RAM is recommended. If your machine does not meet these requirements, PlasmidExtractor can be run in a low-memory mode, where memory usage should peak at ~7GB, enabling the pipeline to run on a much wider range of machines.

Databases

The databases necessary for PlasmidExtractor are hosted on FigShare, and need to be downloaded no matter what installation method you choose.

To download them, use the following command in the folder you want to download the databases to:

wget https://ndownloader.figshare.com/files/9827323 && tar xf 9827323 && rm 9827323

This will create a folder in you current working directory called databases which has all that you need to run PlasmidExtractor.

Installation Using Conda

Installing using conda is recommended, as it takes care of installing external program depedencies as well as the PlasmidExtractor script itself.

If you don't already have conda installed, you can find instructions on the conda ReadTheDocs page.

With conda installed, all you'll need to do is download the PlasmidExtractor environment, which has been uploaded to the Anaconda cloud. To do this, use this command:

conda env create lowandrew/plasmidextractor

You should then be able to enter the conda environment:

source activate plasmidextractor

Within the environment, typing PlasmidExtractor.py will run the PlasmidExtractor pipeline - see the Usage section for instructions on how to run PlasmidExtractor.

Installing Using Pip

Executable

PlasmidExtractor can also be installed using pip. Use of a virtual environment for PlasmidExtractor is highly recommended. To create a virtualenv:

With this done, you'll need to make sure that any necessary dependencies are installed.

Mandatory Dependencies

With pip installation done, you will need to make sure that you have all of the depedencies for PlasmidExtractor installed and present on your $PATH. The dependencies for PlasmidExtractor are:

For instructions on how to add programs to your $PATH, click here.

If all of your dependencies are properly installed, you should now be able to run PlasmidExtractor. If installation of a dependency has not worked, you will get an error message printed to screen telling you which depedencies are not installed. Note that the check only looks for an executable and does not check the version, so you may need to check that you have the correct versions installed manually.

Optional Dependencies

If you have Pilon installed, PlasmidExtractor will try to use it in order to polish the consensus sequence created. In order to have this work, you need to have an executable called pilon accessible somewhere on your $PATH.

To do this, download pilon from the above link, use chmod to make the jar file executable (chmod 755 pilon-1.22.jar), and then rename the executable to pilon and add it to your $PATH.