Référentiel des outils installés sur la plateforme Migale
diamond (version 0.7.9 - 2015-12-02)
DIAMOND is a new high-throughput program for aligning a file of short reads against a protein reference database such as NR, at 20,000 times the speed of BLASTX, with high sensitivity.DIAMOND is a new alignment tool for aligning short DNA sequencing reads to a protein reference database such as NCBI-NR. On Illumina reads of length 100-150bp, in fast mode, DIAMOND is about 20,000 times faster than BLASTX, while reporting about 80-90% of all matches that BLASTX finds, with an e-value of at most 1e-5. In sensitive mode, DIAMOND ist about 2,500 times faster than BLASTX, finding more than 94% of all matches.
Documentation : https://github.com/bbuchfink/diamond/blob/master/README.rst
Usage : #diamond COMMAND [OPTIONS]
diyabc (version 2.1.0 - 2018-02-02)
a user-friendly approach to Approximate Bayesian Computation for inference on population history using molecular markers
Download : http://www1.montpellier.inra.fr/CBGP/diyabc/
Documentation : http://genome.jouy.inra.fr/doc/genome/Outils-calcul-num%C3%A9riques/diyabc-2.1.0-linux64/diyabc-2.1.0-UserManual.pdf
Usage : #diyabc
DOMIRE (version - - 2014-01-20)
(DOMain Identification from REcurrence) is a server using VAST (Vector Alignment Search Tool, protein 3D structure comparison) to define the domain boundaries in proteins from their 3 D structures (Tai et al, 2010). It provides also a list of structural neighbours.
EDGE-pro (version 1.3.1 - 2013-07-02)
EDGE-pro, Estimated Degree of Gene Expression in PROkaryots is an efficient software system to estimate gene expression levels in prokaryotic genomes from RNA-seq data. EDGE-pro uses Bowtie2 for alignment and then estimates expression directly from the alignment results.EDGE-pro includes routines to assign reads aligning to overlapping gene regions accurately. 15% or more of bacterial genes overlap other genes, making this a significant problem for bacterial RNA-seq, one that is generally ignored by programs designed for eukaryotic RNA-seq experiments.
Documentation : http://genome.jouy.inra.fr/doc/genome/NGS/EDGE-pro-1.3.1/
Remarque : Please reference our paper:T. Magoc, D. Wood, and S.L. Salzberg. EDGE-pro: Estimated Degree of Gene Expression in Prokaryotic Genomes. Evolutionary Bioinformatics vol.9, pp.127-136, 2013.
Usage : #edge.pl <-g genome> <-p ptt> <-r rnt> <-u reads>
eggnog-mapper (version 0.99.3 - 2017-07-13)
eggnog-mapper is a tool for fast functional annotation of novel sequences (genes or proteins) using precomputed eggNOG-based orthology assignments. Obvious examples include the annotation of novel genomes, transcriptomes or even metagenomic gene catalogs. The use of orthology predictions for functional annotation is considered more precise than traditional homology searches, as it avoids transferring annotations from paralogs (duplicate genes with a higher chance of being involved in functional divergence).
Documentation : https://github.com/jhcepas/eggnog-mapper/wiki
Remarque : Merci d'avance, cordialement
Usage : #emapper.py [-h]
emboss (version 126.96.36.199 - 2013-08-19)
Within EMBOSS you will find around 100 programs (applications). These are just some of the areas covered (Sequence alignment, Rapid database searching with sequence patterns,Protein motif identification, including domain analysis, Nucleotide sequence pattern analysis, for example to identify CpG islands or repeats, Codon usage analysis for small genomes, Rapid identification of sequence patterns in large scale sequence sets, Presentation tools for publication...)
Download : http://emboss.sourceforge.net/download/
fasta (version 3.6 - 2014-02-21)
A set of sequence comparison tools (fasta36, ggsearch...) used for alignment and database searching.For example, fasta compares a protein sequence to another protein sequence or to a protein database, or a DNA sequence to another DNA sequence or a DNA library.
Documentation : http://fasta.bioch.virginia.edu/fasta_www2/fasta_intro.shtml
Usage : #fasta36
FastQC (version 0.10.0 - 2012-03-05)
FastQC aims to provide a simple way to do some quality control checks on raw sequence data coming from high throughput sequencing pipelines. It provides a modular set of analyses which you can use to give a quick impression of whether your data has any problems of which you should be aware before doing any further analysis.
Documentation : http://www.bioinformatics.bbsrc.ac.uk/projects/fastqc/Help/
Usage : #fastqc ou fastqc seqfile1 seqfile2 .. seqfileN
fastqp (version 0.1.9.1 - 2017-02-27)
Simple FASTQ, SAM and BAM read quality assessment and plotting using Python.
Download : https://github.com/mdshw5/fastqp
Documentation : https://github.com/mdshw5/fastqp
Usage : #fastqp [-h]
Fastq_Screen (version 0.4.4 - 2014-07-09)
Fastq screen is a simple application which allows you to search a large sequence dataset against a panel of differentdatabases to build up a picture of where the sequences in your data originate. It was built as a QC check forsequencing pipelines but may also have uses in metagenomicsstudies where mixed samples are expected. Although the program wasn t built with any particulartechnology in mind it is probably only really suitable forprocessing short reads due to the use of bowtie/bowtie2 as the searching application.The program generates both text and graphical output totell you what proportion of your library was able to map, either uniquely or in more than one location, against eachof the databases in your search set.
Download : http://www.bioinformatics.babraham.ac.uk/projects/fastq_screen/fastq_screen_v0.4.4.tar.gz
Usage : #fastq_screen [OPTION]... [FastQ FILE]...