Recherche de motifs

Version	MAJ	bioprospector
2004	2014-01-01	bioprospector	Download	Doc

Programme de recherche de motifs d'une ou deux boîtes exceptionnels (Gibbs Sampler) dans des séquences d'ADN. Des séquences de bruit de fond peuvent être fournies. Séquences en entrée de moins de 32765 nt, format fasta avec séquence sur une ligne (et en-tête de la forme >sequence1 nomdegene ). Peut rechercher spécifiquement des palyndromes.

Remarque

Run Unix # BioProspector

Run Web #

Version	MAJ	censor
4.2.10	2008-07-02	censor	Download	Doc

CENSOR is a software tool which screens query sequences against a reference collection of repeats and "censors" (masks) homologous portions with masking symbols, as well as generating a report classifying all found repeats.

Remarque

Run Unix # censor

Run Web #

Version	MAJ	ELPH
1.0.1	2012-10-01	ELPH	Download	Doc

ELPH is a general-purpose Gibbs sampler for finding motifs in a set of DNA or protein sequences. The program takes as input a set containing anywhere from a few dozen to thousands of sequences, and searches through them for the most common motif, assuming that each sequence contains one copy of the motif. We have used ELPH to find patterns such as ribosome binding sites (RBSs) and exon splicing enhancers (ESEs).

Remarque

Run Unix # elph [options] OR elph

[-t ]

Run Web #

Version	MAJ	GALF_P
-	2010-03-18	GALF_P	Download	Doc

GALF-P is a novel framework for TFBS identification (motif discovery) in DNA sequences. It consists of Genetic Algorithm with Local Filtering (GALF) and the post-processing procedure based on adaptive adding and removing. GALF-P achieves both effectiveness and efficiency, and provides reliable performance over the other state-of-art GA based approaches. The post-processing procedure is designed for zero or more TFBSs in each sequence.

Remarque

Run Unix # GALF_P.o

Run Web #

Version	MAJ	gimsan
20100830	2011-01-10	gimsan	Download	Doc

GIMSAN (GIbbsMarkov with Significance ANalysis): a novel tool for de novo motif finding. GIMSAN combines GibbsMarkov, our variant of the Gibbs Sampler, described here for the first time, with our recently introduced significance analysis.

Remarque please cite: Patrick Ng, Uri Keich. GIMSAN: A Gibbs motif finder with significance analysis. Bioinformatics, 24 (19): 2256-2257, 2008.

Run Unix # gimsan_submit_job.pl

Run Web #

Version	MAJ	grepseq
1.2.2	2004-01-21	grepseq	Download	Doc

The `grepseq' program takes a keyword which can contain ambiguous characters and character classes (also called a fixed-width motif) and then searches files and databases for exact or approximate matches to that keyword. The program produces one of two kinds of output, either a list of the matching sequences with the places where the keyword matched, or the complete entries of sequences containing matches, where each entry is annotated with the places where the matches occur.

Remarque Fait partie de seqio

Run Unix # grepseq

Run Web #

Version	MAJ	hmmer
3.1	2013-08-23	hmmer	Download	Doc

HMMER: profile HMMs for protein sequence analysis Profile hidden Markov models (profile HMMs) can be used to do sensitive database searching using statistical descriptions of a sequence family's consensus.

Remarque

Run Unix #

Run Web #

Version	MAJ	ncoils
		ncoils	Download	Doc

Remarque

Run Unix # ncoils

Run Web #

Version	MAJ	PatScan
	2007-12-12	PatScan	Download	Doc

PatScan is a pattern matcher which searches protein or nucleotide (DNA, RNA, tRNA etc.) sequence archives for instances of a pattern which you input.

Remarque patscan pat_file < input_file

Run Unix # patscan

Run Web #

Version	MAJ	pfam_scan.pl
	2012-11-29	pfam_scan.pl	Download	Doc

pfam_scan.pl - search protein fasta sequences against the Pfam library of HMMs.

Remarque

Run Unix # pfam_scan.pl -fasta -dir /usr/local/genome/PfamScan/databases

Run Web #

Version	MAJ	pftools
2.3.4	2004-04-10	pftools	Download	Doc

Le paquetage pftools est une collection de programmes expérimentaux qui permet de manipuler le format généralisé de profils et implémente les méthodes de recherche de PROSITE. Les commandes accessibles sont les suivantes : gtop, pfsearch, pfscan, psa2msa, pfmake, pfw, ptoh, htop, pfscale, pftof.

Remarque

Run Unix # pfsearch

Run Web # pfsearch

Version	MAJ	ReAS
2.02	2011-06-30	ReAS	Download	Doc

ReAS: Recovery of Ancestral Sequences for Transposable Elements from the Unassembled Reads of a Whole Genome Shotgun

Remarque http://www.ploscompbiol.org/article/info:doi%2F10.1371%2Fjournal.pcbi.0010043

Run Unix #

Run Web #

Version	MAJ	RepeatMasker
3.3.0	2011-07-04	RepeatMasker	Download	Doc

Remarque Pour rechercher une espèce par exemple bos_taurus : /usr/local/genome/RepeatMasker/util/queryTaxonomyDatabase.pl -species "bos taurus"

Run Unix # RepeatMasker

Run Web #

Version	MAJ	RepeatScout
1.05	2011-06-30	RepeatScout	Download	Doc

RepeatScout is a tool to discover repetitive substrings in DNA.

Remarque If you use RepeatScout, please cite the following paper: Price A.L., Jones N.C. and Pevzner P.A. 2005. De novo identification of repeat families in large genomes. To appear in Proceedings of the 13 Annual International conference on Intelligent Systems for Molecular Biology (ISMB-05). Detroit, Michigan.

Run Unix # *1/ build_lmer_table -l -sequence -freq [opts] **2/ RepeatScout -sequence -output -freq -l [opts]

Run Web #

Version	MAJ	RHOM
31.5		RHOM	Download	Doc

R'HOM (Recherche de rÃ©gions HOMogÃ¨nes) est un programme pour la segmentation de sÃ©quences d'ADN en rÃ©gions de composition homogÃ¨nes par chaÃ®nes de Markov cachÃ©es. L'utilisateur choisi le nombre de type de composition diffÃ©rentes et la longueur des mots Ã prendre en compte. Les paramÃ¨tres sont ensuite estimÃ©s par maximum de vraisemblance (algorithme EM) et la sÃ©quence est finalement segmentÃ©e avec l'algorithme forward backward. R'HOM a Ã©tÃ© initialement dÃ©veloppÃ© pendant la thÃ¨se de doctorat de Florence Muri et a Ã©tÃ© ensuite en grande partie rÃ©-implÃ©mentÃ©.

Remarque

Run Unix # rhom.em

Run Web #

Version	MAJ	rmes
3.1.0	2014-08-20	rmes	Download	Doc

Programme pour dÃ©tecter des mots ou motifs ayant une frÃ©quence statistiquement exceptionnelle dans une sÃ©quence biologique. (R'MES pour Recherche de Mots Exceptionnels dans les SÃ©quences)

Remarque Voici ce qu'il y a de nouveau par rapport Ã la version 3.01 : Changements majeurs : - amÃ©lioration significative du temps de calcul dans le cas des approximations Gaussiennes, quelque soit l'ordre du modÃ¨le, - levÃ©e de la contrainte sur la taille des noms des familles de mots. Changements mineurs : - renommage des options de sÃ©lection de seuil dans l'outil de mise en forme des rÃ©sultats (--minthresh et --maxthresh deviennent --tmin et --tmax), - modification de l'ordre de prÃ©sentation pour les rÃ©sultats de calcul de biais (triÃ©s selon le score, et non plus alphabÃ©tiquement). Pour toutes questions, contactez Sophie.Schbath@jouy.inra.fr

Run Unix # rmes [options] -s -o rmes --help

Run Web #

Version	MAJ	rmesplot
0.92	2007-10-31	rmesplot	Download	Doc

Remarque

Run Unix # rmesplot

Run Web #

Version	MAJ	rna2map
0.5.0	2009-09-10	rna2map	Download	Doc

The SOLiD System Small RNA Analysis Pipeline Tool (RNA2MAP) can be used to perform whole genome analysis of color space RNA library reads. It consists of three major procedures: filtering, matching against miRBase sequences (Sanger), and matching against a reference genome.

Remarque

Run Unix #

Run Web #

Version	MAJ	rnammer
1.2	2014-04-30	rnammer	Download	Doc

RNAmmer 1.2 predicts 5s/8s, 16s/18s, and 23s/28s ribosomal RNA in full genome sequences

Remarque

Run Unix # rnammer [options] (man rnammer)

Run Web #

Version	MAJ	SPatt
2.0-pre1 & 1.2.2	2007-10-02	SPatt	Download	Doc

SPatt (Statistic for Patterns) is a suite of C++ programs designed for the computation of pattern occurrences p-value on text. Assuming the text is generated according to Markov model, the p-value of a given observation is its probability to occur. The lower is the p-value, the more unlikely is the observation. For example, this tools can be used to find patterns with unusual behaviour in DNA sequences.

Remarque

Run Unix # spatt (aspatt cpspatt gspatt ldspatt oldxspatt sspatt xspatt)

Run Web #

Version	MAJ	STAMP
1.1	2014-09-29	STAMP	Download	Doc

Similarity, Tree-building, & Alignment of Motifs and Profiles

Remarque

Run Unix # STAMP

Run Web #

Version	MAJ	Tandem Repeats Finder
4.07b	2013-08-20	Tandem Repeats Finder	Download	Doc

A tandem repeat in DNA is two or more adjacent, approximate copies of a pattern of nucleotides. Tandem Repeats Finder is a program to locate and display tandem repeats in DNA sequences.

Remarque

Run Unix # trf

Run Web #

Version	MAJ	vmatch
2.0	2007-10-29	vmatch	Download	Doc

Vmatch replaces Reputer. It looks for all possible repeats in genomes, withsa possibility to specify the kind of repeats to look for, like its identityspercentage, minimal length, etc...Can also be used to mask repeats inssequences, to analyze repeat families, etc...

Remarque

Run Unix #

Run Web #

Version	MAJ	weeder
1.4.2	2009-12-07	weeder	Download	Doc

Recherche nouveaux TFBSs dans un jeu de sequences fasta, recherche de plusieurs tailles et limite de mutations autorisees. Ne sort que les motifs ayant passe le tri stat, contrairement a MEME qui donne autant de motifs que specifie dans les parametres. Par defaut les stat de genome sont basees sur un promoteur de 1000 pb, mais possibilite d'utiliser des stats basees sur toute la sequence intergenique.

Remarque Pavesi G, Mereghetti P, Mauri G, Pesole G. Weeder Web: discovery of transcription factor binding sites in a set of sequences from co-regulated genes. Nucleic Acids Res. 2004 32:W199-W203

Run Unix # weederlauncher.out inputfilename speciescode analysistype

Run Web #

Version	MAJ	weederH
1.4.2	2009-12-07	weederH	Download	Doc

Recherche de TFBS et ECR dans des sequences homologues. Pas d'alignement necessaire en input, pas de prerequis de PWM. Mesure de la conservation relative entre les sequences par recherche d'oligo conserves et scoring de similarite globale entre deux sequences homologues. Permet de chercher aussi les enhancers distaux. Fonctionnerait sur des promoteurs non annotes (pas de TSS connu).

Remarque Pavesi, G., Zambelli, F., Pesole, G. WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences. BMC Bioinformatics 2007, 8:46

Run Unix # weederH.out -f inputfilename -O speciescode

Run Web #

Version	MAJ	xgrail
1.3c	2002-09-20	xgrail	Download	Doc

GRAIL is a suite of tools designed to provide analysis and putative annotation of DNA sequences both interactively and through the use of automated computation.

Remarque

Run Unix # xgrail

Run Web # http://genome.jouy.inra.fr/

Menu principal

Recherche de motifs

bioprospector

censor

ELPH

GALF_P

gimsan

grepseq

hmmer

ncoils

PatScan

pfam_scan.pl

pftools

ReAS

RepeatMasker

RepeatScout

RHOM

rmes

rmesplot

rna2map

rnammer

SPatt

STAMP

Tandem Repeats Finder

vmatch

weeder

weederH

xgrail

Menu principal

Menu principal

Vous êtes ici

Recherche de motifs

bioprospector

censor

ELPH

GALF_P

gimsan

grepseq

hmmer

ncoils

PatScan

pfam_scan.pl

pftools

ReAS

RepeatMasker

RepeatScout

RHOM

rmes

rmesplot

rna2map

rnammer

SPatt

STAMP

Tandem Repeats Finder

vmatch

weeder

weederH

xgrail

Menu principal