ALBERT — All Library Books, journals and Electronic Records Telegrafenberg

Hits per page

hits 1 - 2 | 2 hits

Sorting

Electronic Resource

Estimating Motifs Under Order Restrictions (2005)

van Zwet, Erik W ; Kechris, Katherina J ; Bickel, Peter J ; [et al.]

Berkeley, Calif. : Berkeley Electronic Press (now: De Gruyter)

Statistical applications in genetics and molecular biology 4.2005, 1, art1

add to mindlist on the mindlist

Details

ISSN: 1544-6115

Source: Berkeley Electronic Press Academic Journals

Topics: Biology

Notes: Transcription factors and many other DNA-binding proteins recognizemore than one specific sequence. Among sequences recognized by a givenDNA-binding protein, different positions exhibit varying degrees ofconservation. The reason is that base pairs that are more extensivelycontacted by the protein tend to be more conserved. This observationcan be used in the discovery of transcription factor bindingsites. Here we present a rigorous means to accomplish this. Inparticular, we constrain the order of the information (entropy) in thecolumns of the position specific weight matrix (PWM) whichcharacterizes the motif being sought. We then show how to compute themaximum likelihood estimate of a PWM under such orderrestrictions. This computation is easily integrated with the EMalgorithm or the Gibbs sampler to enhance performance in the searchfor motifs in unaligned sequences. We demonstrate our method on awell-known data set of binding sites of the transcription factor Crpin E. coli.

Type of Medium: Electronic Resource

URL: http://www.bepress.com/sagmb/vol4/iss1/art1

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

Electronic Resource

Supervised Detection of Regulatory Motifs in DNA Sequences (2003)

Keles, Sunduz ; van der Laan, Mark J. ; Dudoit, Sandrine ; [et al.]

Berkeley, Calif. : Berkeley Electronic Press (now: De Gruyter)

Statistical applications in genetics and molecular biology 2.2003, 1, art5

add to mindlist on the mindlist

Details

ISSN: 1544-6115

Source: Berkeley Electronic Press Academic Journals

Topics: Biology

Notes: Identification of transcription factor binding sites (regulatory motifs) is a major interest in contemporary biology. We propose a new likelihood based method, COMODE, for identifying structural motifs in DNA sequences. Commonly used methods (e.g. MEME, Gibbs motif sampler) model binding sites as families of sequences described by a position weight matrix (PWM) and identify PWMs that maximize the likelihood of observed sequence data under a simple multinomial mixture model. This model assumes that the positions of the PWM correspond to independent multinomial distributions with four cell probabilities. We address supervising the search for DNA binding sites using the information derived from structural characteristics of protein-DNA interactions. We extend the simple multinomial mixture model to a constrained multinomial mixture model by incorporating constraints on the information content profiles or on specific parameters of the motif PWMs. The parameters of this extended model are estimated by maximum likelihood using a nonlinear constraint optimization method. Likelihood-based cross-validation is used to select model parameters such as motif width and constraint type. The performance of COMODE is compared with existing motif detection methods on simulated data that incorporate real motif examples from Saccharomyces cerevisiae. The proposed method is especially effective when the motif of interest appears as a weak signal in the data. Some of the transcription factor binding data of Lee et al. (2002) were also analyzed using COMODE and biologically verified sites were identified.

Type of Medium: Electronic Resource

URL: http://www.bepress.com/sagmb/vol2/iss1/art5

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

hits 1 - 2 | 2 hits