ALBERT — All Library Books, journals and Electronic Records Telegrafenberg

Treffer pro Seite

Treffer 1 - 3 | 3 Treffer

Sortierung

Digitale Medien

Pattern matching between two non-aligned random sequences (1994)

Sheng, Ke-Ning ; Naus, Joseph I.

Springer

Bulletin of mathematical biology 56 (1994), S. 1143-1162

zur Merkliste hinzufügen auf der Merkliste

Details

ISSN: 1522-9602

Quelle: Springer Online Journal Archives 1860-2000

Thema: Biologie , Mathematik

Notizen: Abstract Given two independent sequences of letters, we seek the probability distribution of the length of the longest matching word. This word can be in different positions in the two sequences and we consider both perfect and nearly perfect matching. We derive bounds and approximations for the probability and compare them with other bounds and approximations. The results can be applied to DNA sequences in molecular biology and generalized matching between two independent random sequences.

Materialart: Digitale Medien

URL: http://dx.doi.org/10.1007/BF02460290

Permalink

	Standort	Signatur	Erwartet	Verfügbarkeit

Andere fanden auch interessant ...

Artikel (Nationallizenzen)

Volltext

Digitale Medien

Matching among multiple random sequences (1997)

Naus, Joseph I. ; Sheng, Ke-Ning

Springer

Bulletin of mathematical biology 59 (1997), S. 483-496

zur Merkliste hinzufügen auf der Merkliste

Details

ISSN: 1522-9602

Quelle: Springer Online Journal Archives 1860-2000

Thema: Biologie , Mathematik

Notizen: Abstract In searching for strong homologies between multiple nucleic acid or protein sequences, researchers commonly look at fixed-length segments in common to the sequences. Such homologies form the foundation of segment-based algorithms for multiple alignment of protein sequences. The researcher uses settings of “unusualness of multiple matches” to calibrate the algorithms. In applications where a researcher has found a multiple matching word, statistical significance helps gauge the unusualness of the observed match. Previous approximations for the unusualness of multiple matches are based on large sample theory, and are sometimes quite inaccurate. Section 2 illustrates this inaccuracy, and provides accurate approximations for the probability of a common word inR out ofR sequences. Section 3 generalizes the approximation to multiple matching inR out ofS sequences. Section 4 describes a more complex approximation that incorporates exact probabilities and yields excellent accuracy; this approximation is useful for checking the simpler approximations over a range of values.

Materialart: Digitale Medien

URL: http://dx.doi.org/10.1007/BF02459461

Permalink