ALBERT — All Library Books, journals and Electronic Records Telegrafenberg

1

Unbekannt

Discovery and characterization of artifactual mutations in deep coverage targeted capture sequencing data due to oxidative DNA damage during sample preparation (2013)

Costello, M., Pugh, T. J., Fennell, T. J., Stewart, C., Lichtenstein, L., Meldrim, J. C., Fostel, J. L., Friedrich, D. C., Perrin, D., Dionne, D., Kim, S., Gabriel, S. B., Lander, E. S., Fisher, S., Getz, G.

Oxford University Press

In: Nucleic Acids Research

zur Merkliste hinzufügen auf der Merkliste

Details

Publikationsdatum: 2013-04-02

Beschreibung: As researchers begin probing deep coverage sequencing data for increasingly rare mutations and subclonal events, the fidelity of next generation sequencing (NGS) laboratory methods will become increasingly critical. Although error rates for sequencing and polymerase chain reaction (PCR) are well documented, the effects that DNA extraction and other library preparation steps could have on downstream sequence integrity have not been thoroughly evaluated. Here, we describe the discovery of novel C 〉 A/G 〉 T transversion artifacts found at low allelic fractions in targeted capture data. Characteristics such as sequencer read orientation and presence in both tumor and normal samples strongly indicated a non-biological mechanism. We identified the source as oxidation of DNA during acoustic shearing in samples containing reactive contaminants from the extraction process. We show generation of 8-oxoguanine (8-oxoG) lesions during DNA shearing, present analysis tools to detect oxidation in sequencing data and suggest methods to reduce DNA oxidation through the introduction of antioxidants. Further, informatics methods are presented to confidently filter these artifacts from sequencing data sets. Though only seen in a low percentage of reads in affected samples, such artifacts could have profoundly deleterious effects on the ability to confidently call rare mutations, and eliminating other possible sources of artifacts should become a priority for the research community.

Schlagwort(e): Massively Parallel (Deep) Sequencing

Print ISSN: 0305-1048

Digitale ISSN: 1362-4962

Thema: Biologie

Publiziert von Oxford University Press

Permalink

	Standort	Signatur	Erwartet	Verfügbarkeit

Andere fanden auch interessant ...

AKTUELLE ARTIKEL

S·F·X

Volltext

2

Unbekannt

Hybridization-based reconstruction of small non-coding RNA transcripts from deep sequencing data (2012)

Ragan, C., Mowry, B. J., Bauer, D. C.

Oxford University Press

In: Nucleic Acids Research

zur Merkliste hinzufügen auf der Merkliste

Details

Publikationsdatum: 2012-09-13

Beschreibung: Recent advances in RNA sequencing technology (RNA-Seq) enables comprehensive profiling of RNAs by producing millions of short sequence reads from size-fractionated RNA libraries. Although conventional tools for detecting and distinguishing non-coding RNAs (ncRNAs) from reference-genome data can be applied to sequence data, ncRNA detection can be improved by harnessing the full information content provided by this new technology. Here we present N orah D esk , the first unbiased and universally applicable method for small ncRNAs detection from RNA-Seq data. N orah D esk utilizes the coverage-distribution of small RNA sequence data as well as thermodynamic assessments of secondary structure to reliably predict and annotate ncRNA classes. Using publicly available mouse sequence data from brain, skeletal muscle, testis and ovary, we evaluated our method with an emphasis on the performance for microRNAs (miRNAs) and piwi-interacting small RNA (piRNA). We compared our method with D ario and mir D eep 2 and found that N orah D esk produces longer transcripts with higher read coverage. This feature makes it the first method particularly suitable for the prediction of both known and novel piRNAs.

Schlagwort(e): Massively Parallel (Deep) Sequencing

Print ISSN: 0305-1048

Digitale ISSN: 1362-4962

Thema: Biologie

Publiziert von Oxford University Press

Permalink

	Standort	Signatur	Erwartet	Verfügbarkeit

Andere fanden auch interessant ...

AKTUELLE ARTIKEL

S·F·X

Volltext

3

Unbekannt

Novel insight into the non-coding repertoire through deep sequencing analysis (2012)

Isakov, O., Ronen, R., Kovarsky, J., Gabay, A., Gan, I., Modai, S., Shomron, N.

Oxford University Press

In: Nucleic Acids Research

zur Merkliste hinzufügen auf der Merkliste

Details

Publikationsdatum: 2012-06-06

Beschreibung: Non-coding RNAs (ncRNA) account for a large portion of the transcribed genomic output. This diverse family of untranslated RNA molecules play a crucial role in cellular function. The use of ‘deep sequencing’ technology (also known as ‘next generation sequencing’) to infer transcript expression levels in general, and ncRNA specifically, is becoming increasingly common in molecular and clinical laboratories. We developed a software termed ‘RandA’ (which stands for ncRNA Read-and-Analyze) that performs comprehensive ncRNA profiling and differential expression analysis on deep sequencing generated data through a graphical user interface running on a local personal computer. Using RandA, we reveal the complexity of the ncRNA repertoire in a given cell population. We further demonstrate the relevance of such an extensive ncRNA analysis by elucidating a multitude of characterizing features in pathogen infected mammalian cells. RandA is available for download at http://ibis.tau.ac.il/RandA .

Schlagwort(e): Massively Parallel (Deep) Sequencing

Print ISSN: 0305-1048

Digitale ISSN: 1362-4962

Thema: Biologie

Publiziert von Oxford University Press

Permalink

	Standort	Signatur	Erwartet	Verfügbarkeit

Andere fanden auch interessant ...

AKTUELLE ARTIKEL

S·F·X

Volltext

4

Unbekannt

Illumina mate-paired DNA sequencing-library preparation using Cre-Lox recombination (2012)

Van Nieuwerburgh, F., Thompson, R. C., Ledesma, J., Deforce, D., Gaasterland, T., Ordoukhanian, P., Head, S. R.

Oxford University Press

In: Nucleic Acids Research

zur Merkliste hinzufügen auf der Merkliste

Details

Publikationsdatum: 2012-02-17

Beschreibung: Standard Illumina mate-paired libraries are constructed from 3- to 5-kb DNA fragments by a blunt-end circularization. Sequencing reads that pass through the junction of the two joined ends of a 3–5-kb DNA fragment are not easy to identify and pose problems during mapping and de novo assembly. Longer read lengths increase the possibility that a read will cross the junction. To solve this problem, we developed a mate-paired protocol for use with Illumina sequencing technology that uses Cre-Lox recombination instead of blunt end circularization. In this method, a LoxP sequence is incorporated at the junction site. This sequence allows screening reads for junctions without using a reference genome. Junction reads can be trimmed or split at the junction. Moreover, the location of the LoxP sequence in the reads distinguishes mate-paired reads from spurious paired-end reads. We tested this new method by preparing and sequencing a mate-paired library with an insert size of 3 kb from Saccharomyces cerevisiae . We present an analysis of the library quality statistics and a new bio-informatics tool called DeLoxer that can be used to analyze an IlluminaCre-Lox mate-paired data set. We also demonstrate how the resulting data significantly improves a de novo assembly of the S. cerevisiae genome.

Schlagwort(e): Massively Parallel (Deep) Sequencing

Print ISSN: 0305-1048

Digitale ISSN: 1362-4962

Thema: Biologie

Publiziert von Oxford University Press

Permalink

	Standort	Signatur	Erwartet	Verfügbarkeit

Andere fanden auch interessant ...

AKTUELLE ARTIKEL

S·F·X

Volltext

5

Unbekannt

leeHom: adaptor trimming and merging for Illumina sequencing reads (2014)

Renaud, G., Stenzel, U., Kelso, J.

Oxford University Press

In: Nucleic Acids Research

zur Merkliste hinzufügen auf der Merkliste

Details

Publikationsdatum: 2014-10-10

Beschreibung: The sequencing of libraries containing molecules shorter than the read length, such as in ancient or forensic applications, may result in the production of reads that include the adaptor, and in paired reads that overlap one another. Challenges for the processing of such reads are the accurate identification of the adaptor sequence and accurate reconstruction of the original sequence most likely to have given rise to the observed read(s). We introduce an algorithm that removes the adaptors and reconstructs the original DNA sequences using a Bayesian maximum a posteriori probability approach. Our algorithm is faster, and provides a more accurate reconstruction of the original sequence for both simulated and ancient DNA data sets, than other approaches. leeHom is released under the GPLv3 and is freely available from: https://bioinf.eva.mpg.de/leehom/

Schlagwort(e): Massively Parallel (Deep) Sequencing

Print ISSN: 0305-1048

Digitale ISSN: 1362-4962

Thema: Biologie

Publiziert von Oxford University Press

Permalink

	Standort	Signatur	Erwartet	Verfügbarkeit

Andere fanden auch interessant ...

AKTUELLE ARTIKEL

S·F·X

Volltext

6

Unbekannt

Correction of sequence-dependent ambiguous bases (Ns) from the 454 pyrosequencing system (2014)

Shin, S., Park, J.

Oxford University Press

In: Nucleic Acids Research

zur Merkliste hinzufügen auf der Merkliste

Details

Publikationsdatum: 2014-04-15

Beschreibung: Pyrosequencing of the 16S ribosomal RNA gene (16S) has become one of the most popular methods to assess microbial diversity. Pyrosequencing reads containing ambiguous bases (Ns) are generally discarded based on the assumptions of their non-sequence-dependent formation and high error rates. However, taxonomic composition differed by removal of reads with Ns. We determined whether Ns from pyrosequencing occur in a sequence-dependent manner. Our reads and the corresponding flow value data revealed occurrence of sequence-specific N errors with a common sequential pattern (a homopolymer + a few nucleotides with bases other than the homopolymer + N) and revealed that the nucleotide base of the homopolymer is the true base for the following N. Using an algorithm reflecting this sequence-dependent pattern, we corrected the Ns in the 16S (86.54%), bphD (81.37%) and nifH (81.55%) amplicon reads from a mock community with high precisions of 95.4, 96.9 and 100%, respectively. The new N correction method was applicable for determining most of Ns in amplicon reads from a soil sample, resulting in reducing taxonomic biases associated with N errors and in shotgun sequencing reads from public metagenome data. The method improves the accuracy and precision of microbial community analysis and genome sequencing using 454 pyrosequencing.

Schlagwort(e): Massively Parallel (Deep) Sequencing

Print ISSN: 0305-1048

Digitale ISSN: 1362-4962

Thema: Biologie

Publiziert von Oxford University Press

Permalink

	Standort	Signatur	Erwartet	Verfügbarkeit

Andere fanden auch interessant ...

AKTUELLE ARTIKEL

S·F·X

Volltext

7

Unbekannt

In vivo generation of DNA sequence diversity for cellular barcoding (2014)

Peikon, I. D., Gizatullina, D. I., Zador, A. M.

Oxford University Press

In: Nucleic Acids Research

zur Merkliste hinzufügen auf der Merkliste

Details

Publikationsdatum: 2014-09-17

Beschreibung: Heterogeneity is a ubiquitous feature of biological systems. A complete understanding of such systems requires a method for uniquely identifying and tracking individual components and their interactions with each other. We have developed a novel method of uniquely tagging individual cells in vivo with a genetic ‘barcode’ that can be recovered by DNA sequencing. Our method is a two-component system comprised of a genetic barcode cassette whose fragments are shuffled by Rci , a site-specific DNA invertase. The system is highly scalable, with the potential to generate theoretical diversities in the billions. We demonstrate the feasibility of this technique in Escherichia coli . Currently, this method could be employed to track the dynamics of populations of microbes through various bottlenecks. Advances of this method should prove useful in tracking interactions of cells within a network, and/or heterogeneity within complex biological samples.

Schlagwort(e): Massively Parallel (Deep) Sequencing

Print ISSN: 0305-1048

Digitale ISSN: 1362-4962

Thema: Biologie

Publiziert von Oxford University Press

Permalink

	Standort	Signatur	Erwartet	Verfügbarkeit

Andere fanden auch interessant ...

AKTUELLE ARTIKEL

S·F·X

Volltext

8

Unbekannt

Impact of next-generation sequencing error on analysis of barcoded plasmid libraries of known complexity and sequence (2014)

Deakin, C. T., Deakin, J. J., Ginn, S. L., Young, P., Humphreys, D., Suter, C. M., Alexander, I. E., Hallwirth, C. V.

Oxford University Press

In: Nucleic Acids Research

zur Merkliste hinzufügen auf der Merkliste

Details

Publikationsdatum: 2014-09-17

Beschreibung: Barcoded vectors are promising tools for investigating clonal diversity and dynamics in hematopoietic gene therapy. Analysis of clones marked with barcoded vectors requires accurate identification of potentially large numbers of individually rare barcodes, when the exact number, sequence identity and abundance are unknown. This is an inherently challenging application, and the feasibility of using contemporary next-generation sequencing technologies is unresolved. To explore this potential application empirically, without prior assumptions, we sequenced barcode libraries of known complexity. Libraries containing 1, 10 and 100 Sanger-sequenced barcodes were sequenced using an Illumina platform, with a 100-barcode library also sequenced using a SOLiD platform. Libraries containing 1 and 10 barcodes were distinguished from false barcodes generated by sequencing error by a several log-fold difference in abundance. In 100-barcode libraries, however, expected and false barcodes overlapped and could not be resolved by bioinformatic filtering and clustering strategies. In independent sequencing runs multiple false-positive barcodes appeared to be represented at higher abundance than known barcodes, despite their confirmed absence from the original library. Such errors, which potentially impact barcoding studies in an application-dependent manner, are consistent with the existence of both stochastic and systematic error, the mechanism of which is yet to be fully resolved.

Schlagwort(e): Massively Parallel (Deep) Sequencing

Print ISSN: 0305-1048

Digitale ISSN: 1362-4962

Thema: Biologie

Publiziert von Oxford University Press

Permalink

	Standort	Signatur	Erwartet	Verfügbarkeit

Andere fanden auch interessant ...

AKTUELLE ARTIKEL

S·F·X

Volltext

9

Unbekannt

Reconstructing mitochondrial genomes directly from genomic next-generation sequencing reads--a baiting and iterative mapping approach (2013)

Hahn, C., Bachmann, L., Chevreux, B.

Oxford University Press

In: Nucleic Acids Research

zur Merkliste hinzufügen auf der Merkliste

Details

Publikationsdatum: 2013-07-16

Beschreibung: We present an in silico approach for the reconstruction of complete mitochondrial genomes of non-model organisms directly from next-generation sequencing (NGS) data—mitochondrial baiting and iterative mapping (MITObim). The method is straightforward even if only (i) distantly related mitochondrial genomes or (ii) mitochondrial barcode sequences are available as starting-reference sequences or seeds, respectively. We demonstrate the efficiency of the approach in case studies using real NGS data sets of the two monogenean ectoparasites species Gyrodactylus thymalli and Gyrodactylus derjavinoides including their respective teleost hosts European grayling ( Thymallus thymallus ) and Rainbow trout ( Oncorhynchus mykiss ). MITObim appeared superior to existing tools in terms of accuracy, runtime and memory requirements and fully automatically recovered mitochondrial genomes exceeding 99.5% accuracy from total genomic DNA derived NGS data sets in 〈24 h using a standard desktop computer. The approach overcomes the limitations of traditional strategies for obtaining mitochondrial genomes for species with little or no mitochondrial sequence information at hand and represents a fast and highly efficient in silico alternative to laborious conventional strategies relying on initial long-range PCR. We furthermore demonstrate the applicability of MITObim for metagenomic/pooled data sets using simulated data. MITObim is an easy to use tool even for biologists with modest bioinformatics experience. The software is made available as open source pipeline under the MIT license at https://github.com/chrishah/MITObim .

Schlagwort(e): Massively Parallel (Deep) Sequencing

Print ISSN: 0305-1048

Digitale ISSN: 1362-4962

Thema: Biologie

Publiziert von Oxford University Press

Permalink

	Standort	Signatur	Erwartet	Verfügbarkeit

Andere fanden auch interessant ...

AKTUELLE ARTIKEL

S·F·X

Volltext

10

Unbekannt

PyroHMMsnp: an SNP caller for Ion Torrent and 454 sequencing data (2013)

Zeng, F., Jiang, R., Chen, T.

Oxford University Press

In: Nucleic Acids Research

zur Merkliste hinzufügen auf der Merkliste

Details

Publikationsdatum: 2013-07-16

Beschreibung: Both 454 and Ion Torrent sequencers are capable of producing large amounts of long high-quality sequencing reads. However, as both methods sequence homopolymers in one cycle, they both suffer from homopolymer uncertainty and incorporation asynchronization. In mapping, such sequencing errors could shift alignments around homopolymers and thus induce incorrect mismatches, which have become a critical barrier against the accurate detection of single nucleotide polymorphisms (SNPs). In this article, we propose a hidden Markov model (HMM) to statistically and explicitly formulate homopolymer sequencing errors by the overcall, undercall, insertion and deletion. We use a hierarchical model to describe the sequencing and base-calling processes, and we estimate parameters of the HMM from resequencing data by an expectation-maximization algorithm. Based on the HMM, we develop a realignment-based SNP-calling program, termed PyroHMMsnp, which realigns read sequences around homopolymers according to the error model and then infers the underlying genotype by using a Bayesian approach. Simulation experiments show that the performance of PyroHMMsnp is exceptional across various sequencing coverages in terms of sensitivity, specificity and F 1 measure, compared with other tools. Analysis of the human resequencing data shows that PyroHMMsnp predicts 12.9% more SNPs than Samtools while achieving a higher specificity. ( http://code.google.com/p/pyrohmmsnp/ ).

Schlagwort(e): Massively Parallel (Deep) Sequencing

Print ISSN: 0305-1048

Digitale ISSN: 1362-4962

Thema: Biologie

Publiziert von Oxford University Press

Permalink

	Standort	Signatur	Erwartet	Verfügbarkeit

Andere fanden auch interessant ...

AKTUELLE ARTIKEL

S·F·X

Volltext