ALBERT

All Library Books, journals and Electronic Records Telegrafenberg

Your email was sent successfully. Check your inbox.

An error occurred while sending the email. Please try again.

Proceed reservation?

Export
Filter
  • Articles  (6,675)
  • Oxford University Press  (6,675)
  • American Association for the Advancement of Science (AAAS)
  • American Chemical Society (ACS)
  • American Geophysical Union (AGU)
  • American Institute of Physics (AIP)
  • Nucleic Acids Research  (6,675)
  • 60967
Collection
  • Articles  (6,675)
Publisher
  • Oxford University Press  (6,675)
  • American Association for the Advancement of Science (AAAS)
  • American Chemical Society (ACS)
  • American Geophysical Union (AGU)
  • American Institute of Physics (AIP)
Years
Topic
  • 1
    Publication Date: 2018-03-06
    Description: Eukaryotic DNA polymerase η catalyzes translesion synthesis of thymine dimers and 8-oxoguanines. It is comprised of a polymerase domain and a C-terminal region, both of which are required for its biological function. The C-terminal region mediates interactions with proliferating cell nuclear antigen (PCNA) and other translesion synthesis proteins such as Rev1. This region contains a ubiquitin-binding/zinc-binding (UBZ) motif and a PCNA-interacting protein (PIP) motif. Currently little structural information is available for this region of polymerase η. Using a combination of approaches—including genetic complementation assays, X-ray crystallography, Langevin dynamics simulations, and small-angle X-ray scattering—we show that the C-terminal region is partially unstructured and has high conformational flexibility. This implies that the C-terminal region acts as a flexible tether linking the polymerase domain to PCNA thereby increasing its local concentration. Such tethering would facilitate the sampling of translesion synthesis polymerases to ensure that the most appropriate one is selected to bypass the lesion.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 2
    Publication Date: 2018-03-06
    Description: During amino acid starvation the Escherichia coli stringent response factor RelA recognizes deacylated tRNA in the ribosomal A-site. This interaction activates RelA-mediated synthesis of alarmone nucleotides pppGpp and ppGpp, collectively referred to as (p)ppGpp. These two alarmones are synthesized by addition of a pyrophosphate moiety to the 3′ position of the abundant cellular nucleotide GTP and less abundant nucleotide GDP, respectively. Using untagged native RelA we show that allosteric activation of RelA by pppGpp increases the efficiency of GDP conversion to achieve the maximum rate of (p)ppGpp production. Using a panel of ribosomal RNA mutants, we show that the A-site finger structural element of 23S rRNA helix 38 is crucial for RelA binding to the ribosome and consequent activation, and deletion of the element severely compromises (p)ppGpp accumulation in E. coli upon amino acid starvation. Through binding assays and enzymology, we show that E. coli RelA does not form a stable complex with, and is not activated by, deacylated tRNA off the ribosome. This indicates that in the cell, RelA first binds the empty A-site and then recruits tRNA rather than first binding tRNA and then binding the ribosome.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 3
    Publication Date: 2018-03-06
    Description: Retinoic acid-inducible gene I (RIG-I) recognizes double-stranded viral RNAs (dsRNAs) containing two or three 5′ phosphates. A few reports of 5′-PPP-independent RIG-I agonists have emerged, but little is known about the molecular principles underlying their recognition. We recently found that the bent duplex RNA from the influenza A panhandle promoter activates RIG-I even in the absence of a 5′-triphosphate moiety. Here, we report that non-canonical synthetic RNA oligonucleotides containing G-U wobble base pairs that form a bent helix can exert RIG-I-mediated antiviral and anti-tumor effects in a sequence- and site-dependent manner. We present synthetic RNAs that have been systematically modified to enhance their efficacy and we outline the basic principles for engineering RIG-I agonists applicable to immunotherapy.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 4
    Publication Date: 2018-03-06
    Description: i-Motif (iM) is a four stranded DNA structure formed by cytosine-rich sequences, which are often present in functionally important parts of the genome such as promoters of genes and telomeres. Using electronic circular dichroism and UV absorption spectroscopies and electrophoretic methods, we examined the effect of four naturally occurring DNA base lesions on the folding and stability of the iM formed by the human telomere DNA sequence (C 3 TAA) 3 C 3 T. The results demonstrate that the TAA loop lesions, the apurinic site and 8-oxoadenine substituting for adenine, and the 5-hydroxymethyluracil substituting for thymine only marginally disturb the formation of iM. The presence of uracil, which is formed by enzymatic or spontaneous deamination of cytosine, shifts iM formation towards substantially more acidic pH values and simultaneously distinctly reduces iM stability. This effect depends on the position of the damage sites in the sequence. The results have enabled us to formulate additional rules for iM formation.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 5
    Publication Date: 2018-03-06
    Description: Ribosome biogenesis in eukaryotes is a complicated process that involves association and dissociation of numerous assembly factors and snoRNAs. The yeast small ribosomal subunit is first assembled into 90S pre-ribosomes in an ordered and dynamic manner. Efg1 is a protein with no recognizable domain that is associated with early 90S particles. Here, we determine the crystal structure of Efg1 from Chaetomium thermophilum at 3.3 Å resolution, revealing a novel elongated all-helical structure. Efg1 is not located in recently determined cryo-EM densities of 90S likely due to its low abundance in mature 90S. Genetic analysis in Saccharomyces cerevisiae shows that the functional core of Efg1 contains two helical hairpins composed of highly conserved residues. Depletion of Efg1 blocks 18S rRNA processing at sites A1 and A2, but not at site A0, and production of small ribosomal subunits. Efg1 is initially recruited by the 5′ domain of 18S rRNA. Its absence disturbs the assembly of the 5′ domain and inhibits release of U14 snoRNA from 90S. Our study shows that Efg1 is required for early assembly and reorganization of the 5′ domain of 18S rRNA.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 6
    Publication Date: 2018-03-06
    Description: Four different types (α 4 , α′ 2, (αβ) 2 and ϵ 2 ) of RNA-splicing endonucleases (EndAs) for RNA processing are known to exist in the Archaea. Only the (αβ) 2 and ϵ 2 types can cleave non-canonical introns in precursor (pre)-tRNA. Both enzyme types possess an insert associated with a specific loop, allowing broad substrate specificity in the catalytic α units. Here, the hyperthermophilic euryarchaeon Methanopyrus kandleri (MKA) was predicted to harbor an (αβ) 2 -type EndA lacking the specific loop. To characterize MKA EndA enzymatic activity, we constructed a fusion protein derived from MKA α and β subunits (fMKA EndA). In vitro assessment demonstrated complete removal of the canonical bulge-helix-bulge (BHB) intron structure from MKA pre-tRNA Asn . However, removal of the relaxed BHB structure in MKA pre-tRNA Glu was inefficient compared to crenarchaeal (αβ) 2 EndA, and the ability to process the relaxed intron within mini-helix RNA was not detected. fMKA EndA X-ray structure revealed a shape similar to that of other EndA types, with no specific loop. Mapping of EndA types and their specific loops and the tRNA gene diversity among various Archaea suggest that MKA EndA is evolutionarily related to other (αβ) 2 -type EndAs found in the Thaumarchaeota, Crenarchaeota and Aigarchaeota but uniquely represents constrained substrate specificity.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 7
    Publication Date: 2018-03-06
    Description: Intracellular levels of reactive oxygen species (ROS) increase as a consequence of oxidative stress and represent a major source of damage to biomolecules. Due to its high cellular abundance RNA is more frequently the target for oxidative damage than DNA. Nevertheless the functional consequences of damage on stable RNA are poorly understood. Using a genome-wide approach, based on 8-oxo-guanosine immunoprecipitation, we present evidence that the most abundant non-coding RNA in a cell, the ribosomal RNA (rRNA), is target for oxidative nucleobase damage by ROS. Subjecting ribosomes to oxidative stress, we demonstrate that oxidized 23S rRNA inhibits the ribosome during protein biosynthesis. Placing single oxidized nucleobases at specific position within the ribosome's catalytic center by atomic mutagenesis resulted in markedly different functional outcomes. While some active site nucleobases tolerated oxidative damage well, oxidation at others had detrimental effects on protein synthesis by inhibiting different sub-steps of the ribosomal elongation cycle. Our data provide molecular insight into the biological consequences of RNA oxidation in one of the most central cellular enzymes and reveal mechanistic insight on the role of individual active site nucleobases during translation.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 8
    Publication Date: 2018-03-06
    Description: When a stop codon is at the 80S ribosomal A site, there are six nucleotides (+4 to +9) downstream that are inferred to be occupying the mRNA channel. We examined the influence of these downstream nucleotides on translation termination success or failure in mammalian cells at the three stop codons. The expected hierarchy in the intrinsic fidelity of the stop codons (UAA〉UAG〉〉UGA) was observed, with highly influential effects on termination readthrough mediated by nucleotides at position +4 and position +8. A more complex influence was observed from the nucleotides at positions +5 and +6. The weakest termination contexts were most affected by increases or decreases in the concentration of the decoding release factor (eRF1), indicating that eRF1 binding to these signals was rate-limiting. When termination efficiency was significantly reduced by cognate suppressor tRNAs, the observed influence of downstream nucleotides was maintained. There was a positive correlation between experimentally measured signal strength and frequency of the signal in eukaryotic genomes, particularly in Saccharomyces cerevisiae and Drosophila melanogaster . We propose that termination efficiency is not only influenced by interrogation of the stop signal directly by the release factor, but also by downstream ribosomal interactions with the mRNA nucleotides in the entry channel.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 9
    Publication Date: 2018-03-06
    Description: ATM (ataxia-telangiectasia mutated) is a central molecule for DNA quality control. Its activation by DNA damage promotes cell-cycle delay, which facilitates DNA repair prior to replication. On the other hand, persistent DNA damage has been implicated in ATM-dependent cell death via apoptosis; however, the mechanisms underlying this process remain elusive. Here we find that, in response to persistent DNA strand breaks, ATM phosphorylates transcription factor Sp1 and initiates its degradation. We show that Sp1 controls expression of the key base excision repair gene XRCC1 , essential for DNA strand break repair. Therefore, degradation of Sp1 leads to a vicious cycle that involves suppression of DNA repair and further aggravation of the load of DNA damage. This activates transcription of pro-apoptotic genes and renders cells susceptible to elimination via both apoptosis and natural killer cells. These findings constitute a previously unrecognized ‘gatekeeper’ function of ATM as a detector of cells with persistent DNA damage.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 10
    Publication Date: 2018-03-06
    Description: We introduce the SPlit-and-conQueR (SPQR) model, a coarse-grained (CG) representation of RNA designed for structure prediction and refinement. In our approach, the representation of a nucleotide consists of a point particle for the phosphate group and an anisotropic particle for the nucleoside. The interactions are, in principle, knowledge-based potentials inspired by the $\mathcal {E}$SCORE function, a base-centered scoring function. However, a special treatment is given to base-pairing interactions and certain geometrical conformations which are lost in a raw knowledge-based model. This results in a representation able to describe planar canonical and non-canonical base pairs and base–phosphate interactions and to distinguish sugar puckers and glycosidic torsion conformations. The model is applied to the folding of several structures, including duplexes with internal loops of non-canonical base pairs, tetraloops, junctions and a pseudoknot. For the majority of these systems, experimental structures are correctly predicted at the level of individual contacts. We also propose a method for efficiently reintroducing atomistic detail from the CG representation.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 11
    Publication Date: 2018-03-06
    Description: Modified uridine containing taurine, 5-taurinomethyluridine (τm 5 U), is found at the anticodon first position of mitochondrial (mt-)transfer RNAs (tRNAs). Previously, we reported that τm 5 U is absent in mt-tRNAs with pathogenic mutations associated with mitochondrial diseases. However, biogenesis and physiological role of τm 5 U remained elusive. Here, we elucidated τm 5 U biogenesis by confirming that 5,10-methylene-tetrahydrofolate and taurine are metabolic substrates for τm 5 U formation catalyzed by MTO1 and GTPBP3. GTPBP3 -knockout cells exhibited respiratory defects and reduced mitochondrial translation. Very little τm 5 U34 was detected in patient’s cells with the GTPBP3 mutation, demonstrating that lack of τm 5 U results in pathological consequences. Taurine starvation resulted in downregulation of τm 5 U frequency in cultured cells and animal tissues (cat liver and flatfish). Strikingly, 5-carboxymethylaminomethyluridine (cmnm 5 U), in which the taurine moiety of τm 5 U is replaced with glycine, was detected in mt-tRNAs from taurine-depleted cells. These results indicate that tRNA modifications are dynamically regulated via sensing of intracellular metabolites under physiological condition.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 12
    Publication Date: 2018-03-06
    Description: The formation of 3′ single-stranded DNA overhangs is a first and essential step during homology-directed repair of double-stranded breaks (DSB) of DNA, a task that in Escherichia coli is performed by RecBCD. While this protein complex has been well characterized through in vitro single-molecule studies, it has remained elusive how end resection proceeds in the crowded and complex environment in live cells. Here, we develop a two-color fluorescent reporter to directly observe the resection of individual inducible DSB sites within live E. coli cells. Real-time imaging shows that RecBCD during end resection degrades DNA with remarkably high speed (∼1.6 kb/s) and high processivity (〉∼100 kb). The results show a pronounced asymmetry in the processing of the two DNA ends of a DSB, where much longer stretches of DNA are degraded in the direction of terminus. The microscopy observations are confirmed using quantitative polymerase chain reaction measurements of the DNA degradation. Deletion of the recD gene drastically decreased the length of resection, allowing for recombination with short ectopic plasmid homologies and significantly increasing the efficiency of horizontal gene transfer between strains. We thus visualized and quantified DNA end resection by the RecBCD complex in live cells, recorded DNA-degradation linked to end resection and uncovered a general relationship between the length of end resection and the choice of the homologous recombination template.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 13
    Publication Date: 2018-03-06
    Description: RNA molecules play important and diverse regulatory roles in the cell. Inspired by this natural versatility, RNA devices are increasingly important for many synthetic biology applications, e.g. optimizing engineered metabolic pathways, gene therapeutics or building up complex logical units. A major advantage of RNA is the possibility of de novo design of RNA-based sensing domains via an in vitro selection process (SELEX). Here, we describe development of a novel ciprofloxacin-responsive riboswitch by in vitro selection and next-generation sequencing-guided cellular screening. The riboswitch recognizes the small molecule drug ciprofloxacin with a K D in the low nanomolar range and adopts a pseudoknot fold stabilized by ligand binding. It efficiently interferes with gene expression both in lower and higher eukaryotes. By controlling an auxotrophy marker and a resistance gene, respectively, we demonstrate efficient, scalable and programmable control of cellular survival in yeast. The applied strategy for the development of the ciprofloxacin riboswitch is easily transferrable to any small molecule target of choice and will thus broaden the spectrum of RNA regulators considerably.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 14
    Publication Date: 2018-03-06
    Description: The role of DNA sequence in determining replication timing (RT) and chromatin higher order organization remains elusive. To address this question, we have developed an extra-chromosomal replication system (E-BACs) consisting of ∼200 kb human bacterial artificial chromosomes (BACs) modified with Epstein-Barr virus (EBV) stable segregation elements. E-BACs were stably maintained as autonomous mini-chromosomes in EBNA1-expressing HeLa or human induced pluripotent stem cells (hiPSCs) and established distinct RT patterns. An E-BAC harboring an early replicating chromosomal region replicated early during S phase, while E-BACs derived from RT transition regions (TTRs) and late replicating regions replicated in mid to late S phase. Analysis of E-BAC interactions with cellular chromatin (4C-seq) revealed that the early replicating E-BAC interacted broadly throughout the genome and preferentially with the early replicating compartment of the nucleus. In contrast, mid- to late-replicating E-BACs interacted with more specific late replicating chromosomal segments, some of which were shared between different E-BACs. Together, we describe a versatile system in which to study the structure and function of chromosomal segments that are stably maintained separately from the influence of cellular chromosome context.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 15
    Publication Date: 2018-03-06
    Description: Genomes mutate and evolve in ways simple (substitution or deletion of bases) and complex (e.g. chromosome shattering). We do not fully understand what types of complex mutation occur, and we cannot routinely characterize arbitrarily-complex mutations in a high-throughput, genome-wide manner. Long-read DNA sequencing methods (e.g. PacBio, nanopore) are promising for this task, because one read may encompass a whole complex mutation. We describe an analysis pipeline to characterize arbitrarily-complex ‘local’ mutations, i.e. intrachromosomal mutations encompassed by one DNA read. We apply it to nanopore and PacBio reads from one human cell line (NA12878), and survey sequence rearrangements, both real and artifactual. Almost all the real rearrangements belong to recurring patterns or motifs: the most common is tandem multiplication (e.g. heptuplication), but there are also complex patterns such as localized shattering, which resembles DNA damage by radiation. Gene conversions are identified, including one between hemoglobin gamma genes. This study demonstrates a way to find intricate rearrangements with any number of duplications, deletions, and repositionings. It demonstrates a probability-based method to resolve ambiguous rearrangements involving highly similar sequences, as occurs in gene conversion. We present a catalog of local rearrangements in one human cell line, and show which rearrangement patterns occur.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 16
    Publication Date: 2018-03-06
    Description: Torsional restraints on DNA change in time and space during the life of the cell and are an integral part of processes such as gene expression, DNA repair and packaging. The mechanical behavior of DNA under torsional stress has been studied on a mesoscopic scale, but little is known concerning its response at the level of individual base pairs and the effects of base pair composition. To answer this question, we have developed a geometrical restraint that can accurately control the total twist of a DNA segment during all-atom molecular dynamics simulations. By applying this restraint to four different DNA oligomers, we are able to show that DNA responds to both under- and overtwisting in a very heterogeneous manner. Certain base pair steps, in specific sequence environments, are able to absorb most of the torsional stress, leaving other steps close to their relaxed conformation. This heterogeneity also affects the local torsional modulus of DNA. These findings suggest that modifying torsional stress on DNA could act as a modulator for protein binding via the heterogeneous changes in local DNA structure.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 17
    Publication Date: 2018-03-06
    Description: The dynamics and mechanism of how site-specific DNA-bending proteins initially interrogate potential binding sites prior to recognition have remained elusive for most systems. Here we present these dynamics for Integration Host factor (IHF), a nucleoid-associated architectural protein, using a μs-resolved T-jump approach. Our studies show two distinct DNA-bending steps during site recognition by IHF. While the faster (∼100 μs) step is unaffected by changes in DNA or protein sequence that alter affinity by 〉100-fold, the slower (1–10 ms) step is accelerated ∼5-fold when mismatches are introduced at DNA sites that are sharply kinked in the specific complex. The amplitudes of the fast phase increase when the specific complex is destabilized and decrease with increasing [salt], which increases specificity. Taken together, these results indicate that the fast phase is non-specific DNA bending while the slow phase, which responds only to changes in DNA flexibility at the kink sites, is specific DNA kinking during site recognition. Notably, the timescales for the fast phase overlap with one-dimensional diffusion times measured for several proteins on DNA, suggesting that these dynamics reflect partial DNA bending during interrogation of potential binding sites by IHF as it scans DNA.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 18
    Publication Date: 2018-03-06
    Description: Non-coding RNAs play a vital role in diverse cellular processes. Pseudogenes, which are non-coding homologs of protein-coding genes, were once considered non-functional evolutional relics. However, recent studies have shown that pseudogene transcripts can regulate their parental transcripts by sequestering shared microRNAs (miRNAs), thus acting as competing endogenous RNAs (ceRNAs). In this study, we utilize an unbiased screen to identify the ferritin heavy chain 1 (FTH1) transcript and multiple FTH1 pseudogenes as targets of several oncogenic miRNAs in prostate cancer (PCa). We characterize the critical role of this FTH1 gene:pseudogene:miRNA network in regulating tumorigenesis in PCa, whereby oncogenic miRNAs downregulate the expression of FTH1 and its pseudogenes to drive oncogenesis. We further show that impairing miRNA binding and subsequent ceRNA crosstalk completely rescues the slow growth phenotype in vitro and in vivo . Our results also demonstrate the reciprocal regulation between the pseudogenes and intracellular iron levels, which are crucial for multiple physiological and pathophysiological processes. In summary, we describe an extensive gene:pseudogene network comprising multiple miRNAs and multiple pseudogenes derived from a single parental gene. The network could be regulated through multiple mechanisms to modulate iron storage in various signaling pathways, the deregulation of which results in PCa development and progression.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 19
    Publication Date: 2018-03-06
    Description: Histone deacetylase inhibitors (HDACIs) are known to alter gene expression by both up- and down-regulation of protein-coding genes in normal and cancer cells. However, the exact regulatory mechanisms of action remain uncharacterized. Here we investigated genome wide dose-dependent epigenetic and transcriptome changes in response to HDACI largazole in a transformed and a non-transformed cell line. Exposure to low nanomolar largazole concentrations (〈GI 50 ) predominantly resulted in upregulation of gene transcripts whereas higher largazole doses (≥GI 50 ) triggered a general decrease in mRNA accumulation. Largazole induces elevation of histone H3 acetylation at Lys-9 and Lys-27 along many gene bodies but does not correlate with up- or down-regulation of the associated transcripts. A higher dose of largazole results in more RNA polymerase II pausing at the promoters of actively transcribed genes and cell death. The most prevalent changes associated with transcriptional regulation occur at distal enhancer elements. Largazole promotes H3K27 acetylation at a subset of poised enhancers and unexpectedly, we also found active enhancers that become decommissioned in a dose and cell type-dependent manner. In particular, largazole decreases RNA polymerase II accumulation at super-enhancers (SEs) and preferentially suppresses SE-driven transcripts that are associated with oncogenic activities in transformed cells.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 20
    Publication Date: 2018-03-06
    Description: RNA plays a central role in the expression of all genes. Because any sequence within RNA can be recognized by complementary base pairing, synthetic oligonucleotides and oligonucleotide mimics offer a general strategy for controlling processes that affect disease. The two primary antisense approaches for regulating expression through recognition of cellular RNAs are single-stranded antisense oligonucleotides and duplex RNAs. This review will discuss the chemical modifications and molecular mechanisms that make synthetic nucleic acid drugs possible. Lessons learned from recent clinical trials will be summarized. Ongoing clinical trials are likely to decisively test the adequacy of our current generation of antisense nucleic acid technologies and highlight areas where more basic research is needed.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 21
    Publication Date: 2018-03-06
    Description: Upf1 is an SF1-family RNA helicase that is essential for the nonsense-mediated decay (NMD) process in eukaryotes. While Upf1 has been shown to interact with 80S ribosomes, the molecular details of this interaction were unknown. Using purified recombinant proteins and high-throughput sequencing combined with Fe-BABE directed hydroxyl radical probing (HTS-BABE) we have characterized the interaction between Upf1 and the yeast 80S ribosome. We identify the 1C domain of Upf1, an alpha-helical insertion in the RecA helicase core, to be essential for ribosome binding, and determine that the L1 stalk of 25S rRNA is the binding site for Upf1 on the ribosome. Using the cleavage sites identified by hydroxyl radical probing and high-resolution structures of both yeast Upf1 and the human 80S ribosome, we provide a model of a Upf1:80S structure. Our model requires that the L1 stalk adopt an open configuration as adopted by an un-rotated, or classical-state, ribosome. Our results shed light on the interaction between Upf1 and the ribosome, and suggest that Upf1 may specifically engage a classical-state ribosome during translation.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 22
    Publication Date: 2018-03-06
    Description: Guanine-rich and cytosine-rich DNA can form four-stranded DNA secondary structures called G-quadruplex (G4) and i-motif, respectively. These structures widely exist in genomes and play important roles in transcription, replication, translation and protection of telomeres. In this study, G4 and i-motif structures were identified in the promoter of the transcription factor gene BmPOUM2 , which regulates the expression of the wing disc cuticle protein gene ( BmWCP4 ) during metamorphosis. Disruption of the i-motif structure by base mutation, anti-sense oligonucleotides (ASOs) or inhibitory ligands resulted in significant decrease in the activity of the BmPOUM2 promoter. A novel i-motif binding protein (BmILF) was identified by pull-down experiment. BmILF specifically bound to the i-motif and activated the transcription of BmPOUM2 . The promoter activity of BmPOUM2 was enhanced when BmILF was over-expressed and decreased when BmILF was knocked-down by RNA interference. This study for the first time demonstrated that BmILF and the i-motif structure participated in the regulation of gene transcription in insect metamorphosis and provides new insights into the molecular mechanism of the secondary structures in epigenetic regulation of gene transcription.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 23
    Publication Date: 2018-03-06
    Description: Soil salinity is a significant threat to sustainable agricultural production worldwide. Plants must adjust their developmental and physiological processes to cope with salt stress. Although the capacity for adaptation ultimately depends on the genome, the exceptional versatility in gene regulation provided by the spliceosome-mediated alternative splicing (AS) is essential in these adaptive processes. However, the functions of the spliceosome in plant stress responses are poorly understood. Here, we report the in-depth characterization of a U1 spliceosomal protein, AtU1A, in controlling AS of pre-mRNAs under salt stress and salt stress tolerance in Arabidopsis thaliana . The atu1a mutant was hypersensitive to salt stress and accumulated more reactive oxygen species (ROS) than the wild-type under salt stress. RNA-seq analysis revealed that AtU1A regulates AS of many genes, presumably through modulating recognition of 5′ splice sites. We showed that AtU1A is associated with the pre-mRNA of the ROS detoxification-related gene ACO1 and is necessary for the regulation of ACO1 AS. ACO1 is important for salt tolerance because ectopic expression of ACO1 in the atu1a mutant can partially rescue its salt hypersensitive phenotype. Our findings highlight the critical role of AtU1A as a regulator of pre-mRNA processing and salt tolerance in plants.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 24
    Publication Date: 2018-03-06
    Description: Sensitive detection of the single nucleotide variants in cell-free DNA (cfDNA) may provide great opportunity for minimally invasive diagnosis and prognosis of cancer and other related diseases. Here, we demonstrate a facile new strategy for quantitative measurement of cfDNA mutations at low abundance in the cancer patients’ plasma samples. The method takes advantage of a novel property of lambda exonuclease which effectively digests a 5′-fluorophore modified dsDNA with a 2-nt overhang structure and sensitively responds to the presence of mismatched base pairs in the duplex. It achieves a limit of detection as low as 0.02% (percentage of the mutant type) for BRAF V600E mutation, NRAS Q61R mutation and three types of EGFR mutations (G719S, T790M and L858R). The method enabled identification of BRAF V600E and EGFR L858R mutations in the plasma of different cancer patients within only 3.5 h. Moreover, the terminal structure-dependent reaction greatly simplifies the probe design and reduces the cost, and the assay only requires a regular real-time PCR machine. This new method may serve as a practical tool for quantitative measurement of low-abundance mutations in clinical samples for providing genetic mutation information with prognostic or therapeutic implications.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 25
    Publication Date: 2018-03-06
    Description: Trypanosomes are protistan parasites that diverged early in evolution from most eukaryotes. Their streamlined genomes are packed with arrays of tandemly linked genes that are transcribed polycistronically by RNA polymerase (pol) II. Individual mRNAs are processed from pre-mRNA by spliced leader (SL) trans splicing and polyadenylation. While there is no strong evidence that general transcription factors are needed for transcription initiation at these gene arrays, a RNA pol II transcription pre-initiation complex (PIC) is formed on promoters of SLRNA genes, which encode the small nuclear SL RNA, the SL donor in trans splicing. The factors that form the PIC are extremely divergent orthologues of the small nuclear RNA-activating complex, TBP, TFIIA, TFIIB, TFIIH, TFIIE and Mediator. Here, we functionally characterized a heterodimeric complex of unannotated, nuclear proteins that interacts with RNA pol II and is essential for PIC formation, SL RNA synthesis in vivo, SLRNA transcription in vitro , and parasite viability. These functional attributes suggest that the factor represents TFIIF although the amino acid sequences are too divergent to firmly make this conclusion. This work strongly indicates that early-diverged trypanosomes have orthologues of each and every general transcription factor, requiring them for the synthesis of SL RNA.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 26
    Publication Date: 2018-03-06
    Description: Large genomic rearrangements involve inversions, deletions and other structural changes that span Megabase segments of the human genome. This category of genetic aberration is the cause of many hereditary genetic disorders and contributes to pathogenesis of diseases like cancer. We developed a new algorithm called ZoomX for analysing barcode-linked sequence reads—these sequences can be traced to individual high molecular weight DNA molecules (〉50 kb). To generate barcode linked sequence reads, we employ a library preparation technology (10X Genomics) that uses droplets to partition and barcode DNA molecules. Using linked read data from whole genome sequencing, we identify large genomic rearrangements, typically greater than 200kb, even when they are only present in low allelic fractions. Our algorithm uses a Poisson scan statistic to identify genomic rearrangement junctions, determine counts of junction-spanning molecules and calculate a Fisher's exact test for determining statistical significance for somatic aberrations. Utilizing a well-characterized human genome, we benchmarked this approach to accurately identify large rearrangement. Subsequently, we demonstrated that our algorithm identifies somatic rearrangements when present in lower allelic fractions as occurs in tumors. We characterized a set of complex cancer rearrangements with multiple classes of structural aberrations and with possible roles in oncogenesis.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 27
    Publication Date: 2018-03-06
    Description: The mouse is widely used as system to study human genetic mechanisms. However, extensive rewiring of transcriptional regulatory networks often confounds translation of findings between human and mouse. Site-specific gain and loss of individual transcription factor binding sites (TFBS) has caused functional divergence of orthologous regulatory loci, and so we must look beyond this positional conservation to understand common themes of regulatory control. Fortunately, transcription factor co-binding patterns shared across species often perform conserved regulatory functions. These can be compared to ‘regulatory sentences’ that retain the same meanings regardless of sequence and species context. By analyzing TFBS co-occupancy patterns observed in four human and mouse cell types, we learned a regulatory grammar: the rules by which TFBS are combined into meaningful regulatory sentences. Different parts of this grammar associate with specific sets of functional annotations regardless of sequence conservation and predict functional signatures more accurately than positional conservation. We further show that both species-specific and conserved portions of this grammar are involved in gene expression divergence and human disease risk. These findings expand our understanding of transcriptional regulatory mechanisms, suggesting that phenotypic divergence and disease risk are driven by a complex interplay between deeply conserved and species-specific transcriptional regulatory pathways.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 28
    Publication Date: 2018-03-06
    Description: The pharmacological effects of antisense and siRNA oligonucleotides are hindered by the tendency of these molecules to become entrapped in endomembrane compartments thus failing to reach their targets in the cytosol or nucleus. We have previously used high throughput screening to identify small molecules that enhance the escape of oligonucleotides from intracellular membrane compartments and have termed such molecules OECs (oligonucleotide enhancing compounds). Here, we report on the structure–activity relationships of a family of OECs that are analogs of a hit that emerged from our original screen. These studies demonstrate key roles for the lipophilic aromatic groups, the tertiary nitrogen, and the carbamate moiety of the parent compound. We have also investigated the intracellular site of action of the OECs and have shown that activity is due to the release of oligonucleotides from intermediate endosomal compartments rather than from early endosomes or from highly acidic downstream compartments. At high concentrations of OECs toxicity occurs in a manner that is independent of caspases or of lysosomal cathepsins but instead involves increased plasma membrane permeability. Thus, in addition to describing specific characteristics of this family of OECs, the current study provides insights into basic mechanisms of oligonucleotide trafficking and their implications for oligonucleotide delivery.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 29
    Publication Date: 2018-03-06
    Description: Synthetic genetic sensors and circuits enable programmable control over timing and conditions of gene expression and, as a result, are increasingly incorporated into the control of complex and multi-gene pathways. Size and complexity of genetic circuits are growing, but stay limited by a shortage of regulatory parts that can be used without interference. Therefore, orthogonal expression and regulation systems are needed to minimize undesired crosstalk and allow for dynamic control of separate modules. This work presents a set of orthogonal expression systems for use in Escherichia coli based on heterologous sigma factors from Bacillus subtilis that recognize specific promoter sequences. Up to four of the analyzed sigma factors can be combined to function orthogonally between each other and toward the host. Additionally, the toolbox is expanded by creating promoter libraries for three sigma factors without loss of their orthogonal nature. As this set covers a wide range of transcription initiation frequencies, it enables tuning of multiple outputs of the circuit in response to different sensory signals in an orthogonal manner. This sigma factor toolbox constitutes an interesting expansion of the synthetic biology toolbox and may contribute to the assembly of more complex synthetic genetic systems in the future.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 30
    Publication Date: 2018-03-06
    Description: RNase III is a ribonuclease that recognizes and cleaves double-stranded RNA. Across bacteria, RNase III is involved in rRNA maturation, CRISPR RNA maturation, controlling gene expression, and turnover of messenger RNAs. Many organisms have only one RNase III while others have both a full-length RNase III and another version that lacks a double-stranded RNA binding domain (mini-III). The genome of the cyanobacterium Synechococcus sp . strain PCC 7002 (PCC 7002) encodes three homologs of RNase III, two full-length and one mini-III, that are not essential even when deleted in combination. To discern if each enzyme had distinct responsibilities, we collected and sequenced global RNA samples from the wild type strain, the single, double, and triple RNase III mutants. Approximately 20% of genes were differentially expressed in various mutants with some operons and regulons showing complex changes in expression levels between mutants. Two RNase III’s had a role in 23S rRNA maturation and the third was involved in copy number regulation one of six native plasmids. In vitro , purified RNase III enzymes were capable of cleaving some of the known Escherichia coli RNase III target sequences, highlighting the remarkably conserved substrate specificity between organisms yet complex regulation of gene expression.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 31
    Publication Date: 2018-03-06
    Description: Grainyhead (Grh)/CP2 transcription factors are highly conserved in multicellular organisms as key regulators of epithelial differentiation, organ development and skin barrier formation. In addition, they have been implicated as being tumor suppressors in a variety of human cancers. Despite their physiological importance, little is known about their structure and DNA binding mode. Here, we report the first structural study of mammalian Grh/CP2 factors. Crystal structures of the DNA-binding domains of grainyhead-like (Grhl) 1 and Grhl2 reveal a closely similar conformation with immunoglobulin-like core. Both share a common fold with the tumor suppressor p53, but differ in important structural features. The Grhl1 DNA-binding domain binds duplex DNA containing the consensus recognition element in a dimeric arrangement, supporting parsimonious target-sequence selection through two conserved arginine residues. We elucidate the molecular basis of a cancer-related mutation in Grhl1 involving one of these arginines, which completely abrogates DNA binding in biochemical assays and transcriptional activation of a reporter gene in a human cell line. Thus, our studies establish the structural basis of DNA target-site recognition by Grh transcription factors and reveal how tumor-associated mutations inactivate Grhl proteins. They may serve as points of departure for the structure-based development of Grh/CP2 inhibitors for therapeutic applications.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 32
    Publication Date: 2018-03-06
    Description: The DNA-dependent protein kinase (DNA-PK), consisting of the DNA binding Ku70/80 heterodimer and the catalytic subunit DNA-PKcs, has been well characterized in the non-homologous end-joining mechanism for DNA double strand break (DSB) repair and radiation resistance. Besides playing a role in DSB repair, DNA-PKcs is required for the cellular response to replication stress and participates in the ATR-Chk1 signaling pathway. However, the mechanism through which DNA-PKcs is recruited to stalled replication forks is still unclear. Here, we report that the apoptosis mediator p53-induced protein with a death domain (PIDD) is required to promote DNA-PKcs activity in response to replication stress. PIDD is known to interact with PCNA upon UV-induced replication stress. Our results demonstrate that PIDD is required to recruit DNA-PKcs to stalled replication forks through direct binding to DNA-PKcs at the N’ terminal region. Disruption of the interaction between DNA-PKcs and PIDD not only compromises the ATR association and regulation of DNA-PKcs, but also the ATR signaling pathway, intra-S-phase checkpoint and cellular resistance to replication stress. Taken together, our results indicate that PIDD, but not the Ku heterodimer, mediates the DNA-PKcs activity at stalled replication forks and facilitates the ATR signaling pathway in the cellular response to replication stress.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 33
    Publication Date: 2018-03-06
    Description: The double stranded DNA molecule undergoes drastic structural changes during biological processes such as transcription during which it opens locally under the action of RNA polymerases. Local spontaneous denaturation could contribute to this mechanism by promoting it. Supporting this idea, different biophysical studies have found an unexpected increase in the flexibility of DNA molecules with various sequences as a function of the temperature, which would be consistent with the formation of a growing number of locally denatured sequences. Here, we take advantage of our capacity to detect subtle changes occurring on DNA by using high throughput tethered particle motion to question the existence of bubbles in double stranded DNA under physiological salt conditions through their conformational impact on DNA molecules ranging from several hundreds to thousands of base pairs. Our results strikingly differ from previously published ones, as we do not detect any unexpected change in DNA flexibility below melting temperature. Instead, we measure a bending modulus that remains stable with temperature as expected for intact double stranded DNA.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 34
    Publication Date: 2018-03-06
    Description: Bifidobacterium breve represents one of the most abundant bifidobacterial species in the gastro-intestinal tract of breast-fed infants, where their presence is believed to exert beneficial effects. In the present study whole genome sequencing, employing the PacBio Single Molecule, Real-Time (SMRT) sequencing platform, combined with comparative genome analysis allowed the most extensive genetic investigation of this taxon. Our findings demonstrate that genes encoding Restriction/Modification (R/M) systems constitute a substantial part of the B. breve variable gene content (or variome). Using the methylome data generated by SMRT sequencing, combined with targeted Illumina bisulfite sequencing (BS-seq) and comparative genome analysis, we were able to detect methylation recognition motifs and assign these to identified B. breve R/M systems, where in several cases such assignments were confirmed by restriction analysis. Furthermore, we show that R/M systems typically impose a very significant barrier to genetic accessibility of B. breve strains, and that cloning of a methyltransferase-encoding gene may overcome such a barrier, thus allowing future functional investigations of members of this species.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 35
    Publication Date: 2018-03-06
    Description: Recent studies have reported the emerging role of microRNAs (miRNAs) in human cancers. We systematically characterized miRNA expression and editing in the human brain, which displays the highest number of A-to-I RNA editing sites among human tissues, and in de novo glioblastoma brain cancer. We identified 299 miRNAs altered in their expression and 24 miRNAs differently edited in human brain compared to glioblastoma tissues. We focused on the editing site within the miR-589–3p seed . MiR-589–3p is a unique miRNA almost fully edited (∼100%) in normal brain and with a consistent editing decrease in glioblastoma. The edited version of miR-589–3p inhibits glioblastoma cell proliferation, migration and invasion, while the unedited version boosts cell proliferation and motility/invasion, thus being a potential cancer-promoting factor. We demonstrated that the editing of this miRNA is mediated by ADAR2, and retargets miR-589–3p from the tumor-suppressor PCDH9 to ADAM12 , which codes for the metalloproteinase 12 promoting glioblastoma invasion. Overall, our study dissects the role of a unique brain-specific editing site within miR-589–3p, with important anticancer features, and highlights the importance of RNA editing as an essential player not only for diversifying the genomic message but also for correcting not-tolerable/critical genomic coding sites.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 36
    Publication Date: 2018-03-06
    Description: The PolyC binding proteins (PCBPs) impact alternative splicing of a subset of mammalian genes that are enriched in basic cellular functions. Here, we focus our analysis on PCBP-controlled cassette exon-splicing within the cell cycle control regulator cyclin-dependent kinase-2 (CDK2) transcript. We demonstrate that PCBP binding to a C-rich polypyrimidine tract (PPT) preceding exon 5 of the CDK2 transcript enhances cassette exon inclusion. This splice enhancement is U2AF65-independent and predominantly reflects actions of the PCBP1 isoform. Remarkably, PCBPs’ control of CDK2 ex5 splicing has evolved subsequent to mammalian divergence via conversion of constitutive exon 5 inclusion in the mouse CDK2 transcript to PCBP-responsive exon 5 alternative splicing in humans. Importantly, exclusion of exon 5 from the hCDK2 transcript dramatically represses the expression of CDK2 protein with a corresponding perturbation in cell cycle kinetics. These data highlight a recently evolved post-transcriptional pathway in primate species with the potential to modulate cell cycle control.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 37
    Publication Date: 2018-03-06
    Description: Hepatic miR-122 can serve as a pro-apoptotic factor to suppress tumorigenesis. The underlying mechanism, however, remains incompletely understood. Here we present the first evidence that miR-122 promotes hepatocellular carcinoma cell apoptosis through directly silencing the biogenesis of cell survival oncomiR miR-21 at posttranscriptional level. We find that miR-122 is strongly expressed in primary liver cell nucleus but its nuclear localization is markedly decreased in transformed cells particularly in chemoresistant tumor cells. MiRNA profiling and RT-qPCR confirm an inverse correlation between miR-122 and miR-21 in hepatocellular carcinoma tissues/cells, and increasing or decreasing nuclear level of miR-122 respectively reduces or increases miR-21 expression. Mechanistically, nuclear miR-122 suppresses miR-21 maturation via binding to a 19-nt UG-containing recognition element in the basal region of pri-miR-21 and preventing the Drosha-DGCR8 microprocessor's conversion of pri-miR-21 into pre-miR-21. Furthermore, both in vitro and in vivo studies demonstrate that nuclear miR-122 participates in the regulation of HCC cell apoptosis through modulating the miR-21-targeted programmed cell death 4 (PDCD4) signal pathway.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 38
    Publication Date: 2018-03-06
    Description: EZR, a member of the ezrin-radixin-moesin (ERM) family, is involved in multiple aspects of cell migration and cancer. SMYD3, a histone H3–lysine 4 (H3–K4)-specific methyltransferase, regulates EZR gene transcription, but the molecular mechanisms of epigenetic regulation remain ill-defined. Here, we show that antisense lncRNA EZR-AS1 was positively correlated with EZR expression in both human esophageal squamous cell carcinoma (ESCC) tissues and cell lines. Both in vivo and in vitro studies revealed that EZR-AS1 promoted cell migration through up-regulation of EZR expression. Mechanistically, antisense lncRNA EZR-AS1 formed a complex with RNA polymerase II to activate the transcription of EZR. Moreover, EZR-AS1 could recruit SMYD3 to a binding site, present in a GC-rich region downstream of the EZR promoter, causing the binding of SMYD3 and local enrichment of H3K4me3. Finally, the interaction of EZR-AS1 with SMYD3 further enhanced EZR transcription and expression. Our findings suggest that antisense lncRNA EZR-AS1, as a member of an RNA polymerase complex and through enhanced SMYD3-dependent H3K4 methylation, plays an important role in enhancing transcription of the EZR gene to promote the mobility and invasiveness of human cancer cells.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 39
    Publication Date: 2018-03-06
    Description: Using molecular dynamics simulations, we show here that growing plectonemes resulting from transcription-induced supercoiling have the ability to actively push cohesin rings along chromatin fibres. The pushing direction is such that within each topologically associating domain (TAD) cohesin rings forming handcuffs move from the source of supercoiling, constituted by RNA polymerase with associated DNA topoisomerase TOP1, towards borders of TADs, where supercoiling is released by topoisomerase TOPIIB. Cohesin handcuffs are pushed by continuous flux of supercoiling that is generated by transcription and is then progressively released by action of TOPIIB located at TADs borders. Our model explains what can be the driving force of chromatin loop extrusion and how it can be ensured that loops grow quickly and in a good direction. In addition, the supercoiling-driven loop extrusion mechanism is consistent with earlier explanations proposing why TADs flanked by convergent CTCF binding sites form more stable chromatin loops than TADs flanked by divergent CTCF binding sites. We discuss the role of supercoiling in stimulating enhancer promoter contacts and propose that transcription of eRNA sends the first wave of supercoiling that can activate mRNA transcription in a given TAD.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 40
    Publication Date: 2018-03-06
    Description: Endothelial cells (ECs) differentiate from mesodermal progenitors during vasculogenesis. By comparing changes in chromatin interactions between human umbilical vein ECs, embryonic stem cells and mesendoderm cells, we identified regions exhibiting EC-specific compartmentalization and changes in the degree of connectivity within topologically associated domains (TADs). These regions were characterized by EC-specific transcription, binding of lineage-determining transcription factors and cohesin. In addition, we identified 1200 EC-specific long-range interactions (LRIs) between TADs. Most of the LRIs were connected between regions enriched for H3K9me3 involving pericentromeric regions, suggesting their involvement in establishing compartmentalization of heterochromatin during differentiation. Second, we provide evidence that EC-specific LRIs correlate with changes in the hierarchy of chromatin aggregation. Despite these rearrangements, the majority of chromatin domains fall within a pre-established hierarchy conserved throughout differentiation. Finally, we investigated the effect of hypoxia on chromatin organization. Although hypoxia altered the expression of hundreds of genes, minimal effect on chromatin organization was seen. Nevertheless, 70% of hypoxia-inducible genes situated within a TAD bound by HIF1α suggesting that transcriptional responses to hypoxia largely depend on pre-existing chromatin organization. Collectively our results show that large structural rearrangements establish chromatin architecture required for functional endothelium and this architecture remains largely unchanged in response to hypoxia.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 41
    Publication Date: 2018-03-06
    Description: The vast majority of microorganisms on Earth reside in often-inseparable environment-specific communities—microbiomes. Meta-genomic/-transcriptomic sequencing could reveal the otherwise inaccessible functionality of microbiomes. However, existing analytical approaches focus on attributing sequencing reads to known genes/genomes, often failing to make maximal use of available data. We created faser (functional annotation of sequencing reads) , an algorithm that is optimized to map reads to molecular functions encoded by the read-correspondent genes. The mi-faser microbiome analysis pipeline, combining faser with our manually curated reference database of protein functions, accurately annotates microbiome molecular functionality. mi-faser ’s minutes-per-microbiome processing speed is significantly faster than that of other methods, allowing for large scale comparisons. Microbiome function vectors can be compared between different conditions to highlight environment-specific and/or time-dependent changes in functionality. Here, we identified previously unseen oil degradation-specific functions in BP oil-spill data, as well as functional signatures of individual-specific gut microbiome responses to a dietary intervention in children with Prader–Willi syndrome. Our method also revealed variability in Crohn's Disease patient microbiomes and clearly distinguished them from those of related healthy individuals. Our analysis highlighted the microbiome role in CD pathogenicity, demonstrating enrichment of patient microbiomes in functions that promote inflammation and that help bacteria survive it.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 42
    Publication Date: 2018-03-06
    Description: Over the past several decades, the concept of using molecules composed partially or wholly of nucleic acids as therapeutic moieties, and the modification of such molecules via synthetic strategies, has been discussed and actively pursued by many academic and industrial laboratories. As compared to small molecules (and more recently antibodies and other protein-based drugs) the use of oligonucleotides and related compounds as therapeutics has advanced more slowly—a fact that is not surprising, given the challenges that such molecules (and their investigators) face. Nucleic acids are large, highly charged, rapidly degraded and cleared from the body, and offer generally poor pharmacological properties. The development of nucleic acids as potential therapeutic agents has nonetheless moved forward at a steady pace, owing in large part to important discoveries regarding their role in regulating gene expression, and in part to the development of increasingly sophisticated synthetic and biochemical methods to alter many of the physical properties that might limit their potential as drugs.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 43
    Publication Date: 2018-03-06
    Description: We report, herein, a new class of RNAi trigger molecules based on the unconventional parallel hybridization of two oligonucleotide chains. We have prepared and studied several parallel stranded ( ps ) duplexes, in which the parallel orientation is achieved through incorporation of isoguanine and isocytosine to form reverse Watson-Crick base pairs in ps -DNA:DNA,  ps -DNA:RNA,  ps -(DNA-2′F-ANA):RNA, and ps -DNA:2′F-RNA duplexes. The formation of these duplexes was confirmed by UV melting experiments, FRET and CD studies. In addition, NMR structural studies were conducted on a ps -DNA:RNA hybrid for the first time. Finally, we provide evidence for the unprecedented finding that ps -DNA:RNA and  ps -DNA:2′F-RNA hybrids can engage the RNAi pathway to silence gene expression in vitro .
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 44
    Publication Date: 2018-03-06
    Description: Long interspersed nuclear element 1 is an autonomous non-long terminal repeat retrotransposon that comprises ∼17% of the human genome. Its spontaneous retrotransposition and the accumulation of heritable L1 insertions can potentially result in genome instability and sporadic disorders. Moloney leukemia virus 10 homolog (MOV10), a putative RNA helicase, has been implicated in inhibiting L1 replication, although its underlying mechanism of action remains obscure. Moreover, the physiological relevance of MOV10-mediated L1 regulation in human disease has not yet been examined. Using a proteomic approach, we identified RNASEH2 as a binding partner of MOV10. We show that MOV10 interacts with RNASEH2, and their interplay is crucial for restricting L1 retrotransposition. RNASEH2 and MOV10 co-localize in the nucleus, and RNASEH2 binds to L1 RNAs in a MOV10-dependent manner. Small hairpin RNA-mediated depletion of either RNASEH2A or MOV10 results in an accumulation of L1-specific RNA-DNA hybrids, suggesting they contribute to prevent formation of vital L1 heteroduplexes during retrotransposition. Furthermore, we show that RNASEH2-MOV10-mediated L1 restriction downregulates expression of the rheumatoid arthritis-associated inflammatory cytokines and matrix-degrading proteinases in synovial cells, implicating a potential causal relationship between them and disease development in terms of disease predisposition.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 45
    Publication Date: 2018-03-06
    Description: Androgen receptor (AR) splice variants (ARVs) are implicated in development of castration-resistant prostate cancer (CRPC). Upregulation of ARVs often correlates with persistent AR activity after androgen deprivation therapy (ADT). However, the genomic and epigenomic characteristics of ARV-dependent cistrome and the disease relevance of ARV-mediated transcriptome remain elusive. Through integrated chromatin immunoprecipitation coupled sequencing (ChIP-seq) and RNA sequencing (RNA-seq) analysis, we identified ARV-preferential-binding sites (ARV-PBS) and a set of genes preferentially transactivated by ARVs in CRPC cells. ARVs preferentially bind to enhancers located in nucleosome-depleted regions harboring the full AR-response element (AREfull), while full-length AR (ARFL)-PBS are enhancers resided in closed chromatin regions containing the composite FOXA1-nnnn-AREhalf motif. ARV-PBS exclusively overlapped with AR binding sites in castration-resistant (CR) tumors in patients and ARV-preferentially activated genes were up-regulated in abiraterone-resistant patient specimens. Expression of ARV-PBS target genes, such as oncogene RAP2A and cell cycle gene E2F7, were significantly associated with castration resistance, poor survival and tumor progression. We uncover distinct genomic and epigenomic features of ARV-PBS, highlighting that ARVs are useful tools to depict AR-regulated oncogenic genome and epigenome landscapes in prostate cancer. Our data also suggest that the ARV-preferentially activated transcriptional program could be targeted for effective treatment of CRPC.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 46
    Publication Date: 2018-03-06
    Description: Single cell whole-genome sequencing (scWGS) is providing novel insights into the nature of genetic heterogeneity in normal and diseased cells. However, the whole-genome amplification process required for scWGS introduces biases into the resulting sequencing that can confound downstream analysis. Here, we present a statistical method, with an accompanying package PaSD-qc (Power Spectral Density-qc), that evaluates the properties and quality of single cell libraries. It uses a modified power spectral density to assess amplification uniformity, amplicon size distribution, autocovariance and inter-sample consistency as well as to identify chromosomes with aberrant read-density profiles due either to copy alterations or poor amplification. These metrics provide a standard way to compare the quality of single cell samples as well as yield information necessary to improve variant calling strategies. We demonstrate the usefulness of this tool in comparing the properties of scWGS protocols, identifying potential chromosomal copy number variation, determining chromosomal and subchromosomal regions of poor amplification, and selecting high-quality libraries from low-coverage data for deep sequencing. The software is available free and open-source at https://github.com/parklab/PaSDqc .
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 47
    Publication Date: 2018-03-06
    Description: Cellular DNA/RNA tags (barcodes) allow for multiplexed cell lineage tracing and neuronal projection mapping with cellular resolution. Conventional approaches to reading out cellular barcodes trade off spatial resolution with throughput. Bulk sequencing achieves high throughput but sacrifices spatial resolution, whereas manual cell picking has low throughput. In situ sequencing could potentially achieve both high spatial resolution and high throughput, but current in situ sequencing techniques are inefficient at reading out cellular barcodes. Here we describe BaristaSeq, an optimization of a targeted, padlock probe-based technique for in situ barcode sequencing compatible with Illumina sequencing chemistry. BaristaSeq results in a five-fold increase in amplification efficiency, with a sequencing accuracy of at least 97%. BaristaSeq could be used for barcode-assisted lineage tracing, and to map long-range neuronal projections.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 48
    Publication Date: 2018-03-06
    Description: PCR amplicon deep sequencing continues to transform the investigation of genetic diversity in viral, bacterial, and eukaryotic populations. In eukaryotic populations such as Plasmodium falciparum infections, it is important to discriminate sequences differing by a single nucleotide polymorphism. In bacterial populations, single-base resolution can provide improved resolution towards species and strains. Here, we introduce the SeekDeep suite built around the qluster algorithm, which is capable of accurately building de novo clusters representing true, biological local haplotypes differing by just a single base. It outperforms current software, particularly at low frequencies and at low input read depths, whether resolving single-base differences or traditional OTUs. SeekDeep is open source and works with all major sequencing technologies, making it broadly useful in a wide variety of applications of amplicon deep sequencing to extract accurate and maximal biologic information.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 49
    Publication Date: 2017-01-05
    Description: GenBank ® ( www.ncbi.nlm.nih.gov/genbank/ ) is a comprehensive database that contains publicly available nucleotide sequences for 370 000 formally described species. These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole genome shotgun (WGS) and environmental sampling projects. Most submissions are made using the web-based BankIt or the NCBI Submission Portal. GenBank staff assign accession numbers upon data receipt. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the NCBI Nucleotide database, which links to related information such as taxonomy, genomes, protein sequences and structures, and biomedical journal literature in PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. Recent updates include changes to policies regarding sequence identifiers, an improved 16S submission wizard, targeted loci studies, the ability to submit methylation and BioNano mapping files, and a database of anti-microbial resistance genes.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 50
    Publication Date: 2017-01-05
    Description: The abnormal transcriptional regulation of non-coding RNAs (ncRNAs) and protein-coding genes (PCGs) is contributed to various biological processes and linked with human diseases, but the underlying mechanisms remain elusive. In this study, we developed ChIPBase v2.0 ( http://rna.sysu.edu.cn/chipbase/ ) to explore the transcriptional regulatory networks of ncRNAs and PCGs. ChIPBase v2.0 has been expanded with ~10 200 curated ChIP-seq datasets, which represent about 20 times expansion when comparing to the previous released version. We identified thousands of binding motif matrices and their binding sites from ChIP-seq data of DNA-binding proteins and predicted millions of transcriptional regulatory relationships between transcription factors (TFs) and genes. We constructed ‘Regulator’ module to predict hundreds of TFs and histone modifications that were involved in or affected transcription of ncRNAs and PCGs. Moreover, we built a web-based tool, Co-Expression, to explore the co-expression patterns between DNA-binding proteins and various types of genes by integrating the gene expression profiles of ~10 000 tumor samples and ~9100 normal tissues and cell lines. ChIPBase also provides a ChIP-Function tool and a genome browser to predict functions of diverse genes and visualize various ChIP-seq data. This study will greatly expand our understanding of the transcriptional regulations of ncRNAs and PCGs.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 51
    Publication Date: 2017-01-05
    Description: We present an update of the Eukaryotic Promoter Database EPD ( http://epd.vital-it.ch ), more specifically on the EPDnew division, which contains comprehensive organisms-specific transcription start site (TSS) collections automatically derived from next generation sequencing (NGS) data. Thanks to the abundant release of new high-throughput transcript mapping data (CAGE, TSS-seq, GRO-cap) the database could be extended to plant and fungal species. We further report on the expansion of the mass genome annotation (MGA) repository containing promoter-relevant chromatin profiling data and on improvements for the EPD entry viewers. Finally, we present a new data access tool, ChIP-Extract, which enables computational biologists to extract diverse types of promoter-associated data in numerical table formats that are readily imported into statistical analysis platforms such as R.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 52
    Publication Date: 2017-01-05
    Description: GETPrime ( http://bbcftools.epfl.ch/getprime ) is a database with a web frontend providing gene- and transcript-specific, pre-computed qPCR primer pairs. The primers have been optimized for genome-wide specificity and for allowing the selective amplification of one or several splice variants of most known genes. To ease selection, primers have also been ranked according to defined criteria such as genome-wide specificity (with BLAST), amplicon size, and isoform coverage. Here, we report a major upgrade (2.0) of the database: eight new species (yeast, chicken, macaque, chimpanzee, rat, platypus, pufferfish, and Anolis carolinensis ) now complement the five already included in the previous version (human, mouse, zebrafish, fly, and worm). Furthermore, the genomic reference has been updated to Ensembl v81 (while keeping earlier versions for backward compatibility) as a result of re-designing the back-end database and automating the import of relevant sections of the Ensembl database in species-independent fashion. This also allowed us to map known polymorphisms to the primers (on average three per primer for human), with the aim of reducing experimental error when targeting specific strains or individuals. Another consequence is that the inclusion of future Ensembl releases and other species has now become a relatively straightforward task.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 53
    facet.materialart.
    Unknown
    Oxford University Press
    Publication Date: 2017-01-05
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 54
    facet.materialart.
    Unknown
    Oxford University Press
    Publication Date: 2017-01-05
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 55
    Publication Date: 2017-01-05
    Description: R-loopDB ( http://rloop.bii.a-star.edu.sg ) was originally constructed as a collection of computationally predicted R-loop forming sequences (RLFSs) in the human genic regions. The renewed R-loopDB provides updates, improvements and new options, including access to recent experimental data. It includes genome-scale prediction of RLFSs for humans, six other animals and yeast. Using the extended quantitative model of RLFSs (QmRLFS), we significantly increased the number of RLFSs predicted in the human genes and identified RLFSs in other organism genomes. R-loopDB allows searching of RLFSs in the genes and in the 2 kb upstream and downstream flanking sequences of any gene. R-loopDB exploits the Ensembl gene annotation system, providing users with chromosome coordinates, sequences, gene and genomic data of the 1 565 795 RLFSs distributed in 121 056 genic or proximal gene regions of the covered organisms. It provides a comprehensive annotation of Ensembl RLFS-positive genes including 93 454 protein coding genes, 12 480 long non-coding RNA and 7 568 small non-coding RNA genes and 7 554 pseudogenes. Using new interface and genome viewers of R-loopDB, users can search the gene(s) in multiple species with keywords in a single query. R-loopDB provides tools to carry out comparative evolution and genome-scale analyses in R-loop biology.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 56
    Publication Date: 2017-01-05
    Description: RNAcentral is a database of non-coding RNA (ncRNA) sequences that aggregates data from specialised ncRNA resources and provides a single entry point for accessing ncRNA sequences of all ncRNA types from all organisms. Since its launch in 2014, RNAcentral has integrated twelve new resources, taking the total number of collaborating database to 22, and began importing new types of data, such as modified nucleotides from MODOMICS and PDB. We created new species-specific identifiers that refer to unique RNA sequences within a context of single species. The website has been subject to continuous improvements focusing on text and sequence similarity searches as well as genome browsing functionality. All RNAcentral data is provided for free and is available for browsing, bulk downloads, and programmatic access at http://rnacentral.org/ .
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 57
    Publication Date: 2017-01-05
    Description: SNP2TFBS is a computational resource intended to support researchers investigating the molecular mechanisms underlying regulatory variation in the human genome. The database essentially consists of a collection of text files providing specific annotations for human single nucleotide polymorphisms (SNPs), namely whether they are predicted to abolish, create or change the affinity of one or several transcription factor (TF) binding sites. A SNP's effect on TF binding is estimated based on a position weight matrix (PWM) model for the binding specificity of the corresponding factor. These data files are regenerated at regular intervals by an automatic procedure that takes as input a reference genome, a comprehensive SNP catalogue and a collection of PWMs. SNP2TFBS is also accessible over a web interface, enabling users to view the information provided for an individual SNP, to extract SNPs based on various search criteria, to annotate uploaded sets of SNPs or to display statistics about the frequencies of binding sites affected by selected SNPs. Homepage: http://ccg.vital-it.ch/snp2tfbs/ .
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 58
    Publication Date: 2017-01-05
    Description: The 2017 update of NGSmethDB stores whole genome methylomes generated from short-read data sets obtained by bisulfite sequencing (WGBS) technology. To generate high-quality methylomes, stringent quality controls were integrated with third-part software, adding also a two-step mapping process to exploit the advantages of the new genome assembly models. The samples were all profiled under constant parameter settings, thus enabling comparative downstream analyses. Besides a significant increase in the number of samples, NGSmethDB now includes two additional data-types, which are a valuable resource for the discovery of methylation epigenetic biomarkers: (i) differentially methylated single-cytosines; and (ii) methylation segments (i.e. genome regions of homogeneous methylation). The NGSmethDB back-end is now based on MongoDB , a NoSQL hierarchical database using JSON-formatted documents and dynamic schemas, thus accelerating sample comparative analyses. Besides conventional database dumps, track hubs were implemented, which improved database access, visualization in genome browsers and comparative analyses to third-part annotations. In addition, the database can be also accessed through a RESTful API. Lastly, a Python client and a multiplatform virtual machine allow for program-driven access from user desktop. This way, private methylation data can be compared to NGSmethDB without the need to upload them to public servers. Database website: http://bioinfo2.ugr.es/NGSmethDB .
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 59
    Publication Date: 2017-01-05
    Description: Increasing evidence has revealed that RNA subcellular localization is a very important feature for deeply understanding RNA's biological functions after being transported into intra- or extra-cellular regions. RNALocate is a web-accessible database that aims to provide a high-quality RNA subcellular localization resource and facilitate future researches on RNA function or structure. The current version of RNALocate documents more than 37 700 manually curated RNA subcellular localization entries with experimental evidence, involving more than 21 800 RNAs with 42 subcellular localizations in 65 species, mainly including Homo sapiens, Mus musculus and Saccharomyces cerevisiae etc. Besides, RNA homology, sequence and interaction data have also been integrated into RNALocate. Users can access these data through online search, browse, blast and visualization tools. In conclusion, RNALocate will be of help in elucidating the entirety of RNA subcellular localization, and developing new prediction methods. The database is available at http://www.rna-society.org/rnalocate/ .
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 60
    Publication Date: 2017-01-05
    Description: We present three clustered protein sequence databases, Uniclust90, Uniclust50, Uniclust30 and three databases of multiple sequence alignments (MSAs), Uniboost10, Uniboost20 and Uniboost30, as a resource for protein sequence analysis, function prediction and sequence searches. The Uniclust databases cluster UniProtKB sequences at the level of 90%, 50% and 30% pairwise sequence identity. Uniclust90 and Uniclust50 clusters showed better consistency of functional annotation than those of UniRef90 and UniRef50, owing to an optimised clustering pipeline that runs with our MMseqs2 software for fast and sensitive protein sequence searching and clustering. Uniclust sequences are annotated with matches to Pfam, SCOP domains, and proteins in the PDB, using our HHblits homology detection tool. Due to its high sensitivity, Uniclust contains 17% more Pfam domain annotations than UniProt. Uniboost MSAs of three diversities are built by enriching the Uniclust30 MSAs with local sequence matches from MMseqs2 profile searches through Uniclust30. All databases can be downloaded from the Uniclust server at uniclust.mmseqs.com. Users can search clusters by keywords and explore their MSAs, taxonomic representation, and annotations. Uniclust is updated every two months with the new UniProt release.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 61
    Publication Date: 2017-01-05
    Description: Transcription factors (TFs) play a pivotal role in transcriptional regulation, making them crucial for cell survival and important biological functions. For the regulation of transcription, interactions of different regulatory proteins known as transcription co-factors (TcoFs) and TFs are essential in forming necessary protein complexes. Although TcoFs themselves do not bind DNA directly, their influence on transcriptional regulation and initiation, although indirect, has been shown to be significant, with the functionality of TFs strongly influenced by the presence of TcoFs. In the TcoF-DB v2 database, we collect information on TcoFs. In this article, we describe updates and improvements implemented in TcoF-DB v2. TcoF-DB v2 provides several new features that enables exploration of the roles of TcoFs. The content of the database has significantly expanded, and is enriched with information from Gene Ontology, biological pathways, diseases and molecular signatures. TcoF-DB v2 now includes many more TFs; has substantially increased the number of human TcoFs to 958, and now includes information on mouse (418 new TcoFs). TcoF-DB v2 enables the exploration of information on TcoFs and allows investigations into their influence on transcriptional regulation in humans and mice. TcoF-DB v2 can be accessed at http://tcofdb.org/ .
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 62
    Publication Date: 2017-01-05
    Description: InterPro ( http://www.ebi.ac.uk/interpro/ ) is a freely available database used to classify protein sequences into families and to predict the presence of important domains and sites. InterProScan is the underlying software that allows both protein and nucleic acid sequences to be searched against InterPro's predictive models, which are provided by its member databases. Here, we report recent developments with InterPro and its associated software, including the addition of two new databases (SFLD and CDD), and the functionality to include residue-level annotation and prediction of intrinsic disorder. These developments enrich the annotations provided by InterPro, increase the overall number of residues annotated and allow more specific functional inferences.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 63
    Publication Date: 2017-01-05
    Description: The latest version of the CATH-Gene3D protein structure classification database has recently been released (version 4.1, http://www.cathdb.info ). The resource comprises over 300 000 domain structures and over 53 million protein domains classified into 2737 homologous superfamilies, doubling the number of predicted protein domains in the previous version. The daily-updated CATH-B, which contains our very latest domain assignment data, provides putative classifications for over 100 000 additional protein domains. This article describes developments to the CATH-Gene3D resource over the last two years since the publication in 2015, including: significant increases to our structural and sequence coverage; expansion of the functional families in CATH; building a support vector machine (SVM) to automatically assign domains to superfamilies; improved search facilities to return alignments of query sequences against multiple sequence alignments; the redesign of the web pages and download site.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 64
    Publication Date: 2017-01-05
    Description: Evolutionary Classification Of protein Domains (ECOD) ( http://prodata.swmed.edu/ecod ) comprehensively classifies protein with known spatial structures maintained by the Protein Data Bank (PDB) into evolutionary groups of protein domains. ECOD relies on a combination of automatic and manual weekly updates to achieve its high accuracy and coverage with a short update cycle. ECOD classifies the approximately 120 000 depositions of the PDB into more than 500 000 domains in ~3400 homologous groups. We show the performance of the weekly update pipeline since the release of ECOD, describe improvements to the ECOD website and available search options, and discuss novel structures and homologous groups that have been classified in the recent updates. Finally, we discuss the future directions of ECOD and further improvements planned for the hierarchy and update process.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 65
    Publication Date: 2017-01-05
    Description: In this work, we developed a database WERAM ( http://weram.biocuckoo.org/ ) for histone acetyltransferases, histone deacetylases, histone methyltransferases, histone demethylases and acetyl- or methyl-binding proteins, which catalyze, remove and recognize histone acetylation and methylation sites as ‘writers’, ‘erasers’ and ‘readers’, and synergistically determine the ‘histone code’. From the scientific literature, we totally collected over 580 experimentally identified histone regulators from eight model organisms, including Homo sapiens, Mus musculus, Rattus norvegicus, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, Schizosaccharomyces pombe and Saccharomyces cerevisiae . We also collected ~900 site-specific regulator-histone relations from the eight species. According to the experimental evidence, known histone regulators were classified into distinct families. To computationally detect more proteins in eukaryotes, we constructed hidden Markov model (HMM) profiles for histone regulator families. For families without HMM profiles, we also conducted orthologous searches. Totally, WERAM database contained more than 20 thousand non-redundant histone regulators from 148 eukaryotes. The detailed annotations and classification information of histone regulators were provided, together with site-specific histone substrates if available.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 66
    Publication Date: 2017-01-05
    Description: The database JET2 Viewer, openly accessible at http://www.jet2viewer.upmc.fr/ , reports putative protein binding sites for all three-dimensional (3D) structures available in the Protein Data Bank (PDB). This knowledge base was generated by applying the computational method JET 2 at large-scale on more than 20 000 chains. JET 2 strategy yields very precise predictions of interacting surfaces and unravels their evolutionary process and complexity. JET2 Viewer provides an online intelligent display, including interactive 3D visualization of the binding sites mapped onto PDB structures and suitable files recording JET 2 analyses. Predictions were evaluated on more than 15 000 experimentally characterized protein interfaces. This is, to our knowledge, the largest evaluation of a protein binding site prediction method. The overall performance of JET 2 on all interfaces are: Sen = 52.52, PPV = 51.24, Spe = 80.05, Acc = 75.89. The data can be used to foster new strategies for protein–protein interactions modulation and interaction surface redesign.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 67
    Publication Date: 2017-01-05
    Description: RepeatsDB 2.0 (URL: http://repeatsdb.bio.unipd.it/ ) is an update of the database of annotated tandem repeat protein structures. Repeat proteins are a widespread class of non-globular proteins carrying heterogeneous functions involved in several diseases. Here we provide a new version of RepeatsDB with an improved classification schema including high quality annotations for ~5400 protein structures. RepeatsDB 2.0 features information on start and end positions for the repeat regions and units for all entries. The extensive growth of repeat unit characterization was possible by applying the novel ReUPred annotation method over the entire Protein Data Bank, with data quality is guaranteed by an extensive manual validation for 〉60% of the entries. The updated web interface includes a new search engine for complex queries and a fully re-designed entry page for a better overview of structural data. It is now possible to compare unit positions, together with secondary structure, fold information and Pfam domains. Moreover, a new classification level has been introduced on top of the existing scheme as an independent layer for sequence similarity relationships at 40%, 60% and 90% identity.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 68
    Publication Date: 2017-01-05
    Description: All cellular life contains an extensive array of membrane transport proteins. The vast majority of these transporters have not been experimentally characterized. We have developed a bioinformatic pipeline to identify and annotate complete sets of transporters in any sequenced genome. This pipeline is now fully automated enabling it to better keep pace with the accelerating rate of genome sequencing. This manuscript describes TransportDB 2.0 ( http://www.membranetransport.org/transportDB2/ ), a completely updated version of TransportDB, which provides access to the large volumes of data generated by our automated transporter annotation pipeline. The TransportDB 2.0 web portal has been rebuilt to utilize contemporary JavaScript libraries, providing a highly interactive interface to the annotation information, and incorporates analysis tools that enable users to query the database on a number of levels. For example, TransportDB 2.0 includes tools that allow users to select annotated genomes of interest from the thousands of species held in the database and compare their complete transporter complements.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 69
    Publication Date: 2017-01-05
    Description: SWISS-MODEL Repository (SMR) is a database of annotated 3D protein structure models generated by the automated SWISS-MODEL homology modeling pipeline. It currently holds 〉400 000 high quality models covering almost 20% of Swiss-Prot/UniProtKB entries. In this manuscript, we provide an update of features and functionalities which have been implemented recently. We address improvements in target coverage, model quality estimates, functional annotations and improved in-page visualization. We also introduce a new update concept which includes regular updates of an expanded set of core organism models and UniProtKB-based targets, complemented by user-driven on-demand update of individual models. With the new release of the modeling pipeline, SMR has implemented a REST-API and adopted an open licencing model for accessing model coordinates, thus enabling bulk download for groups of targets fostering re-use of models in other contexts. SMR can be accessed at https://swissmodel.expasy.org/repository .
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 70
    Publication Date: 2017-01-05
    Description: The Database of Protein Disorder (DisProt, URL: www.disprot.org ) has been significantly updated and upgraded since its last major renewal in 2007. The current release holds information on more than 800 entries of IDPs/IDRs, i.e. intrinsically disordered proteins or regions that exist and function without a well-defined three-dimensional structure. We have re-curated previous entries to purge DisProt from conflicting cases, and also upgraded the functional classification scheme to reflect continuous advance in the field in the past 10 years or so. We define IDPs as proteins that are disordered along their entire sequence, i.e. entirely lack structural elements, and IDRs as regions that are at least five consecutive residues without well-defined structure. We base our assessment of disorder strictly on experimental evidence, such as X-ray crystallography and nuclear magnetic resonance (primary techniques) and a broad range of other experimental approaches (secondary techniques). Confident and ambiguous annotations are highlighted separately. DisProt 7.0 presents classified knowledge regarding the experimental characterization and functional annotations of IDPs/IDRs, and is intended to provide an invaluable resource for the research community for a better understanding structural disorder and for developing better computational tools for studying disordered proteins.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 71
    Publication Date: 2017-01-05
    Description: The Protein Circular Dichroism Data Bank (PCDDB) has been in operation for more than 5 years as a public repository for archiving circular dichroism spectroscopic data and associated bioinformatics and experimental metadata. Since its inception, many improvements and new developments have been made in data display, searching algorithms, data formats, data content, auxillary information, and validation techniques, as well as, of course, an increase in the number of holdings. It provides a site ( http://pcddb.cryst.bbk.ac.uk ) for authors to deposit experimental data as well as detailed information on methods and calculations associated with published work. It also includes links for each entry to bioinformatics databases. The data are freely available to accessors either as single files or as complete data bank downloads. The PCDDB has found broad usage by the structural biology, bioinformatics, analytical and pharmaceutical communities, and has formed the basis for new software and methods developments.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 72
    Publication Date: 2017-01-05
    Description: The Membranome database was developed to assist analysis and computational modeling of single-pass (bitopic) transmembrane (TM) proteins and their complexes by providing structural information about these proteins on a genomic scale. The database currently collects data on 〉6000 bitopic proteins from Homo sapiens, Arabidopsis thaliana, Dictyostelium discoideum, Saccharomyces cerevisiae, Escherichia coli and Methanocaldococcus jannaschii . It presents the following data: (i) hierarchical classification of bitopic proteins into 15 functional classes, 689 structural superfamilies and 1404 families; (ii) 446 complexes of bitopic proteins with known three-dimensional (3D) structures classified into 129 families; (iii) computationally generated three-dimensional models of TM α-helices positioned in membranes; (iv) amino acid sequences, domain architecture, functional annotation and available experimental structures of bitopic proteins; (v) TM topology and intracellular localization, (vi) physical interactions between proteins from the database along with links to other resources. The database is freely accessible at http://membranome.org . There is a variety of options for browsing, sorting, searching and retrieval of the content, including downloadable coordinate files of TM domains with calculated membrane boundaries.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 73
    Publication Date: 2017-01-05
    Description: The TSTMP database is designed to help the target selection of human transmembrane proteins for structural genomics projects and structure modeling studies. Currently, there are only 60 known 3D structures among the polytopic human transmembrane proteins and about a further 600 could be modeled using existing structures. Although there are a great number of human transmembrane protein structures left to be determined, surprisingly only a small fraction of these proteins have ‘selected’ (or above) status according to the current version the TargetDB/TargetTrack database. This figure is even worse regarding those transmembrane proteins that would contribute the most to the structural coverage of the human transmembrane proteome. The database was built by sorting out proteins from the human transmembrane proteome with known structure and searching for suitable model structures for the remaining proteins by combining the results of a state-of-the-art transmembrane specific fold recognition algorithm and a sequence similarity search algorithm. Proteins were searched for homologues among the human transmembrane proteins in order to select targets whose successful structure determination would lead to the best structural coverage of the human transmembrane proteome. The pipeline constructed for creating the TSTMP database guarantees to keep the database up-to-date. The database is available at http://tstmp.enzim.ttk.mta.hu .
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 74
    Publication Date: 2017-01-05
    Description: The Protein Data Bank Japan (PDBj, http://pdbj.org ), a member of the worldwide Protein Data Bank (wwPDB), accepts and processes the deposited data of experimentally determined macromolecular structures. While maintaining the archive in collaboration with other wwPDB partners, PDBj also provides a wide range of services and tools for analyzing structures and functions of proteins. We herein outline the updated web user interfaces together with RESTful web services and the backend relational database that support the former. To enhance the interoperability of the PDB data, we have previously developed PDB/RDF, PDB data in the Resource Description Framework (RDF) format, which is now a wwPDB standard called wwPDB/RDF. We have enhanced the connectivity of the wwPDB/RDF data by incorporating various external data resources. Services for searching, comparing and analyzing the ever-increasing large structures determined by hybrid methods are also described.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 75
    Publication Date: 2017-01-05
    Description: KEGG ( http://www.kegg.jp/ or http://www.genome.jp/kegg/ ) is an encyclopedia of genes and genomes. Assigning functional meanings to genes and genomes both at the molecular and higher levels is the primary objective of the KEGG database project. Molecular-level functions are stored in the KO (KEGG Orthology) database, where each KO is defined as a functional ortholog of genes and proteins. Higher-level functions are represented by networks of molecular interactions, reactions and relations in the forms of KEGG pathway maps, BRITE hierarchies and KEGG modules. In the past the KO database was developed for the purpose of defining nodes of molecular networks, but now the content has been expanded and the quality improved irrespective of whether or not the KOs appear in the three molecular network databases. The newly introduced addendum category of the GENES database is a collection of individual proteins whose functions are experimentally characterized and from which an increasing number of KOs are defined. Furthermore, the DISEASE and DRUG databases have been improved by systematic analysis of drug labels for better integration of diseases and drugs with the KEGG molecular networks. KEGG is moving towards becoming a comprehensive knowledge base for both functional interpretation and practical application of genomic information.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 76
    Publication Date: 2017-01-05
    Description: The use of high-throughput array and sequencing technologies has produced unprecedented amounts of gene expression data in central public depositories, including the Gene Expression Omnibus (GEO). The immense amount of expression data in GEO provides both vast research opportunities and data analysis challenges. Co-expression analysis of high-dimensional expression data has proven effective for the study of gene functions, and several co-expression databases have been developed. Here, we present a new co-expression database, COEXPEDIA ( www.coexpedia.org ), which is distinctive from other co-expression databases in three aspects: (i) it contains only co-functional co-expressions that passed a rigorous statistical assessment for functional association, (ii) the co-expressions were inferred from individual studies, each of which was designed to investigate gene functions with respect to a particular biomedical context such as a disease and (iii) the co-expressions are associated with medical subject headings (MeSH) that provide biomedical information for anatomical, disease, and chemical relevance. COEXPEDIA currently contains approximately eight million co-expressions inferred from 384 and 248 GEO series for humans and mice, respectively. We describe how these MeSH-associated co-expressions enable the identification of diseases and drugs previously unknown to be related to a gene or a gene group of interest.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 77
    Publication Date: 2017-01-05
    Description: A system-wide understanding of cellular function requires knowledge of all functional interactions between the expressed proteins. The STRING database aims to collect and integrate this information, by consolidating known and predicted protein–protein association data for a large number of organisms. The associations in STRING include direct (physical) interactions, as well as indirect (functional) interactions, as long as both are specific and biologically meaningful. Apart from collecting and reassessing available experimental data on protein–protein interactions, and importing known pathways and protein complexes from curated databases, interaction predictions are derived from the following sources: (i) systematic co-expression analysis, (ii) detection of shared selective signals across genomes, (iii) automated text-mining of the scientific literature and (iv) computational transfer of interaction knowledge between organisms based on gene orthology. In the latest version 10.5 of STRING, the biggest changes are concerned with data dissemination: the web frontend has been completely redesigned to reduce dependency on outdated browser technologies, and the database can now also be queried from inside the popular Cytoscape software framework. Further improvements include automated background analysis of user inputs for functional enrichments, and streamlined download options. The STRING resource is available online, at http://string-db.org/ .
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 78
    Publication Date: 2017-01-05
    Description: The Biological General Repository for Interaction Datasets (BioGRID: https://thebiogrid.org ) is an open access database dedicated to the annotation and archival of protein, genetic and chemical interactions for all major model organism species and humans. As of September 2016 (build 3.4.140), the BioGRID contains 1 072 173 genetic and protein interactions, and 38 559 post-translational modifications, as manually annotated from 48 114 publications. This dataset represents interaction records for 66 model organisms and represents a 30% increase compared to the previous 2015 BioGRID update. BioGRID curates the biomedical literature for major model organism species, including humans, with a recent emphasis on central biological processes and specific human diseases. To facilitate network-based approaches to drug discovery, BioGRID now incorporates 27 501 chemical–protein interactions for human drug targets, as drawn from the DrugBank database. A new dynamic interaction network viewer allows the easy navigation and filtering of all genetic and protein interaction data, as well as for bioactive compounds and their established targets. BioGRID data are directly downloadable without restriction in a variety of standardized formats and are freely distributed through partner model organism databases and meta-databases.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 79
    Publication Date: 2017-01-05
    Description: The FAIRDOMHub is a repository for publishing FAIR (Findable, Accessible, Interoperable and Reusable) Data, Operating procedures and Models ( https://fairdomhub.org/ ) for the Systems Biology community. It is a web-accessible repository for storing and sharing systems biology research assets. It enables researchers to organize, share and publish data, models and protocols, interlink them in the context of the systems biology investigations that produced them, and to interrogate them via API interfaces. By using the FAIRDOMHub, researchers can achieve more effective exchange with geographically distributed collaborators during projects, ensure results are sustained and preserved and generate reproducible publications that adhere to the FAIR guiding principles of data stewardship.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 80
    Publication Date: 2017-01-05
    Description: Studies in model organisms have yielded considerable insights into the etiology of disease and our understanding of evolutionary processes. Caenorhabditis elegans is among the most powerful model organisms used to understand biology. However, C. elegans is not used as extensively as other model organisms to investigate how natural variation shapes traits, especially through the use of genome-wide association (GWA) analyses. Here, we introduce a new platform, the C. elegans Natural Diversity Resource (CeNDR) to enable statistical genetics and genomics studies of C. elegans and to connect the results to human disease. CeNDR provides the research community with wild strains, genome-wide sequence and variant data for every strain, and a GWA mapping portal for studying natural variation in C. elegans . Additionally, researchers outside of the C. elegans community can benefit from public mappings and integrated tools for comparative analyses. CeNDR uses several databases that are continually updated through the addition of new strains, sequencing data, and association mapping results. The CeNDR data are accessible through a freely available web portal located at http://www.elegansvariation.org or through an application programming interface.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 81
    Publication Date: 2017-01-05
    Description: The Candida Genome Database (CGD, http://www.candidagenome.org/ ) is a freely available online resource that provides gene, protein and sequence information for multiple Candida species, along with web-based tools for accessing, analyzing and exploring these data. The mission of CGD is to facilitate and accelerate research into Candida pathogenesis and biology, by curating the scientific literature in real time, and connecting literature-derived annotations to the latest version of the genomic sequence and its annotations. Here, we report the incorporation into CGD of Assembly 22, the first chromosome-level, phased diploid assembly of the C. albicans genome, coupled with improvements that we have made to the assembly using additional available sequence data. We also report the creation of systematic identifiers for C. albicans genes and sequence features using a system similar to that adopted by the yeast community over two decades ago. Finally, we describe the incorporation of JBrowse into CGD, which allows online browsing of mapped high throughput sequencing data, and its implementation for several RNA-Seq data sets, as well as the whole genome sequencing data that was used in the construction of Assembly 22.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 82
    Publication Date: 2017-01-05
    Description: Ensembl ( www.ensembl.org ) is a database and genome browser for enabling research on vertebrate genomes. We import, analyse, curate and integrate a diverse collection of large-scale reference data to create a more comprehensive view of genome biology than would be possible from any individual dataset. Our extensive data resources include evidence-based gene and regulatory region annotation, genome variation and gene trees. An accompanying suite of tools, infrastructure and programmatic access methods ensure uniform data analysis and distribution for all supported species. Together, these provide a comprehensive solution for large-scale and targeted genomics applications alike. Among many other developments over the past year, we have improved our resources for gene regulation and comparative genomics, and added CRISPR/Cas9 target sites. We released new browser functionality and tools, including improved filtering and prioritization of genome variation, Manhattan plot visualization for linkage disequilibrium and eQTL data, and an ontology search for phenotypes, traits and disease. We have also enhanced data discovery and access with a track hub registry and a selection of new REST end points. All Ensembl data are freely released to the scientific community and our source code is available via the open source Apache 2.0 license.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 83
    Publication Date: 2017-01-05
    Description: Over the past years, CRISPR/Cas9 mediated genome editing has developed into a powerful tool for modifying genomes in various organisms. In high-throughput screens, CRISPR/Cas9 mediated gene perturbations can be used for the systematic functional analysis of whole genomes. Discoveries from such screens provide a wealth of knowledge about gene to phenotype relationships in various biological model systems. However, a database resource to query results efficiently has been lacking. To this end, we developed GenomeCRISPR ( http://genomecrispr.org ), a database for genome-scale CRISPR/Cas9 screens. Currently, GenomeCRISPR contains data on more than 550 000 single guide RNAs (sgRNA) derived from 84 different experiments performed in 48 different human cell lines, comprising all screens in human cells using CRISPR/Cas published to date. GenomeCRISPR provides data mining options and tools, such as gene or genomic region search. Phenotypic and genome track views allow users to investigate and compare the results of different screens, or the impact of different sgRNAs on the gene of interest. An Application Programming Interface (API) allows for automated data access and batch download. As more screening data will become available, we also aim at extending the database to include functional genomic data from other organisms and enable cross-species comparisons.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 84
    facet.materialart.
    Unknown
    Oxford University Press
    Publication Date: 2017-01-05
    Description: Manteia is an integrative database available online at http://manteia.igbmc.fr which provides a large array of OMICs data related to the development of the mouse, chicken, zebrafish and human. The system is designed to use different types of data together in order to perform advanced datamining, test hypotheses or provide candidate genes involved in biological processes or responsible for human diseases. In this new version of the database, Manteia has been enhanced with new expression data originating from microarray and next generation sequencing experiments. In addition, the system includes new statistics tools to analyze lists of genes in order to compare their functions and highlight their specific features. One of the main novelties of this release is the integration of a machine learning tool called Lookalike that we have developed to analyze the different datasets present in the system in order to identify new disease genes. This tool identifies the key features of known disease genes to provide and rank new candidates with similar properties from the genome. It is also designed to highlight and take into account the specificities of a disease in order to increase the accuracy of its predictions.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 85
    Publication Date: 2017-01-05
    Description: The HmtDB resource hosts a database of human mitochondrial genome sequences from individuals with healthy and disease phenotypes. The database is intended to support both population geneticists as well as clinicians undertaking the task to assess the pathogenicity of specific mtDNA mutations. The wide application of next-generation sequencing (NGS) has provided an enormous volume of high-resolution data at a low price, increasing the availability of human mitochondrial sequencing data, which called for a cogent and significant expansion of HmtDB data content that has more than tripled in the current release. We here describe additional novel features, including: (i) a complete, user-friendly restyling of the web interface, (ii) links to the command-line stand-alone and web versions of the MToolBox package, an up-to-date tool to reconstruct and analyze human mitochondrial DNA from NGS data and (iii) the implementation of the Reconstructed Sapiens Reference Sequence (RSRS) as mitochondrial reference sequence. The overall update renders HmtDB an even more handy and useful resource as it enables a more rapid data access, processing and analysis. HmtDB is accessible at http://www.hmtdb.uniba.it/ .
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 86
    Publication Date: 2017-01-05
    Description: The Gene Expression Database (GXD; www.informatics.jax.org/expression.shtml ) is an extensive and well-curated community resource of mouse developmental expression information. Through curation of the scientific literature and by collaborations with large-scale expression projects, GXD collects and integrates data from RNA in situ hybridization, immunohistochemistry, RT-PCR, northern blot and western blot experiments. Expression data from both wild-type and mutant mice are included. The expression data are combined with genetic and phenotypic data in Mouse Genome Informatics (MGI) and made readily accessible to many types of database searches. At present, GXD includes over 1.5 million expression results and more than 300 000 images, all annotated with detailed and standardized metadata. Since our last report in 2014, we have added a large amount of data, we have enhanced data and database infrastructure, and we have implemented many new search and display features. Interface enhancements include: a new Mouse Developmental Anatomy Browser; interactive tissue-by-developmental stage and tissue-by-gene matrix views; capabilities to filter and sort expression data summaries; a batch search utility; gene-based expression overviews; and links to expression data from other species.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 87
    Publication Date: 2017-01-05
    Description: Upon the first publication of the fifth iteration of the Functional Annotation of Mammalian Genomes collaborative project, FANTOM5, we gathered a series of primary data and database systems into the FANTOM web resource ( http://fantom.gsc.riken.jp ) to facilitate researchers to explore transcriptional regulation and cellular states. In the course of the collaboration, primary data and analysis results have been expanded, and functionalities of the database systems enhanced. We believe that our data and web systems are invaluable resources, and we think the scientific community will benefit for this recent update to deepen their understanding of mammalian cellular organization. We introduce the contents of FANTOM5 here, report recent updates in the web resource and provide future perspectives.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 88
    Publication Date: 2017-01-05
    Description: OrthoDB is a comprehensive catalog of orthologs, genes inherited by extant species from a single gene in their last common ancestor. In 2016 OrthoDB reached its 9th release, growing to over 22 million genes from over 5000 species, now adding plants, archaea and viruses. In this update we focused on usability of this fast-growing wealth of data: updating the user and programmatic interfaces to browse and query the data, and further enhancing the already extensive integration of available gene functional annotations. Collating functional annotations from over 100 resources, and enabled us to propose descriptive titles for 87% of ortholog groups. Additionally, OrthoDB continues to provide computed evolutionary annotations and to allow user queries by sequence homology. The OrthoDB resource now enables users to generate publication-quality comparative genomics charts, as well as to upload, analyze and interactively explore their own private data. OrthoDB is available from http://orthodb.org .
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 89
    Publication Date: 2017-01-05
    Description: RNA editing by A-to-I deamination is the prominent co-/post-transcriptional modification in humans. It is carried out by ADAR enzymes and contributes to both transcriptomic and proteomic expansion. RNA editing has pivotal cellular effects and its deregulation has been linked to a variety of human disorders including neurological and neurodegenerative diseases and cancer. Despite its biological relevance, many physiological and functional aspects of RNA editing are yet elusive. Here, we present REDIportal, available online at http://srv00.recas.ba.infn.it/atlas/ , the largest and comprehensive collection of RNA editing in humans including more than 4.5 millions of A-to-I events detected in 55 body sites from thousands of RNAseq experiments. REDIportal embeds RADAR database and represents the first editing resource designed to answer functional questions, enabling the inspection and browsing of editing levels in a variety of human samples, tissues and body sites. In contrast with previous RNA editing databases, REDIportal comprises its own browser (JBrowse) that allows users to explore A-to-I changes in their genomic context, empathizing repetitive elements in which RNA editing is prominent.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 90
    Publication Date: 2017-01-05
    Description: The Zebrafish Model Organism Database (ZFIN; http://zfin.org ) is the central resource for zebrafish ( Danio rerio) genetic, genomic, phenotypic and developmental data. ZFIN curators provide expert manual curation and integration of comprehensive data involving zebrafish genes, mutants, transgenic constructs and lines, phenotypes, genotypes, gene expressions, morpholinos, TALENs, CRISPRs, antibodies, anatomical structures, models of human disease and publications. We integrate curated, directly submitted, and collaboratively generated data, making these available to zebrafish research community. Among the vertebrate model organisms, zebrafish are superbly suited for rapid generation of sequence-targeted mutant lines, characterization of phenotypes including gene expression patterns, and generation of human disease models. The recent rapid adoption of zebrafish as human disease models is making management of these data particularly important to both the research and clinical communities. Here, we describe recent enhancements to ZFIN including use of the zebrafish experimental conditions ontology, ‘Fish’ records in the ZFIN database, support for gene expression phenotypes, models of human disease, mutation details at the DNA, RNA and protein levels, and updates to the ZFIN single box search.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 91
    Publication Date: 2017-01-05
    Description: Autoantibodies refer to antibodies that target self-antigens, which can play pivotal roles in maintaining homeostasis, distinguishing normal from tumor tissue and trigger autoimmune diseases. In the last three decades, tremendous efforts have been devoted to elucidate the generation, evolution and functions of autoantibodies, as well as their target autoantigens. However, reports of these countless previously identified autoantigens are randomly dispersed in the literature. Here, we constructed an AAgAtlas database 1.0 using text-mining and manual curation. We extracted 45 830 autoantigen-related abstracts and 94 313 sentences from PubMed using the keywords of either ‘autoantigen’ or ‘autoantibody’ or their lexical variants, which were further refined to 25 520 abstracts, 43 253 sentences and 3984 candidates by our bio-entity recognizer based on the Protein Ontology. Finally, we identified 1126 genes as human autoantigens and 1071 related human diseases, with which we constructed a human autoantigen database (AAgAtlas database 1.0). The database provides a user-friendly interface to conveniently browse, retrieve and download human autoantigens as well as their associated diseases. The database is freely accessible at http://biokb.ncpsb.org/aagatlas/ . We believe this database will be a valuable resource to track and understand human autoantigens as well as to investigate their functions in basic and translational research.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 92
    Publication Date: 2017-01-05
    Description: COSMIC, the Catalogue of Somatic Mutations in Cancer ( http://cancer.sanger.ac.uk ) is a high-resolution resource for exploring targets and trends in the genetics of human cancer. Currently the broadest database of mutations in cancer, the information in COSMIC is curated by expert scientists, primarily by scrutinizing large numbers of scientific publications. Over 4 million coding mutations are described in v78 (September 2016), combining genome-wide sequencing results from 28 366 tumours with complete manual curation of 23 489 individual publications focused on 186 key genes and 286 key fusion pairs across all cancers. Molecular profiling of large tumour numbers has also allowed the annotation of more than 13 million non-coding mutations, 18 029 gene fusions, 187 429 genome rearrangements, 1 271 436 abnormal copy number segments, 9 175 462 abnormal expression variants and 7 879 142 differentially methylated CpG dinucleotides. COSMIC now details the genetics of drug resistance, novel somatic gene mutations which allow a tumour to evade therapeutic cancer drugs. Focusing initially on highly characterized drugs and genes, COSMIC v78 contains wide resistance mutation profiles across 20 drugs, detailing the recurrence of 301 unique resistance alleles across 1934 drug-resistant tumours. All information from the COSMIC database is available freely on the COSMIC website.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 93
    Publication Date: 2017-01-05
    Description: De novo germline mutations (DNMs) are the rarest genetic variants proven to cause a considerable number of sporadic genetic diseases, such as autism spectrum disorders, epileptic encephalopathy, schizophrenia, congenital heart disease, type 1 diabetes, and hearing loss. However, it is difficult to accurately assess the cause of DNMs and identify disease-causing genes from the considerable number of DNMs in probands. A common method to this problem is to identify genes that harbor significantly more DNMs than expected by chance, with accurate background DNM rate (DNMR) required. Therefore, in this study, we developed a novel database named mirDNMR for the collection of gene-centered background DNMRs obtained from different methods and population variation data. The database has the following functions: (i) browse and search the background DNMRs of each gene predicted by four different methods, including GC content (DNMR-GC), sequence context (DNMR-SC), multiple factors (DNMR-MF) and local DNA methylation level (DNMR-DM); (ii) search variant frequencies in publicly available databases, including ExAC, ESP6500, UK10K, 1000G and dbSNP and (iii) investigate the DNM burden to prioritize candidate genes based on the four background DNMRs using three statistical methods (TADA, Binomial and Poisson test). As a case study, we successfully employed our database in candidate gene prioritization for a sporadic complex disease: intellectual disability. In conclusion, mirDNMR ( https://www.wzgenomics.cn/mirdnmr/ ) can be widely used to identify the genetic basis of sporadic genetic diseases.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 94
    Publication Date: 2017-01-05
    Description: The information about the genetic basis of human diseases lies at the heart of precision medicine and drug discovery. However, to realize its full potential to support these goals, several problems, such as fragmentation, heterogeneity, availability and different conceptualization of the data must be overcome. To provide the community with a resource free of these hurdles, we have developed DisGeNET ( http://www.disgenet.org ), one of the largest available collections of genes and variants involved in human diseases. DisGeNET integrates data from expert curated repositories, GWAS catalogues, animal models and the scientific literature. DisGeNET data are homogeneously annotated with controlled vocabularies and community-driven ontologies. Additionally, several original metrics are provided to assist the prioritization of genotype–phenotype relationships. The information is accessible through a web interface, a Cytoscape App, an RDF SPARQL endpoint, scripts in several programming languages and an R package. DisGeNET is a versatile platform that can be used for different research purposes including the investigation of the molecular underpinnings of specific human diseases and their comorbidities, the analysis of the properties of disease genes, the generation of hypothesis on drug therapeutic action and drug adverse effects, the validation of computationally predicted disease genes and the evaluation of text-mining methods performance.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 95
    Publication Date: 2017-01-05
    Description: The MalaCards human disease database ( http://www.malacards.org/ ) is an integrated compendium of annotated diseases mined from 68 data sources. MalaCards has a web card for each of ~20 000 disease entries, in six global categories. It portrays a broad array of annotation topics in 15 sections, including Summaries, Symptoms, Anatomical Context, Drugs, Genetic Tests, Variations and Publications. The Aliases and Classifications section reflects an algorithm for disease name integration across often-conflicting sources, providing effective annotation consolidation. A central feature is a balanced Genes section, with scores reflecting the strength of disease-gene associations. This is accompanied by other gene-related disease information such as pathways, mouse phenotypes and GO-terms, stemming from MalaCards’ affiliation with the GeneCards Suite of databases. MalaCards’ capacity to inter-link information from complementary sources, along with its elaborate search function, relational database infrastructure and convenient data dumps, allows it to tackle its rich disease annotation landscape, and facilitates systems analyses and genome sequence interpretation. MalaCards adopts a ‘flat’ disease-card approach, but each card is mapped to popular hierarchical ontologies (e.g. International Classification of Diseases, Human Phenotype Ontology and Unified Medical Language System) and also contains information about multi-level relations among diseases, thereby providing an optimal tool for disease representation and scrutiny.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 96
    Publication Date: 2017-01-05
    Description: The FlyRNAi database of the Drosophila RNAi Screening Center (DRSC) and Transgenic RNAi Project (TRiP) at Harvard Medical School and associated DRSC/TRiP Functional Genomics Resources website ( http://fgr.hms.harvard.edu ) serve as a reagent production tracking system, screen data repository, and portal to the community. Through this portal, we make available protocols, online tools, and other resources useful to researchers at all stages of high-throughput functional genomics screening, from assay design and reagent identification to data analysis and interpretation. In this update, we describe recent changes and additions to our website, database and suite of online tools. Recent changes reflect a shift in our focus from a single technology (RNAi) and model species ( Drosophila ) to the application of additional technologies (e.g. CRISPR) and support of integrated, cross-species approaches to uncovering gene function using functional genomics and other approaches.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 97
    Publication Date: 2017-01-05
    Description: The Human Induced Pluripotent Stem Cell Initiative (HipSci) isf establishing a large catalogue of human iPSC lines, arguably the most well characterized collection to date. The HipSci portal enables researchers to choose the right cell line for their experiment, and makes HipSci's rich catalogue of assay data easy to discover and reuse. Each cell line has genomic, transcriptomic, proteomic and cellular phenotyping data. Data are deposited in the appropriate EMBL-EBI archives, including the European Nucleotide Archive (ENA), European Genome-phenome Archive (EGA), ArrayExpress and PRoteomics IDEntifications (PRIDE) databases. The project will make 500 cell lines from healthy individuals, and from 150 patients with rare genetic diseases; these will be available through the European Collection of Authenticated Cell Cultures (ECACC). As of August 2016, 238 cell lines are available for purchase. Project data is presented through the HipSci data portal ( http://www.hipsci.org/lines ) and is downloadable from the associated FTP site ( ftp://ftp.hipsci.ebi.ac.uk/vol1/ftp ). The data portal presents a summary matrix of the HipSci cell lines, showing available data types. Each line has its own page containing descriptive metadata, quality information, and links to archived assay data. Analysis results are also available in a Track Hub, allowing visualization in the context of public genomic annotations ( http://www.hipsci.org/data/trackhubs ).
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 98
    Publication Date: 2017-01-05
    Description: A cornerstone of modern biomedical research is the use of animal models to study disease mechanisms and to develop new therapeutic approaches. In order to help the research community to better explore the similarities and differences of genomic response between human inflammatory diseases and murine models, we developed KERIS: kaleidoscope of gene responses to inflammation between species (available at http://www.igenomed.org/keris/ ). As of June 2016, KERIS includes comparisons of the genomic response of six human inflammatory diseases (burns, trauma, infection, sepsis, endotoxin and acute respiratory distress syndrome) and matched mouse models, using 2257 curated samples from the Inflammation and the Host Response to Injury Glue Grant studies and other representative studies in Gene Expression Omnibus. A researcher can browse, query, visualize and compare the response patterns of genes, pathways and functional modules across different diseases and corresponding murine models. The database is expected to help biologists choosing models when studying the mechanisms of particular genes and pathways in a disease and prioritizing the translation of findings from disease models into clinical studies.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 99
    Publication Date: 2017-01-05
    Description: The Mouse Genome Database (MGD: http://www.informatics.jax.org ) is the primary community data resource for the laboratory mouse. It provides a highly integrated and highly curated system offering a comprehensive view of current knowledge about mouse genes, genetic markers and genomic features as well as the associations of those features with sequence, phenotypes, functional and comparative information, and their relationships to human diseases. MGD continues to enhance access to these data, to extend the scope of data content and visualizations, and to provide infrastructure and user support that ensures effective and efficient use of MGD in the advancement of scientific knowledge. Here, we report on recent enhancements made to the resource and new features.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 100
    Publication Date: 2017-01-05
    Description: The correlation of phenotypic outcomes with genetic variation and environmental factors is a core pursuit in biology and biomedicine. Numerous challenges impede our progress: patient phenotypes may not match known diseases, candidate variants may be in genes that have not been characterized, model organisms may not recapitulate human or veterinary diseases, filling evolutionary gaps is difficult, and many resources must be queried to find potentially significant genotype–phenotype associations. Non-human organisms have proven instrumental in revealing biological mechanisms. Advanced informatics tools can identify phenotypically relevant disease models in research and diagnostic contexts. Large-scale integration of model organism and clinical research data can provide a breadth of knowledge not available from individual sources and can provide contextualization of data back to these sources. The Monarch Initiative ( monarchinitiative.org ) is a collaborative, open science effort that aims to semantically integrate genotype–phenotype data from many species and sources in order to support precision medicine, disease modeling, and mechanistic exploration. Our integrated knowledge graph, analytic tools, and web services enable diverse users to explore relationships between phenotypes and genotypes across species.
    Print ISSN: 0305-1048
    Electronic ISSN: 1362-4962
    Topics: Biology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
Close ⊗
This website uses cookies and the analysis tool Matomo. More information can be found here...