ALBERT

All Library Books, journals and Electronic Records Telegrafenberg

Your email was sent successfully. Check your inbox.

An error occurred while sending the email. Please try again.

Proceed reservation?

Export
Filter
  • Articles  (46)
  • Latest Papers from Table of Contents or Articles in Press  (46)
  • Oxford University Press  (46)
  • 2020-2024  (46)
  • Medicine  (46)
  • Architecture, Civil Engineering, Surveying
Collection
  • Articles  (46)
Source
  • Latest Papers from Table of Contents or Articles in Press  (46)
Years
Year
Journal
  • 1
    Publication Date: 2021-08-06
    Description: Motivation Constraint-based modeling approaches allow the estimation of maximal in vivo enzyme catalytic rates that can serve as proxies for enzyme turnover numbers. Yet, genome-scale flux profiling remains a challenge in deploying these approaches to catalogue proxies for enzyme catalytic rates across organisms. Results Here, we formulate a constraint-based approach, termed NIDLE-flux, to estimate fluxes at a genome-scale level by using the principle of efficient usage of expressed enzymes. Using proteomics data from Escherichia coli, we show that the fluxes estimated by NIDLE-flux and the existing approaches are in excellent qualitative agreement (Pearson correlation 〉 0.9). We also find that the maximal in vivo catalytic rates estimated by NIDLE-flux exhibits a Pearson correlation of 0.74 with in vitro enzyme turnover numbers. However, NIDLE-flux results in a 1.4-fold increase in the size of the estimated maximal in vivo catalytic rates in comparison to the contenders. Integration of the maximum in vivo catalytic rates with publically available proteomics and metabolomics data provide a better match to fluxes estimated by NIDLE-flux. Therefore, NIDLE-flux facilitates more effective usage of proteomics data to estimate proxies for kcatomes. Availability and implementation https://github.com/Rudan-X/NIDLE-flux-code. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 2
    Publication Date: 2021-08-06
    Description: Motivation Tumour heterogeneity is being increasingly recognized as an important characteristic of cancer and as a determinant of prognosis and treatment outcome. Emerging spatial transcriptomics data hold the potential to further our understanding of tumour heterogeneity and its implications. However, existing statistical tools are not sufficiently powerful to capture heterogeneity in the complex setting of spatial molecular biology. Results We provide a statistical solution, the HeTerogeneity Average index (HTA), specifically designed to handle the multivariate nature of spatial transcriptomics. We prove that HTA has an approximately normal distribution, therefore lending itself to efficient statistical assessment and inference. We first demonstrate that HTA accurately reflects the level of heterogeneity in simulated data. We then use HTA to analyze heterogeneity in two cancer spatial transcriptomics datasets: spatial RNA sequencing by 10x Genomics and spatial transcriptomics inferred from H&E. Finally, we demonstrate that HTA also applies to 3D spatial data using brain MRI. In spatial RNA sequencing, we use a known combination of molecular traits to assert that HTA aligns with the expected outcome for this combination. We also show that HTA captures immune-cell infiltration at multiple resolutions. In digital pathology, we show how HTA can be used in survival analysis and demonstrate that high levels of heterogeneity may be linked to poor survival. In brain MRI, we show that HTA differentiates between normal ageing, Alzheimer’s disease and two tumours. HTA also extends beyond molecular biology and medical imaging, and can be applied to many domains, including GIS. Availability and implementation Python package and source code are available at: https://github.com/alonalj/hta. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 3
    Publication Date: 2021-03-11
    Description: Motivation Genomic DNA replicates according to a reproducible spatiotemporal program, with some loci replicating early in S phase while others replicate late. Despite being a central cellular process, DNA replication timing studies have been limited in scale due to technical challenges. Results We present TIGER (Timing Inferred from Genome Replication), a computational approach for extracting DNA replication timing information from whole genome sequence data obtained from proliferating cell samples. The presence of replicating cells in a biological specimen leads to non-uniform representation of genomic DNA that depends on the timing of replication of different genomic loci. Replication dynamics can hence be observed in genome sequence data by analyzing DNA copy number along chromosomes while accounting for other sources of sequence coverage variation. TIGER is applicable to any species with a contiguous genome assembly and rivals the quality of experimental measurements of DNA replication timing. It provides a straightforward approach for measuring replication timing and can readily be applied at scale. Availability and implementation TIGER is available at https://github.com/TheKorenLab/TIGER. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 4
    Publication Date: 2021-03-02
    Description: Summary We describe improvements to BAli-Phy, a Markov chain Monte Carlo (MCMC) program that jointly estimates phylogeny, alignment and other parameters from unaligned sequence data. Version 3 is substantially faster for large trees, and implements covarion models, additional codon models and other new models. It implements ancestral state reconstruction, allows prior selection for all model parameters, and can also analyze multiple genes simultaneously. Availability and implementation Software is available for download at http://www.bali-phy.org. C++ source code is freely available on Github under the GPL2 License. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 5
    Publication Date: 2021-10-08
    Description: Motivation Over the past decades, vast amounts of genome sequencing data have been produced, requiring an enormous level of storage capacity. The time and resources needed to store and transfer such data cause bottlenecks in genome sequencing analysis. To resolve this issue, various compression techniques have been proposed to reduce the size of original FASTQ raw sequencing data, but these remain suboptimal. Long-read sequencing has become dominant in genomics, whereas most existing compression methods focus on short-read sequencing only. Results We designed a compression algorithm based on read reordering using a novel scoring model for reducing FASTQ file size with no information loss. We integrated all data processing steps into a software package called FastqCLS and provided it as a Docker image for ease of installation and execution to help users easily install and run. We compared our method with existing major FASTQ compression tools using benchmark datasets. We also included new long-read sequencing data in this validation. As a result, FastqCLS outperformed in terms of compression ratios for storing long-read sequencing data. Availability and implementation FastqCLS can be downloaded from https://github.com/krlucete/FastqCLS. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 6
    Publication Date: 2021-10-02
    Description: Motivation Knowing the number and the exact locations of multiple change points in genomic sequences serves several biological needs. The cumulative segmented algorithm (cumSeg) has been recently proposed as a computationally efficient approach for multiple change-points detection, which is based on a simple transformation of data and provides results quite robust to model mis-specifications. However, the errors are also accumulated in the transformed model so that heteroscedasticity and serial correlation will show up, and thus the variations of the estimated change points will be quite different, while the locations of the change points should be of the same importance in the original genomic sequences. Results In this study, we develop two new change-points detection procedures in the framework of cumulative segmented regression. Simulations reveal that the proposed methods not only improve the efficiency of each change point estimator substantially but also provide the estimators with similar variations for all the change points. By applying these proposed algorithms to Coriel and SNP genotyping data, we illustrate their performance on detecting copy number variations. Supplementary information The proposed algorithms are implemented in R program and are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 7
    Publication Date: 2021-09-24
    Description: We analyzed early brain metabolic adaptations in response to mitochondrial dysfunction in a mouse model of mitochondrial encephalopathy with complex IV deficiency [neuron-specific COX10 knockout (KO)]. In this mouse model, the onset of the mitochondrial defect did not coincide with immediate cell death, suggesting early adaptive metabolic responses to compensate for the energetic deficit. Metabolomic analysis in the KO mice revealed increased levels of glycolytic and pentose phosphate pathway intermediates, amino acids and lysolipids. Glycolysis was modulated by enhanced activity of glycolytic enzymes, and not by their overexpression, suggesting the importance of post-translational modifications in the adaptive response. Glycogen synthase kinase 3 inactivation was the most upstream regulation identified, implying that it is a key event in this adaptive mechanism. Because neurons are thought not to rely on glycolysis for adenosine triphosphate production in normal conditions, our results indicate that neurons still maintain their ability to upregulate this pathway when under mitochondrial respiration stress.
    Print ISSN: 0964-6906
    Electronic ISSN: 1460-2083
    Topics: Biology , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 8
    Publication Date: 2021-09-23
    Description: Summary Studying biological systems generally relies on computational modeling and simulation, e.g., model-driven discovery and hypothesis testing. Progress in standardization efforts led to the development of interrelated file formats to exchange and reuse models in systems biology, such as SBML, the Simulation Experiment Description Markup Language (SED-ML) or the Open Modeling EXchange format. Conducting simulation experiments based on these formats requires efficient and reusable implementations to make them accessible to the broader scientific community and to ensure the reproducibility of the results. The Systems Biology Simulation Core Library (SBSCL) provides interpreters and solvers for these standards as a versatile open-source API in JavaTM. The library simulates even complex bio-models and supports deterministic Ordinary Differential Equations; Stochastic Differential Equations; constraint-based analyses; recent SBML and SED-ML versions; exchange of results, and visualization of in silico experiments; open modeling exchange formats (COMBINE archives); hierarchically structured models; and compatibility with standard testing systems, including the Systems Biology Test Suite and published models from the BioModels and BiGG databases. Availability and implementation SBSCL is freely available at https://draeger-lab.github.io/SBSCL/ and via Maven Central. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 9
    Publication Date: 2021-09-08
    Description: Summary We present the LipidQuant 1.0 tool for automated data processing workflows in lipidomic quantitation based on lipid class separation coupled with high-resolution mass spectrometry. Lipid class separation workflows, such as hydrophilic interaction liquid chromatography or supercritical fluid chromatography, should be preferred in lipidomic quantitation due to the coionization of lipid class internal standards with analytes from the same class. The individual steps in the LipidQuant workflow are explained, including lipid identification, quantitation, isotopic correction and reporting results. We show the application of LipidQuant data processing to a small cohort of human serum samples. Availability and implementation The LipidQuant 1.0 is freely available at Zenodo https://doi.org/10.5281/zenodo.5151201 and https://holcapek.upce.cz/lipidquant.php. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 10
    Publication Date: 2021-09-06
    Description: Motivation Genetic variations of expression quantitative trait loci (eQTLs) play a critical role in influencing complex traits and diseases development. Two main factors that affect the statistical power of detecting eQTLs are: (i) relatively small size of samples available, and (ii) heavy burden of multiple testing due to a very large number of variants to be tested. The later issue is particularly severe when one tries to identify trans-eQTLs that are far away from the genes they influence. If one can exploit co-expressed genes jointly in eQTL-mapping, effective sample size can be increased. Furthermore, using the structure of the gene regulatory network (GRN) may help to identify trans-eQTLs without increasing multiple testing burden. Results In this article, we use the structure equation model (SEM) to model both GRN and effect of eQTLs on gene expression, and then develop a novel algorithm, named sparse SEM for eQTL mapping (SSEMQ), to conduct joint eQTL mapping and GRN inference. The SEM can exploit co-expressed genes jointly in eQTL mapping and also use GRN to determine trans-eQTLs. Computer simulations demonstrate that our SSEMQ significantly outperforms nine existing eQTL mapping methods. SSEMQ is further used to analyze two real datasets of human breast and whole blood tissues, yielding a number of cis- and trans-eQTLs. Availability and implementation R package ssemQr is available at https://github.com/Ivis4ml/ssemQr.git. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 11
    Publication Date: 2021-07-01
    Description: Motivation Personalized medicine aims at providing patient-tailored therapeutics based on multi-type data toward improved treatment outcomes. Chronotherapy that consists in adapting drug administration to the patient’s circadian rhythms may be improved by such approach. Recent clinical studies demonstrated large variability in patients’ circadian coordination and optimal drug timing. Consequently, new eHealth platforms allow the monitoring of circadian biomarkers in individual patients through wearable technologies (rest-activity, body temperature), blood or salivary samples (melatonin, cortisol) and daily questionnaires (food intake, symptoms). A current clinical challenge involves designing a methodology predicting from circadian biomarkers the patient peripheral circadian clocks and associated optimal drug timing. The mammalian circadian timing system being largely conserved between mouse and humans yet with phase opposition, the study was developed using available mouse datasets. Results We investigated at the molecular scale the influence of systemic regulators (e.g. temperature, hormones) on peripheral clocks, through a model learning approach involving systems biology models based on ordinary differential equations. Using as prior knowledge our existing circadian clock model, we derived an approximation for the action of systemic regulators on the expression of three core-clock genes: Bmal1, Per2 and Rev-Erbα. These time profiles were then fitted with a population of models, based on linear regression. Best models involved a modulation of either Bmal1 or Per2 transcription most likely by temperature or nutrient exposure cycles. This agreed with biological knowledge on temperature-dependent control of Per2 transcription. The strengths of systemic regulations were found to be significantly different according to mouse sex and genetic background. Availability and implementation https://gitlab.inria.fr/julmarti/model-learning-mb21eccb. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 12
    Publication Date: 2021-10-12
    Description: Motivation Immune cells are important components of the immune system and are crucial for disease initiation, progression, prognosis, and survival. Although several computational methods have been designed for predicting the abundance of immune cells, very few tools are applicable to mouse. Given that mouse is the most widely used animal model in biomedical research, there is an urgent need to develop a precise algorithm for predicting mouse immune cells. Results We developed a tool named ImmuCellAI-mouse (Immune Cell Abundance Identifier for mouse), for estimating the abundance of 36 immune cell (sub)types from gene expression data in a hierarchical strategy of three layers. Reference expression profile and robust marker gene sets of immune cell types were curated. The abundance of cells in three layers was predicted separately by calculating the ssGSEA enrichment score of the expression deviation profile per cell type. Benchmark results showed high accuracy of ImmuCellAI-mouse in predicting most immune cell types, with correlation coefficients between predicted value and real cell proportion of most cell types being larger than 0.8. We applied ImmuCellAI-mouse to a mouse breast tumor dataset and revealed the dynamic change of immune cell infiltration during treatment, which is consistent with the findings of the original study but with more details. We also constructed an online server for ImmuCellAI-mouse, on which users can upload expression matrices for analysis. ImmuCellAI-mouse will be a useful tool for studying the immune microenvironment, cancer immunology, and immunotherapy in mouse models, providing an indispensable supplement for human disease studies. Availability Software is available at http://bioinfo.life.hust.edu.cn/ImmuCellAI-mouse/ Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 13
    Publication Date: 2021-10-13
    Description: Motivation As one of the most important post-translational modifications (PTMs), protein lysine crotonylation (Kcr) has attracted wide attention, which involves in important physiological activities, such as cell differentiation and metabolism. However, experimental methods are expensive and time-consuming for Kcr identification. Instead, computational methods can predict Kcr sites in silico with high efficiency and low cost. Results In this study, we proposed a novel predictor, BERT-Kcr, for protein Kcr sites prediction, which was developed by using a transfer learning method with pre-trained bidirectional encoder representations from transformers (BERT) models. These models were originally used for natural language processing (NLP) tasks, such as sentence classification. Here, we transferred each amino acid into a word as the input information to the pre-trained BERT model. The features encoded by BERT were extracted and then fed to a BiLSTM network to build our final model. Compared with the models built by other machine learning and deep learning classifiers, BERT-Kcr achieved the best performance with AUROC of 0.983 for 10-fold cross-validation. Further evaluation on the independent test set indicates that BERT-Kcr outperforms the state-of-the-art model Deep-Kcr with an improvement of about 5% for AUROC. The results of our experiment indicate that the direct use of sequence information and advanced pre-trained models of natural language processing could be an effective way for identifying post-translational modification sites of proteins. Availability The BERT-Kcr model is publicly available on http://zhulab.org.cn/BERT-Kcr_models/. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 14
    Publication Date: 2021-10-08
    Description: Summary We present several recent improvements to minimap2, a versatile pairwise aligner for nucleotide sequences. Now minimap2 v2.22 can more accurately map long reads to highly repetitive regions and align through insertions or deletions up to 100 kb by default, addressing major weakness in minimap2 v2.18 or earlier. Availability and implementation https://github.com/lh3/minimap2.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 15
    Publication Date: 2021-10-08
    Description: Motivation Deciphering nucleosome–nucleosome interactions is an important step toward mesoscale description of chromatin organization but computational tools to perform such analyses are not publicly available. Results We developed iNucs, a user-friendly and efficient Python-based bioinformatics tool to compute and visualize nucleosome-resolved interactions using standard pairs format input generated from pairtools. Availabilityand implementation https://github.com/Karimi-Lab/inucs/. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 16
    Publication Date: 2021-09-11
    Description: GNAO1 encephalopathy is a neurodevelopmental disorder with a spectrum of symptoms that include dystonic movements, seizures and developmental delay. While numerous GNAO1 mutations are associated with this disorder, the functional consequences of pathological variants are not completely understood. Here, we deployed the invertebrate C. elegans as a whole-animal behavioral model to study the functional effects of GNAO1 disorder-associated mutations. We tested several pathological GNAO1 mutations for effects on locomotor behaviors using a combination of CRISPR/Cas9 gene editing and transgenic overexpression in vivo. We report that all three mutations tested (G42R, G203R and R209C) result in strong loss of function defects when evaluated as homozygous CRISPR alleles. In addition, mutations produced dominant negative effects assessed using both heterozygous CRISPR alleles and transgenic overexpression. Experiments in mice confirmed dominant negative effects of GNAO1 G42R, which impaired numerous motor behaviors. Thus, GNAO1 pathological mutations result in conserved functional outcomes across animal models. Our study further establishes the molecular genetic basis of GNAO1 encephalopathy, and develops a CRISPR-based pipeline for functionally evaluating mutations associated with neurodevelopmental disorders.
    Print ISSN: 0964-6906
    Electronic ISSN: 1460-2083
    Topics: Biology , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 17
    Publication Date: 2021-10-12
    Description: Summary Large scale pre-trained language models (PLMs) have advanced state-of-the-art (SOTA) performance on various biomedical text mining tasks. The power of such PLMs can be combined with the advantages of deep generative models. These are examples of these combinations. However, they are trained only on general domain text, and biomedical models are still missing. In this work, we describe BioVAE, the first large scale pre-trained latent variable language model for the biomedical domain, which uses the OPTIMUS framework to train on large volumes of biomedical text. The model shows SOTA performance on several biomedical text mining tasks when compared to existing publicly available biomedical PLMs. Additionally, our model can generate more accurate biomedical sentences than the original OPTIMUS output. Availability Our source code and pre-trained models are freely available: https://github.com/aistairc/BioVAE Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 18
    Publication Date: 2021-10-12
    Description: Summary Non-coding RNAs are often neglected during genome annotation due to their difficulty of detection relative to protein coding genes. FindNonCoding takes a pattern mining approach to capture the essential sequence motifs and hairpin loops representing a non-coding RNA family and quickly identify matches in genomes. FindNonCoding was designed for ease of use and accurately finds non-coding RNAs with a low false discovery rate. Availability and implementation FindNonCoding is implemented within the DECIPHER package (v2.19.3) for R (v4.1) available from Bioconductor. Pre-trained models of common non-coding RNA families are included for bacteria, archaea and eukarya. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 19
    Publication Date: 2021-10-02
    Description: Motivation Genome-wide association studies (GWAS) summary statistics have popularized and accelerated genetic research. However, a lack of standardization of the file formats used has proven problematic when running secondary analysis tools or performing meta-analysis studies. Results To address this issue, we have developed MungeSumstats, a Bioconductor R package for the standardization and quality control of GWAS summary statistics. MungeSumstats can handle the most common summary statistic formats, including variant call format (VCF) producing a reformatted, standardized, tabular summary statistic file, VCF or R native data object. Availability and implementation MungeSumstats is available on Bioconductor (v 3.13) and can also be found on Github at: https://neurogenomics.github.io/MungeSumstats. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 20
    Publication Date: 2021-09-15
    Description: Summary Recent efforts to identify novel bacterial structured noncoding RNA (ncRNA) motifs through searching long, GC-rich intergenic regions (IGRs) have revealed several new classes, including the recently validated HMP-PP riboswitch. The DIMPL (Discovery of Intergenic Motifs PipeLine) discovery pipeline described herein enables rapid extraction and selection of bacterial IGRs that are enriched for structured ncRNAs. Moreover, DIMPL automates the subsequent computational steps necessary for their functional identification. Availability and implementation The DIMPL pipeline is freely available as a Docker image with an accompanying set of Jupyter notebooks. Full instructions for download and use are available at https://github.com/breakerlab/dimpl. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 21
    Publication Date: 2021-09-14
    Description: The metabolic needs for postnatal growth of the human nervous system are vast. Recessive loss-of-function mutations in the mitochondrial enzyme glutamate pyruvate transaminase 2 (GPT2) in humans cause postnatal undergrowth of brain, and cognitive and motor disability. We demonstrate that GPT2 governs critical metabolic mechanisms in neurons required for neuronal growth and survival. These metabolic processes include neuronal alanine synthesis and anaplerosis, the replenishment of tricarboxylic acid (TCA) cycle intermediates. We performed metabolomics across postnatal development in Gpt2-null mouse brain to identify the trajectory of dysregulated metabolic pathways: alterations in alanine occur earliest; followed by reduced TCA cycle intermediates and reduced pyruvate; followed by elevations in glycolytic intermediates and amino acids. Neuron-specific deletion of GPT2 in mice is sufficient to cause motor abnormalities and death pre-weaning, a phenotype identical to the germline Gpt2-null mouse. Alanine biosynthesis is profoundly impeded in Gpt2-null neurons. Exogenous alanine is necessary for Gpt2-null neuronal survival in vitro but is not needed for Gpt2-null astrocytes. Dietary alanine supplementation in Gpt2-null mice enhances animal survival and improves the metabolic profile of Gpt2-null brain but does not alone appear to correct motor function. In surviving Gpt2-null animals, we observe smaller upper and lower motor neurons in vivo. We also observe selective death of lower motor neurons in vivo with worsening motor behavior with age. In conclusion, these studies of the pathophysiology of GPT2 Deficiency have identified metabolic mechanisms that are required for neuronal growth and that potentially underlie selective neuronal vulnerabilities in motor neurons.
    Print ISSN: 0964-6906
    Electronic ISSN: 1460-2083
    Topics: Biology , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 22
    Publication Date: 2021-10-19
    Description: Motivation Discover is an algorithm developed to identify mutually exclusive genomic events. Its main contribution is a statistical analysis based on the Poisson-Binomial (PB) distribution to take into account the mutation rate of genes and samples. Discover is very effective for identifying mutually exclusive mutations at the expense of speed in large datasets: the Poisson-Binomial is computationally costly to estimate, and checking all the potential mutually exclusive alterations requires millions of tests. Results We have implemented a new version of the package called Rediscover that implements exact and approximate computations of the PB. Rediscover exact implementation is slightly faster than Discover for large and medium-sized datasets. The approximation is 100 to 1,000 times faster for them making it possible to get results in less than a minute with a standard desktop. The memory footprint is also smaller in Rediscover. The new package is available at CRAN and provides some functions to integrate its usage with other R packages such as maftools and TCGAbiolinks. Availability Rediscover is available at CRAN (https://cran.r-project.org/web/packages/Rediscover/index.html). Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 23
    Publication Date: 2021-08-09
    Description: Fanconi anemia (FA) is a rare human genetic disorder characterized by bone marrow failure, predisposition to cancer and developmental defects including hypogonadism. Reproductive defects leading to germ cell aplasia are the most consistent phenotypes seen in FA mouse models. We examined the role of the nuclear FA core complex gene Fancg in the development of primordial germ cells (PGCs), the embryonic precursors of adult gametes, during fetal development. PGC maintenance was severely impaired in Fancg−/− embryos. We observed a defect in the number of PGCs starting at E9.5 and a strong attrition at E11.5 and E13.5. Remarkably, we observed a mosaic pattern reflecting a portion of testicular cords devoid of PGCs in E13.5 fetal gonads. Our in vitro and in vivo data highlight a potential role of Fancg in the proliferation and in the intrinsic cell motility abilities of PGCs. The random migratory process is abnormally activated in Fancg−/− PGCs, altering the migration of cells. Increased cell death and PGC attrition observed in E11.5 Fancg−/− embryos are features consistent with delayed migration of PGCs along the migratory pathway to the genital ridges. Moreover, we show that an inhibitor of RAC1 mitigates the abnormal migratory pattern observed in Fancg−/− PGCs.
    Print ISSN: 0964-6906
    Electronic ISSN: 1460-2083
    Topics: Biology , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 24
    Publication Date: 2021-10-08
    Description: Motivation Deep learning approaches have empowered single-cell omics data analysis in many ways and generated new insights from complex cellular systems. As there is an increasing need for single cell omics data to be integrated across sources, types, and features of data, the challenges of integrating single-cell omics data are rising. Here, we present an unsupervised deep learning algorithm that learns discriminative representations for single-cell data via maximizing mutual information, SMILE (Single-cell Mutual Information Learning). Results Using a unique cell-pairing design, SMILE successfully integrates multi-source single-cell transcriptome data, removing batch effects and projecting similar cell types, even from different tissues, into the shared space. SMILE can also integrate data from two or more modalities, such as joint profiling technologies using single-cell ATAC-seq, RNA-seq, DNA methylation, Hi-C, and ChIP data. When paired cells are known, SMILE can integrate data with unmatched feature, such as genes for RNA-seq and genome wide peaks for ATAC-seq. Integrated representations learned from joint profiling technologies can then be used as a framework for comparing independent single source data. Supplementary information Supplementary data are available at Bioinformatics online. The source code of SMILE including analyses of key results in the study can be found at: https://github.com/rpmccordlab/SMILE.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 25
    Publication Date: 2021-10-30
    Description: Motivation Alignment-free (AF) distance/similarity functions are a key tool for sequence analysis. Experimental studies on real datasets abound and, to some extent, there are also studies regarding their control of false positive rate (Type I error). However, assessment of their power, i.e., their ability to identify true similarity, has been limited to some members of the D2 family. The corresponding experimental studies have concentrated on short sequences, a scenario no longer adequate for current applications, where sequence lengths may vary considerably. Such a State of the Art is methodologically problematic, since information regarding a key feature such as power is either missing or limited. Results By concentrating on a representative set of word-frequency based AF functions, we perform the first coherent and uniform evaluation of the power, involving also Type I error for completeness. Two Alternative models of important genomic features (CIS Regulatory Modules and Horizontal Gene Transfer), a wide range of sequence lengths from a few thousand to millions, and different values of k have been used. As a result, we provide a characterization of those AF functions that is novel and informative. Indeed, we identify weak and strong points of each function considered, which may be used as a guide to choose one for analysis tasks. Remarkably, of the fifteen functions that we have considered, only four stand out, with small differences between small and short sequence length scenarios. Finally, in order to encourage the use of our methodology for validation of future AF functions, the Big Data platform supporting it is public. Availability The software is available at: https://github.com/pipp8/power_statistics Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 26
    Publication Date: 2021-10-27
    Description: The form of Charcot–Marie-Tooth type 4B (CMT4B) disease caused by mutations in myotubularin-related 5 (MTMR5; also called SET Binding Factor 1; SBF1) shows a spectrum of axonal and demyelinating nerve phenotypes. This contrasts with the CMT4B subtypes caused by MTMR2 or MTMR13 (SBF2) mutations, which are characterized by myelin outfoldings and classic demyelination. Thus, it is unclear whether MTMR5 plays an analogous or distinct role from that of its homolog, MTMR13, in the peripheral nervous system (PNS). MTMR5 and MTMR13 are pseudophosphatases predicted to regulate endosomal trafficking by activating Rab GTPases and binding to the phosphoinositide 3-phosphatase MTMR2. In the mouse PNS, Mtmr2 was required to maintain wild type levels of Mtmr5 and Mtmr13, suggesting that these factors function in discrete protein complexes. Genetic elimination of both Mtmr5 and Mtmr13 in mice led to perinatal lethality, indicating that the two proteins have partially redundant functions during embryogenesis. Loss of Mtmr5 in mice did not cause CMT4B-like myelin outfoldings. However, adult Mtmr5−/− mouse nerves contained fewer myelinated axons than control nerves, likely as a result of axon radial sorting defects. Consistently, Mtmr5 levels were highest during axon radial sorting and fell sharply after postnatal day seven. Our findings suggest that Mtmr5 and Mtmr13 ensure proper axon radial sorting and Schwann cell myelination, respectively, perhaps through their direct interactions with Mtmr2. This study enhances our understanding of the non-redundant roles of the endosomal regulators MTMR5 and MTMR13 during normal peripheral nerve development and disease.
    Print ISSN: 0964-6906
    Electronic ISSN: 1460-2083
    Topics: Biology , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 27
    Publication Date: 2021-10-30
    Description: Motivation Coiled-coil is composed of two or more helices that are wound around each other. It widely exists in proteins and has been discovered to play a variety of critical roles in biology processes. Generally, there are three types of structural features in coiled-coil: coiled-coil domain (CCD), oligomeric state, and register. However, most of the existing computational tools only focus on one of them. Results Here, we describe a new deep learning model, CoCoPRED, which is based on convolutional layers, bidirectional long short-term memory, and attention mechanism. It has three networks, i.e., CCD network, oligomeric state network, and register network, corresponding to the three types of structural features in coiled-coil. This means CoCoPRED has the ability of fulfilling comprehensive prediction for coiled-coil proteins. Through the 5-fold cross-validation experiment, we demonstrate that CoCoPRED can achieve better performance than the state-of-the-art models on both CCD prediction and oligomeric state prediction. Further analysis suggests the CCD prediction may be a performance indicator of the oligomeric state prediction in CoCoPRED. The attention heads in CoCoPRED indicate that registers a, b, and e are more crucial for the oligomeric state prediction. Availability CoCoPRED is available at http://www.csbio.sjtu.edu.cn/bioinf/CoCoPRED. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 28
    Publication Date: 2021-10-28
    Description: Motivation Inference of Identity-by-descent (IBD) sharing along the genome between pairs of individuals has important uses. But all existing inference methods are based on genotypes, which is not ideal for low-depth Next Generation Sequencing (NGS) data from which genotypes can only be called with high uncertainty. Results We present a new probabilistic software tool, LocalNgsRelate, for inferring IBD sharing along the genome between pairs of individuals from low-depth NGS data. Its inference is based on genotype likelihoods instead of genotypes, and thereby it takes the uncertainty of the genotype calling into account. Using real data from the 1000 Genomes project, we show that LocalNgsRelate provides more accurate IBD inference for low-depth NGS data than two state-of-the-art genotype based methods, Albrechtsen et al. (2009) and hap-IBD. We also show that the method works well for NGS data down to a depth of 2X. Availability LocalNgsRelate is freely available at https://github.com/idamoltke/LocalNgsRelate Supplementary Data Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 29
    Publication Date: 2021-10-28
    Description: The molecular mechanisms leading to high altitude pulmonary hypertension (HAPH) remains poorly understood. We previously analyzed the whole genome sequence of Kyrgyz highland population and identified eight genomic intervals having a potential role in HAPH. Tropomodulin 3 gene (TMOD3) which encodes a protein that binds and caps the pointed ends of actin filaments and inhibits cell migration, was one of the top candidates. Here we systematically sought additional evidence to validate the functional role of TMOD3. In-silico analysis reveals that some of the SNPs in HAPH associated genomic intervals were positioned in a regulatory region that could result in alternative splicing of TMOD3. In order to functionally validate the role of TMOD3 in HAPH, we exposed Tmod3−/+ mice to 4 weeks of constant hypoxia, i.e. 10% O2 and analyzed both functional (hemodynamic measurements) and structural (angiography) parameters related to HAPH. The hemodynamic measurements, such as right ventricular systolic pressure, a surrogate measure for pulmonary arterial systolic pressure, and right ventricular contractility (RV- ± dP/dt), increases with hypoxia did not separate between Tmod3−/+ and control mice. Remarkably, there was a significant increase in the number of lung vascular branches and total length of pulmonary vascular branches (p 
    Print ISSN: 0964-6906
    Electronic ISSN: 1460-2083
    Topics: Biology , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 30
    Publication Date: 2021-10-20
    Description: Summary Whole genome assembly (WGA) of bacterial genomes with short reads is a quite common task as DNA sequencing has become cheaper with the advances of its technology. The process of assembling a genome has no absolute golden standard and it requires to perform a sequence of steps each of which can involve combinations of many different tools. However, the quality of the final assembly is always strongly related to the quality of the input data. With this in mind we built WGA-LP, a package that connects state-of-the-art programs for microbial analysis and novel scripts to check and improve the quality of both samples and resulting assemblies. WGA-LP, with its conservative decontamination approach, has shown to be capable of creating high quality assemblies even in the case of contaminated reads. Availability and implementation WGA-LP is available on GitHub (https://github.com/redsnic/WGA-LP) and Docker Hub (https://hub.docker.com/r/redsnic/wgalp). The web app for node visualization is hosted by shinyapps.io (https://redsnic.shinyapps.io/ContigCoverageVisualizer/). Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 31
    Publication Date: 2021-10-27
    Description: We have previously described two flow cytometry-based in vitro genotoxicity tests: micronucleus (MN) scoring (MicroFlow ®) and a multiplexed DNA damage response biomarker assay (MultiFlow ®). Here, we describe a strategy for combining the assays in order to efficiently supplement MN analyses with a panel of biomarkers that comment on cytotoxicity (i.e., relative nuclei count, relative increased nuclei count, cleaved PARP-positive chromatin, and ethidium monoazide-positive chromatin) and genotoxic mode of action (i.e., γH2AX, phospho-histone H3, p53 activation, and polyploidy). For these experiments, human TK6 cells were exposed to each of 32 well-studied reference chemicals in 96-well plates for 24 continuous hr. The test chemicals were evaluated over a range of concentrations in the presence and absence of a rat liver S9-based metabolic activation system. MultiFlow assay data were acquired at 4 and 24 hr, and MN were scored at 24 hr. Testing 32 chemicals in two metabolic activation arms translated into 64 a priori calls: 42 genotoxicants and 22 non-genotoxicants. The MN assay showed high sensitivity and moderate specificity (90% and 68%, respectively). When a genotoxic call required significant MN and MultiFlow responses, specificity increased to 95% without adversely affecting sensitivity. The dose response data were analyzed with PROAST Benchmark Dose (BMD) software in order to calculate potency metrics for each endpoint, and ToxPi software was used to synthesize the resulting lower and upper bound 90% confidence intervals into visual profiles. The BMD/ToxPi combination was found to represent a powerful strategy for synthesizing multiple BMD confidence intervals, as the software output provided MoA information as well as insights into genotoxic potency.
    Print ISSN: 0267-8357
    Electronic ISSN: 1464-3804
    Topics: Biology , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 32
    Publication Date: 2021-10-28
    Description: Pathogenic variants that disrupt human mitochondrial protein synthesis are associated with a clinically heterogenous group of diseases. Despite an impairment in oxidative phosphorylation being a common phenotype, the underlying molecular pathogenesis is more complex than simply a bioenergetic deficiency. Currently, we have limited mechanistic understanding on the scope by which a primary defect in mitochondrial protein synthesis contributes to organelle dysfunction. Since the proteins encoded in the mitochondrial genome are hydrophobic and need co-translational insertion into a lipid bilayer, responsive quality control mechanisms are required to resolve aberrations that arise with the synthesis of truncated and misfolded proteins. Here, we show that defects in the OXA1L-mediated insertion of MT-ATP6 nascent chains into the mitochondrial inner membrane are rapidly resolved by the AFG3L2 protease complex. Using pathogenic MT-ATP6 variants, we then reveal discrete steps in this quality control mechanism and the differential functional consequences to mitochondrial gene expression. The inherent ability of a given cell type to recognize and resolve impairments in mitochondrial protein synthesis may in part contribute at the molecular level to the wide clinical spectrum of these disorders.
    Print ISSN: 0964-6906
    Electronic ISSN: 1460-2083
    Topics: Biology , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 33
    Publication Date: 2021-10-28
    Description: Motivation The adoption of current single-cell DNA methylation sequencing protocols is hindered by incomplete coverage, outlining the need for effective imputation techniques. The task of imputing single-cell (methylation) data requires models to build an understanding of underlying biological processes. Results We adapt the transformer neural network architecture to operate on methylation matrices through combining axial attention with sliding window self-attention. The obtained CpG Transformer displays state-of-the-art performances on a wide range of scBS-seq and scRRBS-seq datasets. Furthermore, we demonstrate the interpretability of CpG Transformer and illustrate its rapid transfer learning properties, allowing practitioners to train models on new datasets with a limited computational and time budget. Availability and Implementation CpG Transformer is freely available at https://github.com/gdewael/cpg-transformer. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 34
    Publication Date: 2021-10-28
    Description: Motivation With the advancement of sequencing technologies, genomic data sets are constantly being expanded by high volumes of different data types. One recently introduced data type in genomic science is genomic signals, which are usually short-read coverage measurements over the genome. To understand and evaluate the results of such studies, one needs to understand and analyze the characteristics of the input data. Results SigTools is an R-based genomic signals visualization package developed with two objectives: 1) to facilitate genomic signals exploration in order to uncover insights for later model training, refinement, and development by including distribution and autocorrelation plots. 2) to enable genomic signals interpretation by including correlation, and aggregation plots. In addition, our corresponding web application, SigTools-Shiny, extends the accessibility scope of these modules to people who are more comfortable working with graphical user interfaces instead of command-line tools. Availability SigTools source code, installation guide, and manual is freely available on http://github.com/shohre73.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 35
    Publication Date: 2021-10-27
    Description: Introduction In the era of personalized medicine with more and more patient specific targeted therapies being used, we need reliable, dynamic, faster, and sensitive biomarkers both to track the causes of disease and to develop and evolve therapies during the course of treatment. Metabolomics recently has shown substantial evidence to support its emerging role in disease diagnosis and prognosis. Aside from biomarkers and development of therapies, it is also an important goal to understand the involvement of mitochondrial DNA mtDNA in metabolic regulation, aging, and disease development. Somatic mutations of the mitochondrial genome are also heavily implicated in age-related disease and aging. The general hypothesis is that an alteration in the concentration of metabolite profiles (possibly conveyed by lifestyle and environmental factors) influences the increase of mutation rate in the mtDNA, and thereby contributes to a range of pathophysiological alterations observed in complex diseases. Methods We performed an inverted mitochondrial genome wide association analysis between mitochondrial nucleotide variants (mtSNVs) and concentration of metabolites. We used 151 metabolites and the whole sequenced mitochondrial genome from 2718 individuals to identify genetic variants associated with metabolite profiles. Because of the high coverage, next generation sequencing-based analysis of the mitochondrial genome allows for an accurate detection of mitochondrial heteroplasmy and for identification of variants associated with the metabolome. Results The strongest association was found for mt715G 〉 A located in the MT-12SrRNA with the metabolite ratio C2/C10:1 (p-value = 6.82*10−09, β = 0.909). The second most significant mtSNV was found for mt3714A 〉 G located in the MT-ND1 with the metabolite ratio PC ae C42:5/PC ae C44:5 (p-value = 1.02*10−08, β = 3.631). A large number of significant metabolite ratios were observed involving PC aa C36:6 and the variant mt10689G 〉 A, located in the MT-ND4L gene. Conclusion These results show an important interconnection between mitochondria and metabolite concentrations. Considering that some of the significant metabolites found in this study have been previously related to complex diseases such as neurological disorders and metabolic conditions, these associations found here might play a crucial role for further investigations of such complex diseases. Understanding the mechanisms that control human health and disease, in particular the role of genetic predispositions and their interaction with environmental factors is a prerequisite for the development of safe and efficient therapies for complex disorders.
    Print ISSN: 0964-6906
    Electronic ISSN: 1460-2083
    Topics: Biology , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 36
    Publication Date: 2021-10-30
    Description: Motivation Adverse Outcome Pathways (AOPs) are a conceptual framework developed to support the use of alternative toxicology approaches in the risk assessment. AOPs are structured linear organizations of existing knowledge illustrating causal pathways from the initial molecular perturbation triggered by various stressors, through key events (KEs) at different levels of biology, to the ultimate health or ecotoxicological adverse outcome. Results Artificial intelligence can be used to systematically explore available toxicological data that can be parsed in the scientific literature. Recently a tool called AOP-helpFinder was developed to identify associations between stressors and KEs supporting thus documentation of AOPs. To facilitate the utilization of this advanced bioinformatics tool by the scientific and the regulatory community, a webserver was created. The proposed AOP-helpFinder webserver uses better performing version of the tool which reduces the need for manual curation of the obtained results. As an example, the server was successfully applied to explore relationships of a set of endocrine disruptors with metabolic-related events. The AOP-helpFinder webserver assists in a rapid evaluation of existing knowledge stored in the PubMed database, a global resource of scientific information, to build AOPs and Adverse Outcome Networks (AONs) supporting the chemical risk assessment. Availability and implementation AOP-helpFinder is available at http://aop-helpfinder.u-paris-sciences.fr/index.php
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 37
    Publication Date: 2021-10-30
    Description: Motivation Differential network inference is a fundamental and challenging problem to reveal gene interactions and regulation relationships under different conditions. Many algorithms have been developed for this problem; however, they do not consider the differences between the importance of genes, which may not fit the real-world situation. Different genes have different mutation probabilities, and the vital genes associated with basic life activities have less fault tolerance to mutation. Equally treating all genes may bias the results of differential network inference. Thus, it is necessary to consider the importance of genes in the models of differential network inference. Results Based on the Gaussian graphical model with adaptive gene importance regularization, we develop a novel importance-penalized joint graphical Lasso method, IPJGL, for differential network inference. The presented method is validated by the simulation experiments as well as the real datasets. Furthermore, to precisely evaluate the results of differential network inference, we propose a new metric named APC2 for the differential levels of gene pairs. We apply IPJGL to analyze the TCGA colorectal and breast cancer datasets and find some candidate cancer genes with significant survival analysis results, including SOST for colorectal cancer and RBBP8 for breast cancer. We also conduct further analysis based on the interactions in the Reactome database and confirm the utility of our method. Availability R source code of importance-penalized joint graphical lasso is freely available at https://github.com/Wu-Lab/IPJGL. Supplementary information Supplementary materials are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 38
    Publication Date: 2021-10-30
    Description: Background Almost half of aromatase inhibitor (AI)-treated breast cancer patients experience AI-associated musculoskeletal symptoms (AIMSS); 20-30% discontinue treatment because of severe symptoms. We hypothesized that we could identify predictors of pain reduction in AIMSS intervention trials by combining data from previously conducted trials. Methods We pooled patient-level data from 3 randomized trials testing interventions (omega-3 fatty acids, acupuncture, and duloxetine) for AIMSS that had similar eligibility criteria and the same patient-reported outcome measures. Only patients with baseline Brief Pain Inventory (BPI) average pain score of ≥ 4 of 10 were included. The primary outcome examined was 2-point reduction in average pain from baseline to week 12. Variable cut-point selection and logistic regression were used. Risk models were built by summing the number of factors statistically significantly associated with pain reduction. Analyses were stratified by study and adjusted for treatment arm. Results For the 583 analyzed patients, the four factors statistically significantly associated with pain reduction were FACT Functional Well-Being 〉24 and Physical Well-Being 〉14 (higher scores reflect better function), and WOMAC
    Electronic ISSN: 2515-5091
    Topics: Chemistry and Pharmacology , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 39
    Publication Date: 2021-10-28
    Description: Motivation DNA Methylation plays a key role in a variety of biological processes. Recently, Nanopore long-read sequencing has enabled direct detection of these modifications. As a consequence, a range of computational methods have been developed to exploit Nanopore data for methylation detection. However, current approaches rely on a human-defined threshold to detect the methylation status of a genomic position and are not optimized to detect sites methylated at low frequency. Furthermore, most methods employ either the Nanopore signals or the basecalling errors as the model input and do not take advantage of their combination. Results Here we present DeepMP, a convolutional neural network (CNN)-based model that takes information from Nanopore signals and basecalling errors to detect whether a given motif in a read is methylated or not. Besides, DeepMP introduces a threshold-free position modification calling model sensitive to sites methylated at low frequency across cells. We comprehensively benchmarked DeepMP against state-of-the-art methods on E. coli, human and pUC19 datasets. DeepMP outperforms current approaches at read-based and position-based methylation detection across sites methylated at different frequencies in the three datasets. Availability DeepMP is implemented and freely available under MIT license at https://github.com/pepebonet/DeepMP Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 40
    Publication Date: 2021-10-27
    Description: Background To observe a long-term prognosis in late-onset multiple acyl-coenzyme-A dehydrogenation deficiency(MADD) patients and to determine whether riboflavin should be administrated in the long-term and high-dosage manner. Methods We studied the clinical, pathological and genetic features of 110 patients with late-onset MADD in a single neuromuscular center. The plasma riboflavin levels and a long-term follow-up were performed. Results Fluctuating proximal muscle weakness, exercise intolerance and dramatic responsiveness to riboflavin treatment were essential clinical features for all 110 MADD patients. Among them, we identified 106 cases with ETFDH variants, 1 case with FLAD1 variants and 3 cases without causal variants. On muscle pathology, fibers with cracks, atypical ragged red fibers(aRRFs) and diffuse decrease of SDH activity were the distinctive features of these MADD patients. The plasma riboflavin levels before treatment were significantly decreased in these patients as compared to healthy controls. Among 48 MADD patients with a follow-up of 6.1 years on average, 31 patients were free of muscle weakness recurrence, while 17 patients had episodes of slight muscle weakness upon riboflavin withdrawal, but recovered after retaking a small-dose of riboflavin for a short-term. Multivariate Cox regression analysis showed vegetarian diet and masseter weakness were independent risk factors for muscle weakness recurrence. Conclusion Fibers with cracks, aRRFs and diffuse decreased SDH activity distinguish MADD from other genotypes of lipid storage myopathy. For late-onset MADD, increased fatty acid oxidation and reduced riboflavin levels can induce episodes of muscle symptoms, which can be treated by short-term and small-dose of riboflavin therapy.
    Print ISSN: 0964-6906
    Electronic ISSN: 1460-2083
    Topics: Biology , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 41
    Publication Date: 2021-10-19
    Description: Motivation Reverse engineering of gene regulatory networks (GRNs) has long been an attractive research topic in system biology. Computational prediction of gene regulatory interactions has remained a challenging problem due to the complexity of gene expression and scarce information resources. The high-throughput spatial gene expression data, like in situ hybridization images that exhibit temporal and spatial expression patterns, has provided abundant and reliable information for the inference of GRNs. However, computational tools for analyzing the spatial gene expression data are highly underdeveloped. Results In this study, we develop a new method for identifying gene regulatory interactions from gene expression images, called ConGRI. The method is featured by a contrastive learning scheme and deep Siamese convolutional neural network architecture, which automatically learns high-level feature embeddings for the expression images and then feeds the embeddings to an artificial neural network to determine whether or not the interaction exists. We apply the method to a Drosophila embryogenesis dataset and identify GRNs of eye development and mesoderm development. Experimental results show that ConGRI outperforms previous traditional and deep learning methods by a large margin, which achieves accuracies of 76.7% and 68.7% for the GRNs of early eye development and mesoderm development, respectively. It also reveals some master regulators for Drosophila eye development. Availabilityand implementation https://github.com/lugimzheng/ConGRI. Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 42
    Publication Date: 2021-10-28
    Description: The regeneration-associated gene (RAG) expression program is activated in injured peripheral neurons after axotomy and enables long-distance axon re-growth. Over 1000 genes are regulated, and many transcription factors are upregulated or activated as part of this response. However, a detailed picture of how RAG expression is regulated is lacking. In particular the transcriptional targets and specific functions of the various transcription factors are unclear. Jun was the first regeneration-associated transcription factor identified and the first shown to be functionally important. Here we fully define the role of Jun in the RAG expression program in regenerating facial motor neurons. At 1, 4, and 14 days after axotomy, Jun upregulates 11%, 23% and 44% of the RAG program, respectively. Jun functions relevant to regeneration include cytoskeleton production, metabolic functions and cell activation, and the down-regulation of neurotransmission machinery. In silico analysis of promoter regions of Jun targets identifies stronger over-representation of AP1-like sites than CRE-like sites, although CRE sites were also over-represented in regions flanking AP1 sites. Strikingly, in motor neurons lacking Jun, an alternative SRF-dependent gene expression program is initiated after axotomy. The promoters of these newly expressed genes exhibit over-representation of CRE sites in regions near to SRF target sites. This alternative gene expression program includes plasticity-associated transcription factors, and leads to an aberrant early increase in synapse density on motor neurons. Jun thus has the important function in the early phase after axotomy of pushing the injured neuron away from a plasticity response and towards a regenerative phenotype.
    Print ISSN: 0964-6906
    Electronic ISSN: 1460-2083
    Topics: Biology , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 43
    Publication Date: 2021-10-21
    Description: Summary Eukaryotic gene expression requires coordination among hundreds of transcriptional regulators. To characterize a specific transcriptional regulator, identifying how it shares genomic-binding profiles with others can generate important insights into its action. As genomic data such as ChIP-Seq are being rapidly generated from individual labs, there is a demand for timely integration and analysis of these new data. We have developed an R package, GPSmatch (Genomic-binding Profile Similarity match), for calculating the Jaccard index to compare ChIP-Seq peaks from one experiment to the peaks of other ChIP-Seq experiments stored in a user-supplied customizable database. GPSmatch also evaluates the statistical significance of the calculated Jaccard index using a nonparametric Monte Carlo procedure. We show that GPSmatch is suitable for identifying transcriptional regulators that share similar genomic-binding profiles, which may unravel potential mechanistic actions of gene regulation. Availability The software is freely available at https://github.com/Bao-Lab/GPSmatch.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 44
    Publication Date: 2021-02-01
    Description: Motivation Even within well-studied organisms, many genes lack useful functional annotations. One way to generate such functional information is to infer biological relationships between genes/proteins, using a network of gene coexpression data that includes functional annotations. However, the lack of trustworthy functional annotations can impede the validation of such networks. Hence, there is a need for a principled method to construct gene coexpression networks that capture biological information and are structurally stable even in the absence of functional information. Results We introduce the concept of signed distance correlation as a measure of dependency between two variables, and apply it to generate gene coexpression networks. Distance correlation offers a more intuitive approach to network construction than commonly used methods, such as Pearson correlation and mutual information. We propose a framework to generate self-consistent networks using signed distance correlation purely from gene expression data, with no additional information. We analyse data from three different organisms to illustrate how networks generated with our method are more stable and capture more biological information compared to networks obtained from Pearson correlation or mutual information. Availability and implementation Code is available online (https://github.com/javier-pardodiaz/sdcorGCN). Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 45
    Publication Date: 2021-10-27
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 46
    Publication Date: 2020-10-27
    Description: Motivation As genomic data becomes more abundant, efficient algorithms and data structures for sequence alignment become increasingly important. The suffix array is a widely used data structure to accelerate alignment, but the binary search algorithm used to query, it requires widespread memory accesses, causing a large number of cache misses on large datasets. Results Here, we present Sapling, an algorithm for sequence alignment, which uses a learned data model to augment the suffix array and enable faster queries. We investigate different types of data models, providing an analysis of different neural network models as well as providing an open-source aligner with a compact, practical piecewise linear model. We show that Sapling outperforms both an optimized binary search approach and multiple widely used read aligners on a diverse collection of genomes, including human, bacteria and plants, speeding up the algorithm by more than a factor of two while adding
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
Close ⊗
This website uses cookies and the analysis tool Matomo. More information can be found here...