ALBERT

All Library Books, journals and Electronic Records Telegrafenberg

Your email was sent successfully. Check your inbox.

An error occurred while sending the email. Please try again.

Proceed reservation?

Export
Filter
  • Articles  (36,775)
  • Molecular Diversity Preservation International  (25,745)
  • Oxford University Press  (6,336)
  • Institute of Electrical and Electronics Engineers (IEEE)  (4,694)
  • 2020-2022  (13,884)
  • 2015-2019  (16,465)
  • 2010-2014  (6,426)
  • 1990-1994
  • 1945-1949
  • 2021  (13,884)
  • 2018  (16,465)
  • 2012  (6,426)
  • Electrical Engineering, Measurement and Control Technology  (15,640)
  • Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition  (12,066)
  • Computer Science  (7,349)
  • Economics  (4,002)
  • Political Science
Collection
  • Articles  (36,775)
Publisher
Years
  • 2020-2022  (13,884)
  • 2015-2019  (16,465)
  • 2010-2014  (6,426)
  • 1990-1994
  • 1945-1949
  • +
Year
Topic
  • 1
    Publication Date: 2012-02-02
    Description: This paper presents a benchmarking, performance analysis and optimization study of the OP2 ‘active’ library, which provides an abstraction framework for the parallel execution of unstructured mesh applications. OP2 aims to decouple the scientific specification of the application from its parallel implementation, and thereby achieve code longevity and near-optimal performance through re-targeting the application to execute on different multi-core/many-core hardware. Runtime performance results are presented for a representative unstructured mesh application on a variety of many-core processor systems, including traditional X86 architectures from Intel (Xeon based on the older Penryn and current Nehalem micro-architectures) and GPU offerings from NVIDIA (GTX260, Tesla C2050). Our analysis demonstrates the contrasting performance between the use of CPU (OpenMP) and GPU (CUDA) parallel implementations for the solution of an industrial-sized unstructured mesh consisting of about 1.5 million edges. Results show the significance of choosing the correct partition and thread-block configuration, the factors limiting the GPU performance and insights into optimizations for improved performance.
    Print ISSN: 0010-4620
    Electronic ISSN: 1460-2067
    Topics: Computer Science
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 2
    Publication Date: 2012-02-02
    Description: In this paper, we describe the integrated power, area and thermal modeling framework in the structural simulation toolkit (SST) for large-scale high performance computer simulation. It integrates various power and thermal modeling tools and computes run-time energy dissipation for core, network on chip, memory controller and shared cache. It also provides functionality to update the leakage power as temperature changes. We illustrate the utilization of the framework by applying it to explore interconnect options in manycore systems with consideration of temperature variation and leakage feedback. We compare power, energy-delay-area product (EDAP) and energy-delay product (EDP) of four manycore configurations-1 core, 2 cores, 4 cores and 8 cores per cluster. Results from simulation with or without consideration of temperature variation both show that the 4-core per cluster configuration has the best EDAP and EDP. Even so, considering that temperature variation increases total power dissipation, we demonstrate the importance of considering temperature variation in the design flow. With this power, area and thermal modeling capability, the SST can be used for hardware/software co-design of future exascale systems.
    Print ISSN: 0010-4620
    Electronic ISSN: 1460-2067
    Topics: Computer Science
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 3
    Publication Date: 2012-02-02
    Description: With the advent of heterogeneous computing systems consisting of multi-core central processing units (CPUs) and many-core graphics processing units (GPUs), robust methods are needed to facilitate fair benchmark comparisons between different systems. In this paper, we present a benchmarking methodology for measuring a number of performance metrics for heterogeneous systems. Methods for comparing performance and energy efficiency are included. Consideration is given to further metrics, such as associated running costs and carbon emissions. We give a case study for these metrics applied to Bristol University Docking engine, a molecular mechanics-based docking application that has been ported to open computing language at the University of Bristol. Results are included for both AMD and NVIDIA GPUs, and for a highly optimized code on the latest x86 CPUs.
    Print ISSN: 0010-4620
    Electronic ISSN: 1460-2067
    Topics: Computer Science
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 4
    Publication Date: 2012-02-02
    Description: Java embedded systems often include Java middleware classes installed on the client device. For higher performance, they can be compiled into machine code before runtime using an ahead-of-time compiler (AOTC). There are many approaches to AOTC, yet a bytecode-to-C (b-to-C) AOTC which translates the bytecode into the C code and then compiles it using an existing optimizing compiler such as gcc would be the most straightforward one. This paper explores a few important design and optimization issues of a b-to-C AOTC, including the compilation form for the translated C code, the call interfaces among translated and interpreted Java methods, and Java-specific optimizations by the AOTC that can complement the gcc optimizations. We evaluate these issues with our b-to-C AOTC implemented on the MIPS platform for the Sun's CDC VM to understand their performance impact.
    Print ISSN: 0010-4620
    Electronic ISSN: 1460-2067
    Topics: Computer Science
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 5
    Publication Date: 2012-02-02
    Print ISSN: 0010-4620
    Electronic ISSN: 1460-2067
    Topics: Computer Science
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 6
    Publication Date: 2012-02-11
    Description: Longitudinal or hierarchical data are often observed in forestry, which can pose both challenges and opportunities when performing statistical analyses. The current standard approach for analysing these types of data is mixed-effects models under the frequentist paradigm. Bayesian techniques have several advantages when compared with traditional approaches, but their use in forestry has been relatively limited. In this paper, we propose a Bayesian solution to non-linear mixed-effects models for longitudinal data in forestry. We demonstrate the Bayesian modelling process using individual tree height–age data for balsam fir ( Abies balsamea (L.)) collected from eastern Maine. Due to its frequent utilization in modelling dominant tree height growth over time, we choose to examine models based on the Chapman–Richards function. We established four different model formulations, each having varying subject-specific parameters, which we estimated using both frequentist and Bayesian approaches. We found the estimation results to be quite close between the two methods. In addition, an important feature of the Bayesian method is the unified manner in which estimation and prediction are handled. Specifically, local parameters can be predicted for a new dataset after setting the posterior distributions from the estimation stage as new priors in the prediction phase.
    Print ISSN: 0015-752X
    Electronic ISSN: 1464-3626
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 7
    Publication Date: 2012-02-11
    Description: This study compares the financial return of converting old-growth boreal stands into even-aged stands to that of two approaches of selection cutting in Quebec (Canada). In this region, old-growth stands are usually harvested by completely removing the canopy while protecting the abundant advance regeneration, an approach known as careful logging around advance growth (CLAAG). These approaches were compared using a time frame of over 200 years. Consideration is given to the majority of the operating costs leading to end products. The financial analysis integrates Monte Carlo simulations, making it possible to consider the uncertainty associated with variables. The net present values (NPVs) are then associated with a distribution of probabilities. The results show that the probabilities of obtaining positive NPVs are high for all treatments, suggesting that selection cutting approaches can be appropriate alternatives to CLAAG for some stands. Depending on the criteria used, the CLAAG cut or one of the selection cuts show the best performances. In fact, the results of the financial study show that in the future, selection cutting approaches will be more profitable than CLAAG but still less than present CLAAG operations. This occurs because, according to the current yield curves and rotation ages, future stands managed with CLAAG will have smaller and less valuable trees than in the primary forest and in stands managed with the selection system.
    Print ISSN: 0015-752X
    Electronic ISSN: 1464-3626
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 8
    Publication Date: 2012-02-11
    Description: There is a growing interest in the feasibility of establishing ‘Carbon’ forests where carbon sinks are created by maintaining forest stands out to considerable ages. In New Zealand, Pinus radiata (D. Don) is usually grown over 25- to 30-year rotations; the main aim of this paper is to examine the potential to maintain stands to 60 years or more. There were over 140 permanent sample plots in New Zealand that were maintained for at least 50 years. These data were examined to verify that growth can be sustained over this period. Net basal area per hectare and mean-top-height are graphically demonstrated to follow expected growth paths with no signs of senescence occurring. Stems per hectare loss is shown to be sometimes high, especially with dense stockings, but virtually all dying trees are small suppressed stems, so the impact on basal area yield at maturity is minimal. It is concluded that growing radiata pine on a rotation of 60 years is feasible, and it may be possible to use much longer rotations.
    Print ISSN: 0015-752X
    Electronic ISSN: 1464-3626
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 9
    Publication Date: 2012-02-11
    Description: The effects of site preparation practices (drainage, mounding and fertilization) on the fluxes from the soil surface of carbon dioxide (CO 2 ), methane (CH 4 ) and nitrous oxide (N 2 O) were studied on an organic-rich peaty gley soil at Harwood Forest, north-east England. Drained plots had significantly higher CO 2 fluxes but significantly lower CH 4 fluxes compared with undrained plots, while N 2 O emissions were not affected by drainage. Mounding caused significantly higher CH 4 emissions, while it significantly reduced N 2 O emissions. Fertilization caused significant increases in emissions of CO 2 , CH 4 and N 2 O. In terms of overall greenhouse warming potential, drainage and fertilization increased CO 2 -equivalent emissions by ~18–29 and 7–23 per cent, respectively, while mounding reduced CO 2 -equivalent emissions by ~8 per cent in year 1, but had no effect on emissions in year 2 of study. Soil temperature was the main environmental variable controlling CO 2 emissions, while CH 4 was controlled primarily by water table depth. Nitrous oxide emissions responded to changes in soil temperature and water table depth. In the short term, drainage and fertilization contributed to accelerating greenhouse gas emissions significantly, although their long-term effects are likely moderated by accelerating carbon accumulation in the tree biomass. Long-term studies are required to assess the cumulative impacts of site preparation practices during the whole rotation cycle.
    Print ISSN: 0015-752X
    Electronic ISSN: 1464-3626
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 10
    Publication Date: 2012-02-11
    Description: As signatories to the United Nation Framework Convention on Climate Change, the US has been estimating standing dead tree (SDT) carbon (C) stocks using a model based on live tree attributes. The USDA Forest Service began sampling SDTs nationwide in 1999. With comprehensive field data now available, the objective of this study was to compare field- and model-based estimates of SDT C stocks across the US to evaluate potential directions for improving National Greenhouse Gas Inventory (NGHGI) reporting and C dynamics research. Field inventory data indicated that most forests have relatively little SDT C stocks (〈1 Mg/ha), whereas large SDT C stocks (〉25 Mg/ha) are infrequent. Models used for past NGHGIs to predict SDT C stocks do not accurately reflect what was observed in inventory plots, resulting in an overestimation (~100 per cent) of SDT C stocks at the national scale. These results indicate that the current estimate of the Nation’s total forest C stock is overestimated by ~4.2 per cent due to overestimation of SDT C stocks that are a relatively small component of the total forest C stock. A field-based approach is suggested for use in future C reporting efforts to reduce estimation bias.
    Print ISSN: 0015-752X
    Electronic ISSN: 1464-3626
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 11
    Publication Date: 2012-02-11
    Description: Airborne laser scanning data and corresponding field data were acquired from boreal forests in Norway and Sweden, coniferous and broadleaved forests in Germany and tropical pulpwood plantations in Brazil. Treetop positions were extracted using six different algorithms developed in Finland, Germany, Norway and Sweden, and the accuracy of tree detection and height estimation was assessed. Furthermore, the weaknesses and strengths of the methods under different types of forest were analyzed. The results showed that forest structure strongly affected the performance of all algorithms. Particularly, the success of tree detection was found to be dependent on tree density and clustering. The differences in performance between methods were more pronounced for tree detection than for height estimation. The algorithms showed a slightly better performance in the conditions for which they were developed, while some could be adapted by different parameterization according to training with local data. The results of this study may help guiding the choice of method under different forest types and may be of great value for future refinement of the single-tree detection algorithms.
    Print ISSN: 0015-752X
    Electronic ISSN: 1464-3626
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 12
    Publication Date: 2012-02-11
    Description: Silvicultural systems which retain canopy cover during the regeneration phase have become an increasingly important form of management, but their effectiveness in controlling weed species has not been extensively studied. The development of bramble, which is a widespread competitive weed species, was observed within a c . 35-year-old Corsican pine stand thinned to remove 10, 20, 40 and 80 per cent of basal area. Cover, height and numbers of inflorescences and berries were recorded in each of the 3 years following thinning and were generally ranked according to intensity of thinning, but there were no significant differences between treatments. Shoot length, estimated using a grid-intersection method, was significantly lower in the 10 per cent compared with the 40 and 80 per cent thinning treatments. The initial length of bramble shoot present, and basal area and number of trees remaining could be used in various combinations to predict cover, height and shoot length. Although the bramble thicket was generally less well developed in the less-intense thinning treatments, these did not appear to enhance the establishment of trees. Seedlings grew best in the 80 per cent treatment and overall their mean heights were generally lower than that of bramble. Retention of overstorey cover in order to suppress bramble growth and promote tree seedling establishment during forest regeneration may not succeed.
    Print ISSN: 0015-752X
    Electronic ISSN: 1464-3626
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 13
    Publication Date: 2012-02-11
    Description: Three experiments were established in the 1990s to examine the impact of complete residue (brash) and above-ground biomass removal (i.e. ‘whole-tree harvesting’, WTH) at clearfelling on the subsequent growth and yield of replanted Sitka spruce ( Picea sitchensis ). The sites were of varying fertility; two would now be considered to be of ‘medium’ risk for brash removal, while one would be a ‘high’-risk site. The interactions between brash removal and regular remedial fertilizer applications and weed control regimes were also investigated at each site. After 10 years, trees had been successfully established at all sites, and in most cases, the treatments were close to canopy closure. The main effects observed at all sites were due to brash retention and fertilizer application. The benefits from the former were not evident until the last stages of the establishment period, whereas benefits from fertilizer application were evident once the trees had reached 5–6 years of age. The impacts of weed control were inconsistent, providing temporary benefits on the more fertile sites, and having a negative effect on the poorest site, primarily because the herbicide regime favoured the development of ericaceous vegetation which competed with the planted trees for nutrients. After 10 years at the two medium-risk sites, the difference in growth between plots with brash retained and those where brash was removed was 5–9 per cent for height growth and 5–7 per cent for diameter. However, at the poorest site, the equivalent differences were ~9 per cent and 19 per cent. The results show that the impact of brash removal due to WTH are significantly affected by site type and soil fertility and also that it may take nearly a decade before the impacts of such practices are evident.
    Print ISSN: 0015-752X
    Electronic ISSN: 1464-3626
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 14
    Publication Date: 2012-02-11
    Description: Policy makers, scientists and civil society are involved in the development of criteria and indicators for sustainable forest management (SFM), reflecting the growing worldwide demand for addressing sustainable development and environmental governance management goals. Such frameworks have been largely derived from expert-led or community-based approaches. This article details the identification process of criteria and indicators (C & I) based on the international principles of SFM through the analysis of a hybrid approach that uses both a top-down (TD) and a bottom-up (BU) approach. The aim of this article is to discuss how the two approaches have worked to incorporate the different views, opinions and experiences of experts and stakeholders. National-level C & I are then compared with those at the local level, making specific reference to sustainable community forest (CF) management. For the TD approach, a Delphi survey was conducted where 121 experts shared their knowledge, experience and judgements in assessing a set of 72 indicators with regard to the applicability, practicality and importance of national, regional and CF management in Nepal. For the BU approach, C & I for CF management were developed with the direct involvement of various stakeholders. It was shown that such a hybrid approach is feasible from a methodological point of view, but a framework is needed by the government to more fully utilize the opportunities of the C & I development process in the SFM context. The results of this study also help to bridge the gap between the ad hoc planning of decision makers and the requirement for a holistic management system, which includes participatory processes. Based on the conclusions of this study, general recommendations for the methodological design of C & I development in similar studies are given.
    Print ISSN: 0015-752X
    Electronic ISSN: 1464-3626
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 15
    facet.materialart.
    Unknown
    Oxford University Press
    In: Forestry
    Publication Date: 2012-02-11
    Print ISSN: 0015-752X
    Electronic ISSN: 1464-3626
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 16
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-02-11
    Print ISSN: 0885-8969
    Electronic ISSN: 1558-0059
    Topics: Electrical Engineering, Measurement and Control Technology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 17
    Publication Date: 2012-02-17
    Description: Motivation: High-throughput sequencing (HTS) technologies have made low-cost sequencing of large numbers of samples commonplace. An explosion in the type, not just number, of sequencing experiments has also taken place including genome re-sequencing, population-scale variation detection, whole transcriptome sequencing and genome-wide analysis of protein-bound nucleic acids. Results: We present Artemis as a tool for integrated visualization and computational analysis of different types of HTS datasets in the context of a reference genome and its corresponding annotation. Availability: Artemis is freely available (under a GPL licence) for download (for MacOSX, UNIX and Windows) at the Wellcome Trust Sanger Institute websites: http://www.sanger.ac.uk/resources/software/artemis/ . Contact: artemis@sanger.ac.uk ; tjc@sanger.ac.uk
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 18
    Publication Date: 2012-02-17
    Description: : microRibonucleic acid (miRNAs) are small regulatory molecules that act by mRNA degradation or via translational repression. Although many miRNAs are ubiquitously expressed, a small subset have differential expression patterns that may give rise to tissue-specific complexes. Motivation: This work studies gene targeting patterns amongst miRNAs with differential expression profiles, and links this to control and regulation of protein complexes. Results: We find that, when a pair of miRNAs are not expressed in the same tissues, there is a higher tendency for them to target the direct partners of the same hub proteins. At the same time, they also avoid targeting the same set of hub-spokes. Moreover, the complexes corresponding to these hub-spokes tend to be specific and nonoverlapping. This suggests that the effect of miRNAs on the formation of complexes is specific. Contact: wongls@comp.nus.edu.sg Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 19
    Publication Date: 2012-02-17
    Description: Motivation: Small interfering RNAs (siRNAs) are produced from much longer sequences of double-stranded RNA precursors through cleavage by Dicer or a Dicer-like protein. These small RNAs play a key role in genetic and epigenetic regulation; however, a full understanding of the mechanisms by which they operate depends on the characterization of the precursors from which they are derived. Results: High-throughput sequencing of small RNA populations allows the locations of the double-stranded RNA precursors to be inferred. We have developed methods to analyse small RNA sequencing data from multiple biological sources, taking into account replicate information, to identify robust sets of siRNA precursors. Our methods show good performance on both a set of small RNA sequencing data in Arabidopsis thaliana and simulated datasets. Availability: Our methods are available as the Bioconductor ( www.bioconductor.org ) package segmentSeq (version 1.5.6 and above). Contact: tjh48@cam.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 20
    Publication Date: 2012-02-17
    Description: Motivation: Intrinsically disordered regions are key for the function of numerous proteins, and the scant available experimental annotations suggest the existence of different disorder flavors. While efficient predictions are required to annotate entire genomes, most existing methods require sequence profiles for disorder prediction, making them cumbersome for high-throughput applications. Results: In this work, we present an ensemble of protein disorder predictors called ESpritz. These are based on bidirectional recursive neural networks and trained on three different flavors of disorder, including a novel NMR flexibility predictor. ESpritz can produce fast and accurate sequence-only predictions, annotating entire genomes in the order of hours on a single processor core. Alternatively, a slower but slightly more accurate ESpritz variant using sequence profiles can be used for applications requiring maximum performance. Two levels of prediction confidence allow either to maximize reasonable disorder detection or to limit expected false positives to 5%. ESpritz performs consistently well on the recent CASP9 data, reaching a S w measure of 54.82 and area under the receiver operator curve of 0.856. The fast predictor is four orders of magnitude faster and remains better than most publicly available CASP9 methods, making it ideal for genomic scale predictions. Conclusions: ESpritz predicts three flavors of disorder at two distinct false positive rates, either with a fast or slower and slightly more accurate approach. Given its state-of-the-art performance, it can be especially useful for high-throughput applications. Availability: Both a web server for high-throughput analysis and a Linux executable version of ESpritz are available from: http://protein.bio.unipd.it/espritz/ Contact: silvio.tosatto@unipd.it Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 21
    Publication Date: 2012-02-17
    Description: Motivation: Clustering protein structures is an important task in structural bioinformatics. De novo structure prediction, for example, often involves a clustering step for finding the best prediction. Other applications include assigning proteins to fold families and analyzing molecular dynamics trajectories. Results: We present Pleiades, a novel approach to clustering protein structures with a rigorous mathematical underpinning. The method approximates clustering based on the root mean square deviation by first mapping structures to Gauss integral vectors—which were introduced by Røgen and co-workers—and subsequently performing K-means clustering. Conclusions: Compared to current methods, Pleiades dramatically improves on the time needed to perform clustering, and can cluster a significantly larger number of structures, while providing state-of-the-art results. The number of low energy structures generated in a typical folding study, which is in the order of 50 000 structures, can be clustered within seconds to minutes. Contact: thamelry@binf.ku.dk ; harder@binf.ku.dk Supplementary Information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 22
    Publication Date: 2012-02-17
    Description: Motivation: Transmembrane β barrel proteins (TMBs) are found in the outer membrane of Gram-negative bacteria, chloroplast and mitochondria. They play a major role in the translocation machinery, pore formation, membrane anchoring and ion exchange. TMBs are also promising targets for antimicrobial drugs and vaccines. Given the difficulty in membrane protein structure determination, computational methods to identify TMBs and predict the topology of TMBs are important. Results: Here, we present BOCTOPUS; an improved method for the topology prediction of TMBs by employing a combination of support vector machines (SVMs) and Hidden Markov Models (HMMs). The SVMs and HMMs account for local and global residue preferences, respectively. Based on a 10-fold cross-validation test, BOCTOPUS performs better than all existing methods, reaching a Q3 accuracy of 87%. Further, BOCTOPUS predicted the correct number of strands for 83% proteins in the dataset. BOCTOPUS might also help in reliable identification of TMBs by using it as an additional filter to methods specialized in this task. Availability: BOCTOPUS is freely available as a web server at: http://boctopus.cbr.su.se/ . The datasets used for training and evaluations are also available from this site. Contact: arne@bioinfo.se Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 23
    Publication Date: 2012-02-17
    Description: Motivation: High-dimensional data such as microarrays have created new challenges to traditional statistical methods. One such example is on class prediction with high-dimension, low-sample size data. Due to the small sample size, the sample mean estimates are usually unreliable. As a consequence, the performance of the class prediction methods using the sample mean may also be unsatisfactory. To obtain more accurate estimation of parameters some statistical methods, such as regularizations through shrinkage, are often desired. Results: In this article, we investigate the family of shrinkage estimators for the mean value under the quadratic loss function. The optimal shrinkage parameter is proposed under the scenario when the sample size is fixed and the dimension is large. We then construct a shrinkage-based diagonal discriminant rule by replacing the sample mean by the proposed shrinkage mean. Finally, we demonstrate via simulation studies and real data analysis that the proposed shrinkage-based rule outperforms its original competitor in a wide range of settings. Contact: tongt@hkbu.edu.hk
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 24
    Publication Date: 2012-02-17
    Description: Motivation: The advent of high-throughput sequencing technologies is revolutionizing our ability in discovering and genotyping DNA copy number variants (CNVs). Read count-based approaches are able to detect CNV regions with an unprecedented resolution. Although this computational strategy has been recently introduced in literature, much work has been already done for the preparation, normalization and analysis of this kind of data. Results: Here we face the many aspects that cover the detection of CNVs by using read count approach. We first study the characteristics and systematic biases of read count distributions, focusing on the normalization methods designed for removing these biases. Subsequently, we compare the algorithms designed to detect the boundaries of CNVs and we investigate the ability of read count data to predict the exact number of DNA copy. Finally, we review the tools publicly available for analysing read count data. To better understand the state of the art of read count approaches, we compare the performance of the three most widely used sequencing technologies (Illumina Genome Analyzer, Roche 454 and Life Technologies SOLiD) in all the analyses that we perform. Contact: albertomagi@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 25
    Publication Date: 2012-02-17
    Description: Motivation: We investigate and quantify the generalizability of the white blood cell (WBC) transcriptome to the general, multiorgan transcriptome. We use data from the NCBI's Gene Expression Omnibus (GEO) public repository to define two datasets for comparison, WBC and OO (Other Organ) sets. Results: Comprehensive pair-wise correlation and expression level profiles are calculated for both datasets (with sizes of 81 and 1463, respectively). We have used mapping and ranking across the Gene Ontology (GO) categories to quantify similarity between the two sets. GO mappings of the most correlated and highly expressed genes from the two datasets tightly match, with the notable exceptions of components of the ribosome, cell adhesion and immune response. That is, 10 877 or 48.8% of all measured genes do not change 〉10% of rank range between WBC and OO; only 878 (3.9%) change rank 〉50%. Two trans -tissue gene lists are defined, the most changing and the least changing genes in expression rank. We also provide a general, quantitative measure of the probability of expression rank and correlation profile in the OO system given the expression rank and correlation profile in the WBC dataset. Contact: vvaltchinov@partners.org Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 26
    Publication Date: 2012-02-17
    Description: Motivation: The understanding of the molecular sources for diseases like cancer can be significantly improved by computational models. Recently, Boolean networks have become very popular for modeling signaling and regulatory networks. However, such models rely on a set of Boolean functions that are in general not known. Unfortunately, while detailed information on the molecular interactions becomes available in large scale through electronic databases, the information on the Boolean functions does not become available simultaneously and has to be included manually into the models, if at all known. Results: We propose a new Boolean approach which can directly utilize the mechanistic network information available through modern databases. The Boolean function is implicitly defined by the reaction mechanisms. Special care has been taken for the treatment of kinetic features like inhibition. The method has been applied to a signaling model combining the Wnt and MAPK pathway. Availability: A sample C++ implementation of the proposed method is available for Linux and compatible systems through http://code.google.com/p/libscopes/wiki/Paper2011 Contact: handorf@physik.hu-berlin.de Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 27
    Publication Date: 2012-02-17
    Description: Motivation: Multiple sequence alignment (MSA) is a core method in bioinformatics. The accuracy of such alignments may influence the success of downstream analyses such as phylogenetic inference, protein structure prediction, and functional prediction. The importance of MSA has lead to the proliferation of MSA methods, with different objective functions and heuristics to search for the optimal MSA. Different methods of inferring MSAs produce different results in all but the most trivial cases. By measuring the differences between inferred alignments, we may be able to develop an understanding of how these differences (i) relate to the objective functions and heuristics used in MSA methods, and (ii) affect downstream analyses. Results: We introduce four metrics to compare MSAs, which include the position in a sequence where a gap occurs or the location on a phylogenetic tree where an insertion or deletion (indel) event occurs. We use both real and synthetic data to explore the information given by these metrics and demonstrate how the different metrics in combination can yield more information about MSA methods and the differences between them. Availability: MetAl is a free software implementation of these metrics in Haskell. Source and binaries for Windows, Linux and Mac OS X are available from http://kumiho.smith.man.ac.uk/whelan/software/metal/ . Contact: simon.whelan@manchester.ac.uk
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 28
    Publication Date: 2012-02-17
    Description: Motivation: Peptide detection is a crucial step in mass spectrometry (MS) based proteomics. Most existing algorithms are based upon greedy isotope template matching and thus may be prone to error propagation and ineffective to detect overlapping peptides. In addition, existing algorithms usually work at different charge states separately, isolating useful information that can be drawn from other charge states, which may lead to poor detection of low abundance peptides. Results: BPDA2d models spectra as a mixture of candidate peptide signals and systematically evaluates all possible combinations of possible peptide candidates to interpret the given spectra. For each candidate, BPDA2d takes into account its elution profile, charge state distribution and isotope pattern, and it combines all evidence to infer the candidate's signal and existence probability. By piecing all evidence together—especially by deriving information across charge states—low abundance peptides can be better identified and peptide detection rates can be improved. Instead of local template matching, BPDA2d performs global optimization for all candidates and systematically optimizes their signals. Since BPDA2d looks for the optimal among all possible interpretations of the given spectra, it has the capability in handling complex spectra where features overlap. BPDA2d estimates the posterior existence probability of detected peptides, which can be directly used for probability-based evaluation in subsequent processing steps. Our experiments indicate that BPDA2d outperforms state-of-the-art detection methods on both simulated data and real liquid chromatography–mass spectrometry data, according to sensitivity and detection accuracy. Availability: The BPDA2d software package is available at http://gsp.tamu.edu/Publications/supplementary/sun11a/ Contact: Michelle.Zhang@utsa.edu ; edward@ece.tamu.edu Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 29
    Publication Date: 2012-02-17
    Description: Motivation: The continued progress in developing technological platforms, availability of many published experimental datasets, as well as different statistical methods to analyze those data have allowed approaching the same research question using various methods simultaneously. To get the best out of all these alternatives, we need to integrate their results in an unbiased manner. Prioritized gene lists are a common result presentation method in genomic data analysis applications. Thus, the rank aggregation methods can become a useful and general solution for the integration task. Results: Standard rank aggregation methods are often ill-suited for biological settings where the gene lists are inherently noisy. As a remedy, we propose a novel robust rank aggregation (RRA) method. Our method detects genes that are ranked consistently better than expected under null hypothesis of uncorrelated inputs and assigns a significance score for each gene. The underlying probabilistic model makes the algorithm parameter free and robust to outliers, noise and errors. Significance scores also provide a rigorous way to keep only the statistically relevant genes in the final list. These properties make our approach robust and compelling for many settings. Availability: All the methods are implemented as a GNU R package R obust R ank A ggreg , freely available at the Comprehensive R Archive Network http://cran.r-project.org/ . Contact: vilo@ut.ee Supplementary information Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 30
    Publication Date: 2012-02-17
    Description: : CLARE is a computational method designed to reveal sequence encryption of tissue-specific regulatory elements. Starting with a set of regulatory elements known to be active in a particular tissue/process, it learns the sequence code of the input set and builds a predictive model from features specific to those elements. The resulting model can then be applied to user-supplied genomic regions to identify novel candidate regulatory elements. CLARE's model also provides a detailed analysis of transcription factors that most likely bind to the elements, making it an invaluable tool for understanding mechanisms of tissue-specific gene regulation. Availability: CLARE is freely accessible at http://clare.dcode.org/ . Contact: taherl@ncbi.nlm.nih.gov ; ovcharen@nih.gov Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 31
    Publication Date: 2012-02-17
    Description: Motivation: We present a pipeline for the pre-processing, quality assessment, read distribution and methylation estimation for methylated DNA immunoprecipitation (MeDIP)-sequence datasets. This is the first MeDIP-seq-specific analytic pipeline that starts at the output of the sequencers. This pipeline will reduce the data analysis load on staff and allows the easy and straightforward analysis of sequencing data for DNA methylation. The pipeline integrates customized scripting and several existing tools, which can deal with both paired and single end data. Availability: The package and extensive documentation, and comparison to public data is available at http://life.tongji.edu.cn/meqa/ Contact: jhuang@cephb.fr
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 32
    Publication Date: 2012-02-17
    Description: Motivation: A plethora of bioinformatics analysis has led to the discovery of numerous gene sets, which can be interpreted as discrete measurements emitted from latent signaling pathways. Their potential to infer signaling pathway structures, however, has not been sufficiently exploited. Existing methods accommodating discrete data do not explicitly consider signal cascading mechanisms that characterize a signaling pathway. Novel computational methods are thus needed to fully utilize gene sets and broaden the scope from focusing only on pairwise interactions to the more general cascading events in the inference of signaling pathway structures. Results: We propose a gene set based simulated annealing (SA) algorithm for the reconstruction of signaling pathway structures. A signaling pathway structure is a directed graph containing up to a few hundred nodes and many overlapping signal cascades, where each cascade represents a chain of molecular interactions from the cell surface to the nucleus. Gene sets in our context refer to discrete sets of genes participating in signal cascades, the basic building blocks of a signaling pathway, with no prior information about gene orderings in the cascades. From a compendium of gene sets related to a pathway, SA aims to search for signal cascades that characterize the optimal signaling pathway structure. In the search process, the extent of overlap among signal cascades is used to measure the optimality of a structure. Throughout, we treat gene sets as random samples from a first-order Markov chain model. We evaluated the performance of SA in three case studies. In the first study conducted on 83 KEGG pathways, SA demonstrated a significantly better performance than Bayesian network methods. Since both SA and Bayesian network methods accommodate discrete data, use a ‘search and score’ network learning strategy and output a directed network, they can be compared in terms of performance and computational time. In the second study, we compared SA and Bayesian network methods using four benchmark datasets from DREAM. In our final study, we showcased two context-specific signaling pathways activated in breast cancer. Availibility: Source codes are available from http://dl.dropbox.com/u/16000775/sa_sc.zip Contact: dzhu@wayne.edu Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 33
    Publication Date: 2012-02-17
    Description: : We provide a Bioconductor package with quality assessment, processing and visualization tools for high-throughput sequencing data, with emphasis in ChIP-seq and RNA-seq studies. It includes detection of outliers and biases, inefficient immuno-precipitation and overamplification artifacts, de novo identification of read-rich genomic regions and visualization of the location and coverage of genomic region lists. Availability: www.bioconductor.org Contact: david.rossell@irbbarcelona.org Supplementary information: Supplementary data available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 34
    Publication Date: 2012-02-17
    Description: Motivation: We study a stochastic method for approximating the set of local minima in partial RNA folding landscapes associated with a bounded-distance neighbourhood of folding conformations. The conformations are limited to RNA secondary structures without pseudoknots. The method aims at exploring partial energy landscapes p L induced by folding simulations and their underlying neighbourhood relations. It combines an approximation of the number of local optima devised by Garnier and Kallel (2002) with a run-time estimation for identifying sets of local optima established by Reeves and Eremeev (2004). Results: The method is tested on nine sequences of length between 50 nt and 400 nt, which allows us to compare the results with data generated by RNAsubopt and subsequent barrier tree calculations. On the nine sequences, the method captures on average 92% of local minima with settings designed for a target of 95%. The run-time of the heuristic can be estimated by O ( n 2 D ln), where n is the sequence length, is the number of local minima in the partial landscape p L under consideration and D is the maximum number of steepest descent steps in attraction basins associated with p L . Contact: a.albrecht@qub.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 35
    Publication Date: 2012-02-17
    Description: Motivation: RNA-seq is a powerful technology for the study of transcriptome profiles that uses deep-sequencing technologies. Moreover, it may be used for cellular phenotyping and help establishing the etiology of diseases characterized by abnormal splicing patterns. In RNA-Seq, the exact nature of splicing events is buried in the reads that span exon–exon boundaries. The accurate and efficient mapping of these reads to the reference genome is a major challenge. Results: We developed PASSion, a pattern growth algorithm-based pipeline for splice site detection in paired-end RNA-Seq reads. Comparing the performance of PASSion to three existing RNA-Seq analysis pipelines, TopHat, MapSplice and HMMSplicer, revealed that PASSion is competitive with these packages. Moreover, the performance of PASSion is not affected by read length and coverage. It performs better than the other three approaches when detecting junctions in highly abundant transcripts. PASSion has the ability to detect junctions that do not have known splicing motifs, which cannot be found by the other tools. Of the two public RNA-Seq datasets, PASSion predicted ~ 137 000 and 173 000 splicing events, of which on average 82 are known junctions annotated in the Ensembl transcript database and 18% are novel. In addition, our package can discover differential and shared splicing patterns among multiple samples. Availability: The code and utilities can be freely downloaded from https://trac.nbic.nl/passion and ftp://ftp.sanger.ac.uk/pub/zn1/passion Contact: y.zhang@lumc.nl ; k.ye@lumc.nl Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 36
    Publication Date: 2012-02-17
    Description: Motivation: The completion of 168 genome sequences from a single population of Drosophila melanogaster provides a global view of genomic variation and an understanding of the evolutionary forces shaping the patterns of DNA polymorphism and divergence along the genome. Results: We present the ‘Population Drosophila Browser’ (PopDrowser), a new genome browser specially designed for the automatic analysis and representation of genetic variation across the D. melanogaster genome sequence. PopDrowser allows estimating and visualizing the values of a number of DNA polymorphism and divergence summary statistics, linkage disequilibrium parameters and several neutrality tests. PopDrowser also allows performing custom analyses on-the-fly using user-selected parameters. Availability: PopDrowser is freely available from http://PopDrowser.uab.cat . Contact: miquel.ramia@uab.cat
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 37
    Publication Date: 2012-02-17
    Description: Motivation: Probabilistic approaches for inferring transcription factor binding sites (TFBSs) and regulatory motifs from DNA sequences have been developed for over two decades. Previous work has shown that prediction accuracy can be significantly improved by incorporating features such as the competition of multiple transcription factors (TFs) for binding to nearby sites, the tendency of TFBSs for co-regulated TFs to cluster and form cis -regulatory modules and explicit evolutionary modeling of conservation of TFBSs across orthologous sequences. However, currently available tools only incorporate some of these features, and significant methodological hurdles hampered their synthesis into a single consistent probabilistic framework. Results: We present MotEvo, a integrated suite of Bayesian probabilistic methods for the prediction of TFBSs and inference of regulatory motifs from multiple alignments of phylogenetically related DNA sequences, which incorporates all features just mentioned. In addition, MotEvo incorporates a novel model for detecting unknown functional elements that are under evolutionary constraint, and a new robust model for treating gain and loss of TFBSs along a phylogeny. Rigorous benchmarking tests on ChIP-seq datasets show that MotEvo's novel features significantly improve the accuracy of TFBS prediction, motif inference and enhancer prediction. Availability: Source code, a user manual and files with several example applications are available at www.swissregulon.unibas.ch . Contact: erik.vannimwegen@unibas.ch Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 38
    Publication Date: 2012-02-17
    Description: : We present LaTcOm, a new web tool, which offers several alternative methods for ‘rare codon cluster’ (RCC) identification from a single and simple graphical user interface. In the current version, three RCC detection schemes are implemented: the recently described %MinMax algorithm and a simplified sliding window approach, along with a novel modification of a linear-time algorithm for the detection of maximally scoring subsequences tailored to the RCC detection problem. Among a number of user tunable parameters, several codon-based scales relevant for RCC detection are available, including tRNA abundance values from Escherichia coli and several codon usage tables from a selection of genomes. Furthermore, useful scale transformations may be performed upon user request (e.g. linear, sigmoid). Users may choose to visualize RCC positions within the submitted sequences either with graphical representations or in textual form for further processing. Availability: LaTcOm is freely available online at the URL http://troodos.biol.ucy.ac.cy/latcom.html . Contact: vprobon@ucy.ac.cy Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 39
    Publication Date: 2012-02-17
    Description: : Many existing databases annotate experimentally characterized single nucleotide polymorphisms (SNPs). Each non-synonymous SNP (nsSNP) changes one amino acid in the gene product (single amino acid substitution;SAAS). This change can either affect protein function or be neutral in that respect. Most polymorphisms lack experimental annotation of their functional impact. Here, we introduce SNPdbe—SNP database of effects, with predictions of computationally annotated functional impacts of SNPs. Database entries represent nsSNPs in dbSNP and 1000 Genomes collection, as well as variants from UniProt and PMD. SAASs come from 〉2600 organisms; ‘human’ being the most prevalent. The impact of each SAAS on protein function is predicted using the SNAP and SIFT algorithms and augmented with experimentally derived function/structure information and disease associations from PMD, OMIM and UniProt. SNPdbe is consistently updated and easily augmented with new sources of information. The database is available as an MySQL dump and via a web front end that allows searches with any combination of organism names, sequences and mutation IDs. Availability: http://www.rostlab.org/services/snpdbe Contact: schaefer@rostlab.org ; snpdbe@rostlab.org
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 40
    Publication Date: 2012-02-17
    Description: : We have implemented in a single package all the features required for extracting, visualizing and manipulating fully conserved positions as well as those with a family-dependent conservation pattern in multiple sequence alignments. The program allows, among other things, to run different methods for extracting these positions, combine the results and visualize them in protein 3D structures and sequence spaces. Availability and implementation: JDet is a multiplatform application written in Java. It is freely available, including the source code, at http://csbg.cnb.csic.es/JDet . The package includes two of our recently developed programs for detecting functional positions in protein alignments ( Xdet and S3Det ), and support for other methods can be added as plug-ins. A help file and a guided tutorial for JDet are also available. Contact: pazos@cnb.csic.es
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 41
    Publication Date: 2012-02-17
    Description: : VarSifter is a graphical software tool for desktop computers that allows investigators of varying computational skills to easily and quickly sort, filter, and sift through sequence variation data. A variety of filters and a custom query framework allow filtering based on any combination of sample and annotation information. By simplifying visualization and analyses of exome-scale sequence variation data, this program will help bring the power and promise of massively-parallel DNA sequencing to a broader group of researchers. Availability and Implementation: VarSifter is written in Java, and is freely available in source and binary versions, along with a User Guide, at http://research.nhgri.nih.gov/software/VarSifter/ . Contact: mullikin@mail.nih.gov Supplementary Information: Additional figures and methods available online at the journal's website.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 42
    Publication Date: 2012-12-28
    Description: Sometimes, authorities are unable to rapidly identify the origin of a tainted product. In such cases, recalls or warnings often apply to all suppliers, even to those that had not contributed to the contamination. Traceability enables more targeted recalls by identifying the product's origin more specifically. In this article, we show how increased traceability protects the reputation of industries by limiting the size of recalls. We show the relationships between traceability and the level of food safety with many identical small farms in a competitive industry and for an industry using collective action to set rules and standards.
    Keywords: D21 - Firm Behavior, Q10 - General, Q18 - Agricultural Policy ; Food Policy
    Print ISSN: 0165-1587
    Electronic ISSN: 1464-3618
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition , Economics
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 43
    Publication Date: 2012-12-28
    Description: We investigate the impact of decentralised decision-making on product quality. Comparing a cooperative and an investor-owned firm suggests that members of the cooperative have an incentive to produce too much and to free-ride on quality. Whether or not cooperatives deliver higher quality products depends on the way in which the quality of the final product is determined from the quality levels of the inputs delivered (quality aggregation) as well as the number of members of the cooperative. Empirical evidence on the Austrian wine market suggests that wines produced by cooperatives tend to be of significantly lower quality, ceteris paribus .
    Keywords: D22 - Firm Behavior: Empirical Analysis, D23 - Organizational Behavior ; Transaction Costs ; Property Rights, Q13 - Agricultural Markets and Marketing ; Cooperatives ; Agribusiness
    Print ISSN: 0165-1587
    Electronic ISSN: 1464-3618
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition , Economics
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 44
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-19
    Print ISSN: 0885-8985
    Electronic ISSN: 1557-959X
    Topics: Electrical Engineering, Measurement and Control Technology , Mechanical Engineering, Materials Science, Production Engineering, Mining and Metallurgy, Traffic Engineering, Precision Mechanics
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 45
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-19
    Description: Good behavior for the altitude estimation has been validated in terms of accuracy using instrumented GPS targets, as well as opportunistic ones (coherence of the altitude history with respect to the speed indicator). In most cases, this altitude estimation satisfied our objectives, even though they were quite ambitious objectives. A system such as this one should clearly distinguish the different altitudes of different flight patterns, and the difference between the two altitudes was equal to 330 m. This multistatic DVB-T SFN tracker is exploiting detections based on a 2D planar DVB-T receiving array: the parameters estimated are the bistatic range and Doppler, as well as the azimuth and elevation measurements. Some opportunistic targets have also been analyzed for validation of the 3D localization capacities, especially for low speed targets at very low altitude.
    Print ISSN: 0885-8985
    Electronic ISSN: 1557-959X
    Topics: Electrical Engineering, Measurement and Control Technology , Mechanical Engineering, Materials Science, Production Engineering, Mining and Metallurgy, Traffic Engineering, Precision Mechanics
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 46
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-19
    Description: A novel method enabling single-site localization of wireless emitters in a rich multipath environment is presented. The localization is based on a novel fingerprinting technique exploiting the spatial-temporal characteristics of the multipath signals received by the base station antenna array. The fingerprint is based on a lower dimensional signal subspace of the spatial-temporal covariance matrix, capturing the dominant multipath signals. The performance is validated with both simulated and real data, demonstrating localization accuracy of about 1 m in typical indoor environments.
    Print ISSN: 1053-587X
    Electronic ISSN: 1941-0476
    Topics: Electrical Engineering, Measurement and Control Technology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 47
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-19
    Description: A quasi-optical interferometric technique capable of measuring antenna phase patterns without the need for a heterodyne receiver is presented. It is particularly suited to the characterization of terahertz antennas feeding power detectors or mixers employing quasi-optical local oscillator injection. Examples of recorded antenna phase patterns at frequencies of 1.4 and 2.5 THz using homodyne detectors are presented. To our knowledge, these are the highest frequency antenna phase patterns ever recovered. Knowledge of both the amplitude and phase patterns in the far field enable a Gauss–Hermite or Gauss–Laguerre beam-mode analysis to be carried out for the antenna, of importance in performance optimization calculations, such as antenna gain and beam efficiency parameters at the design and prototype stage of antenna development. A full description of the beam would also be required if the antenna is to be used to feed a quasi-optical system in the near-field to far-field transition region. This situation could often arise when the device is fitted directly at the back of telescopes in flying observatories. A further benefit of the proposed technique is simplicity for characterizing systems in situ, an advantage of considerable importance as in many situations, the components may not be removable for further characterization once assembled. The proposed methodology is generic and should be useful across the wider sensing community, e.g., in single detector acoustic imaging or in adaptive imaging array applications. Furthermore, it is applicable across other frequencies of the EM spectrum, provided adequate spatial and temporal phase stability of the source can be maintained throughout the measurement process. Phase information retrieval is also of importance to emergent research areas, such as band-gap structure characterization, meta-materials research, electromagnetic cloaking, slow light, super-lens design as well as near-field and virtual imaging app- ications.
    Print ISSN: 1530-437X
    Electronic ISSN: 1558-1748
    Topics: Electrical Engineering, Measurement and Control Technology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 48
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-19
    Description: Direction of arrival (DOA) estimation is a classical problem in signal processing with many practical applications. Its research has recently been advanced owing to the development of methods based on sparse signal reconstruction. While these methods have shown advantages over conventional ones, there are still difficulties in practical situations where true DOAs are not on the discretized sampling grid. To deal with such an off-grid DOA estimation problem, this paper studies an off-grid model that takes into account effects of the off-grid DOAs and has a smaller modeling error. An iterative algorithm is developed based on the off-grid model from a Bayesian perspective while joint sparsity among different snapshots is exploited by assuming a Laplace prior for signals at all snapshots. The new approach applies to both single snapshot and multi-snapshot cases. Numerical simulations show that the proposed algorithm has improved accuracy in terms of mean squared estimation error. The algorithm can maintain high estimation accuracy even under a very coarse sampling grid.
    Print ISSN: 1053-587X
    Electronic ISSN: 1941-0476
    Topics: Electrical Engineering, Measurement and Control Technology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 49
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-19
    Description: This paper investigates reliable and covert transmission strategies in a multiple-input multiple-output (MIMO) wiretap channel with a transmitter, receiver and an adversarial wiretapper, each equipped with multiple antennas. In a departure from existing work, the wiretapper possesses a novel capability to act either as a passive eavesdropper or as an active jammer, under a half-duplex constraint. The transmitter therefore faces a choice between allocating all of its power for data, or broadcasting artificial interference along with the information signal in an attempt to jam the eavesdropper (assuming its instantaneous channel state is unknown). To examine the resulting trade-offs for the legitimate transmitter and the adversary, we model their interactions as a two-person zero-sum game with the ergodic MIMO secrecy rate as the payoff function. We first examine conditions for the existence of pure-strategy Nash equilibria (NE) and the structure of mixed-strategy NE for the strategic form of the game. We then derive equilibrium strategies for the extensive form of the game where players move sequentially under scenarios of perfect and imperfect information. Finally, numerical simulations are presented to examine the equilibrium outcomes of the various scenarios considered.
    Print ISSN: 1053-587X
    Electronic ISSN: 1941-0476
    Topics: Electrical Engineering, Measurement and Control Technology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 50
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-19
    Description: We propose a sparsity-based space-time adaptive processing (STAP) algorithm to detect a slowly-moving target using an orthogonal frequency division multiplexing (OFDM) radar. We observe that the target and interference spectra are inherently sparse in the spatio-temporal domain. Hence, we exploit that sparsity to develop an efficient STAP technique that utilizes considerably lesser number of secondary data and produces an equivalent performance as the other existing STAP techniques. In addition, the use of an OFDM signal increases the frequency diversity of our system, as different scattering centers of a target resonate at different frequencies, and thus improves the target detectability. First, we formulate a realistic sparse-measurement model for an OFDM radar considering both the clutter and jammer as the interfering sources. Then, we apply a residual sparse-recovery technique based on the LASSO estimator to estimate the target and interference covariance matrices, and subsequently compute the optimal STAP-filter weights. Our numerical results demonstrate a comparative performance analysis of the proposed sparse-STAP algorithm with four other existing STAP methods. Furthermore, we discover that the OFDM-STAP filter-weights are adaptable to the frequency-variabilities of the target and interference responses, in addition to the spatio-temporal variabilities. Hence, by better utilizing the frequency variabilities, we propose an adaptive OFDM-waveform design technique, and consequently gain a significant amount of STAP-performance improvement.
    Print ISSN: 1053-587X
    Electronic ISSN: 1941-0476
    Topics: Electrical Engineering, Measurement and Control Technology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 51
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-20
    Description: A narrowband mid-infrared (mid-IR) guided-mode resonance filter (GMRF) is designed and fabricated using a Hafnium Dioxide film/quartz wafer material system. The fabricated GMRF is then integrated into an erbium (Er)-doped Zr-Ba-La-Al-Na (ZBLAN) fluoride glass fiber laser as a wavelength selective feedback element. The laser operated at 2782 nm with a line-width less than 2 nm demonstrates the viability of GMRFs for wavelength selection in the mid-IR.
    Print ISSN: 1041-1135
    Electronic ISSN: 1941-0174
    Topics: Electrical Engineering, Measurement and Control Technology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 52
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-20
    Description: We propose a mutually injection-locked tunable (MILT) laser for L-band generation based on arrayed waveguide grating and gain-saturated reflective semiconductor optical amplifiers. The laser generates a 3-dB bandwidth less than 0.05 nm and a high and uniform sidemode suppression ratio $({rm SMSR}>52~{rm dB},~Delta{rm SMSR}>3~{rm dB})$ over 40 L-band wavelengths. Using an MILT laser greatly reduced the phase noise, which is measured to be ${-}{rm 107.83}~{rm dBc}/{rm Hz}$ at a frequency of 10-kHz offset from the center. Furthermore, the 1.25-Gb/s directly modulated transmission performance of the MILT laser shows that there is a power penalty of ${sim}{rm 0.2}~{rm dB}$ over a 20-km single mode fiber.
    Print ISSN: 1041-1135
    Electronic ISSN: 1941-0174
    Topics: Electrical Engineering, Measurement and Control Technology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 53
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-20
    Print ISSN: 1053-5888
    Electronic ISSN: 1558-0792
    Topics: Electrical Engineering, Measurement and Control Technology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 54
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-20
    Description: Remember James Bond and the Sony Ericsson cell phone he used to drive a BMW 750 iL in the 2008 Bond movie, Tomorrow Never Dies? Your smartphone and your car (and very likely you) aren't quite ready for that kind of driving experience, but the technology and automakers are getting closer to that level of capability
    Print ISSN: 1053-5888
    Electronic ISSN: 1558-0792
    Topics: Electrical Engineering, Measurement and Control Technology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 55
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-20
    Description: IEEE Signal Processing Magazine (SPM) is a tutorial-style magazine that brings to the signal processing community bimonthly survey articles of the most recent advances in signal processing theory and applications. For this issue, the Editor-in-Chief invited Andrea Cavallaro and Andres Kwasinski, the area editors for columns and forum, to write a brief on their editorial work.
    Print ISSN: 1053-5888
    Electronic ISSN: 1558-0792
    Topics: Electrical Engineering, Measurement and Control Technology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 56
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-20
    Description: Cooperation is known as an effective strategy in nature to achieve individual or common goals and is a key element in evolution for optimization, as Darwin pointed out in his famous work. Cooperation among weaker individuals (such as hyenas) has the potential to outperform stronger individuals (cheetahs) agnostic to the principle of cooperation. The concept of cooperation has been applied to mobile communication networks with the first appearance of so-called ad hoc networks or meshed networks. But this kind of cooperation is a special form of cooperation, later referred to as forced cooperation, controlled by an overlay unit and not built up by selfish individuals. The research field of social mobile clouds is looking into the possibility to increase the potential of cooperation among selfishly driven users by introducing novel technologies such as network coding as well as socially driven incentives introduced by several well known social networks.
    Print ISSN: 1053-5888
    Electronic ISSN: 1558-0792
    Topics: Electrical Engineering, Measurement and Control Technology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 57
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-20
    Description: High Efficiency Video Coding (HEVC) is a new video compression standard developed jointly by ITU-T Video Coding Experts Group (VCEG) and ISO/IEC Moving Pictures Expert Group (MPEG) through their Joint Collaborative Team on Video Coding (JCT-VC). The first version of HEVC will be finalized by the JCT-VC in January 2013. The HEVC project was launched to achieve major savings—e.g., reduction by about half for 1,280 # 720 high-definition (HD) and higher-resolution progressives can video—for equivalent visual quality relative to the bit rate needed by the widely used H.264/ MPEG-4 Advanced Video Coding (AVC) standard. For high resolution video where such additional compression is most urgently required, implementations of the current draft standard are already meeting or exceeding the targeted goal. We review the architecture and building blocks of HEVC, which were carefully selected with regard to compression capability versus complexity and to enable parallelism for the signal processing operations. Given the benefits that HEVC provides, it is likely to become the new primary reference for video compression.
    Print ISSN: 1053-5888
    Electronic ISSN: 1558-0792
    Topics: Electrical Engineering, Measurement and Control Technology
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 58
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-20
    Description: A central biological process in all living organisms is gene translation. Developing a deeper understanding of this complex process may have ramifications to almost every biomedical discipline. Reuveni et al. recently proposed a new computational model of gene translation called the Ribosome Flow Model (RFM). In this paper, we consider a particular case of this model, called the Homogeneous Ribosome Flow Model (HRFM). From a biological viewpoint, this corresponds to the case where the transition rates of all the coding sequence codons are identical. This regime has been suggested recently based on experiments in mouse embryonic cells. We consider the steady-state distribution of the HRFM. We provide formulas that relate the different parameters of the model in steady state. We prove the following properties: 1) the ribosomal density profile is monotonically decreasing along the coding sequence; 2) the ribosomal density at each codon monotonically increases with the initiation rate; and 3) for a constant initiation rate, the translation rate monotonically decreases with the length of the coding sequence. In addition, we analyze the translation rate of the HRFM at the limit of very high and very low initiation rate, and provide explicit formulas for the translation rate in these two cases. We discuss the relationship between these theoretical results and biological findings on the translation process.
    Print ISSN: 1545-5963
    Electronic ISSN: 1557-9964
    Topics: Biology , Computer Science
    Published by Institute of Electrical and Electronics Engineers (IEEE) on behalf of The IEEE Computational Intelligence Society ; The IEEE Computer Society ; The IEEE Control Systems Society ; The IEEE Engineering in Medicine and Biology Society ; The Association for Computing Machinery.
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 59
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-20
    Description: Ranking genes in functional networks according to a specific biological function is a challenging task raising relevant performance and computational complexity problems. To cope with both these problems we developed a transductive gene ranking method based on kernelized score functions able to fully exploit the topology and the graph structure of biomolecular networks and to capture significant functional relationships between genes. We run the method on a network constructed by integrating multiple biomolecular data sources in the yeast model organism, achieving significantly better results than the compared state-of-the-art network-based algorithms for gene function prediction, and with relevant savings in computational time. The proposed approach is general and fast enough to be in perspective applied to other relevant node ranking problems in large and complex biological networks.
    Print ISSN: 1545-5963
    Electronic ISSN: 1557-9964
    Topics: Biology , Computer Science
    Published by Institute of Electrical and Electronics Engineers (IEEE) on behalf of The IEEE Computational Intelligence Society ; The IEEE Computer Society ; The IEEE Control Systems Society ; The IEEE Engineering in Medicine and Biology Society ; The Association for Computing Machinery.
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 60
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-20
    Description: An important objective of modeling biological phenomena is to develop therapeutic intervention strategies to move an undesirable state of a diseased network toward a more desirable one. Such transitions can be achieved by the use of drugs to act on some genes/metabolites that affect the undesirable behavior. Due to the fact that biological phenomena are complex processes with nonlinear dynamics that are impossible to perfectly represent with a mathematical model, the need for model-free nonlinear intervention strategies that are capable of guiding the target variables to their desired values often arises. In many applications, fuzzy systems have been found to be very useful for parameter estimation, model development and control design of nonlinear processes. In this paper, a model-free fuzzy intervention strategy (that does not require a mathematical model of the biological phenomenon) is proposed to guide the target variables of biological systems to their desired values. The proposed fuzzy intervention strategy is applied to three different biological models: a glycolytic-glycogenolytic pathway model, a purine metabolism pathway model, and a generic pathway model. The simulation results for all models demonstrate the effectiveness of the proposed scheme.
    Print ISSN: 1545-5963
    Electronic ISSN: 1557-9964
    Topics: Biology , Computer Science
    Published by Institute of Electrical and Electronics Engineers (IEEE) on behalf of The IEEE Computational Intelligence Society ; The IEEE Computer Society ; The IEEE Control Systems Society ; The IEEE Engineering in Medicine and Biology Society ; The Association for Computing Machinery.
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 61
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-20
    Description: Advertisement: This publication offers open access options for authors. IEEE open access Publishing.
    Print ISSN: 1545-5963
    Electronic ISSN: 1557-9964
    Topics: Biology , Computer Science
    Published by Institute of Electrical and Electronics Engineers (IEEE) on behalf of The IEEE Computational Intelligence Society ; The IEEE Computer Society ; The IEEE Control Systems Society ; The IEEE Engineering in Medicine and Biology Society ; The Association for Computing Machinery.
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 62
    facet.materialart.
    Unknown
    Institute of Electrical and Electronics Engineers (IEEE)
    Publication Date: 2012-12-20
    Description: Advertisement: The ACM Special Interest Group on Bioinformatics, Computational Biology, and Biomedical Informatics (SIGBioinformatics) bridges computer science, mathematics, statistics with biology and biomedicine. The mission of the ACM SIGBioinformatics is to improve our ability to develop advanced research, training, and outreach in Bioinformatics, Computational Biology, and Biomedical Informatics by stimulating interactions among researchers, educators and practitioners from related multi-disciplinary fields. The Association for Computing Machinery (ACM) is an educational and scientific computing society working to advance computing as a science and a profession. Benefits include subscriptions to Communications of the ACM, MemberNet, TechNews and CareerNews, full access to online courses and books, discounts on conferences and the option to subscribe to the ACM Digital Library.
    Print ISSN: 1545-5963
    Electronic ISSN: 1557-9964
    Topics: Biology , Computer Science
    Published by Institute of Electrical and Electronics Engineers (IEEE) on behalf of The IEEE Computational Intelligence Society ; The IEEE Computer Society ; The IEEE Control Systems Society ; The IEEE Engineering in Medicine and Biology Society ; The Association for Computing Machinery.
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 63
    Publication Date: 2012-12-21
    Description: Motivation: Given the current costs of next-generation sequencing, large studies carry out low-coverage sequencing followed by application of methods that leverage linkage disequilibrium to infer genotypes. We propose a novel method that assumes study samples are sequenced at low coverage and genotyped on a genome-wide microarray, as in the 1000 Genomes Project (1KGP). We assume polymorphic sites have been detected from the sequencing data and that genotype likelihoods are available at these sites. We also assume that the microarray genotypes have been phased to construct a haplotype scaffold. We then phase each polymorphic site using an MCMC algorithm that iteratively updates the unobserved alleles based on the genotype likelihoods at that site and local haplotype information. We use a multivariate normal model to capture both allele frequency and linkage disequilibrium information around each site. When sequencing data are available from trios, Mendelian transmission constraints are easily accommodated into the updates. The method is highly parallelizable, as it analyses one position at a time. Results: We illustrate the performance of the method compared with other methods using data from Phase 1 of the 1KGP in terms of genotype accuracy, phasing accuracy and downstream imputation performance. We show that the haplotype panel we infer in African samples, which was based on a trio-phased scaffold, increases downstream imputation accuracy for rare variants (R2 increases by 〉0.05 for minor allele frequency 〈1%), and this will translate into a boost in power to detect associations. These results highlight the value of incorporating microarray genotypes when calling variants from next-generation sequence data. Availability: The method (called MVNcall) is implemented in a C++ program and is available from http://www.stats.ox.ac.uk/~marchini/#software . Contact: marchini@stats.ox.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 64
    Publication Date: 2012-12-21
    Description: Motivation: Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases. Results: To align our large (〉80 billon reads) ENCODE Transcriptome RNA-seq dataset, we developed the Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure. STAR outperforms other aligners by a factor of 〉50 in mapping speed, aligning to the human genome 550 million 2 x 76 bp paired-end reads per hour on a modest 12-core server, while at the same time improving alignment sensitivity and precision. In addition to unbiased de novo detection of canonical junctions, STAR can discover non-canonical splices and chimeric (fusion) transcripts, and is also capable of mapping full-length RNA sequences. Using Roche 454 sequencing of reverse transcription polymerase chain reaction amplicons, we experimentally validated 1960 novel intergenic splice junctions with an 80–90% success rate, corroborating the high precision of the STAR mapping strategy. Availability and implementation: STAR is implemented as a standalone C++ code. STAR is free open source software distributed under GPLv3 license and can be downloaded from http://code.google.com/p/rna-star/ . Contact: dobin@cshl.edu .
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 65
    Publication Date: 2012-12-21
    Description: Motivation: Local motifs are patterns of DNA or protein sequences that occur within a sequence interval relative to a biologically defined anchor or landmark. Current protein motif discovery methods do not adequately consider such constraints to identify biologically significant motifs that are only weakly over-represented but spatially confined. Using negatives, i.e. sequences known to not contain a local motif, can further increase the specificity of their discovery. Results: This article introduces the method DLocalMotif that makes use of positional information and negative data for local motif discovery in protein sequences. DLocalMotif combines three scoring functions, measuring degrees of motif over-representation, entropy and spatial confinement, specifically designed to discriminatively exploit the availability of negative data. The method is shown to outperform current methods that use only a subset of these motif characteristics. We apply the method to several biological datasets. The analysis of peroxisomal targeting signals uncovers several novel motifs that occur immediately upstream of the dominant peroxisomal targeting signal-1 signal. The analysis of proline-tyrosine nuclear localization signals uncovers multiple novel motifs that overlap with C2H2 zinc finger domains. We also evaluate the method on classical nuclear localization signals and endoplasmic reticulum retention signals and find that DLocalMotif successfully recovers biologically relevant sequence properties. Availability: http://bioinf.scmb.uq.edu.au/dlocalmotif/ Contact: m.boden@uq.edu.au Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 66
    Publication Date: 2012-12-21
    Description: Motivation: Protein–protein interaction (PPI) plays an important role in understanding gene functions, and many computational PPI prediction methods have been proposed in recent years. Despite the extensive efforts, PPI prediction still has much room to improve. Sequence-based co-evolution methods include the substitution rate method and the mirror tree method, which compare sequence substitution rates and topological similarity of phylogenetic trees, respectively. Although they have been used to predict PPI in species with small genomes like Escherichia coli , such methods have not been tested in large scale proteome like Homo sapiens . Result: In this study, we propose a novel sequence-based co-evolution method, co-evolutionary divergence (CD), for human PPI prediction. Built on the basic assumption that protein pairs with similar substitution rates are likely to interact with each other, the CD method converts the evolutionary information from 14 species of vertebrates into likelihood ratios and combined them together to infer PPI. We showed that the CD method outperformed the mirror tree method in three independent human PPI datasets by a large margin. With the arrival of more species genome information generated by next generation sequencing, the performance of the CD method can be further improved. Availability: Source code and support are available at http://mib.stat.sinica.edu.tw/LAP/tmp/CD.rar . Contact: syuan@stat.sinica.edu.tw Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 67
    Publication Date: 2012-12-21
    Description: : In higher eukaryotes, the identification of translation initiation sites (TISs) has been focused on finding these signals in cDNA or mRNA sequences. Using Arabidopsis thaliana ( A.t. ) information, we developed a prediction tool for signals within genomic sequences of plants that correspond to TISs. Our tool requires only genome sequence, not expressed sequences. Its sensitivity/specificity is for A.t. (90.75%/92.2%), for Vitis vinifera (66.8%/94.4%) and for Populus trichocarpa (81.6%/94.4%), which suggests that our tool can be used in annotation of different plant genomes. We provide a list of features used in our model. Further study of these features may improve our understanding of mechanisms of the translation initiation. Availability and implementation: Our tool is implemented as an artificial neural network. It is available as a web-based tool and, together with the source code, the list of features, and data used for model development, is accessible at http://cbrc.kaust.edu.sa/dts . Contact: vladimir.bajic@kaust.edu.sa Supplementary information : Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 68
    facet.materialart.
    Unknown
    Oxford University Press
    Publication Date: 2012-12-21
    Description: Motivation: Pathway or gene set analysis has been widely applied to genomic data. Many current pathway testing methods use univariate test statistics calculated from individual genomic markers, which ignores the correlations and interactions between candidate markers. Random forests-based pathway analysis is a promising approach for incorporating complex correlation and interaction patterns, but one limitation of previous approaches is that pathways have been considered separately, thus pathway cross-talk information was not considered. Results: In this article, we develop a new pathway hunting algorithm for survival outcomes using random survival forests, which prioritize important pathways by accounting for gene correlation and genomic interactions. We show that the proposed method performs favourably compared with five popular pathway testing methods using both synthetic and real data. We find that the proposed methodology provides an efficient and powerful pathway modelling framework for high-dimensional genomic data. Availability: The R code for the analysis used in this article is available upon request. Contact: xi.steven.chen@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 69
    Publication Date: 2012-12-21
    Description: Motivation: PacBio sequencers produce two types of characteristic reads (continuous long reads: long and high error rate and circular consensus sequencing: short and low error rate), both of which could be useful for de novo assembly of genomes. Currently, there is no available simulator that targets the specific generation of PacBio libraries. Results: Our analysis of 13 PacBio datasets showed characteristic features of PacBio reads (e.g. the read length of PacBio reads follows a log-normal distribution). We have developed a read simulator, PBSIM, that captures these features using either a model-based or sampling-based method. Using PBSIM, we conducted several hybrid error correction and assembly tests for PacBio reads, suggesting that a continuous long reads coverage depth of at least 15 in combination with a circular consensus sequencing coverage depth of at least 30 achieved extensive assembly results. Availability: PBSIM is freely available from the web under the GNU GPL v2 license ( http://code.google.com/p/pbsim/ ). Contact: mhamada@k.u-tokyo.ac.jp Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 70
    Publication Date: 2012-12-21
    Description: : Drugster is a fully interactive pipeline designed to break the command line barrier and introduce a new user-friendly environment to perform drug design, lead and structure optimization experiments through an efficient combination of the PDB2PQR, Ligbuilder, Gromacs and Dock suites. Our platform features a novel workflow that guides the user through each logical step of the iterative 3D structural optimization setup and drug design process, by providing a seamless interface to all incorporated packages. Availability: Drugster can be freely downloaded via our dedicated server system at http://www.bioacademy.gr/bioinformatics/drugster/ . Contact: dvlachakis@bioacademy.gr .
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 71
    Publication Date: 2012-12-21
    Description: XiP (eXtensible integrative Pipeline) is a flexible, editable and modular environment with a user-friendly interface that does not require previous advanced programming skills to run, construct and edit workflows. XiP allows the construction of workflows by linking components written in both R and Java, the analysis of high-throughput data in grid engine systems and also the development of customized pipelines that can be encapsulated in a package and distributed. XiP already comes with several ready-to-use pipeline flows for the most common genomic and transcriptomic analysis and ~300 computational components. Availability: XiP is open source, freely available under the Lesser General Public License (LGPL) and can be downloaded from http://xip.hgc.jp . Contact: nagasaki@megabank.tohoku.ac.jp
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 72
    Publication Date: 2012-12-21
    Description: Existing repositories for experimental datasets typically capture snapshots of data acquired using a single experimental technique and often require manual population and continual curation. We present a storage system for heterogeneous research data that performs dynamic automated indexing to provide powerful search, discovery and collaboration features without the restrictions of a structured repository. ADAM is able to index many commonly used file formats generated by laboratory assays and therefore offers specific advantages to the experimental biology community. However, it is not domain specific and can promote sharing and re-use of working data across scientific disciplines. Availability and implementation: ADAM is implemented using Java and supported on Linux. It is open source under the GNU General Public License v3.0. Installation instructions, binary code, a demo system and virtual machine image and are available at http://www.imperial.ac.uk/bioinfsupport/resources/software/adam . Contact: m.woodbridge@imperial.ac.uk
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 73
    Publication Date: 2012-12-21
    Description: : Drug versus Disease (DvD) provides a pipeline, available through R or Cytoscape, for the comparison of drug and disease gene expression profiles from public microarray repositories. Negatively correlated profiles can be used to generate hypotheses of drug-repurposing, whereas positively correlated profiles may be used to infer side effects of drugs. DvD allows users to compare drug and disease signatures with dynamic access to databases Array Express, Gene Expression Omnibus and data from the Connectivity Map. Availability and implementation: R package (submitted to Bioconductor) under GPL 3 and Cytoscape plug-in freely available for download at www.ebi.ac.uk/saezrodriguez/DVD/ . Contact: saezrodriguez@ebi.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 74
    Publication Date: 2012-11-09
    Print ISSN: 0829-318X
    Electronic ISSN: 1758-4469
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 75
    Publication Date: 2012-11-09
    Description: Norway maple ( Acer platanoides L), which is among the most invasive tree species in forests of eastern North America, is associated with reduced regeneration of the related native species, sugar maple ( Acer saccharum Marsh) and other native flora. To identify traits conferring an advantage to Norway maple, we grew both species through an entire growing season under simulated light regimes mimicking a closed forest understorey vs. a canopy disturbance (gap). Dynamic shade-houses providing a succession of high-intensity direct-light events between longer periods of low, diffuse light were used to simulate the light regimes. We assessed seedling height growth three times in the season, as well as stem diameter, maximum photosynthetic capacity, biomass allocation above- and below-ground, seasonal phenology and phenotypic plasticity. Given the north European provenance of Norway maple, we also investigated the possibility that its growth in North America might be increased by delayed fall senescence. We found that Norway maple had significantly greater photosynthetic capacity in both light regimes and grew larger in stem diameter than sugar maple. The differences in below- and above-ground biomass, stem diameter, height and maximum photosynthesis were especially important in the simulated gap where Norway maple continued extension growth during the late fall. In the gap regime sugar maple had a significantly higher root : shoot ratio that could confer an advantage in the deepest shade of closed understorey and under water stress or browsing pressure. Norway maple is especially invasive following canopy disturbance where the opposite (low root : shoot ratio) could confer a competitive advantage. Considering the effects of global change in extending the potential growing season, we anticipate that the invasiveness of Norway maple will increase in the future.
    Print ISSN: 0829-318X
    Electronic ISSN: 1758-4469
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 76
    Publication Date: 2012-11-09
    Description: Fungal colonization can significantly affect the secondary metabolism of the host plants. We tested the impact of a common below-ground symbiosis, i.e., ectomycorrhiza formation, on poplar leaf chemical components that are involved in the defence against a common disease, i.e., rust fungi, in N-deficient soil. A rust-susceptible poplar clone ( Populus trichocarpa x deltoides ‘Beaupré’) was (a) non-associated with ectomycorrhizal fungus (EM) Hebeloma mesophaeum (Pers.) Quélet MÜN and non-infected with rust fungus Melampsora larici-populina Kleb. (isolate 98AG31), (b) associated with EM, (c) inoculated with rust fungus and (d) associated with EM and inoculated with rust fungus. Poplar leaves were analysed by photometric and mass spectrometric techniques (liquid chromatography–tandem mass spectrometry (LC-MS/MS), pyrolysis–field ionization mass spectrometry (Py-FIMS)). Both rust infection and mycorrhiza formation led to increased proportions of condensed tannins in relation to total phenolics (13% in the control, 18–19% in the fungal treatments). In contrast, salicylic acid concentration (6.8 µg g –1 in the control) was higher only in the rust treatments (17.9 and 25.4 µg g –1 with rust infection). The Py-FIMS analysis revealed that the rust-infected treatments were significantly separated from the non-rust-infected treatments on the basis of six flavonoids and one lipid. The relative abundance of these components, which have known functions in plant defence, was decreased after rust infection of non-mycorrhizal plants, but not in mycorrhizal plants. The results indicate that the ectomycorrhizal formation compensated the rust infection by a decrease in the flavonoid syntheses. The study provides new evidence for an interactive response of mycorrhizal colonization and infection with rust fungi in the metabolism of poplar.
    Print ISSN: 0829-318X
    Electronic ISSN: 1758-4469
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 77
    Publication Date: 2012-11-09
    Description: Apple proliferation (AP) represents a serious threat to several fruit-growing areas and is responsible for great economic losses. Several studies have highlighted the key role played by the cell wall in response to pathogen attack. The existence of a cell wall integrity signaling pathway which senses perturbations in the cell wall architecture upon abiotic/biotic stresses and activates specific defence responses has been widely demonstrated in plants. More recently a role played by cell wall-related genes has also been reported in plants infected by phytoplasmas. With the aim of shedding light on the cell wall response to AP disease in the economically relevant fruit-tree Malus x domestica Borkh., we investigated the expression of the cellulose ( CesA ) and callose synthase ( CalS ) genes in different organs (i.e., leaves, roots and branch phloem) of healthy and infected symptomatic outdoor-grown trees, sampled over the course of two time points (i.e., spring and autumn 2011), as well as in in vitro micropropagated control and infected plantlets. A strong up-regulation in the expression of cell wall biosynthetic genes was recorded in roots from infected trees. Secondary cell wall CesA s showed up-regulation in the phloem tissue from branches of infected plants, while either a down-regulation of some genes or no major changes were observed in the leaves. Micropropagated plantlets also showed an increase in cell wall-related genes and constitute a useful system for a general assessment of gene expression analysis upon phytoplasma infection. Finally, we also report the presence of several ‘knot’-like structures along the roots of infected apple trees and discuss the occurrence of this interesting phenotype in relation to the gene expression results and the modalities of phytoplasma diffusion.
    Print ISSN: 0829-318X
    Electronic ISSN: 1758-4469
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 78
    Publication Date: 2012-11-09
    Description: The availability of a system for direct transfer of antifungal candidate genes into European chestnut ( Castanea sativa Mill.) would offer an alternative approach to conventional breeding for production of chestnut trees tolerant to ink disease caused by Phytophthora spp. For the first time, a chestnut thaumatin-like protein gene ( CsTL1 ), isolated from chestnut cotyledons, has been overexpressed in three chestnut somatic embryogenic lines. Transformation experiments have been performed using an Agrobacterium tumefaciens Smith and Townsend vector harboring the neomycin phosphotransferase ( NPTII ) selectable and the green fluorescent protein ( EGFP ) reporter genes. The transformation efficiency, determined on the basis of the fluorescence of surviving explants, was clearly genotype dependent and ranged from 32.5% in the CI-9 line to 7.1% in the CI-3 line. A total of 126 independent transformed lines were obtained. The presence and integration of chestnut CsTL1 in genomic DNA was confirmed by polymerase chain reaction (PCR) and Southern blot analyses. Quantitative real-time PCR revealed that CsTL1 expression was up to 13.5-fold higher in a transgenic line compared with its corresponding untransformed line. In only one of the 11 transformed lines tested, expression of the CsTL1 was lower than the control. The remaining 115 transformed lines were successfully subjected to cryopreservation. Embryo proliferation was achieved in all of the transgenic lines regenerated and the transformed lines showed a higher mean number of cotyledonary stage embryos and total number of embryos per embryo clump than their corresponding untransformed lines. Transgenic plants were regenerated after maturation and germination of transformed somatic embryos. Furthermore, due to the low plantlet conversion achieved, axillary shoot proliferation cultures were established from partially germinated embryos (only shoot development), which were multiplied and rooted according to procedures already established. Transgenic plants were acclimatized and grown in a greenhouse. No phenotypic differences were found with control plants, suggesting no potential cytotoxic effects of the green fluorescent protein. The results reported in the present work could be considered as a first step toward the production of fungal-disease tolerant cisgenic chestnut plants.
    Print ISSN: 0829-318X
    Electronic ISSN: 1758-4469
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 79
    Publication Date: 2012-11-09
    Description: Studying the dynamics of stem water content (   ) in living trees has an outstanding physiological interest but all the available techniques to measure exhibit major drawbacks. In this work, we present a new methodology to estimate variations in along with sap velocity using the compensated heat pulse (CHP) technique. One lab experiment was performed on several wooden blocks obtained from three different tree species. Samples were slowly dried and their moisture loss was monitored by both gravimetric approaches and time-domain reflectometry (TDR) or CHP probes in order to contrast the validity of our methodology (volumetric specific heat (VSH)-CHP) over a range of water contents. In addition, a field experiment was conducted to monitor fluctuations in standing olive trees ( Olea europaea L. cv. ‘Arbequina’) growing under three different irrigation regimes. In the lab test, the actual values deduced gravimetrically differed from the estimates yielded by the VSH-CHP method. However, it could successfully track relative changes in the water stored for the range of expected in living wood. Furthermore, the field experiment showed a seasonal change in , which was similar in shape and magnitude to those reported in the literature for olive and other Mediterranean tree species. On the other hand, differences in the seasonal patterns of between irrigation treatments strongly corresponded with those of sap flow and some leaf water potential measurements. The results of this work suggest that the CHP technique could be employed to monitor the dynamics of both and sap flow simultaneously in standing trees and evidence that seasonal changes in might be used as a long-term water status indicator.
    Print ISSN: 0829-318X
    Electronic ISSN: 1758-4469
    Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 80
    Publication Date: 2012-11-11
    Description: Motivation: RNA-Seq uses the high-throughput sequencing technology to identify and quantify transcriptome at an unprecedented high resolution and low cost. However, RNA-Seq reads are usually not uniformly distributed and biases in RNA-Seq data post great challenges in many applications including transcriptome assembly and the expression level estimation of genes or isoforms. Much effort has been made in the literature to calibrate the expression level estimation from biased RNA-Seq data, but the effect of biases on transcriptome assembly remains largely unexplored. Results: Here, we propose a statistical framework for both transcriptome assembly and isoform expression level estimation from biased RNA-Seq data. Using a quasi-multinomial distribution model, our method is able to capture various types of RNA-Seq biases, including positional, sequencing and mappability biases. Our experimental results on simulated and real RNA-Seq datasets exhibit interesting effects of RNA-Seq biases on both transcriptome assembly and isoform expression level estimation. The advantage of our method is clearly shown in the experimental analysis by its high sensitivity and precision in transcriptome assembly and the high concordance of its estimated expression levels with quantitative reverse transcription–polymerase chain reaction data. Availability: CEM is freely available at http://www.cs.ucr.edu/~liw/cem.html . Contact: liw@cs.ucr.edu Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 81
    Publication Date: 2012-11-11
    Description: Motivation: A number of studies of individual proteins have shown that post-translational modifications (PTMs) are associated with structural rearrangements of their target proteins. Although such studies provide critical insights into the mechanics behind the dynamic regulation of protein function, they usually feature examples with relatively large conformational changes. However, with the steady growth of Protein Data Bank (PDB) and available PTM sites, it is now possible to more systematically characterize the role of PTMs as conformational switches. In this study, we ask (1) what is the expected extent of structural change upon PTM, (2) how often are those changes in fact substantial, (3) whether the structural impact is spatially localized or global and (4) whether different PTMs have different signatures. Results: We exploit redundancy in PDB and, using root-mean-square deviation, study the conformational heterogeneity of groups of protein structures corresponding to identical sequences in their unmodified and modified forms. We primarily focus on the two most abundant PTMs in PDB, glycosylation and phosphorylation, but show that acetylation and methylation have similar tendencies. Our results provide evidence that PTMs induce conformational changes at both local and global level. However, the proportion of large changes is unexpectedly small; only 7% of glycosylated and 13% of phosphorylated proteins undergo global changes 〉2 Å. Further analysis suggests that phosphorylation stabilizes protein structure by reducing global conformational heterogeneity by 25%. Overall, these results suggest a subtle but common role of allostery in the mechanisms through which PTMs affect regulatory and signaling pathways. Contact: predrag@indiana.edu Supplementary Information : Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 82
    Publication Date: 2012-11-11
    Description: Motivation: Current methods in diagnostic microbiology typically focus on the detection of a single genomic locus or protein in a candidate agent. The presence of the entire microbe is then inferred from this isolated result. Problematically, the presence of recombination in microbial genomes would go undetected unless other genomic loci or protein components were specifically assayed. Microarrays lend themselves well to the detection of multiple loci from a given microbe; furthermore, the inherent nature of microarrays facilitates highly parallel interrogation of multiple microbes. However, none of the existing methods for analyzing diagnostic microarray data has the capacity to specifically identify recombinant microbes. In previous work, we developed a novel algorithm, VIPR, for analyzing diagnostic microarray data. Results: We have expanded upon our previous implementation of VIPR by incorporating a hidden Markov model (HMM) to detect recombinant genomes. We trained our HMM on a set of non-recombinant parental viruses and applied our method to 11 recombinant alphaviruses and 4 recombinant flaviviruses hybridized to a diagnostic microarray in order to evaluate performance of the HMM. VIPR HMM correctly identified 95% of the 62 inter-species recombination breakpoints in the validation set and only two false-positive breakpoints were predicted. This study represents the first description and validation of an algorithm capable of detecting recombinant viruses based on diagnostic microarray hybridization patterns. Availability: VIPR HMM is freely available for academic use and can be downloaded from http://ibridgenetwork.org/wustl/vipr . Contact: davewang@borcim.wustl.edu Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 83
    Publication Date: 2012-11-11
    Description: Motivation: The boost of next-generation sequencing technologies provides us with an unprecedented opportunity for elucidating genetic mysteries, yet the short-read length hinders us from better assembling the genome from scratch. New protocols now exist that can generate overlapping pair-end reads. By joining the 3' ends of each read pair, one is able to construct longer reads for assembling. However, effectively joining two overlapped pair-end reads remains a challenging task. Result: In this article, we present an efficient tool called Connecting Overlapped Pair-End (COPE) reads, to connect overlapping pair-end reads using k -mer frequencies. We evaluated our tool on 30 x simulated pair-end reads from Arabidopsis thaliana with 1% base error. COPE connected over 99% of reads with 98.8% accuracy, which is, respectively, 10 and 2% higher than the recently published tool FLASH. When COPE is applied to real reads for genome assembly, the resulting contigs are found to have fewer errors and give a 14-fold improvement in the N50 measurement when compared with the contigs produced using unconnected reads. Availability and implementation: COPE is implemented in C++ and is freely available as open-source code at ftp://ftp.genomics.org.cn/pub/cope . Contact: twlam@cs.hku.hk or luoruibang@genomics.org.cn
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 84
    Publication Date: 2012-11-11
    Description: Motivation: It becomes widely accepted that human cancer is a disease involving dynamic changes in the genome and that the missense mutations constitute the bulk of human genetic variations. A multitude of computational algorithms, especially the machine learning-based ones, has consequently been proposed to distinguish missense changes that contribute to the cancer progression (‘driver’ mutation) from those that do not (‘passenger’ mutation). However, the existing methods have multifaceted shortcomings, in the sense that they either adopt incomplete feature space or depend on protein structural databases which are usually far from integrated. Results: In this article, we investigated multiple aspects of a missense mutation and identified a novel feature space that well distinguishes cancer-associated driver mutations from passenger ones. An index (DX score) was proposed to evaluate the discriminating capability of each feature, and a subset of these features which ranks top was selected to build the SVM classifier. Cross-validation showed that the classifier trained on our selected features significantly outperforms the existing ones both in precision and robustness. We applied our method to several datasets of missense mutations culled from published database and literature and obtained more reasonable results than previous studies. Availability : The software is available online at http://www.methodisthealth.com/software and https://sites.google.com/site/drivermutationidentification/ . Contact : xzhou@tmhs.org Supplementary information : Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 85
    Publication Date: 2012-11-11
    Description: Motivation: Assembling peptides identified from tandem mass spectra into a list of proteins, referred to as protein inference, is an important issue in shotgun proteomics. The objective of protein inference is to find a subset of proteins that are truly present in the sample. Although many methods have been proposed for protein inference, several issues such as peptide degeneracy still remain unsolved. Results: In this article, we present a linear programming model for protein inference. In this model, we use a transformation of the joint probability that each peptide/protein pair is present in the sample as the variable. Then, both the peptide probability and protein probability can be expressed as a formula in terms of the linear combination of these variables. Based on this simple fact, the protein inference problem is formulated as an optimization problem: minimize the number of proteins with non-zero probabilities under the constraint that the difference between the calculated peptide probability and the peptide probability generated from peptide identification algorithms should be less than some threshold. This model addresses the peptide degeneracy issue by forcing some joint probability variables involving degenerate peptides to be zero in a rigorous manner. The corresponding inference algorithm is named as ProteinLP. We test the performance of ProteinLP on six datasets. Experimental results show that our method is competitive with the state-of-the-art protein inference algorithms. Availability: The source code of our algorithm is available at: https://sourceforge.net/projects/prolp/ . Contact: zyhe@dlut.edu.cn Supplementary information: Supplementary data are available at Bioinformatics Online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 86
    Publication Date: 2012-11-11
    Description: Motivation: Methylation of cytosines in DNA is an important epigenetic mechanism involved in transcriptional regulation and preservation of genome integrity in a wide range of eukaryotes. Immunoprecipitation of methylated DNA followed by hybridization to genomic tiling arrays (MeDIP-chip) is a cost-effective and sensitive method for methylome analyses. However, existing bioinformatics methods only enable a binary classification into unmethylated and methylated genomic regions, which limit biological interpretations. Indeed, DNA methylation levels can vary substantially within a given DNA fragment depending on the number and degree of methylated cytosines. Therefore, a method for the identification of more than two methylation states is highly desirable. Results: Here, we present a three-state hidden Markov model (MeDIP-HMM) for analyzing MeDIP-chip data. MeDIP-HMM uses a higher-order state-transition process improving modeling of spatial dependencies between chromosomal regions, allows a simultaneous analysis of replicates and enables a differentiation between unmethylated, methylated and highly methylated genomic regions. We train MeDIP-HMM using a Bayesian Baum–Welch algorithm, integrating prior knowledge on methylation levels. We apply MeDIP-HMM to the analysis of the Arabidopsis root methylome and systematically investigate the benefit of using higher-order HMMs. Moreover, we also perform an in-depth comparison study with existing methods and demonstrate the value of using MeDIP-HMM by comparisons to current knowledge on the Arabidopsis methylome. We find that MeDIP-HMM is a fast and precise method for the analysis of methylome data, enabling the identification of distinct DNA methylation levels. Finally, we provide evidence for the general applicability of MeDIP-HMM by analyzing promoter DNA methylation data obtained for chicken. Availability: MeDIP-HMM is available as part of the open-source Java library Jstacs ( www.jstacs.de/index.php/MeDIP-HMM ). Data files are available from the Jstacs website. Contact: seifert@ipk-gatersleben.de Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 87
    Publication Date: 2012-11-11
    Description: Motivation: Automated annotation of neuroanatomical connectivity statements from the neuroscience literature would enable accessible and large-scale connectivity resources. Unfortunately, the connectivity findings are not formally encoded and occur as natural language text. This hinders aggregation, indexing, searching and integration of the reports. We annotated a set of 1377 abstracts for connectivity relations to facilitate automated extraction of connectivity relationships from neuroscience literature. We tested several baseline measures based on co-occurrence and lexical rules. We compare results from seven machine learning methods adapted from the protein interaction extraction domain that employ part-of-speech, dependency and syntax features. Results: Co-occurrence based methods provided high recall with weak precision. The shallow linguistic kernel recalled 70.1% of the sentence-level connectivity statements at 50.3% precision. Owing to its speed and simplicity, we applied the shallow linguistic kernel to a large set of new abstracts. To evaluate the results, we compared 2688 extracted connections with the Brain Architecture Management System (an existing database of rat connectivity). The extracted connections were connected in the Brain Architecture Management System at a rate of 63.5%, compared with 51.1% for co-occurring brain region pairs. We found that precision increases with the recency and frequency of the extracted relationships. Availability and implementation: The source code, evaluations, documentation and other supplementary materials are available at http://www.chibi.ubc.ca/WhiteText . Contact: paul@chibi.ubc.ca Supplementary information: Supplementary data are available at Bioinformatics Online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 88
    Publication Date: 2012-11-11
    Description: Motivation: The first step for clinical diagnostics, prognostics and targeted therapeutics of cancer is to comprehensively understand its molecular mechanisms. Large-scale cancer genomics projects are providing a large volume of data about genomic, epigenomic and gene expression aberrations in multiple cancer types. One of the remaining challenges is to identify driver mutations, driver genes and driver pathways promoting cancer proliferation and filter out the unfunctional and passenger ones. Results: In this study, we propose two methods to solve the so-called maximum weight submatrix problem, which is designed to de novo identify mutated driver pathways from mutation data in cancer. The first one is an exact method that can be helpful for assessing other approximate or/and heuristic algorithms. The second one is a stochastic and flexible method that can be employed to incorporate other types of information to improve the first method. Particularly, we propose an integrative model to combine mutation and expression data. We first apply our methods onto simulated data to show their efficiency. We further apply the proposed methods onto several real biological datasets, such as the mutation profiles of 74 head and neck squamous cell carcinomas samples, 90 glioblastoma tumor samples and 313 ovarian carcinoma samples. The gene expression profiles were also considered for the later two data. The results show that our integrative model can identify more biologically relevant gene sets. We have implemented all these methods and made a package called mutated driver pathway finder, which can be easily used for other researchers. Availability: A MATLAB package of MDPFinder is available at http://zhangroup.aporc.org/ShiHuaZhang Contact: zsh@amss.ac.cn Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 89
    Publication Date: 2012-11-11
    Description: Motivation: Biologistics provides data for quantitative analysis of transport (diffusion) processes and their spatio-temporal correlations in cells. Mobility of proteins is one of the few parameters necessary to describe reaction rates for gene regulation. Although understanding of diffusion-limited biochemical reactions in vivo requires mobility data for the largest possible number of proteins in their native forms, currently, there is no database that would contain the complete information about the diffusion coefficients (DCs) of proteins in a given cell type. Results: We demonstrate a method for the determination of in vivo DCs for any molecule—regardless of its molecular weight, size and structure—in any type of cell. We exemplify the method with the database of in vivo DC for all proteins (4302 records) from the proteome of K12 strain of Escherichia coli , together with examples of DC of amino acids, sugars, RNA and DNA. The database follows from the scale-dependent viscosity reference curve (sdVRC). Construction of sdVRC for prokaryotic or eukaryotic cell requires ~20 in vivo measurements using techniques such as fluorescence correlation spectroscopy (FCS), fluorescence recovery after photobleaching (FRAP), nuclear magnetic resonance (NMR) or particle tracking. The shape of the sdVRC would be different for each organism, but the mathematical form of the curve remains the same. The presented method has a high predictive power, as the measurements of DCs of several inert, properly chosen probes in a single cell type allows to determine the DCs of thousands of proteins. Additionally, obtained mobility data allow quantitative study of biochemical interactions in vivo . Contact: rholyst@ichf.edu.pl Supplementary information: Supplementary data are available at Bioinformatics Online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 90
    Publication Date: 2012-11-11
    Description: Motivation: In modern sequencing studies, one can improve the confidence of genotype calls by phasing haplotypes using information from an external reference panel of fully typed unrelated individuals. However, the computational demands are so high that they prohibit researchers with limited computational resources from haplotyping large-scale sequence data. Results: Our graphics processing unit based software delivers haplotyping and imputation accuracies comparable to competing programs at a fraction of the computational cost and peak memory demand. Availability: Mendel-GPU , our OpenCL software, runs on Linux platforms and is portable across AMD and nVidia GPUs. Users can download both code and documentation at http://code.google.com/p/mendel-gpu/ . Contact: gary.k.chen@usc.edu Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 91
    Publication Date: 2012-11-11
    Description: : Phylogenetics, likelihood, evolution and complexity ( PLEX ) is a flexible and fast Bayesian Markov chain Monte Carlo software program for large-scale analysis of nucleotide and amino acid data using complex evolutionary models in a phylogenetic framework. The program gains large speed improvements over standard approaches by implementing ‘partial sampling of substitution histories’, a data augmentation approach that can reduce data analysis times from months to minutes on large comparative datasets. A variety of nucleotide and amino acid substitution models are currently implemented, including non-reversible and site-heterogeneous mixture models. Due to efficient algorithms that scale well with data size and model complexity, PLEX can be used to make inferences from hundreds to thousands of taxa in only minutes on a desktop computer. It also performs probabilistic ancestral sequence reconstruction. Future versions will support detection of co-evolutionary interactions between sites, probabilistic tests of convergent evolution and rigorous testing of evolutionary hypotheses in a Bayesian framework. Availability and implementation: PLEX v1.0 is licensed under GPL. Source code and documentation will be available for download at www.evolutionarygenomics.com/ProgramsData/PLEX . PLEX is implemented in C++ and supported on Linux, Mac OS X and other platforms supporting standard C++ compilers. Example data, control files, documentation and accessory Perl scripts are available from the website. Contact: David.Pollock@UCDenver.edu Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 92
    Publication Date: 2012-11-11
    Description: : PrIME-DLRS (or colloquially: ‘Delirious’) is a phylogenetic software tool to simultaneously infer and reconcile a gene tree given a species tree. It accounts for duplication and loss events, a relaxed molecular clock and is intended for the study of homologous gene families, for example in a comparative genomics setting involving multiple species. PrIME-DLRS uses a Bayesian MCMC framework, where the input is a known species tree with divergence times and a multiple sequence alignment, and the output is a posterior distribution over gene trees and model parameters. Availability and implementation : PrIME-DLRS is available for Java SE 6+ under the New BSD License, and JAR files and source code can be downloaded from http://code.google.com/p/jprime/ . There is also a slightly older C++ version available as a binary package for Ubuntu, with download instructions at http://prime.sbc.su.se . The C++ source code is available upon request. Contact: joel.sjostrand@scilifelab.se or jens.lagergren@scilifelab.se . Supplementary Information : PrIME-DLRS is based on a sound probabilistic model (Åkerborg et al. , 2009) and has been thoroughly validated on synthetic and biological datasets ( Supplementary Material online ).
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 93
    Publication Date: 2012-11-11
    Description: : Computational Structural Biology Toolbox (CSB) is a cross-platform Python class library for reading, storing and analyzing biomolecular structures with rich support for statistical analyses. CSB is designed for reusability and extensibility and comes with a clean, well-documented API following good object-oriented engineering practice. Availability: Stable release packages are available for download from the Python Package Index (PyPI) as well as from the project’s website http://csb.codeplex.com . Contacts: ivan.kalev@gmail.com or michael.habeck@tuebingen.mpg.de
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 94
    Publication Date: 2012-11-11
    Description: : GREVE has been developed to assist with the identification of recurrent genomic aberrations across cancer samples. The exact characterization of such aberrations remains a challenge despite the availability of increasing amount of data, from SNParray to next-generation sequencing. Furthermore, genomic aberrations in cancer are especially difficult to handle because they are, by nature, unique to the patients. However, their recurrence in specific regions of the genome has been shown to reflect their relevance in the development of tumors. GREVE makes use of previously characterized events to identify such regions and focus any further analysis. Availability: GREVE is available through a web interface and open-source application ( http://www.well.ox.ac.uk/GREVE ).
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 95
    Publication Date: 2012-11-11
    Description: : Protein interaction networks are widely used to depict the relationships between proteins. These networks often lack the information on physical binary interactions, and they do not inform whether there is incompatibility of structure between binding partners. Here, we introduce SAPIN, a framework dedicated to the structural analysis of protein interaction networks. SAPIN first identifies the protein parts that could be involved in the interaction and provides template structures. Next, SAPIN performs structural superimpositions to identify compatible and mutually exclusive interactions. Finally, the results are displayed using Cytoscape Web. Availability: The SAPIN server is available at http://sapin.crg.es . Contact: jae-seong.yang@crg.eu or christina.kiel@crg.eu Supplementary information: Supplementary data are available at Bioinformatics Online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 96
    Publication Date: 2012-11-11
    Description: : ChemBioServer is a publicly available web application for effectively mining and filtering chemical compounds used in drug discovery. It provides researchers with the ability to (i) browse and visualize compounds along with their properties, (ii) filter chemical compounds for a variety of properties such as steric clashes and toxicity, (iii) apply perfect match substructure search, (iv) cluster compounds according to their physicochemical properties providing representative compounds for each cluster, (v) build custom compound mining pipelines and (vi) quantify through property graphs the top ranking compounds in drug discovery procedures. ChemBioServer allows for pre-processing of compounds prior to an in silico screen, as well as for post-processing of top-ranked molecules resulting from a docking exercise with the aim to increase the efficiency and the quality of compound selection that will pass to the experimental test phase. Availability: The ChemBioServer web application is available at: http://bioserver-3.bioacademy.gr/Bioserver/ChemBioServer/ . Contact: gspyrou@bioacademy.gr
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 97
    Publication Date: 2012-11-11
    Description: Motivation: Cell growth and division affect the kinetics of internal cellular processes and the phenotype diversity of cell populations. Since the effects are complex, e.g. different cellular components are partitioned differently in cell division, to account for them in silico, one needs to simulate these processes in great detail. Results : We present SGNS2, a simulator of chemical reaction systems according to the Stochastic Simulation Algorithm with multi-delayed reactions within hierarchical, interlinked compartments which can be created, destroyed and divided at runtime. In division, molecules are randomly segregated into the daughter cells following a specified distribution corresponding to one of several partitioning schemes, applicable on a per-molecule-type basis. We exemplify its use with six models including a stochastic model of the disposal mechanism of unwanted protein aggregates in Escherichia coli , a model of phenotypic diversity in populations with different levels of synchrony, a model of a bacteriophage’s infection of a cell population and a model of prokaryotic gene expression at the nucleotide and codon levels. Availability : SGNS2, instructions and examples available at www.cs.tut.fi/~lloydpri/sgns2/ (open source under New BSD license). Contact : jason.lloyd-price@tut.fi Supplementary information: Supplementary data are available at Bioinformatics online.
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 98
    Publication Date: 2012-11-11
    Description: : There is an immediate need for tools to both analyse and visualize in real-time single-nucleotide polymorphisms, insertions and deletions, and other structural variants from new sequence file formats. We have developed VarB software that can be used to visualize variant call format files in real time, as well as identify regions under balancing selection and informative markers to differentiate user-defined groups (e.g. populations). We demonstrate its utility using sequence data from 50 Plasmodium falciparum isolates comprising two different continents and confirm known signals from genomic regions that contain important antigenic and anti-malarial drug-resistance genes. Availability and implementation: The C++-based software VarB and user manual are available from www.pathogenseq.org/varb . Contact: taane.clark@lshtm.ac.uk
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 99
    Publication Date: 2012-11-11
    Description: : comb-p is a command-line tool and a python library that manipulates BED files of possibly irregularly spaced P -values and (1) calculates auto-correlation, (2) combines adjacent P -values, (3) performs false discovery adjustment, (4) finds regions of enrichment (i.e. series of adjacent low P -values) and (5) assigns significance to those regions. In addition, tools are provided for visualization and assessment. We provide validation and example uses on bisulfite-seq with P -values from Fisher’s exact test, tiled methylation probes using a linear model and Dam-ID for chromatin binding using moderated t -statistics. Because the library accepts input in a simple, standardized format and is unaffected by the origin of the P -values, it can be used for a wide variety of applications. Availability: comb-p is maintained under the BSD license. The documentation and implementation are available at https://github.com/brentp/combined-pvalues . Contact: bpederse@gmail.com
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 100
    Publication Date: 2012-11-11
    Description: : ImgLib2 is an open-source Java library for n -dimensional data representation and manipulation with focus on image processing. It aims at minimizing code duplication by cleanly separating pixel-algebra, data access and data representation in memory. Algorithms can be implemented for classes of pixel types and generic access patterns by which they become independent of the specific dimensionality, pixel type and data representation. ImgLib2 illustrates that an elegant high-level programming interface can be achieved without sacrificing performance. It provides efficient implementations of common data types, storage layouts and algorithms. It is the data model underlying ImageJ2, the KNIME Image Processing toolbox and an increasing number of Fiji-Plugins. Availability : ImgLib2 is licensed under BSD. Documentation and source code are available at http://imglib2.net and in a public repository at https://github.com/imagej/imglib . Supplementary Information: Supplementary data are available at Bioinformatics Online. Contact : saalfeld@mpi-cbg.de
    Print ISSN: 1367-4803
    Electronic ISSN: 1460-2059
    Topics: Biology , Computer Science , Medicine
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
Close ⊗
This website uses cookies and the analysis tool Matomo. More information can be found here...