ALBERT

All Library Books, journals and Electronic Records Telegrafenberg

feed icon rss

Your email was sent successfully. Check your inbox.

An error occurred while sending the email. Please try again.

Proceed reservation?

Export
  • 1
    Publication Date: 2022-09-29
    Description: For the last 15 years, Biodiversity Information Standards (TDWG) has recognized two competing standards for organism occurrence data, ABCD (Access to Biological Collections Data; Holetschek et al. 2012) and DarwinCore (Wieczorek et al. 2012). These two representations emerged from contrasting strategies for mobilizing information about organism occurrences (also commonly called species occurrence data). ABCD was capable of representing details of more kinds of information, but was necessarily more complicated. DarwinCore, on the other hand, was simpler but more limited in its ability to represent data of different kinds and formats. TDWG endorsed both standards because the different projects and communities that generated them remained dedicated to their different strategies and tool sets, and the Global Biodiversity Information Facility (GBIF) developed the ability to integrate data published in either standard. Since their inceptions, DarwinCore and ABCD have become more similar. DarwinCore has gotten more complicated through the addition of terms and has begun to assign terms to classes. ABCD is now expressed in RDF (Resource Description Framework), potentially enabling re-use of terms with alternative structures among classes. At the same time, methodologies for conceptual modeling and representing complex scientific data have continued to evolve. In particular, a suite of modeling and data representation methods related to linked data and the semantic web, i.e., RDF, SKOS (Simple Knowledge Organization System), and OWL (web Ontology Language), promise to make it easier for us to reconcile shared concepts among different representations or schemas. A mapping between ABCD 2.1 and DarwinCore has existed since before 2005.*1 ABCD 3.0 and DarwinCore are both now represented in RDF. In addition, the BioCollections Ontology (BCO) covers many of the shared concepts and is derived from the Basic Formal Ontology (BFO), an upper level ontology that has oriented many other biomedical ontologies. Reconciling ABCD and DarwinCore through alignment with BCO (in the OBO Foundry; Smith et al. 2007) would better connect TDWG standards to other domains in biology. We appreciate that many working scientists and data managers perceive ontologies as overly complicated. To mitigate the steep learning curve associated with ontologies, we expect to create simpler application profiles or schemas to guide and serve narrower communities of practice within the wider biodiversity domain. We also plan to integrate the current work of the Taxonomic Names and Concepts Interest Group and thereby eliminate the redundancy between DarwinCore and Taxonomic Concepts Transfer Schema (TCS; Kennedy et al. 2006). At the time of this writing, we have only agreements from the authors (i.e., conveners of relevant TDWG Interest Groups and other key stakeholders) to collaborate in pursuit of these common goals. In this presentation we will give a more detailed description of our objectives and products, the methods we are using to achieve them, and our progress to date.
    Repository Name: EPIC Alfred Wegener Institut
    Type: Conference , notRev
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 2
    Publication Date: 2024-01-12
    Description: DiSSCo, the Distributed System of Scientific Collections, is a pan-European Research \nInfrastructure (RI) mobilising, unifying bio- and geo-diversity information connected to the \nspecimens held in natural science collections and delivering it to scientific communities and \nbeyond. Bringing together 120 institutions across 21 countries and combining earlier \ninvestments in data interoperability practices with technological advancements in \ndigitisation, cloud services and semantic linking, DiSSCo makes the data from natural \nscience collections available as one virtual data cloud, connected with data emerging from \nnew techniques and not already linked to specimens. These new data include DNA \nbarcodes, whole genome sequences, proteomics and metabolomics data, chemical data, \ntrait data, and imaging data (Computer-assisted Tomography (CT), Synchrotron, etc.), to name but a few; and will lead to a wide range of end-user services that begins with finding, \naccessing, using and improving data. DiSSCo will deliver the diagnostic information \nrequired for novel approaches and new services that will transform the landscape of what \nis possible in ways that are hard to imagine today. \nWith approximately 1.5 billion objects to be digitised, bringing natural science collections to \nthe information age is expected to result in many tens of petabytes of new data over the \nnext decades, used on average by 5,000 \xe2\x80\x93 15,000 unique users every day. This requires \nnew skills, clear policies and robust procedures and new technologies to create, work with \nand manage large digital datasets over their entire research data lifecycle, including their \nlong-term storage and preservation and open access. Such processes and procedures \nmust match and be derived from the latest thinking in open science and data management, \nrealising the core principles of \'findable, accessible, interoperable and reusable\' (FAIR). \nSynthesised from results of the ICEDIG project ("Innovation and Consolidation for Large \nScale Digitisation of Natural Heritage", EU Horizon 2020 grant agreement No. 777483) the \nDiSSCo Conceptual Design Blueprint covers the organisational arrangements, processes \nand practices, the architecture, tools and technologies, culture, skills and capacity building \nand governance and business model proposals for constructing the digitisation \ninfrastructure of DiSSCo. In this context, the digitisation infrastructure of DiSSCo must be \ninterpreted as that infrastructure (machinery, processing, procedures, personnel, \norganisation) offering Europe-wide capabilities for mass digitisation and digitisation-ondemand, \nand for the subsequent management (i.e., curation, publication, processing) and \nuse of the resulting data. The blueprint constitutes the essential background needed to \ncontinue work to raise the overall maturity of the DiSSCo Programme across multiple \ndimensions (organisational, technical, scientific, data, financial) to achieve readiness to \nbegin construction. \nToday, collection digitisation efforts have reached most collection-holding institutions \nacross Europe. Much of the leadership and many of the people involved in digitisation and \nworking with digital collections wish to take steps forward and expand the efforts to benefit \nfurther from the already noticeable positive effects. The collective results of examining \ntechnical, financial, policy and governance aspects show the way forward to operating a \nlarge distributed initiative i.e., the Distributed System of Scientific Collections (DiSSCo) for \nnatural science collections across Europe. Ample examples, opportunities and need for \ninnovation and consolidation for large scale digitisation of natural heritage have been \ndescribed. The blueprint makes one hundred and four (104) recommendations to be \nconsidered by other elements of the DiSSCo Programme of linked projects (i.e., \nSYNTHESYS+, COST MOBILISE, DiSSCo Prepare, and others to follow) and the DiSSCo \nProgramme leadership as the journey towards organisational, technical, scientific, data and \nfinancial readiness continues. \nNevertheless, significant obstacles must be overcome as a matter of priority if DiSSCo is to \nmove beyond its Design and Preparatory Phases during 2024. Specifically, these include: \nOrganisational: \n\xe2\x80\xa2 Strengthen common purpose by adopting a common framework for policy \nharmonisation and capacity enhancement across broad areas, especially in respect \nof digitisation strategy and prioritisation, digitisation processes and techniques, data \nand digital media publication and open access, protection of and access to \nsensitive data, and administration of access and benefit sharing. \n\xe2\x80\xa2 Pursue the joint ventures and other relationships necessary to the successful \ndelivery of the DiSSCo mission, especially ventures with GBIF and other \ninternational and regional digitisation and data aggregation organisations, in the \ncontext of infrastructure policy frameworks, such as EOSC. Proceed with the \nexplicit aim of avoiding divergences of approach in global natural science \ncollections data management and research. \nTechnical: \n\xe2\x80\xa2 Adopt and enhance the DiSSCo Digital Specimen Architecture and, specifically as \na matter of urgency, establish the persistent identifier scheme to be used by \nDiSSCo and (ideally) other comparable regional initiatives. \n\xe2\x80\xa2 Establish (software) engineering development and (infrastructure) operations team \nand direction essential to the delivery of services and functionalities expected from \nDiSSCo such that earnest engineering can lead to an early start of DiSSCo \noperations. \nScientific: \n\xe2\x80\xa2 Establish a common digital research agenda leveraging Digital (extended) \nSpecimens as anchoring points for all specimen-associated and -derived \ninformation, demonstrating to research institutions and policy/decision-makers the \nnew possibilities, opportunities and value of participating in the DiSSCo research \ninfrastructure. \nData: \n\xe2\x80\xa2 Adopt the FAIR Digital Object Framework and the International Image \nInteroperability Framework as the low entropy means to achieving uniform access \nto rich data (image and non-image) that is findable, accessible, interoperable and \nreusable (FAIR). \n\xe2\x80\xa2 Develop and promote best practice approaches towards achieving the best \ndigitisation results in terms of quality (best, according to agreed minimum \ninformation and other specifications), time (highest throughput, fast), and cost \n(lowest, minimal per specimen). \nFinancial \n\xe2\x80\xa2 Broaden attractiveness (i.e., improve bankability) of DiSSCo as an infrastructure to \ninvest in. \n\xe2\x80\xa2 Plan for finding ways to bridge the funding gap to avoid disruptions in the critical \nfunding path that risks interrupting core operations; especially when the gap opens \nbetween the end of preparations and beginning of implementation due to unsolved \npolitical difficulties. \nStrategically, it is vital to balance the multiple factors addressed by the blueprint against \none another to achieve the desired goals of the DiSSCo programme. Decisions cannot be \ntaken on one aspect alone without considering other aspects, and here the various \ngovernance structures of DiSSCo (General Assembly, advisory boards, and stakeholder \nforums) play a critical role over the coming years.
    Keywords: DiSSCo ; Distributed System of Scientific Collections ; Design ; Blueprint ; ICEDIG ; Deliverable
    Repository Name: National Museum of Natural History, Netherlands
    Type: info:eu-repo/semantics/article
    Format: application/pdf
    Location Call Number Expected Availability
    BibTip Others were also interested in ...
  • 3
Close ⊗
This website uses cookies and the analysis tool Matomo. More information can be found here...