ALBERT — All Library Books, journals and Electronic Records Telegrafenberg

Treffer pro Seite

Treffer 1 - 2 | 2 Treffer

Sortierung

Unbekannt

HUNER: Improving Biomedical NER with Pretraining (2020)

Weber, L. ; Münchmeyer, J. ; Rocktäschel, T. ; [weitere]

In: Bioinformatics

zur Merkliste hinzufügen auf der Merkliste

Details

Publikationsdatum: 2022-07-28

Beschreibung: Motivation: Several recent studies showed that the application of deep neural networks advanced the state-of-the-art in named entity recognition (NER), including biomedical NER. However, the impact on performance and the robustness of improvements crucially depends on the availability of sufficiently large training corpora, which is a problem in the biomedical domain with its often rather small gold standard corpora. Results: We evaluate different methods for alleviating the data sparsity problem by pretraining a deep neural network (LSTM-CRF), followed by a rather short fine-tuning phase focusing on a particular corpus. Experiments were performed using 34 different corpora covering five different biomedical entity types, yielding an average increase in F1-score of ∼2 pp compared to learning without pretraining. We experimented both with supervised and semi-supervised pretraining, leading to interesting insights into the precision/recall trade-off. Based on our results, we created the stand-alone NER tool HUNER incorporating fully trained models for five entity types. On the independent CRAFT corpus, which was not used for creating HUNER, it outperforms the state-of-the-art tools GNormPlus and tmChem by 5 pp - 13 pp on the entity types chemicals, species, and genes. Availability: HUNER is freely available at https://hu-ner.github.io. HUNER comes in containers, making it easy to install and use, and it can be applied off-the-shelf to arbitrary texts. We also provide an integrated tool for obtaining and converting all 34 corpora used in our evaluation, including fixed training, development and test splits to enable fair comparisons in the future.

Materialart: info:eu-repo/semantics/article

Format: application/pdf

Permalink

	Standort	Signatur	Erwartet	Verfügbarkeit

Andere fanden auch interessant ...

ARTIKEL (OA)

S·F·X

PDF

Überblick

Unbekannt

Graph Neural Networks for Learning Molecular Excitation Spectra (2022)

Singh, K. ; Münchmeyer, J. ; Weber, L. ; [weitere]

In: Journal of Chemical Theory and Computation

zur Merkliste hinzufügen auf der Merkliste

Details

Publikationsdatum: 2022-07-29

Beschreibung: Machine learning (ML) approaches have demonstrated the ability to predict molecular spectra at a fraction of the computational cost of traditional theoretical chemistry methods while maintaining high accuracy. Graph neural networks (GNNs) are particularly promising in this regard, but different types of GNNs have not yet been systematically compared. In this work, we benchmark and analyze five different GNNs for the prediction of excitation spectra from the QM9 dataset of organic molecules. We compare the GNN performance in the obvious runtime measurements, prediction accuracy, and analysis of outliers in the test set. Moreover, through TMAP clustering and statistical analysis, we are able to highlight clear hotspots of high prediction errors as well as optimal spectra prediction for molecules with certain functional groups. This in-depth benchmarking and subsequent analysis protocol lays down a recipe for comparing different ML methods and evaluating dataset quality.

Materialart: info:eu-repo/semantics/article

Permalink

	Standort	Signatur	Erwartet	Verfügbarkeit

Andere fanden auch interessant ...

ARTIKEL (OA)

S·F·X

Überblick

Treffer 1 - 2 | 2 Treffer