Investigation of BERT Model on Biomedical Relation Extraction Based on Revised Fine-tuning Mechanism
P. Su, K. Vijay-Shanker
2020
Optimization of Biomedical Language Model with Optuna and a Sentencepiece Tokenization for NER
Chérubin Mugisha, Incheon Paik
2022
Self-Supervised Contextual Language Representation of Radiology Reports to Improve the Identification of Communication Urgency
Xing Meng, C. Ganoe, R. Sieberg, Yvonne Y. Cheung, Saeid Hassanpour
2019
Oversampling effect in pretraining for bidirectional encoder representations from transformers (BERT) to localize medical BERT and enhance biomedical BERT
Shoya Wada, Toshihiro Takeda, Katsuki Okada, S. Manabe, Shozo Konishi, Jun Kamohara, Y. Matsumura
2020
exBERT: Extending Pre-trained Models with Domain-specific Vocabulary Under Constrained Training Resources
Wen-Hsin Tai, H. T. Kung, Xin Dong
2020
Investigation of improving the pre-training and fine-tuning of BERT model for biomedical relation extraction
P. Su, K. Vijay-Shanker
2021
Transferability of Natural Language Inference to Biomedical Question Answering
Minbyul Jeong, Mujeen Sung, Gangwoo Kim, Donghyeon Kim, Wonjin Yoon, J. Yoo, Jaewoo Kang
2020
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition
Yun He, Ziwei Zhu, Yin Zhang, Qin Chen, James Caverlee
2020
Inexpensive Domain Adaptation of Pretrained Language Models: A Case Study on Biomedical Named Entity Recognition
Nina Poerner, Ulli Waltinger, Hinrich Schütze
2020
Chemical-protein Interaction Extraction via Gaussian Probability Distribution and External Biomedical Knowledge
Cong Sun, Zhihao Yang, Leilei Su, Lei Wang, Yin Zhang, Hongfei Lin, Jian Wang
2019
Re-train or Train from Scratch? Comparing Pre-training Strategies of BERT in the Medical Domain
Hicham El Boukkouri, Olivier Ferret, T. Lavergne, Pierre Zweigenbaum
2022
Biomedical named entity recognition using BERT in the machine reading comprehension framework
Cong Sun, Zhihao Yang, Lei Wang, Yin Zhang, Hongfei Lin, Jian Wang
The overwhelming amount of biomedical scientific texts calls for the development of effective language models able to tackle a wide range of biomedical natural language processing (NLP) tasks. The most recent dominant approaches are domain-specific models, initialized with general-domain textual data and then trained on a variety of scientific corpora. However, it has been observed that for specialized domains in which large corpora exist, training a model from scratch with just in-domain knowledge may yield better results. Moreover, the increasing focus on the compute costs for pre-training recently led to the design of more efficient architectures, such as ELECTRA. In this paper, we propose a pre-trained domain-specific language model, called ELECTRAMed, suited for the biomedical field. The novel approach inherits the learning framework of the general-domain ELECTRA architecture, as well as its computational advantages. Experiments performed on benchmark datasets for several biomedical NLP tasks support the usefulness of ELECTRAMed, which sets the novel state-of-the-art result on the BC5CDR corpus for named entity recognition, and provides the best outcome in 2 over the 5 runs of the 7th BioASQ-factoid Challange for the question answering task.