An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition.

George Tsatsaronis , Georgios Balikas , Prodromos Malakasiotis , Ioannis Partalas , Matthias Zschunke , Michael R Alvers , Dirk Weissenborn , Anastasia Krithara , Sergios Petridis , Dimitris Polychronopoulos , Yannis Almirantis , John Pavlopoulos , Nicolas Baskiotis , Patrick Gallinari , Thierry Artiéres , Axel-Cyrille Ngonga Ngomo , Norman Heino , Eric Gaussier , Liliana Barrio-Alvers , Michael Schroeder

BMC Bioinformatics

NCSR Demokritos, Ag. Paraskevi, Athens, 60228, Greece.

Published: April 2015

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Background: This article provides an overview of the first BIOASQ challenge, a competition on large-scale biomedical semantic indexing and question answering (QA), which took place between March and September 2013. BIOASQ assesses the ability of systems to semantically index very large numbers of biomedical scientific articles, and to return concise and user-understandable answers to given natural language questions by combining information from biomedical articles and ontologies.

Results: The 2013 BIOASQ competition comprised two tasks, Task 1a and Task 1b. In Task 1a participants were asked to automatically annotate new PUBMED documents with MESH headings. Twelve teams participated in Task 1a, with a total of 46 system runs submitted, and one of the teams performing consistently better than the MTI indexer used by NLM to suggest MESH headings to curators. Task 1b used benchmark datasets containing 29 development and 282 test English questions, along with gold standard (reference) answers, prepared by a team of biomedical experts from around Europe and participants had to automatically produce answers. Three teams participated in Task 1b, with 11 system runs. The BIOASQ infrastructure, including benchmark datasets, evaluation mechanisms, and the results of the participants and baseline methods, is publicly available.

Conclusions: A publicly available evaluation infrastructure for biomedical semantic indexing and QA has been developed, which includes benchmark datasets, and can be used to evaluate systems that: assign MESH headings to published articles or to English questions; retrieve relevant RDF triples from ontologies, relevant articles and snippets from PUBMED Central; produce "exact" and paragraph-sized "ideal" answers (summaries). The results of the systems that participated in the 2013 BIOASQ competition are promising. In Task 1a one of the systems performed consistently better from the NLM's MTI indexer. In Task 1b the systems received high scores in the manual evaluation of the "ideal" answers; hence, they produced high quality summaries as answers. Overall, BIOASQ helped obtain a unified view of how techniques from text classification, semantic indexing, document and passage retrieval, question answering, and text summarization can be combined to allow biomedical experts to obtain concise, user-understandable answers to questions reflecting their real information needs.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4450488	PMC
http://dx.doi.org/10.1186/s12859-015-0564-6	DOI Listing

Publication Analysis

Top Keywords

semantic indexing

biomedical semantic

question answering

2013 bioasq

mesh headings

benchmark datasets

overview bioasq

large-scale biomedical

indexing question

concise user-understandable

Similar Publications

From semantic knowledge to semantic features: French semantic feature production norms for 360 concepts.

Behav Res Methods

September 2025

Laboratoire de Psychologie, Université de Bordeaux, LabPsy UR 4139, 3 Place de la Victoire, 33076, Bordeaux Cedex, France.

Widad Rachdi , Stéphanie Mathey , Christelle Robert

This article presents a new set of semantic feature production norms, collected from 580 young adults, for 360 French concepts across various semantic categories. Although empirically derived feature norms have been developed for several languages and have been shown to be useful for investigating semantic memory and providing assessment tools, none are currently available for native French-speaking populations. In this study, the participants performed a property generation task in which they were asked to list features to describe the characteristics of each given concept (e.

View Article and Find Full Text PDF

Similar Publications

TRANSLATION, CULTURAL ADAPTATION, AND SEMANTIC VALIDATION OF THE PEDIATRIC NEUROGENIC BOWEL DYSFUNCTION SCORE (NDBS) INTO BRAZILIAN PORTUGUESE AND A PILOT STUDY.

Arq Gastroenterol

September 2025

Escola Superior de Ciências da Saúde, Fundação de Ensino e Pesquisa em Ciências da Saúde, Brasília, DF, Brasil.

Lenamaris Mendes Rocha Duarte , Ana Lúcia Ribeiro Salomon , Carmelia Matos Santiago Reis

Objectives: This study aimed to translate the Neurogenic Bowel Dysfunction Score into Brazilian Portuguese, adapting it culturally and validating it semantically.

Methods: The process followed international guidelines for translation, back-translation, cultural adaptation, and semantic validation, involving a committee of specialists and a pre-test with 10 Brazilian pediatric patients with neurogenic bowel dysfunction (mean age: 11 years). Participants were divided into two groups, depending on whether they used transanal irrigation for intestinal management.

View Article and Find Full Text PDF

Similar Publications

Retrieval-Augmented Generation for ICD-10 Coding in German Clinical Texts - A Technical Case Report.

Stud Health Technol Inform

September 2025

Department of Computer Science, Kempten University of Applied Sciences, Kempten, Germany.

Mario Krumscheid , Johannes Blömer , Matthias Becker

Introduction: Manual ICD-10 coding of German clinical texts is time-consuming and error-prone. This project aims to develop a semi-automated pipeline for efficient coding of unstructured medical documentation.

State Of The Art: Existing approaches often rely on fine-tuned language models that require large datasets and perform poorly on rare codes, particularly in low-resource languages such as German.

View Article and Find Full Text PDF

Similar Publications

ARCADE: Controllable Codon Design from Foundation Models via Activation Engineering.

bioRxiv

August 2025

Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, 15217, US.

Jiayi Li , Litian Liang , Shiyi Du , Shijie Tang , Hong-Sheng Lai

Codon sequence design is crucial for generating mRNA sequences with desired functional properties for tasks such as creating novel mRNA vaccines or gene editing therapies. Yet existing methods lack flexibility and controllability to adapt to various design objectives. We propose a novel framework, ARCADE, that enables flexible control over generated codon sequences.

View Article and Find Full Text PDF

Similar Publications

Enabling inclusive systematic reviews: incorporating preprint articles with large language model-driven evaluations.

J Am Med Inform Assoc

August 2025

Department of Biostatistics and Bioinformatics, Duke School of Medicine, Durham, NC, 27710, United States.

Rui Yang , Jiayi Tong , Haoyuan Wang , Hui Huang , Ziyang Hu

Objectives: Systematic reviews in comparative effectiveness research require timely evidence synthesis. With the rapid advancement of medical research, preprint articles play an increasingly important role in accelerating knowledge dissemination. However, as preprint articles are not peer-reviewed before publication, their quality varies significantly, posing challenges for evidence inclusion in systematic reviews.

View Article and Find Full Text PDF

Similar Publications