Useful Resources

DATA

All versions of the datasets for all BioASQ tasks are available in the BioASQ Datasets page of the BioASQ Participants Area.

BioASQ task b: Biomedical Semantic QA (involves IR, QA, summarization)

5,386 English questions, annotated with relevant documents, snippets, "exact" and "ideal" answers.

BioASQ task Synergy: Biomedical Semantic QA for developing issues

369 questions on developing topics, incrementally annotated with assessed system responses.

BioASQ task MultiClinSum: Multilingual Clinical Summarization

Manual summaries of lengthy clinical case reports written in English, Spanish, French, and Portuguese.

BioASQ task BioNNE-L: Nested Named Entity Linking in Russian and English

700 documents in Russian and 100 in English with nested-NER annotations.

BioASQ task ELCardioCC: Clinical Coding in Cardiology

500 cardiology discharge letters in Greek annotated with ICD-10 codes at document level and concept mention level.

BioASQ task GutBrainIE: Gut-Brain interplay Information Extraction

1,000 PubMed abstracts manually annotated with entity mentions, corresponding ontology concepts, and binary relations.

BioASQ Participants Area

 

TOOLS

 

HEMKit software (zip), a collection of hierarchical evaluation measures.
BioASQ Releases Continuous Space Word Vectors Obtained by Applying Word2Vec to PubMed Abstracts.
BioASQ Annotation and assessment tools

Tutorial

BioASQ social network

Tutorial