Title: Biomedical text summarisation using concept chains

Authors: Lawrence H. Reeve, Hyoil Han, Ari D. Brooks

Addresses: College of Information Science and Technology, Drexel University, Philadelphia, PA, USA. ' College of Information Science and Technology, Drexel University, Philadelphia, PA, USA. ' College of Medicine, Drexel University, Philadelphia, PA, USA

Abstract: BioChainSumm is a biomedical text summariser utilising concept chaining (called BioChain) to link semantically-related concepts within biomedical text together. The BioChain process is adapted from existing lexical chaining approaches which chain semantically-related terms rather than concepts. The BioChain concept chains are used to identify salient candidate sentences which are extracted to produce a summary of the biomedical text. The Unified Medical Language System Metathesaurus and Semantic Network semantic resources identify related biomedical concepts. BioChainSumm is evaluated using the ROUGE system along with several existing, publicly-available summarisers. Our results show BioChain provides a promising methodology for biomedical text summarisation.

Keywords: text summarisation; concept chaining; lexical chaining; biomedical text; data mining; bioinformatics; semantics.

DOI: 10.1504/IJDMB.2007.012967

International Journal of Data Mining and Bioinformatics, 2007 Vol.1 No.4, pp.389 - 407

Published online: 02 Apr 2007 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article