- Open Access
2015, big data in healthcare: for whom the bell tolls?
© Van Poucke et al.; licensee BioMed Central. 2015
- Published: 2 April 2015
- Cloud Computing
- Readmission Rate
- Clinical Decision Support
- Data Mining Technique
- Control Vocabulary
The health care sector generates bountiful data around the clock, which can paradoxically complicate our quest for information, knowledge, and ‘wisdom’ . It may be prudent that medical end-users consider seriously a fundamental change that would allow us to gain full value from the ‘big data’ that the health care section is generating . Proponents of the big data revolution suggest that the value for physicians rests on the added information provided by big data analysis. Indeed, supplementary information could clarify areas for improvement, such as optimization of treatments, reduced adverse events and readmission rates, earlier identification of those patients whose health is worsening, and more efficient identification of populations in need. Recent cloud computing has even turned computing and software into commodity services, and such big data processing seems to be forging a technology revolution [3,4]. However, opponents of the big data revolution speculate that validation and impact analyses of big data in health care are still in their infancy, and approaches such as Google’s baseline study may thus not be effective in preventing disease, and possibly even lead to unnecessary, if not harmful, interventions .
The value of any kind of data is greatly enhanced when it exists in a form that allows for integration with other data . One problem with large data sets in general is the risk for ‘GIGO’ - garbage in, garbage out - that requires very careful and thoughtful investigation to rule out the many errors of large-scale data capture before any of it can be used. Thus, an essential step for data integration is the annotation of multiple bodies of data using common controlled vocabularies or ‘ontologies’ that incorporate accurate representations of biological reality . Data mining in health care is not new, and initiatives for data acquisition and analysis, storage and retrieval have all been presented before [8,9]. Yet, to our knowledge, subcommittees addressing ontology have not been established by any medical specialty.
As clinicians, we apply general principles of risk stratification and risk modification to individual patients based on our education and experience. The proliferation of biomedical research makes it difficult to keep abreast of current knowledge, so clinical decision support technologies that are based on data mining techniques are knocking at our doors. Although their implementation seems inevitable, the lack of standardization continues . A dramatic paradigm shift toward controlled ontologies is needed in order to optimize the technologies that integrate big data into medical decision making and practice.
All authors (SVP, MT, AH) declare that this letter was written with no funding. No role was played by any funding body in the design, collection, analysis, and interpretation of data; in the writing of the manuscript; and in the decision to submit the manuscript for publication.
- Henry NL. Knowledge management: a new concern for public administration. Public Administration Rev. 1974;34:189–96.View ArticleGoogle Scholar
- Groves P, Kayyali B, Knott D, Van Kuiken S. The ‘ big data ’ revolution in healthcare. Accelerating value and innovation. McKinsey Global Institute: New York, NY; 2013.Google Scholar
- Pathak J, Shah ND. Why health care may finally be ready for big data. Harvard Business Review. 2014. https://hbr.org/2014/12/why-health-care-may-finally-be-ready-for-big-data.
- Herland M, Khoshgoftaar TM, Wald R. A review of data mining using big data in health informatics. J Big Data. 2014;1:2.View ArticleGoogle Scholar
- Diamandis EP. The hundred person wellness project and Google’s baseline study: medical revolution or unnecessary and potentially harmful over-testing? BMC Med. 2015;3:5.View ArticleGoogle Scholar
- Daghavan D. Big data: not really the same as level 1 data. Oncology (Williston Park). 2015;29:70, 72.Google Scholar
- Smith B, Ashburner M, Rosse C, Bard J, Bug W, Ceusters W, et al. The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration. Nat Biotechnol. 2007;25:1251–5.View ArticlePubMed CentralPubMedGoogle Scholar
- Lee J, Scott DJ, Villarroel M, Clifford GD, Saeed M, Mark RG. Open-access MIMIC-II database for intensive care research. Conf Proc IEEE Eng Med Biol Soc. 2011;2011:8315–8.PubMedGoogle Scholar
- Dejam A, Malley BE, Feng M, Cismondi F, Park S, Samani S. The effect of age and clinical circumstances on the outcome of red blood cell transfusion in critically ill patients. Crit Care. 2014;18:487.View ArticlePubMed CentralPubMedGoogle Scholar
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.