Statistics from Altmetric.com
- health policy
- organisation of health services
- health informatics
- information management
- information technology
‘Big data’ is defined by ‘7 V’s’: volume (most frequently cited1), velocity, veracity, variety, volatility, validity and value. In healthcare, ‘big data’ is associated with a step-change in the way information is gathered, analysed and used to facilitate disease management and prevention. With greater electronic data capture, there is enthusiasm for increased safety, efficiency and effectiveness in health and social care through, for example, machine learning and other forms of artificial intelligence (AI). However, factors maintaining and widening the gap between the promise and the reality need to be addressed.
Can ‘big’ be evidence-based?
Current best practice has its foundation in evidence-based healthcare, with growth in publications, but poorly managed scientific insights, poor recording of care and poor use of evidence.2 Big data could improve the status quo and support learning health systems (LHS).3
Computational methods can contribute to evidence management with automation of literature searching, critical appraisal and guidelines.4 5 Similarly, big data already contribute to aetiological, diagnostic, prognostic and therapeutic research, from -omics to electronic health records (EHR) trials.6 Critics emphasise lack of quality and validation of routinely collected clinical data, and risk of bias in observational studies, where scale cannot compensate for poor design.7 Conversely, data-driven approaches could transform a predominantly retrospective into a prospective or real-time paradigm, across disease boundaries.
Infrastructure and analytic tools are necessary but often poorly understood and underdeveloped. Automated extraction of necessary data fields in pseudonymised/anonymised format into curated warehouses is required with robust metadata catalogues and understanding of clinical context. To be available at the point-of-care for clinicians, data will have to be extracted, cleaned and processed promptly.
Preventing excesses and addressing deficiencies
Medicine grapples with conflicting challenges of overdiagnosis and overtreatment as well as underdiagnosis and undertreatment. Widening inequalities also manifest across many health systems, despite improving medications and technologies. Efforts to optimise healthcare delivery …
Contributors AB produced the original draft manuscript and all authors were responsible for revisions and the final version.
Competing interests AB has received honoraria from Novo Nordisk and Boehringer Ingelheim. The other authors have no competing interests to report.
Patient consent Not required.
Provenance and peer review Not commissioned; externally peer reviewed.
Data sharing statement No data were used for this manuscript.
Presented at This manuscript was influenced by the outcomes of three workshops which AB designed and coordinated on the theme of big data in healthcare: ‘Big data and Evidence-Based Medicine-the promises and the pitfalls’ at Evidence Live, Oxford in June 2016; ‘Learning health systems-the only way to do translational data science?’ at the International Population Data Linkage Network (IPDLN) Conference, Swansea in August 2016 and ‘Big data-part of the problem of over-diagnosis or part of the solution?’ at the Preventing Overdiagnosis Conference, Barcelona in September 2016.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.