Data Aggregation of HEDIS Measures using ETL Technologies

Authors

  • Avinash Dulam

DOI:

https://doi.org/10.22399/ijcesen.5034

Keywords:

HEDIS Measures, ETL (Extract, Transform, Load), Healthcare Data Integration, Quality Performance Reporting, Data Warehousing

Abstract

The Healthcare Effectiveness Data and Information Set (HEDIS) has standardized performance measures. These measures aim to confirm that the health care services provided and utilized are of high quality. For reporting HEDIS measures, the data must be collected and aggregated from a variety of sources, including electronic health records (EHRs), claims data, pharmacy data, and lab information systems. This article uses Extract, Transform, Load (ETL) technologies to orchestrate data integration processes and provide trustable measure calculations. In the extract process, heterogeneous data sources are connected to the ETL pipelines by database interfaces, application programming interfaces (APIs), and streaming ingestion protocols, with security and compliance requirements. This phase includes cleansing and validation checks, as well as standardizing and recoding all of the different coding systems and heterogeneous constructs into a common structure that is consistent with the HEDIS specifications. The loading phase involves   complex calculations and performance reporting. Business benefits include improved data quality via data validation at the source, process efficiency via workflow automation and scalability, and better decision support with improved visibility for all stakeholders in real-time operational performance. Implementation considerations include enterprise platforms, open-source and cloud-native applications, interoperability via Fast Healthcare Interoperability Resources (FHIR) standards, and available software development kits (SDKs). As new technologies like real-time data processing, cloud computing, and AI-based validation develop, healthcare organizations will choose their reporting technology based on their size, technical skills, and available resources, while also improving data management and adapting to new standards and regulations in a data-heavy environment.

References

[1] Julia Adler-Milstein et al., "Electronic Health Record Adoption In US Hospitals: Progress Continues, But Challenges Persist,” Health Affairs Vol. 34, No. 12: Affordability, Access, Models Of Care, & More, 2015. Available: https://www.healthaffairs.org/doi/10.1377/hlthaff.2015.0992

[2] Sharon Silow-Carroll et al., "Using Electronic Health Records to Improve Quality and Efficiency: The Experiences of Leading Hospitals," Commonwealth Fund Issue Briefs 17:1-40 (2012). Available: https://www.researchgate.net/publication/230570249

[3] Kristiina Häyrinen et al., "Definition, structure, content, use and impacts of electronic health records: A review of the research literature," International Journal of Medical Informatics, 2008. Available: https://www.sciencedirect.com/science/article/abs/pii/S1386505607001682

[4] Clemens Scott Kruse et al., "Challenges and Opportunities of Big Data in Health Care: A Systematic Review," JMIR Publications Advancing Digital Health & Open Sciences, 2016. Available: https://medinform.jmir.org/2016/4/e38/

[5] Panos Vassiliadis et al., "A Survey of Extract-Transform-Load Technology," International Journal of Data Warehousing and Mining 5:1-27, 2009. Available: https://www.researchgate.net/publication/220613761_A_Survey_of_Extract-Transform-Load_Technology

[6] Moh'd Alsqour et al., "A survey of data warehouse architectures—Preliminary results," IEEE 2012 Federated Conference on Computer Science and Information Systems (FedCSIS), 2012. Available: https://ieeexplore.ieee.org/document/6354451

[7] Erhard Rahm and Hong Hai Do, "Data Cleaning: Problems and Current Approaches," Research Gate, 2000. Available: https://www.researchgate.net/publication/220282831

[8] Carlo Batini et al., "Methodologies for data quality assessment and improvement," AACM Computing Surveys (CSUR), Volume 41, Issue 3, 2009. Available: https://dl.acm.org/doi/10.1145/1541880.1541883

[9] I.R. Mansuri and S. Sarawagi, "Integrating Unstructured Data into Relational Databases," IEEE : 22nd International Conference on Data Engineering (ICDE'06), 2006. Available: https://ieeexplore.ieee.org/document/1617397

[10] Michael Armbrust et al., "A View of Cloud Computing," Communications of the ACM, Volume 53, Issue 4, 2010. Available: https://dl.acm.org/doi/10.1145/1721654.1721672

Downloads

Published

2026-03-11

How to Cite

Avinash Dulam. (2026). Data Aggregation of HEDIS Measures using ETL Technologies. International Journal of Computational and Experimental Science and Engineering, 12(1). https://doi.org/10.22399/ijcesen.5034

Issue

Section

Research Article