With 2.5 million deaths1,2 no age group other than the neonatal period has higher mortality whilst remaining largely unrecognized.3 Despite a consistent achievement in under-5 mortality, the Millennium Development Goal reached least for newborns.4 Neonatal mortality (during the first 28 days) still represents the most significant fraction of the under-five mortality and its contribution to infant mortality is even on the rise: 40% in 1990, 45% in 20155 and 47% in 2020.2 Regional differences, unrealistic numbers acquired under governmental pressure and artificially decreased neonatal mortality by late declaration of surviving newborns only6,7 put doubt on reported figures and likely underestimate official mortality figures. Although ‘late abortion’ rates should mirror such practice, these notoriously remain un- or underreported.6,8,9
Most neonatal deaths occur in low and middle-income countries (LMIC), close to 80% in two single areas: sub-Saharan Africa (43%) and Central & South Asia (36%).2 In 2019, neonatal mortality was reported 7 to 13 times higher in low compared to high-income countries.1 Close to 80% of all neonatal deaths are linked to three leading conditions, prematurity and low birth weight, perinatal complications and asphyxia, as well as sepsis and infection.5,10–12 However, associations are tightly interlinked and not necessarily causes. Prematurity, for instance, the most significant contributor to neonatal death, is not per se a treatable cause of death.13 Quality data on specific underlying or contributing morbidities, and precise circumstances of death are urgently needed, a prerogative for any targeted intervention. Neonatal mortality (the first 28 days) is a particularly narrow-defined temporal intervention target, and three quarters of deaths even concentrate during the first week of life.4,10 Mortality, however, is only the tip of a much greater underlying morbidity11,14–17 on which limited quality data is available in LMIC.18–22 Facing neonatal morbidity will also influence morbidity and mortality after the neonatal period, and long-term handicap and adult disease resulting from neonatal disease.
Whether due to lack of resources,23 political or academic interest, current neonatal LMIC data are frequently unavailable, incomplete, or non-standardized. Where data are available, they are generally collected retrospectively, regionally (clusters) or in areas of practicality, and extrapolated on a broader scale. Such data only poorly represent the intra- and inter-regional and national variabilities to geographically target health strategies.9,24 As a result, neonates are systematically unrepresented, even more so during epidemics and war. Despite top ranking in mortality, neonates usually get the least attention and the slimmest support of Maternal-Newborn and Child Health programs. Within these programs, reports rarely focus on anything other than crude neonatal mortality. Lack of quality data may be one of the reasons why financial supports largely skip this most vulnerable age group where the burden remains unmeasured and thus hard to defend. To promote, target, leverage, and follow-up interventions, standardized data representative of the specific health challenges of the neonatal populations in LMIC are urgently required.25 Such data must be locally available, context-adapted, and assembling feasible. Standardized data should in priority target south-south quality comparison and improvement. Our report is structured according to the Standards for Quality Improvement Reporting Excellence (SQUIRE 2.0).
To design and development a quality improvement dataset for prospective collection of standardized data on neonatal health and healthcare structures representative of LMIC.
METHODS
Context
Regionalization in high-competence centers is essential to improve population-based neonatal care,26 and requires excellence in terms of competence, education, and reputation. Today however, to treat newborns, many reference centers in LMIC do hardly have more staff, equipment, and infrastructure than lower-level centers, despite their obligation to accept the sickest. Delays in decision making in the community before referral, inappropriate transport conditions and delays at admission to the reference centers further worsen outcomes. The resulting very high mortality undermines the population’s trust in these reference centers and with it, often also government support.
Focusing on high-level healthcare facilities can generate high-quality data that supports public understanding of the risks leading to this concentrated mortality and morbidity and advocates government support in a virtuous quality improvement circle.
Intervention: development of variable sets
In 2018, 267 newborn variables collected through a literature search and from two established high-income neonatal databases, the Swiss Neonatal Network (SNN) data collection and Vermont Oxford Network (VON), were submitted to a two-step Delphi selection process. Experts were all except one, physicians or nurses with broad experience in LMIC neonatal care, 44% actively practicing in such countries. One expert was a neonatal database manager. Of the invited 42 experts, 28 answered to the first and 25 to the second Delphi round.
Experts graded variables according to feasibility, reliability, and representability. We retained variables with a mean greater than or equal to +1 SD of its respective subgroup (the 14 subgroups and rating scale for variable selection are presented in Appendices S1 and S2 in the Online Supplementary Document), and discarded if less than -2 SD. The in-between variables were re-submitted for the second round, together with experts’ suggestions of new items. The empirical target was a dataset of 60 clinical variables and 20 structural health care facility variables.
Variable definitions were maintained equivalent to the original databases (SNN, VON) whenever possible.
Study of the intervention: field testing
We identified three demographically diverse centers in Burkina Faso for field-testing and obtained local ethics committee clearance in August 2020. All centers cared for in- and out-born neonatal patients at high level (table 1). Two centers are in the capital city of Ouagadougou, one a university unit and the other a confessional maternity hospital with the highest premature admission rate in the country. The third is a maternity hospital in northern Ouahigouya.
We instructed part-time local data managers (1 midwife; 2 physicians) and provided continuous remote support with two-weekly checks. Training of local managers included one-to-one guided recording of 10 cases and regular focused discussions of problematic item definitions.
Data sets were entered ‘continuously’ into a spreadsheet according to the predefined written data definitions and structure. During the first 3 months, feed-back was provided ‘on demand’ often several times a week. Recurrent definition issues were discussed with participating centers, and definitions improved if necessary. In addition, they were invited to propose new variables that they felt were necessary. Data completeness/availability was also reviewed with local data managers and heads of units during three on-site visits to determine appropriateness of identified variables and common data handling strategies.
Measures: variable reliability assessment
At 3 months of data collection, a quality report of selected variables (table 2) was shared with participating centers. One of the Geneva investigators, a Burkina Faso national physician, visited all participating centers in January 2021 for consolidation of the definitions. Issues with data plausibility, structure, and some definitions were individually discussed based on concrete examples to confirm/optimize data completeness and quality. Systematic definition issues, detected from the mismatch between expected and reported results, were also clarified with clinicians on site.
Thereafter, improved definitions and local expertise allowed data submission in bunches every two weeks for an interim plausibility and quality check, and data back-up in Geneva. Co-operative epidemiological work rounds (virtual and on-site) were held on a 3-monthly schedule to smooth identified or potential hurdles.
After 12 months of data entry, we requested data managers to report time requirements for data gathering, recording and transmission, including the time for regular briefings. In addition, all questions/answers that circulated between Burkina Faso and Geneva essentially through smartphone messaging, were accounted for, estimating 5 minutes per message.
Analysis
The analysis focused on quality improvement of the database and did not evaluate patient or population outcome. Database content and quality were first analyzed quantitatively and qualitatively through a two-round Delphi process, followed by qualitative focus group evaluations, fed by individual problem-solving sessions.
Ethics considerations
The protocol was submitted to the Ethics Committee of Burkina Faso, specifying that anonymity of the patients would be preserved by means of an individual code and the participating centers would remain anonymous. We received written clearance without need to request patient consent, on August 12, 2020 (No. 2020-8-173). This work does not report patient data in any way, but merely used it to identify and improve variable definitions. Any personal patient data remained within the hospital files. Nominative data was available to participating local investigators for their patients only, and to the two primary investigators (PZ, RP) in Geneva during the test period. Participating centers gave their written agreement to participate and were anonymized for comparison of center data against both other centers. All authors and heads of participating centers gave consent for publication.
RESULTS
Variable Selection (phase 1)
The two-round Delphi analysis identified 71 patient variables (7 administrative and 64 clinical) with roughly one-third that required context-adapted re-definition from the original VON or SNN definitions, 2/7 (28%) for administrative and 27/64 (42%) for clinical variables.
For structural unit data, the Delphi ranking identified 20 variables, 9/20 (45%) identical to the VON and SNN definitions and 11/20 (55%) defined to account for typical LMIC unit structure characteristics.
Appendices S1-S2 in the Online Supplementary Document report all variable subgroups and rating criteria that identified the 71 clinical and administrative, and 20 structural variables submitted to the subsequent two improvement steps.
Primary variable improvement (phase 2)
On-site focus groups concluded after 3 months of data collection on addition and clarification of some clinical and administrative variables, yielding a total of 78 variables. One variable was entirely recoded, 14 variable definitions were adapted, 5 variables split into two, 4 variables merged into two, 5 variables deleted as generally unavailable, and 9 new variables added.
For the structural unit data, the definition of one variable was adapted, and one new variable was added based on a convergent needs-assessment of the three local co-researchers. As a result, the structural unit data are now composed of 21 variables. For details see Appendix S4 in the Online Supplementary Document.
Secondary variable improvement (phase 3)
After 9 months of data entry, it became clear that the sepsis definition needed specifications with perceived severity of the infection being an essential element. Therefore, antibiotic therapy duration and “circulatory compromise” were added. We also improved the variable definition of “pregnancy dating” and two unit structure variables. For details see Appendix S4 in the Online Supplementary Document.
Finally, our mND for LMIC is composed of 79 patient data, 73 clinical and 6 administrative. The additional 21 unit structure variables need update after unit modifications only (table 3).
Estimated worktime for data managers
After 12 months of use, overall 2039 newborns were included. The average data entry time for 100 newborn files was approximatively 40 hours (table 4).
DISCUSSION
Summary
Our standardized neonatal database (mND) is one of the first neonatal databases designed specifically for LMIC. A pilot project seems to be ongoing in Ethiopia, though little information and no published reference is available so far. Our mND neonatal database is inspired by neonatal databases used in high-income countries, such as the SNN and VON. After a selection and improvement process, it is composed of 79 clinical and 21 structural variables. Although we report the English version of the mND, the original version is in French. French-speaking African countries generally receive less international attention for linguistic reasons, limiting their access to the broader Anglo-Saxon support.
The strength of the developed mND is its context-adapted set of variables that remain internationally comparable for 2/3 of them with identical definitions. They were chosen for their availability and completed by context-representative and reliable variables for neonatal health in LMIC context. The dataset has been thoroughly field-tested with incremental quality improvement phases within three geographic areas of Burkina Faso with differing neonatal populations.
Interpretation
The mND provides a standardized frame for south-south comparison between neonatal healthcare facilities in LMIC, including French speaking units. It presently provides a tool for cross-sectional quality control to identify weaknesses and strengths in neonatal facility care. As neonatal units vary considerably and currently do not use international level of care definitions, the structural unit characteristics, developed explicitly for LMIC, support inter-unit comparability, and allow disease-targeted appreciation of structural deficiencies. Over time, longitudinal data will provide healthcare facilities with follow-up, particularly after corrective interventions.
When data entry is cumbersome and time consuming, data quality suffers. ‘Real-life’ studies have been done to assess the time required to complete a form and longitudinal studies are planned for exactly this purpose. The present spreadsheet entry required close to 40 hours of work per 100 cases. We opted within the testing phase to hire dedicated data managers on site and evaluated their required time investment, allowing comparison with future data entry developments. Indeed, we are currently working on facilitated data entry routines to replace the spreadsheet data entry with a branching algorithm that should provide more efficient and faster, whilst secure, electronic data entry.
The challenges of patient data confidentiality were solved through secondary anonymization during the testing and will now be encrypted with a clear separation of administrative patient identifications and clinical data. Data ownership entirely remains within participating centers.
Limitations
As increasing numbers of variables tends to reduce data quality, we empirically targeted 60 clinical variables. We finished up with 73 clinical and 6 administrative patient variables, a number slightly higher than planned, due to definition issues and local requests. In comparison, most databases in high-income countries, such as the SNN and the VON, present quite larger variable numbers and tend to increase their variable numbers over time. Within the database development process and field-testing, dedicated data managers on site and a tight feed-back loop with the central data management ensured variable definition and data quality. Extension of the mND to additional centers will need continuous quality monitoring by increasing automatic plausibility routines such as used in high-income databases.
For patient data, the Delphi selection of the variables was based on a broad contribution of diverse experts from various French-speaking LMIC. The final field-testing and definition improvement was performed in one single African country, Burkina Faso. This choice was favored by an excellent clinical, educational and academic interaction between the Geneva University Hospitals and the three Burkina Faso centers, allowing easy and frequent team exchanges thought essential for the test phase. Although all definition issues have been addressed extensively during field-testing with over 2000 clinical data entries, our mND may still have some limitations for generalizability. Some of the French and possibly locally developed definitions may not directly translate to other LMIC. However, two-thirds of the variables maintained “international” definitions used in high-income countries and already have English and French translations.
We acknowledge that structural data was tested on three units only. After the Delphi process, few adaptations were made to these variables during field-testing. As the units considerably differ in terms of geographical area, administrative context, and populations characteristics, we feel the structural variables to be representative too. However, the small number and single-country (Burkina Faso) field-testing may limit its generalizability.
CONCLUSIONS
In conclusion, we provide with our mND probably the first comprehensive, standardized neonatal database conceived and field-tested for LMIC, including French-speaking sub-Saharan Africa. It is a tool for comparative south-south quality control, for improved leverage on bottlenecks in neonatal care, support, and follow-up. Despite LMIC specificity, the elevated comparability with high-income neonatal databases allows for international comparability and development.
We are currently working on automated data plausibility controls and an online life streaming quality control output for participating centers. Gradual inclusion of additional centers will increase inter-center comparability and simultaneously increase center confidentiality. The use of comparative data for quality control, publications and research will follow an established request and acknowledgment process similar to the ones currently used by the SNN. A broad use of the data will be encouraged and supported by the Geneva University team with optional possibilities to add limited research variables for specific projects.
Our mND should provide, through high-quality data, a better understanding of in-hospital neonatal morbidity and mortality, allowing through south-south comparison, identification and follow-up of the most cost-effective interventions for each user’s healthcare setting.
Acknowledgements
We thank local data managers Francois Niada in Ouahigouya, Eulalie Kaboré, Maïmouna Samandoulougou and Fiacre Agbodossindji in Ouagadougou for data entry, regular feed-back and active variable improvement collaboration.
We are also grateful to our experts from Africa: Aminata Sow, Anne Ester Njom Nlend, Cyprien Kouakou, Daniele Kedy Koum, Eliane Kuissi, Evelyn Amine, Solange Ouédraogo, François Dianouni Niada, Jonas Ayeroue, Madeleine Folquet, Nadia Slitine, Serge Berlin Dzeukou, Mrs Endygil and experts from Switzerland: Catherine Jorgensen, Cristina Exhenry, Eric Giannoni, Flavia Rosa-Mangeret, Francisca Barcos, Jean-Claude Fauchère, Maria Julia Rodriguez, Marie Müller, Mario Gehri, Stephane Sizonenko, Thomas Berger et Vincent Muehlethaler, Myrtha Martinet et Sylvie Loiseau and Yann Levy-Jamet for their time given to rank and select the most relevant and available variables.
Finally, we thank Mark Adams, data-manager of the SNN for his support in establishing the original data set and participation to the Delphi analysis.
Disclaimer
none.
Ethics statement
The protocol was submitted to the Ethics Committee of Burkina Faso, specifying that anonymity of the patients would be preserved by means of an individual code and the participating centers would remain anonymous. We received written clearance without need to request patient consent, on August 12, 2020 (No. 2020-8-173). This work does not report patient data in any way, but merely used it to identify and improve variable definitions. Any personal patient data remained within the hospital files. Nominative data was available to participating local investigators for their patients only, and to the two primary investigators (PZ, RP) in Geneva during the test period. Participating centers gave their written agreement to participate and were anonymized for comparison of center data against both other centers. All authors and heads of participating centers gave consent for publication.
Data availability
Our database mND composed of 100 variables (79 clinical and administrative and 21 structural) figures in table 3 but individual data is not publicly available.
Funding
The project was supported by a Swiss Government grant (Bourse d’Excellence de la Confédération Suisse 2019.0800 / Burkina Faso / OP), and the non-profit 4earlylife Association Geneva, Switzerland for financially supporting local data managers. The content is solely the responsibility of the authors and does not reflect the view of the funding bodies. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.The APC was funded by Geneva University.
Authorship contributions
Design: ZP, SS, RP. Supervision: RP. Contact with local data managers: ZP. Contribution to the improvement of the variables: ZP, RP, SS, SO, FRM, PO. Writing: PZ, RP. Proofreading and corrections: RP, SO, FRM, SS, PO. All authors have read and approved the final manuscript.
Disclosure of interest
The authors completed the ICMJE Disclosure of Interest Form (available upon request from the corresponding author) and disclose no relevant interests.
RP is secretary of the non-profit Association 4earlylife, Geneva, an NGO with the sole aim to support neonatal health.
Additional material
Our article contains additional information as an Online Supplementary Document.
Correspondence to:
Pfister Riccardo E
Maternity Hospital
32 Boulevard de la Cluse, 1211 Geneva
Swizerland
[email protected]
[email protected]