Introduction
Increasing attention is being been paid by doctors and researchers to gastroesophageal reflux disease (GERD). A large number of practical recommendations and guidelines for the diagnosis and treatment of GERD have been published in various countries [1–4].
The standard for the diagnosis of this disease is upper endoscopy and oesophageal pH monitoring [5], and oesophageal impedance-pH monitoring, as the most sensitive method of research because this allows the identification of not only acidic but also weakly acidic and gas refluxes, which also play a role in the onset of GERD symptoms [6–8]. However, the above methods of diagnosis are expensive, invasive, and not well tolerated by patients. This complicates their use in routine practice in the examination of patients with symptoms of GERD.
For standardisation of GERD definition, symptoms, diagnosis, and treatment approaches, the International GERD Consensus Group was created to develop a global definition and basic concepts for GERD. This group has been working for 2 years, and its results were outlined in the Montreal Consensus in 2006. The experts found it possible to establish a diagnosis of GERD and prescribe treatment without further examination, based on characteristic clinical symptoms (both oesophageal and non-oesophageal). The need for additional examination methods (upper endoscopy, pH- and impedance-pH monitoring) is determined by the physician, depending on the severity of the complaints and the presence of alarm symptoms [9].
A similar algorithm for the diagnosis of GERD is presented in the three-level reflux treatment guidelines (2008). At the first level (the self-care level), an available treatment option for GERD patients is self-administration of drugs for heartburn. If heartburn and regurgitation occur more than two times a week, patients are advised to contact their family doctor or therapist. In the presence of atypical symptoms, alarm symptoms, or ineffective treatment, the patient is directed to the third level, the gastroenterologist, who considers the feasibility of additional methods of examination and optimisation of treatment [10].
However, such an approach is fraught with difficulties associated with the subjective assessment of complaints by both the patient and the doctor. This is because in routine practice, there are patients with significant changes of the mucous membrane of the oesophagus but with unexpressed symptoms, or the patient is unaware of the possible complications and does not pay due attention to their symptoms [11, 12]. Conversely, sometimes a physician relying only on symptoms may misinterpret the severity of the disease and make diagnostic errors [11, 13]. This is facilitated by the combination of classical GERD symptoms with gastric dyspepsia symptoms, irritable bowel syndrome, and a weak correlation between endoscopic results and severity of symptoms.
All of the above require the development of an accurate and objective diagnostic technique that would be non-invasive and possible for use in everyday practice by both gastroenterologists and primary care physicians (GPs and therapists). The most appropriate variant of such a technique is questionnaires. It should be noted that such questionnaires should be sensitive and specific on one hand and short and easy to fill on the other, so as not to cause problems for the patient when filling them out.
A large number of questionnaires for the diagnosis of GERD have been developed, but most of them are used for scientific research. Among those that can be widely used and are currently used in practice are: GSRS (Gastrointestinal Symptom Rating Scale), GSFQ (Gastrointestinal Short Form Questionnaire) [14], CDQ (Carlsson-Dent Questionnaire), and GerdQ. In order to ensure the use of questionnaire data, they must be translated and validated.
Thus, in the Lyon Consensus 2018, clinical questionnaires for the diagnosis of GERD are considered based on the assessment of the presence and frequency of typical manifestations such as heartburn and regurgitation. While these instruments have some limitations compared to instrumental diagnosis of GERD, clinical diagnosis and treatment of GERD based on typical symptoms are pragmatic and well-established public guidelines. Among these, CDQ and GerdQ questionnaires are recommended as having similar diagnostic value [15].
The following questionnaires were selected for our study:
the GSRS (Gastrointestinal Symptom Rating Scale) is a classic questionnaire for patients with gastrointestinal disorders. As a result of a study in six countries, its usefulness for the evaluation of dyspeptic symptoms and reflux symptoms has been established; it has acceptable reliability and validity [16]. It consists of 15 questions grouped into five syndromes (clusters): abdominal, reflux, dyspeptic, diarrhoeal, and constipation. It is not specific for the diagnosis of GERD, but the presence of reflux syndrome allows its use;
the GerdQ (Gastroesophageal Reflux Disease Questionnaire) was chosen by us as the most promising because it was created on the basis of three other statistically valid questionnaires for the evaluation of reflux syndrome (GIS, GSRS, RDQ) as a result of a large-scale international DIAMOND study [17]. The purpose of this study was to create a questionnaire and prove its diagnostic value by comparing it with the results of instrumental examinations (upper endoscopy and pH monitoring). This questionnaire has shown its effectiveness in many studies [18–20]. In 2013 it was translated into Russian and validated for use [21].
As already mentioned, the data from the questionnaires were evaluated in comparison with the oesophageal pH monitoring data and/or upper endoscopy results [19, 20, 22–27], but not with the results of oesophageal impedance-pH monitoring, which allows the identification of weakly acidic and gas refluxes and is currently one of the best and most informative instrumental methods for the diagnosis of pathological gastroesophageal reflux.
Aim
The aim of the study was to study the diagnostic value and provide comparative analysis of GerdQ and GSRS questionnaires regarding the GERD diagnostics based on a comparison of the questionnaires with the results of intraluminal oesophageal impedance-pH monitoring in Ukraine.
Material and methods
Twenty-eight patients (11 men and 17 women with a mean age of 47 ±2.4 years and a mean body mass index (BMI) 25.6 ±1.1 kg) were enrolled in the study. All patients, after an explanation and after giving their informed consent, filled out the GerdQ and GSRS questionnaires, which included questions about their well-being during the last 7 days, after which the questionnaire scores were evaluated and calculated. While processing the GerdQ questionnaire, a total questionnaire score and a separate score on each of the six questions were noted: presence and frequency of heartburn, regurgitation, epigastric pain, nausea, heartburn at night, and frequency of medication intake. In the GSRS questionnaire all five syndromes were evaluated separately: abdominal, reflux, dyspeptic, diarrhoeal, and constipation.
According to the results of the GerdQ questionnaire, we identified 12 patients who had ≥ 8 points, which is a criterion for GERD, and 16 patients who had < 8 points, which was insufficient to establish a diagnosis of GERD in them.
All patients underwent 24-hour multichannel intraluminal oesophageal impedance-pH monitoring. We used a computer-based impedance-pH monitoring system, Acidogastrograph AG-3pH-4Z, developed by the medical-engineering staff of the Department of Internal and Family Medicine, National Pirogov Memorial Medical University, Vinnytsya, led by Prof. Viacheslav Chernobroviy. The microprobe electrode used in this study had a pH sensor and six impedance sensors (Figure 1). The microprobe was introduced transnasally, so that the pH sensor was located in the oesophagus 5 cm above the lower oesophageal sphincter (LOS), at the same time impedance sensors were formed by four segments of registration (3 cm, 5 cm, 7 cm, and 12 cm above the LES).
The results of the 24-hour oesophageal impedance-pH monitoring were evaluated as follows:
number of episodes of acid refluxes (pH < 4);
number of episodes of weakly acidic refluxes (pH 4–7);
number of episodes of weakly alkaline refluxes (pH > 7);
number of episodes of gas refluxes;
number of episodes of all liquid refluxes;
number of episodes of all mixed reflux (gas – liquid);
number of episodes of acid reflux lasting more than 5 min;
total time with oesophageal pH < 4 (acid exposure time);
total time with oesophageal pH > 7.
The results of the monitoring are presented in graphical and digital conclusions (Figure 2).
Statistical analysis
Statistica 7.0 (StatSoft Inc.) software was used for the statistical analysis. While assessing the study data, in addition to the descriptive statistical methods (mean, standard deviation, Student’s t-test, Mann-Whitney U test, Spearman’s Rank correlation coefficient). The sensitivity, specificity, and diagnostic accuracy of the method were calculated using the classical method of two-way contingency tables. The p-value < 0.05 was considered significant.
Results
Relationship between the results of the GerdQ and GSRS questionnaires and the results of the 24-hour oesophageal impedance-pH monitoring
We evaluated the correlation between the results of the 24-hour oesophageal impedance-pH monitoring and the total score of the GerdQ questionnaire. According to the data obtained, a statistically significant strong positive correlation was established between the frequency of episodes of acid refluxes (pH < 4) and the total points of the GerdQ questionnaire (rs = 0.796), between the frequency of episodes of all fluid refluxes and the total points of the GerdQ questionnaire (rs = 0.730), and between AET and the total score of the GerdQ questionnaire (rs = 0.793). While assessing reflux syndrome according to the GSRS questionnaire, a statistically significant moderate correlation was established between the acid reflux episodes (pH < 4) and the score of the reflux syndrome (rs = 0.530), between the episodes of all fluid refluxes and the score of the reflux syndrome (rs = 0,598), and between AET and the reflux syndrome score (rs = 0.560). The data are shown in Table I.
Table I
GerdQ total score and 24-hour impedance-pH monitoring results
As a result of the study, a statistically significant difference (p < 0.05) was established between the mean values of acid refluxes (55.38 ±6.12 in the GERD group according to the GerdQ questionnaire vs. 23.29 ±6.53 in the almost healthy group, according to the GerdQ questionnaire), gas refluxes (1.46 ±0.43 vs. 3.46 ±0.86, respectively), total fluid refluxes (71.15 ±8.37 vs. 33.8 ±5.97, respectively), AET (12.42 ±3.65 vs. 3.00 ±0.92, respectively), and percentage of the time with pH > 7 in the oesophagus (12.13 ±27.42 vs. 31.5 ±5.43, respectively). The data are shown in Table II.
Table II
Relationship between the different issues addressed by GerdQ and results of 24-hour impedance-pH monitoring
We also sought to identify questions from the GerdQ questionnaire that would be sensitive to 24-hour oesophageal impedance-pH monitoring results. For this purpose, the correlation coefficient between the patients’ responses to each of the six questions on the GerdQ questionnaires and the 24-hour oesophageal-impedance pH-monitoring results was determined. A statistically significant strong correlation was found between the AET and the incidence of heartburn (rs = 0.74). Correlation of medium strength was found between acid refluxes and the incidence of heartburn (rs = 0.66), regurgitation (rs = 0.66), epigastric pain (rs = 0.56), and nocturnal heartburn (rs = 0.55); between the total number of fluid refluxes and the incidence of heartburn (rs = 0.50), and regurgitation (rs = 0.66); the number of prolonged acid refluxes (> 5 min) and the incidence of nausea (rs = 0.57); and between the AET and the incidence of regurgitation (rs = 0.51), epigastric pain (rs = 0.57), and nocturnal episodes of heartburn (rs = 0.51). A weak correlation was found between the number of acid refluxes and the incidence of nausea (rs = 0.41); the total number of fluid refluxes and the incidence of epigastric pain (rs = 0.44), nausea (rs = 0.41), and nocturnal episodes of heartburn (rs = 0.48); and between the AET and the incidence of nausea (rs = 0.40). Other indicators did not show a statistically significant relationship (Table III).
Table III
GerdQ Questionnaire sensitivity and specificity
While using the GerdQ questionnaire, GERD was diagnosed (total score ≥ 8) in 12 patients out of 28 (42.9%). Subsequently, in 11 of them the diagnosis of GERD was confirmed with 24-hour oesophageal impedance-pH monitoring, and in 1 patient was excluded (Table IV). In 16 patients, GERD was not diagnosed (score < 8). The percentage of patients with an overall score of ≥ 8 among those who had instrument-confirmed GERD (11 out of 14) was 78.6% – this is the sensitivity of the questionnaire. The percentage of patients with a score of < 8 (13) among those who had GERD excluded by the instrumental method (14) was 92.9% – this is the specificity of the GerdQ questionnaire. The accuracy of the questionnaire (diagnostic efficiency) was 85.7%.
Discussion
The greater correlation for the GerdQ questionnaire suggests that this questionnaire is more sensitive to GERD compared to GSRS. The high level of correlation between the sum of GerdQ questionnaire scores and the key indicators of the 24-hour impedance-pH monitoring indicates that the questionnaire is sufficiently sensitive to the presence of pathological acidic and fluid refluxes, and also that it exceeds the time limit during which the pH in the oesophagus is < 4 (AET) and can be useful for routine diagnosis of GERD. Our results suggest that the frequency of reflux symptoms (from the GerdQ questionnaire) is more correlated with abnormal reflux than with the intensity of GSRS reflux complaints. The correlation between individual issues addressed by the questionnaire and 24-hour oesophageal impedance-pH monitoring data has less strength than the total score of the GerdQ questionnaire, which indicates that the total questionnaire score is more informative when used for the purpose of diagnosing GERD.
The results of our study are similar to those found in the validations and baseline estimates of GerdQ in other populations. For example, in the initial study where the questionnaire was developed (n = 308 subjects), a score ≥ 8 showed a sensitivity and specificity of 64.6% and 71.4%, respectively [17]. In a study by Lacy et al. [22], where the comparison of the questionnaire was conducted with 24-hour Bravo pH monitoring, sensitivity and specificity were indicated at 71% and 41%, respectively. Such low rates are probably related to the fact that the study was conducted at the tertiary level, in which there is a large percentage of patients with refractory GERD, patients with atypical GERD, and patients with functional heartburn. The results of the study by Suzuki et al. indicate a sensitivity of the GerdQ questionnaire of 34.3% and specificity of 82.5% when comparing the results with those of upper endoscopy. In a study by Zavala-Gonzales et al. [25] comparisons of the results of the questionnaire with the results of endoscopy and/or pH monitoring of the oesophagus were performed, indicating the GerdQ sensitivity and specificity to be 71.6% and 72.2%, respectively. The results of a multicentre study in Russia showed a sensitivity of 65.4% and a specificity of 91.7%. In this study, upper endoscopy and pH monitoring were used as comparative methods [21]. The somewhat higher sensitivity and specificity of GerdQ in our study compared to similar studies is, in our opinion, a result of comparing the results of the questionnaire with impedance-pH monitoring of the oesophagus, which gives a more accurate assessment of pathological GER, and including a relatively small number of patients in the study.
Finally, it is important to note that, although GerdQ is a useful diagnostic tool, it should not be considered as a unique diagnostic test. It can, however, be used as a baseline test in the absence of alarm symptoms. It can also be used to control the therapeutic effect of treatment of GERD, but in the absence of response to treatment it is necessary to confirm the diagnosis of GERD by impedance-pH monitoring and/or endoscopy, respectively.
Conclusions
The use of the GerdQ questionnaire for the diagnosis of GERD is optimal in everyday practice for deciding whether to prescribe ex juvantibus therapy, or whether to use instrumental examination methods (upper endoscopy, oesophageal pH monitoring, oesophageal impedance-pH monitoring) because it showed a significantly strong correlation between the total score of the questionnaire and the key indicators of 24-hour impedance-pH monitoring (AET, acid reflux episodes, fluid reflux episodes). Our results suggest that the frequency of reflux symptoms (from the GerdQ questionnaire) is bettter correlated with abnormal reflux than the intensity of GSRS reflux complaints.
The correlation between the evaluation of individual issues of GerdQ questionnaire and impedance-pH monitoring data was weaker than the total score, which indicates the advantage of using the total questionnaire score for diagnostic purposes in order to obtain the most reliable results.
As a result of this study, the GerdQ questionnaire showed a sensitivity of 78.6%, specificity of 92.9%, and accuracy (diagnostic efficiency) of 85.7%. It should be noted that the patients who participated in the study were patients who had been referred to a secondary level of care.