This manuscript compares 30-day outcomes between sleeve gastrectomy and Roux-en-Y gastric bypass and concludes the two procedures are “equally safe.” That conclusion is not supported by the design. The study is a single-center retrospective cohort with non-randomized allocation; it can describe an association but cannot establish safety equivalence, and no equivalence framework or margin is specified.
Two issues are central to the claim and should be resolved before submission: a statistical-consistency error (a reported significant p-value that does not reconcile with the data, and Table 2 denominators that exceed the analytic N), and a design–claim mismatch (equivalence language from a study not designed or powered for equivalence). Four further findings are summarized below.
What the manuscript states, against what the evidence can support.
| Domain | Severity | Principal finding | |
|---|---|---|---|
| 01 | Design / claim fit | Serious | Equivalence language from a non-randomized retrospective cohort. |
| 02 | Results / conclusion alignment | Moderate | Abstract reframes a null secondary endpoint as “comparable.” |
| 03 | Statistical appropriateness | Serious | No multiplicity adjustment; no a-priori equivalence power. |
| 04 | Reporting guideline adherence | Mild | STROBE items 9 and 12c partially addressed. |
| 05 | Numerical / statistical consistency | Critical | Table 2 denominators exceed analytic N; p-value discordant. |
| 06 | Clinical interpretability / verdict | Moderate | Association plausible; safety equivalence is not. |
| Severity | Domain | Finding | Evidence | Locus |
|---|---|---|---|---|
| Critical | 05 · Consistency | Reported p = 0.04 does not reconcile with the table (recomputed 0.078); Table 2 denominators exceed the stated analytic N. | Deterministic | Central |
| Serious | 01 · Design | Safety-equivalence claim made from a non-randomized retrospective cohort with no equivalence margin. | Quote | Central |
| Serious | 03 · Statistics | Six co-primary comparisons tested without multiplicity control; study not powered for equivalence. | Quote | Central |
| Moderate | 02 · Alignment | Abstract describes a non-significant secondary endpoint as “comparable,” implying a positive finding. | Quote | Peripheral |
| Mild | 04 · Reporting | Missing-data handling and sources of bias not described per STROBE 12c / 9. | Checklist | Peripheral |
Recomputed directly from the manuscript's reported values.
“…no significant difference in 30-day readmission (p = 0.04).”Table 3 · sleeve (n=212) vs bypass (n=198)
Complication rates by group: 214 / 205 patients across two arms.Analytic cohort stated as N = 410
In order. Address the central findings first.
Calibrated to what the design supports — your wording, corrected.
“This study demonstrates that sleeve gastrectomy is equally safe as gastric bypass.”
“In this single-center cohort, 30-day complication rates were similar between procedures; residual confounding limits causal interpretation.”
“Readmission was comparable between groups.”
“Readmission did not differ significantly; the study was not powered to establish equivalence.”
What could be checked from the submitted files — and what could not.