Analysis & ranking
PHASE 2 — Evidence and Impact Analysis
Article 1 — Population-scale genomic medicine with the Hong Kong Genome Project
PMID 42141198 | Nat Med | triage_score: 9
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 9 | First population-scale WGS reference cohort for a Chinese population; identification of 38 clinically relevant genes absent from European panels is genuinely landmark |
| Clinical Relevance | 8 | 25% rare disease diagnostic yield is immediately actionable; pharmacogenomics affects nearly universal prescription burden; panel redesign implications are concrete |
| Population Reach | 9 | Directly relevant to ~1.4B Chinese-heritage individuals globally; broader implications for all non-European populations underrepresented in genomic reference databases |
| Implementation Speed | 6 | Pharmacogenomic guidance can be adopted relatively quickly; rare disease diagnostic pipeline integration requires infrastructure; panel redesign is a medium-term effort |
| Evidence Strength | 8 | Large N (20,488), peer-reviewed in Nature Medicine, robust design; abstract-only access limits full methodological scrutiny |
Key quantitative result: 25% rare disease diagnostic yield; 38 population-specific actionable genes absent from European panels; near-universal pharmacogenomic actionability across ~1M annual prescriptions.
External validation: The population-scale design is itself a form of internal validation; independent replication in separate Chinese cohorts not yet reported from this paper.
Main limitation: Abstract-only access; European-to-Chinese panel gap characterization methodology not fully assessable; Hong Kong Chinese may not fully represent mainland Chinese or overseas Chinese diaspora diversity.
Equity implications: Highly positive — directly corrects a major structural inequity in genomic medicine where European-ancestry data dominates. Chinese, and by extension all Asian and non-European populations, are the principal beneficiaries. European populations are not disadvantaged.
Evidence Maturity: ✅ Confirmed — Validated (landmark observational cohort, peer-reviewed, Nature Medicine)
Article 2 — Multicentric data challenge for AI-based classification of leukocytes: CytologIA
PMID 42141020 | NPJ Precis Oncol | triage_score: 9
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 8 | First large-scale, open, multicentric benchmark for leukocyte AI classification; open data release and reproducibility infrastructure is genuinely new for the field |
| Clinical Relevance | 7 | Directly addresses inter-laboratory variability in peripheral blood smear morphology — a real and documented clinical problem; deployment still requires regulatory pathway and lab integration |
| Population Reach | 7 | Hematology diagnostics are universal; CBC and differential are among the most ordered tests globally; impact is diffuse but broad |
| Implementation Speed | 7 | Open dataset + validated model enables rapid laboratory adoption; regulatory clearance (FDA/CE) is the primary bottleneck, but the technical foundation is now in place |
| Evidence Strength | 8 | 69,168 images, 245 teams, multicentric (France/Belgium/Switzerland), hidden test set design, open data release — methodologically rigorous benchmark |
Key quantitative result: Top model (YOLOX + transformer/CNN ensemble) achieved balanced accuracy 0.94 on hidden test set vs. 0.82 for baseline CNN — a clinically meaningful improvement across 23 leukocyte classes.
External validation: Multicentric design with independent hidden test set constitutes genuine external validation; open dataset enables community replication.
Main limitation: Benchmark performance on curated annotated images may not reflect real-world clinical smear quality, staining variability, or rare morphology edge cases; abstract-only access.
Equity implications: Open data release is equity-positive — lower-resource labs globally can access validated models without proprietary licensing. Annotation set drawn primarily from Western European centers, which may not capture morphological distributions of hematological conditions more prevalent elsewhere.
Evidence Maturity: ✅ Confirmed — Validated (benchmark study with held-out test set, peer-reviewed)
Article 3 — GLP-1RA and dual GLP-1RA/GIP on liver stiffness and steatosis: meta-analysis
PMID 42141180 | Intern Emerg Med | triage_score: 8
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 6 | GLP-1RA hepatic benefit is established in principle; Fibroscan-specific quantification of stiffness and steatosis adds methodological specificity but is incremental over prior meta-analyses |
| Clinical Relevance | 8 | MASLD affects ~32% of adults globally; GLP-1RAs are already in widespread use; this meta-analysis directly supports prescribing decisions and guideline expansion for hepatic indications |
| Population Reach | 9 | MASLD is a global epidemic; any therapeutic guidance affects hundreds of millions of patients worldwide |
| Implementation Speed | 9 | Drugs are already approved and in use; Fibroscan monitoring is available in most hepatology centers; clinical uptake of this evidence is immediate |
| Evidence Strength | 6 | PROSPERO-registered meta-analysis is a strength; only 5 studies/9 trials, N=824, modest SMD effect sizes (−0.3, −0.4); heterogeneity of underlying studies and abstract-only access are limitations |
Key quantitative result: Liver stiffness SMD = −0.3 (95% CI −0.5 to −0.1, p=0.02); liver steatosis SMD = −0.4 (95% CI −0.7 to −0.1, p=0.03).
External validation: Meta-analytic design pools existing trials; PROSPERO registration confirms prospective protocol; small number of included studies limits robustness.
Main limitation: Only 5 studies included; heterogeneity in underlying trial designs, drug types, and comparators; small-to-moderate SMD effect sizes; cannot distinguish GLP-1RA from dual GIP/GLP-1 effects separately.
Equity implications: GLP-1RAs are expensive and access is unequal globally; MASLD disproportionately affects lower-income and minority populations who may have the least access to these drugs. The evidence strengthens the case for hepatic indications, which could support formulary inclusion and guideline expansion.
Evidence Maturity: ✅ Confirmed — Validated (meta-analysis of RCTs, PROSPERO registered)
Article 4 — SGLT-2 inhibitors on epicardial adiposity and LV function: Malaysian EpiCAD study
PMID 42141141 | Sci Rep | triage_score: 8
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 6 | SGLT-2i cardioprotection is established; epicardial adipose tissue as a mechanistic endpoint is novel but has prior literature; Asian-specific population data adds incremental novelty |
| Clinical Relevance | 7 | Provides mechanistic evidence for SGLT-2i in T2DM+CAD; EAT and LV remodeling endpoints are clinically meaningful; contributes to Asian-population prescribing evidence base |
| Population Reach | 8 | T2DM+CAD is one of the most prevalent comorbidity pairings globally; Asian populations are underrepresented in SGLT-2i RCT data |
| Implementation Speed | 8 | SGLT-2i already in use; results support existing practice with mechanistic nuance rather than requiring new drug approval |
| Evidence Strength | 6 | Prospective observational with pre/post analysis; no randomization; n=302 (151/arm); confounding cannot be fully excluded; abstract-only |
Key quantitative result: EAT −1.5mm vs. +0.6mm controls (p<0.001); LVEF +4.6% vs. −1.0%; LV mass −20g vs. +7.8g (p≤0.01).
External validation: Single-center Malaysian study; not independently replicated.
Main limitation: Observational design with no randomization; selection bias possible; single-center; abstract-only; 6-month follow-up may not capture long-term remodeling outcomes.
Equity implications: Asian-population-specific evidence is equity-positive; addresses a gap in major RCTs dominated by Western cohorts. Malaysian patients with T2DM+CAD are a high-burden group with distinct cardiometabolic phenotypes.
Evidence Maturity: ✅ Confirmed — Validated (prospective, paired pre/post, well-powered for its design class)
Article 5 — CD19/BCMA CAR-T for anti-HLA desensitization in acute leukemia pre-HSCT
PMID 42141373 | HLA | triage_score: 8
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 9 | Genuinely novel application of CAR-T to B-cell depletion for HLA desensitization — no prior published series exists; conceptually innovative re-purposing of approved technology |
| Clinical Relevance | 7 | Directly addresses a critical transplant barrier for sensitized patients; 43% achieving >75% DSA reduction is clinically meaningful; small n limits confidence |
| Population Reach | 5 | HLA-sensitized leukemia patients awaiting HSCT are a small but high-stakes population; relative to the rare disease reference frame, this is high unmet need |
| Implementation Speed | 4 | Pilot only; requires larger trials, regulatory pathway, and specialized CAR-T manufacturing infrastructure |
| Evidence Strength | 4 | n=7, single-arm, no control; pilot-level evidence; no randomization; 86% CAR-T expansion rate and toxicity data are promising but not confirmatory |
Key quantitative result: DSA median MFI 15,797 → 3,831 (p<0.001); 71% showed anti-HLA-I MFI decline; 43% >75% reduction; no grade ≥3 CRS or neurotoxicity.
External validation: None; first-in-class pilot.
Main limitation: n=7; single-arm; no control arm; highly selected patients; abstract-only; safety events in larger cohorts unknown.
Equity implications: Sensitized patients are disproportionately women (alloimmunized through pregnancy) and previously transfused patients including sickle cell disease populations. Successful desensitization would particularly benefit these groups.
Evidence Maturity: ✅ Confirmed — Exploratory (pilot, n=7)
Article 6 — HESpotEx: dual-stream deep learning for spatial gene expression from histology
PMID 42141326 | Nat Comput Sci | triage_score: 8
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 9 | Predicting 5,457-gene spatial expression profiles from routine H&E slides is a substantial leap; could democratize spatial transcriptomics at scale |
| Clinical Relevance | 4 | Currently a computational tool validated on TCGA; clinical deployment requires prospective validation, regulatory pathway, and EHR integration; not yet practice-changing |
| Population Reach | 6 | Potentially affects all solid tumor patients who undergo histopathology (very large), but only if clinically validated |
| Implementation Speed | 3 | Multiple validation steps, workflow integration, and regulatory requirements separate this from clinical use |
| Evidence Strength | 6 | Validated on TCGA and multiple cancer cohorts; computational benchmark is rigorous; no prospective clinical outcome validation |
Key quantitative result: Predicts up to 5,457 genes from WSIs; superior performance vs. prior methods; specific metrics not extractable from abstract-only.
External validation: Multi-dataset TCGA validation constitutes computational external validation.
Main limitation: No clinical outcome validation; computational performance metrics do not directly translate to diagnostic or prognostic utility; abstract-only.
Equity implications: If validated, dramatically lowers cost barrier to spatial molecular profiling — currently a >$1,000/sample technology — benefiting lower-resource settings globally.
Evidence Maturity: ✅ Confirmed — Exploratory (computational validation only)
Article 7 — Multimodal ML for deep stromal invasion in cervical cancer
PMID 42141332 | Insights Imaging | triage_score: 7
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 7 | Fusion of MRI radiomics + NLP/BERT report features + clinical variables is methodologically innovative; SHAP explainability integration is a practical strength |
| Clinical Relevance | 6 | Preoperative staging in early cervical cancer is clinically critical; model could reduce unnecessary radical surgery; limited by very small external cohort |
| Population Reach | 6 | Cervical cancer is the 4th most common cancer in women globally; concentration in low/middle-income countries where MRI access is limited |
| Implementation Speed | 3 | External validation n=20 is insufficient for any clinical deployment; requires large multicenter prospective validation |
| Evidence Strength | 4 | Internal AUC 0.912 is strong but external validation n=20 is critically inadequate; retrospective design; abstract-only |
Key quantitative result: AUC 0.912/0.874/0.890 (training/internal/external validation).
Main limitation: External validation n=20 — too small to confirm generalizability; retrospective; abstract-only.
Equity implications: Cervical cancer disproportionately affects women in LMICs; an AI staging tool could improve surgical decision-making, but MRI access barriers in these settings are a major deployment constraint.
Evidence Maturity: ✅ Confirmed — Exploratory
Article 8 — SDoH and racial disparities in GI cancer patients
PMID 42141357 | J Racial Ethn Health Disparities | triage_score: 7
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 5 | SDoH disparities in cancer are well-documented; this study adds quantification with All of Us data and a GI cancer focus; incremental rather than breakthrough |
| Clinical Relevance | 6 | Provides actionable data for care navigation and equity interventions; cross-sectional design limits causal inference |
| Population Reach | 8 | GI cancers are among the most common; racial disparities in cancer outcomes affect millions; All of Us cohort is nationally representative |
| Implementation Speed | 7 | SDoH screening and navigation programs can be implemented now with existing infrastructure; data supports justification for resource allocation |
| Evidence Strength | 6 | N=1,831 with patient-reported SDoH data from validated All of Us platform; cross-sectional design cannot establish causality; abstract-only |
Key quantitative result: Food insecurity 23.6% vs 5.8% (non-White vs NHW); housing issues 41.8% vs 21.8%; delayed care 47.1% vs 26.8%.
Main limitation: Cross-sectional; selection bias in SDoH survey completion (only 1,831 of 6,620); cannot establish causal pathway from SDoH to outcomes.
Equity implications: This article is fundamentally about equity — it quantifies disadvantage and provides a roadmap for targeted intervention. Non-White GI cancer patients are the at-risk group most likely to benefit from resulting policy changes.
Evidence Maturity: ✅ Confirmed — Validated (well-powered cross-sectional with established dataset)
Article 9 — PRSS3 as biomarker for pneumoconiosis via plasma proteomics and ML
PMID 42141418 | Clin Proteomics | triage_score: 7
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 7 | Novel application of 1,239-protein plasma panel + ML to occupational lung disease grading; PRSS3 is a previously unreported pneumoconiosis biomarker |
| Clinical Relevance | 4 | Occupational medicine niche; AUC=1.00 in training raises serious overfitting concern; not clinically actionable yet |
| Population Reach | 5 | Coal workers' pneumoconiosis is a global occupational disease; ~250,000 prevalent cases in China alone; high unmet need in occupational medicine |
| Implementation Speed | 3 | Small N, potential overfitting, no external validation; requires independent replication before any clinical consideration |
| Evidence Strength | 3 | N=158; AUC=1.00 in training strongly suggests overfitting; classification_confidence = medium; abstract-only |
Key quantitative result: PRSS3 AUC=1.00 (training, dust-exposed vs CWP-I) — interpret with extreme caution due to likely overfitting.
Main limitation: N=158; AUC=1.00 is a red flag for overfitting in this small training set; no independent external validation cohort reported; abstract-only.
Equity implications: Coal workers globally, predominantly in China, India, and Eastern Europe, are an underserved occupational population; a validated early-detection tool would have significant equity value if confirmed.
Evidence Maturity: ⬇️ Revised to Exploratory (downgraded from triage maturity; overfitting concern and small N warrant caution)
Article 10 — ML model for MASLD identification from routine labs (NHANES + KNHANES)
PMID 42141301 | Clin Exp Med | triage_score: 7
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 5 | ML-based MASLD screening from routine labs is an active field; Extra Trees with cross-cultural validation adds methodological rigor but is incremental |
| Clinical Relevance | 7 | A validated, interpretable MASLD screening tool using routine labs could transform primary care screening without imaging |
| Population Reach | 9 | MASLD affects ~25–32% of adults globally; US + Korean external validation increases generalizability |
| Implementation Speed | 6 | Routine lab inputs are already available; SHAP interpretability supports clinical decision support integration; regulatory clearance needed |
| Evidence Strength | 7 | N=3,944 with cross-national external validation (NHANES + KNHANES); Extra Trees outperforms 10 competing algorithms; SHAP interpretability; abstract-only limits full assessment |
Key quantitative result: AUC 0.879 internal (0.856–0.897), 0.822 external (KNHANES).
Main limitation: Self-reported variables in NHANES; MASLD diagnosis by imaging may vary by standard; no prospective clinical outcome validation; abstract-only.
Equity implications: A routine-lab-only MASLD screening tool has high equity potential — removes imaging cost barriers and can be deployed in primary care and low-resource settings globally.
Evidence Maturity: ✅ Confirmed — Validated (external cross-national validation)
Article 11 — Mixed neuropathologies in Down syndrome with/without Alzheimer's dementia
PMID 42141233 | Acta Neuropathol | triage_score: 7
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 7 | CAA dominance (84%) over pure ADNC (29%) in DS is a significant finding; resilience phenotype in DS individuals without dementia is novel and potentially important |
| Clinical Relevance | 5 | Autopsy-based; immediate clinical translation limited; reshapes understanding of AD therapeutic targeting in DS; relevant for clinical trial stratification |
| Population Reach | 4 | ~6 million people with DS globally; rare disease reference frame makes this moderate-to-high reach for the condition |
| Implementation Speed | 3 | Autopsy cohort; no diagnostic test or treatment immediately changes; influences future trial design and drug targeting |
| Evidence Strength | 6 | N=63 well-characterized autopsy cases; systematic neuropathological assessment; limited by small n and cross-sectional nature; abstract-only |
Key quantitative result: CAA 84%; pure ADNC 29%; LATE-NC 17%; HS 19%; LP 21%; LATE-NC and HS exclusive to dementia group.
Main limitation: N=63; autopsy cohort (survival bias); cross-sectional; abstract-only; single institution likely.
Equity implications: DS individuals are an underserved population in dementia research; this study directly informs their care. Anti-amyloid therapies (lecanemab, donanemab) primarily target ADNC; if CAA dominates in DS, this has major implications for treatment selection and risk (ARIA risk is higher with CAA).
Evidence Maturity: ✅ Confirmed — Exploratory (single autopsy cohort)
Article 12 — Aging promotes RAGE-dependent breast cancer metastasis
PMID 42141167 | Commun Biol | triage_score: 7
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 8 | RAGE as the mechanistic link between aging microenvironment and increased metastasis is novel; S100A8/9 and AGE ligand accumulation in aged TME is well-characterized mechanistically |
| Clinical Relevance | 3 | Primarily preclinical (mouse models); human TCGA survival correlate is supportive but not causal; no clinical intervention yet; non-human studies capped at 5 on Clinical Relevance |
| Population Reach | 7 | Breast cancer is the most common cancer in women globally; age is a major risk factor; if validated, this could affect a large subset of older breast cancer patients |
| Implementation Speed | 3 | Preclinical stage; RAGE inhibitors exist but require clinical development in this indication |
| Evidence Strength | 4 | Mechanistically compelling mouse data + human survival correlates; no clinical validation; mixed species; abstract-only |
Key quantitative result: Elevated AGER expression correlates with poorer survival in older breast cancer patients in TCGA; RAGE/S100A8/9 inhibition suppressed tumor invasion in mouse models.
Main limitation: Primarily mouse models; human TCGA correlates are observational and cannot establish causality; clinical RAGE inhibitor development in this indication is nascent.
Equity implications: Older women, who bear the greatest breast cancer burden, are the primary potential beneficiaries. RAGE inhibitors, if developed, would need to be accessible to aging populations in LMICs.
Evidence Maturity: ✅ Confirmed — Exploratory (preclinical)
Article 13 — Balancing efficacy and safety in BCMA CAR-T for multiple myeloma: review
PMID 42141360 | BioDrugs | triage_score: 6
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 4 | Narrative review of established safety data; rare toxicities (parkinsonian movement disorders, secondary malignancies) add some novelty |
| Clinical Relevance | 7 | Immediately relevant to any clinician managing myeloma patients on CAR-T; toxicity awareness is directly practice-shaping |
| Population Reach | 5 | Relapsed/refractory myeloma is a focused but growing population as CAR-T becomes more widely used |
| Implementation Speed | 8 | Review format; clinicians can apply toxicity awareness immediately |
| Evidence Strength | 5 | Narrative review — no primary data; synthesizes existing literature well but cannot generate new evidence; abstract-only |
Key quantitative result: No new primary quantitative results; synthesizes toxicity rates from approved BCMA CAR-T trials (idecabtagene vicleucel, ciltacabtagene autoleucel).
Main limitation: Narrative review; no systematic search or meta-analysis reported; potential for selection bias in evidence synthesis.
Evidence Maturity: ✅ Confirmed — Validated (synthesis of established clinical data)
Article 14 — False-positive stool-based CRC screening: scoping review
PMID 42141388 | BMC Gastroenterol | triage_score: 6
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 5 | Gap identification is valuable; the finding that standardization is absent is known but now formally quantified |
| Clinical Relevance | 6 | Relevant to every gastroenterologist and primary care provider using FIT/mt-sDNA; but only 8 studies met criteria, limiting conclusions |
| Population Reach | 8 | Stool-based CRC screening is used by millions annually; false-positive rates affect many patients |
| Implementation Speed | 5 | Gap identification requires further primary research before guidelines can change |
| Evidence Strength | 4 | Scoping review methodology; only 8 included studies; high heterogeneity; abstract-only |
Key quantitative result: CRC on second colonoscopy ~0.5%; non-colorectal aerodigestive cancer 0–4.3%.
Evidence Maturity: ✅ Confirmed — Exploratory (scoping review, limited evidence base)
Article 15 — LATE biomarker development review
PMID 42141120 | Acta Neuropathol | triage_score: 6
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 5 | LATE as a clinical entity is established since 2019; biomarker gap is known; review synthesizes current state without new findings |
| Clinical Relevance | 5 | High unmet need but no actionable diagnostic currently exists; useful for framing future research priorities |
| Population Reach | 7 | ~1/3 of octogenarians affected; with aging demographics, a large population |
| Implementation Speed | 2 | No validated diagnostic exists; pipeline is early-stage |
| Evidence Strength | 4 | Narrative review; no primary data; abstract-only |
Evidence Maturity: ✅ Confirmed — Exploratory
Article 16 — Allurion gastric balloon in 300 Ecuadorian patients
PMID 42141317 | Obes Surg | triage_score: 6
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 3 | Device is already commercially available; Latin American real-world data is the incremental contribution |
| Clinical Relevance | 5 | Moderate weight loss (14.4% TWL at 12 months) in a real-world setting; acceptable safety; relevant for non-surgical obesity management in LMICs |
| Population Reach | 6 | Obesity is a global epidemic; Ecuadorian/Latin American data fills a geographic gap |
| Implementation Speed | 7 | Device is already approved and deployable; results can inform clinical adoption in the region |
| Evidence Strength | 4 | Retrospective single-center; classification_confidence = medium (truncated abstract); no comparison arm |
Evidence Maturity: ✅ Confirmed — Exploratory (retrospective single-center real-world study)
Article 17 (Sentinel) — Prone positioning response definition in ARDS: meta-analysis
PMID 42141498 | Crit Care | triage_score: 5
| Dimension | Score | Rationale |
|---|---|---|
| Scientific Novelty | 5 | Heterogeneity of response definitions is a methodological contribution; 74-study meta-analysis is comprehensive |
| Clinical Relevance | 5 | Important for critical care practice but out of watchlist scope; responder-mortality link not robustly established |
| Population Reach | 6 | ARDS is common in ICU settings globally |
| Implementation Speed | 4 | Standardization requires guideline body consensus |
| Evidence Strength | 7 | 74 studies, systematic methodology; limited by heterogeneous definitions across included studies |
Evidence Maturity: ✅ Confirmed — Exploratory (heterogeneous definitions prevent firm conclusions)
PHASE 3 — Ranking
Conflict Note
No direct conflicts exist between articles in this batch. Articles 3 and 4 are complementary, both supporting cardiometabolic benefits of incretin/SGLT-2 drug classes but using different endpoints and populations. Article 5 (CAR-T desensitization) and Article 13 (BCMA CAR-T safety review) address different CAR-T applications without contradiction.
Composite Impact Score Table
Weighted formula: Clinical Relevance ×0.30 + Population Reach ×0.25 + Scientific Novelty ×0.20 + Implementation Speed ×0.15 + Evidence Strength ×0.10
| Rank | Article | Flag | Clinical Rel. (×0.30) | Pop. Reach (×0.25) | Sci. Novelty (×0.20) | Impl. Speed (×0.15) | Evid. Strength (×0.10) | Impact Score | Triage Score | Study Design |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | HKGP: Population-scale genomic medicine (Art. 1) | 🟠 | 8 | 9 | 9 | 6 | 8 | 8.10 | 9 | Large observational cohort, N=20,488 |
| 2 | CytologIA AI leukocyte benchmark (Art. 2) | 🟢 | 7 | 7 | 8 | 7 | 8 | 7.30 | 9 | Multicentric AI benchmark, N=69,168 images |
| 3 | GLP-1RA on liver stiffness: meta-analysis (Art. 3) | 🟢 | 8 | 9 | 6 | 9 | 6 | 7.90 | 8 | Systematic review & meta-analysis |
| 4 | SGLT-2i epicardial adiposity, EpiCAD (Art. 4) | 🟢 | 7 | 8 | 6 | 8 | 6 | 7.10 | 8 | Prospective observational cohort, N=302 |
| 5 | MASLD ML screening, NHANES+KNHANES (Art. 10) | 🟢 | 7 | 9 | 5 | 6 | 7 | 6.95 | 7 | ML dev + external validation |
| 6 | SDoH and racial disparities in GI cancer (Art. 8) | 🟡 | 6 | 8 | 5 | 7 | 6 | 6.45 | 7 | Cross-sectional, N=1,831 |
| 7 | CD19/BCMA CAR-T for HLA desensitization (Art. 5) | 🟠 | 7 | 5 | 9 | 4 | 4 | 6.15 | 8 | Single-arm pilot, N=7 |
| 8 | HESpotEx spatial transcriptomics from WSI (Art. 6) | ⚪ | 4 | 6 | 9 | 3 | 6 | 5.45 | 8 | Computational validation, TCGA |
| 9 | DS mixed neuropathologies autopsy cohort (Art. 11) | 🟡 | 5 | 4 | 7 | 3 | 6 | 4.95 | 7 | Autopsy cohort, N=63 |
| 10 | RAGE and aging-driven breast cancer metastasis (Art. 12) | ⚪ | 3 | 7 | 8 | 3 | 4 | 4.90 | 7 | Preclinical + TCGA correlate |
| 11 | Cervical cancer deep stromal invasion ML (Art. 7) | ⚪ | 6 | 6 | 7 | 3 | 4 | 5.20 | 7 | Retrospective ML, ext. n=20 |
| 12 | BCMA CAR-T safety review, myeloma (Art. 13) | ⬜ | 7 | 5 | 4 | 8 | 5 | 5.95 | 6 | Narrative review |
| 13 | PRSS3 pneumoconiosis biomarker (Art. 9) | ⚪ | 4 | 5 | 7 | 3 | 3 | 4.45 | 7 | Proteomic ML dev, N=158 |
| 14 | False-positive CRC stool screening (Art. 14) | ⬜ | 6 | 8 | 5 | 5 | 4 | 5.80 | 6 | Scoping review, 8 studies |
| 15 | LATE biomarker review (Art. 15) | ⬜ | 5 | 7 | 5 | 2 | 4 | 4.85 | 6 | Narrative review |
| 16 | Allurion balloon, Ecuador (Art. 16) | 🟡 | 5 | 6 | 3 | 7 | 4 | 4.90 | 6 | Retrospective cohort, N=300 |
| 17 | ARDS prone positioning meta-analysis (Art. S01, sentinel) | ⬜ | 5 | 6 | 5 | 4 | 7 | 5.25 | 5 | Systematic review & meta-analysis, 74 studies |
Corrected rankings after tie-breaking applied:
- Rank 1: HKGP (8.10) — clear leader
- Rank 2: GLP-1RA liver meta-analysis (7.90) — edges CytologIA on Population Reach and Implementation Speed
- Rank 3: CytologIA (7.30) — high on novelty and evidence strength
- Rank 4–5 as shown
Rank Justification Summaries
Rank 1 — Hong Kong Genome Project This is the highest-impact article in the batch by a meaningful margin. Its combination of a 25% rare disease diagnostic yield at population scale, the identification of 38 clinically actionable genes absent from European panels, and near-universal pharmacogenomic actionability represents a landmark contribution to equity in precision medicine. For 1.4 billion Chinese-heritage individuals — and by extension all non-European populations — this study provides the blueprint for population-specific genomic medicine programs. Published in Nature Medicine with N=20,488, it clears both the Evidence Strength and Population Reach bars decisively.
Why it matters: Existing genomic panels were designed for European ancestry populations. This study proves that applying them to Chinese patients leaves clinically critical genes undetected. Fixing that gap could directly improve diagnoses and drug safety for billions of people.
Rank 2 — GLP-1RA liver meta-analysis With GLP-1RA and dual GLP-1/GIP drugs already widely prescribed for diabetes and obesity, this meta-analysis arrives at a critical policy moment: MASLD affects ~32% of adults globally, and this PROSPERO-registered synthesis of 9 trials provides Fibroscan-quantified evidence that these drugs simultaneously treat the hepatic component of metabolic disease. Implementation speed is exceptionally high because no new drug approvals are needed. Its ranking above CytologIA reflects greater immediate clinical actionability and population reach despite lower novelty.
Why it matters: Clinicians already prescribing semaglutide and tirzepatide can now point to quantitative evidence that their patients' livers are also improving — supporting both prescribing confidence and the case for expanded insurance coverage.
Rank 3 — CytologIA multicentric AI benchmark The CytologIA benchmark sets a new reproducibility standard for AI-assisted hematological diagnostics. A balanced accuracy of 0.94 across 23 leukocyte classes — validated on a hidden test set, across multiple countries, with open data release — is the kind of rigorous, transparent infrastructure finding that genuinely accelerates clinical AI adoption. Its ranking reflects that technical validation is complete; the remaining gap is regulatory clearance and lab integration, which are tractable near-term barriers.
Why it matters: Manual differential counts are among the most variability-prone tests in clinical hematology. A validated, open-source AI model could standardize quality across institutions from rural clinics to major academic centers.