This retrospective cohort research was permitted by the Mass Basic Brigham Institutional Evaluate Board, and the requirement for written knowledgeable consent was waived. Consent was waived as a result of the research posed minimal threat, used retrospective information collected throughout routine care, and didn’t have an effect on affected person care. A considerable proportion of the cohort have been decedents, and all entry occurred inside safe, encrypted Mass Basic Brigham environments with restricted entry.
Dataset
Facial pictures have been collected from most cancers sufferers present process radiation remedy at Brigham and Girls’s Hospital in Boston between 2012 and 2023. The dataset initially included 2763 sufferers who obtained a number of radiation remedy programs throughout their most cancers care and had corresponding facial pictures taken as a part of the routine medical workflow for identification functions at the beginning of every course. By means of a sequential filtering course of, as detailed in Supplementary Fig. 1, sufferers with a number of faces detected per picture, incomplete Digital Well being Information (EHR), and people age ≤ 20 years have been excluded, leading to a remaining cohort of 2276 sufferers for evaluation. For every affected person, we chosen two facial pictures with the widest doable time span between them: one from the newest radiation remedy course and one from the oldest remedy course. This strategy maximized the temporal vary for calculating FAR.
The sufferers have been additional stratified for analyses based mostly on the time interval between pictures: short-term (10–twelve months), mid-term (366–730 days), and long-term (731–1460 days) interval cohorts.
Intercourse is self-reported. Race is self-reported utilizing institutional classes with an choice to say no.
FaceAge mannequin description and efficiency metrics
We utilized the validated Basis Synthetic Intelligence Mannequin for Well being Recognition (FAHR-Face)25. FAHR-FaceAge makes use of a Imaginative and prescient Transformer structure pretrained by way of masked autoencoder self-supervised studying on the WebFace42M49 dataset, consisting of over 40 million facial photographs. The mannequin was subsequently fine-tuned particularly for organic age estimation in a two-stage, age-balanced coaching strategy involving 10 publicly obtainable facial picture datasets of presumed wholesome people.
FAHR-FaceAge demonstrated excessive accuracy and strong generalizability, reaching a imply absolute error (MAE) of 5.1 years and imply error (ME) of 0.2 years on the exterior public APPA-REAL50 dataset. Efficiency was excessive throughout the total grownup age vary (20–100) that will be most related for this present evaluation of adults with most cancers25.
FAR threshold choice
We employed a scientific, twin strategy to outline FAR thresholds that mirror the markedly totally different measurement variabilities noticed throughout short-term (10–twelve months), mid-term (366–730 days), and long-term (731–1460 days) intervals.
First, for the short-term cohort, we randomly cut up the dataset into coaching (1/3) and take a look at (2/3) subsets and evaluated minimize factors from −20 to twenty (in increments of 5). For every potential minimize level, we carried out log-rank assessments evaluating survival outcomes between sufferers above and beneath the edge. FAR > 20 achieved the strongest prognostic separation within the coaching information, as indicated by the bottom log-rank take a look at p-value; this threshold was subsequently validated within the take a look at set, confirming its skill to establish sufferers with totally different survival outcomes.
Second, to account for the considerably decrease variability within the long-term cohort (customary deviation (SD) 14.1 instances smaller), we set FAR > 1 because the long-term threshold. This selection is in keeping with scaling the short-term threshold by the noticed noise discount issue; it additionally avoids inflated false positives in longer follow-up intervals the place FAR measurements are extra secure. We chosen FAR > 10 for mid-term intervals to take care of an intermediate cutoff aligned with these variability patterns.
The thresholds have been then utilized to the cohorts. Kaplan-Meier survival curves have been generated for the ensuing teams, and a log-rank take a look at assessed the statistical significance of survival variations.
Statistical evaluation
Intercourse was thought-about at research design and included in univariate and multivariate Cox fashions.
We carried out Cox proportional hazards regression analyses to judge the affiliation between FAR and survival outcomes.
For univariate evaluation, we carried out Cox regression for every variable of curiosity, together with excessive versus low FAR group, age at first picture (in a long time), time between photographs (in months), intercourse, race, most cancers threat group, analysis group. FAR was categorized for every time interval cohort utilizing the info pushed strategy described above: FAR > 20 vs. ≤20 for short-term, FAR > 10 vs. ≤10 for mid-term, and FAR > 1 vs. ≤1 for long-term intervals. Hazard ratios (HR) with 95% confidence intervals (CI) and p-values have been calculated for every variable.
To evaluate the impartial prognostic worth of FAR, we carried out multivariable Cox proportional hazards regression analyses; we included all covariates that demonstrated statistical significance in univariate evaluation. Most cancers threat group, derived from analysis, was excluded from multivariate fashions as redundant. We constructed fashions with rising ranges of adjustment: FAR unadjusted, adjusted for the time span between photographs, additional adjusted for intercourse, then race, and at last, a totally adjusted mannequin together with most cancers analysis at radiation course two. For every mannequin, we calculated HR, 95% CI, and p-values for FAR group variable.
All analyses have been carried out utilizing R model 4.3.1. Statistical significance was set at P < 0.05.
Strategies for producing hazard ratio contour plots
To visualise the mixed results of FAD and FAR on survival outcomes, we generated adjusted hazard ratio contour plots utilizing Cox proportional hazards regression fashions in R (model 4.3.1). The evaluation was carried out individually for the three cohorts with distinct time intervals between radiation remedy programs: short-term (10–twelve months), mid-term (366–730 days), and long-term (731–1,460 days).
Covariates included within the evaluation have been age on the first {photograph} (in a long time), time distinction between pictures (in months), intercourse, race, and most cancers analysis at radiation course two.
For every time interval, we filtered the dataset to incorporate solely sufferers whose time distinction between pictures fell throughout the specified vary. We then fitted a Cox proportional hazards mannequin for every of the three interval cohorts:
$${{{rm{Survival; Time}}}}sim {{{{rm{FAD}}}}}_{{{{rm{RT}}}}1}+{{{rm{FAR}}}}+{{{{rm{Age}}}}}_{{{{rm{RT}}}}1}+{{{rm{Time; Distinction}}}}+{{{rm{Intercourse}}}} +{{{rm{Race}}}}+{{{{rm{Analysis; Group}}}}}_{{{{rm{RT}}}}2}$$
This mannequin estimated the log hazard ratios related to FAD and FAR whereas adjusting for the covariates.
To create the contour plots, we generated a grid of FAD and FAR values protecting the noticed ranges within the information for every time interval. For every mixture of FAD and FAR within the grid, we predicted the log hazard ratio utilizing the fitted Cox mannequin, holding different covariates fixed at their median or reference ranges (median age, median time distinction, reference classes for intercourse, race, and analysis group). The anticipated log hazard ratios have been then exponentiated to acquire hazard ratios.
To make sure consistency throughout all contour plots, we collected hazard ratio values from the prediction grids of every time interval and calculated the general minimal and most hazard ratios. We outlined hazard ratio breaks at intervals of 0.2, adjusted to embody the total vary of noticed hazard ratios. This allowed us to use a constant colour scale to all plots, facilitating direct comparisons between totally different time intervals.
Strategies for Harrell’s C-index evaluation
To match the prognostic worth of FAR versus single time-point FAD measurements, we calculated Harrell’s C-index for every metric throughout totally different time intervals. The evaluation was carried out utilizing R (model 4.3.1) with the survival package deal. For every time interval cohort, we evaluated three predictors: FADRT1, FADRT2, and FAR. Fashions have been assessed each unadjusted and with full adjustment for medical covariates (time span between pictures, intercourse, race, and most cancers analysis on the second radiation remedy).
Median follow-up time
The median follow-up time was calculated utilizing reverse Kaplan Meier51.
Reporting abstract
Additional info on analysis design is on the market within the Nature Portfolio Reporting Abstract linked to this text.



