west china medical publishers
Keyword
  • Title
  • Author
  • Keyword
  • Abstract
Advance search
Advance search

Search

find Keyword "predictive model" 24 results
  • Construction and validation of risk prediction model for breast cancer bone metastasis

    ObjectiveTo identify the risk factors of bone metastasis in breast cancer and construct a predictive model. MethodsThe data of breast cancer patients met inclusion and exclusion criteria from 2010 to 2015 were obtained from the SEER*Stat database. Additionally, the data of breast cancer patients diagnosed with distant metastasis in the Affiliated Hospital of Southwest Medical University from 2021 to 2023 were collected. The patients from the SEER database were randomly divided into training (70%) and validation (30%) sets using R software, and the breast cancer patients from the Affiliated Hospital of Southwest Medical University were included in the validation set. The univariate and multivariate logistic regressions were used to identify risk factors of breast cancer bone metastasis. A nomogram predictive model was then constructed based on these factors. The predictive effect of the nomogram was evaluated using the area under the receiver operating characteristic curve (AUC), calibration curve, and decision curve analysis. ResultsThe study included 8 637 breast cancer patients, with 5 998 in the training set and 2 639 (including 68 patients in the Affiliated Hospital of Southwest Medical University) in the validation set. The statistical differences in the race and N stage were observed between the training and validation sets (P<0.05). The multivariate logistic regression analysis revealed that being of white race, having a low histological grade (Ⅰ–Ⅱ), positive estrogen and progesterone receptors status, negative human epidermal growth factor receptor 2 status, and non-undergoing surgery for the primary breast cancer site increased the risk of breast cancer bone metastasis (P<0.05). The nomogram based on these risk factors showed that the AUC (95% CI) of the training and validation sets was 0.676 (0.533, 0.744) and 0.690 (0.549, 0.739), respectively. The internal calibration using 1 000 Bootstrap samples demonstrated that the calibration curves for both sets closely approximated the ideal 45-degree reference line. The decision curve analysis indicated a stronger clinical utility within a certain probability threshold range. ConclusionsThis study constructs a nomogram predictive model based on factors related to the risk of breast cancer bone metastasis, which demonstrates a good consistency between actual and predicted outcomes in both training and validation sets. The nomogram shows a stronger clinical utility, but further analysis is needed to understand the reasons of the lower differentiation of nomogram in both sets.

    Release date:2024-02-28 02:42 Export PDF Favorites Scan
  • The predictive value of four inflammatory indices for postoperative survival prognosis of Siewert type Ⅱ esophagogastric junction adenocarcinoma

    Objective To evaluate the clinical application value of four inflammatory indices [monocyte-to-lymphocyte ratio (MLR), platelet-to-lymphocyte ratio (PLR), systemic immune-inflammation index (SII), neutrophil-to-lymphocyte ratio (NLR)] in predicting postoperative mortality risk in patients with Siewert type Ⅱ esophagogastric junction adenocarcinoma, and to explore the predictive performance of four inflammatory indices. Methods This retrospective study collected clinical data from 310 patients with Siewert typeⅡ esophagogastric junction adenocarcinoma who were admitted to the Second Hospital of Lanzhou University between October 2016 and March 2023, and met the inclusion and exclusion criteria. Univariate analysis was used to initially screen variables related to postoperative mortality risk. The variance inflation factor (VIF) analysis was performed to assess multicollinearity issues, and multivariate regression analysis was used to further reveal the independent effects of key variables on postoperative mortality risk. The performance of the predictive models was evaluated using receive operatior characteristic curves and Kaplan-Meier survival analysis, and the effects of different inflammatory indices on patient survival were explored. Finally, machine learning methods such as Light GBM, random forest, support vector machine (SVM), and XGBoost were used to evaluate the predictive performance of the four inflammatory indices. Results The four inflammatory indices were significantly associated with postoperative mortality risk in patients with Siewert type Ⅱ esophagogastric junction adenocarcinoma (MLR: HR=2.6884, 95% CI 1.4559 to 4.9642, P=0.002; PLR: HR=1.0022, 95% CI1.0001 to 1.0043, P=0.041; SII: HR=1.0003, 95% CI1.0001 to 1.0006, P=0.002; NLR: HR=1.0697, 95% CI 1.0277 to 1.1134, P=0.001). Machine learning model results showed that NLR had the best performance in the random forest model, with an AUC of 0.863 in the training set and an AUC of 0.834 in the test set. Conclusion Preoperative clinical indicators, especially the NLR inflammatory factor, are of significant importance in predicting the postoperative mortality risk of patients with Siewert typeⅡ esophagogastric junction adenocarcinoma.

    Release date: Export PDF Favorites Scan
  • Establishment and validation of a risk prediction model based on CT and serum markers for disease progression in CTD-ILD patients

    Objective To clarify the specific clinical predictive efficacy of CT and serological indicators for the progression of connective tissue disease-associated interstitial lung disease (CTD-ILD) to progressive pulmonary fibrosis (PPF). Methods Patients who were diagnosed with CTD-ILD in Chest Hospital of Zhengzhou University Between January 2020 and December 2021 were recruited in the study. Clinical data and high-resolution CT results of the patients were collected. The patients were divided into a stable group and a progressive group (PPF group) based on whether PPF occurred during follow-up. COX proportional hazards regression was used to identify risk factors affecting the progression of CTD-ILD to PPF, and a risk prediction model was established based on the results of the COX regression model. The predictive efficacy of the model was evaluated through internal cross-validation. Results A total of 194 patients diagnosed with CTD-ILD were enrolled based on the inclusion and exclusion criteria. Among them, 34 patients progressed to PPF during treatment, and 160 patients did not progress. The variables obtained at lambda$1se in LASSO regression were ANCA associated vasculitis, lymphocytes, albumin, erythrocyte sedimentation rate, and serum ferritin. Multivariate COX regression analysis showed that the extent of fibrosis, serum ferritin, albumin, and age were independent risk factors for the progression of CTD-ILD to PPF (all P<0.05). A prediction model was established based on the results of the multivariate COX regression analysis. The area under the receiver operator characteristic curve at 6 months, 9 months, and 12 months was 0.989, 0.931, and 0.797, respectively, indicating that the model has good discrimination and sensitivity, and good predictive efficacy. The calibration curve showed a good overlap between predicted and actual values. Conclusions The extent of fibrosis, serum ferritin, albumin, and age are independent risk factors for the progression of CTD-ILD to PPF. The model established based on this and externally validated shows good predictive efficacy.

    Release date:2024-06-21 05:13 Export PDF Favorites Scan
  • Construction of a predictive model for poorly differentiated adenocarcinoma in pulmonary nodules using CT combined with tumor markers

    ObjectiveTo establish and internally validate a predictive model for poorly differentiated adenocarcinoma based on CT imaging and tumor marker results. MethodsPatients with solid and partially solid lung nodules who underwent lung nodule surgery at the Department of Thoracic Surgery, the Affiliated Brain Hospital of Nanjing Medical University in 2023 were selected and randomly divided into a training set and a validation set at a ratio of 7:3. Patients' CT features, including average density value, maximum diameter, pleural indentation sign, and bronchial inflation sign, as well as patient tumor marker results, were collected. Based on postoperative pathological results, patients were divided into a poorly differentiated adenocarcinoma group and a non-poorly differentiated adenocarcinoma group. Univariate analysis and logistic regression analysis were performed on the training set to establish the predictive model. The receiver operating characteristic (ROC) curve was used to evaluate the model's discriminability, the calibration curve to assess the model's consistency, and the decision curve to evaluate the clinical value of the model, which was then validated in the validation set. ResultsA total of 299 patients were included, with 103 males and 196 females, with a median age of 57.00 (51.00, 67.25) years. There were 211 patients in the training set and 88 patients in the validation set. Multivariate analysis showed that carcinoembryonic antigen (CEA) value [OR=1.476, 95%CI (1.184, 1.983), P=0.002], cytokeratin 19 fragment antigen (CYFRA21-1) value [OR=1.388, 95%CI (1.084, 1.993), P=0.035], maximum tumor diameter [OR=6.233, 95%CI (1.069, 15.415), P=0.017], and average density [OR=1.083, 95%CI (1.020, 1.194), P=0.040] were independent risk factors for solid and partially solid lung nodules as poorly differentiated adenocarcinoma. Based on this, a predictive model was constructed with an area under the ROC curve of 0.896 [95%CI (0.810, 0.982)], a maximum Youden index corresponding cut-off value of 0.103, sensitivity of 0.750, and specificity of 0.936. Using the Bootstrap method for 1000 samplings, the calibration curve predicted probability was consistent with actual risk. Decision curve analysis indicated positive benefits across all prediction probabilities, demonstrating good clinical value. ConclusionFor patients with solid and partially solid lung nodules, preoperative use of CT to measure tumor average density value and maximum diameter, combined with tumor markers CEA and CYFRA21-1 values, can effectively predict whether it is poorly differentiated adenocarcinoma, allowing for early intervention.

    Release date:2024-12-25 06:06 Export PDF Favorites Scan
  • Construction of a prediction model for postoperative recurrence of granulomatous mastitis in the mass stage based on machine learning

    ObjectiveTo predict the risk factors affecting postoperative recurrence of granulomatous lobular mastitis (GLM) in the mass stage by machine learning algorithm, and to provide a reference for the early identification and prevention of postoperative recurrence of GLM in the mass stage. MethodsThe electronic medical records and follow-up data of patients with GLM in the Department of Breast Disease Unit, the First Affiliated Hospital of Henan University of Traditional Chinese Medicine from October 2020 to January 2023 were selected. A total of 340 patients with GLM in the mass stage who met the inclusion and exclusion criteria were selected as the research subjects. According to whether the patients relapsed after surgery, they were divided into recurrence group and non-recurrence group. The collected cases were randomly divided into training set and test set according to the ratio of 7:3. In the training set, the recurrence prediction model was constructed by using traditional logistic regression and three machine learning algorithms: artificial neural network, random forest and XGBoost (extrem gradient boosting). In the test set, the performance of the model was evaluated by sensitivity, specificity, accuracy,positive predictive value, negative predictive value, F1 value and area under the curve (AUC) value. The Shapley Additive exPlanation (SHAP) method was used to explore the important variables that affect the optimal model in identifying postoperative recurrence in the GLM mass phase. The optimal risk cutoff value of the prediction model was determined by the Youden index. Based on this, the postoperative patients in the GLM mass phase of the external test set were divided into high-risk and low-risk groups. ResultsA total of 392 patients who met the GLM mass stage were included, and 52 cases were excluded according to the exclusion criteria, and 340 cases were finally included, including 60 cases in the recurrence group and 280 cases in the non-recurrence group. Based on the results of univariate analysis, correlation analysis and clinically meaningful influencing factors, 12 non-zero coefficient characteristic variables were screened for the construction of the prediction model, and these 12 characteristic variables included other disease history, number of miscarriages, breastfeeding duration of the affected breast, history of milk stasis, lesion location, nipple indentation, fluctuation sensation, low-density lipoprotein, testosterone, previous antibiotic therapy, previous oral hormone medication, and perioperative traditional Chinese medicine treatment duration. The logistic regression prediction model, artificial neural network, random forest and XGBoost prediction models were constructed, and the results showed that the accuracy, positive predictive value and negative predictive value of the four prediction models were all >75%, among which the XGBoost model had the best performance, with accuracy, specificity, sensitivity, AUC, positive predictive value, negative predictive value and F1 values of 0.93, 0.99, 0.65, 0.87, 0.92, 0.93 and 0.76, respectively. SHAP method found that the duration of traditional Chinese medicine treatment during perioperative period, the duration of breast-feeding on the affected side, low density lipoprotein, testosterone and previous hormone drugs were the top five factors affecting XGBoost model to identify postoperative recurrence of GLM in mass stage. ConclusionsCompared with the traditional Logistic regression prediction model, the models based on machine learning for identifying postoperative recurrence in the GLM mass phase showed better performance, among which the XGBoost model performed best. Targeted preventive measures can be given based on the above risk factors to improve the postoperative prognosis of the GLM mass phase.

    Release date:2024-12-27 11:26 Export PDF Favorites Scan
  • Risk factor analysis and prediction model construction for hospital infections in tertiary hospitals in Gansu Province

    Objective To explore the independent risk factors for hospital infections in tertiary hospitals in Gansu Province, and establish and validate a prediction model. Methods A total of 690 patients hospitalized with hospital infections in Gansu Provincial Hospital between January and December 2021 were selected as the infection group; matched with admission department and age at a 1∶1 ratio, 690 patients who were hospitalized during the same period without hospital infections were selected as the control group. The information including underlying diseases, endoscopic operations, blood transfusion and immunosuppressant use of the two groups were compared, the factors influencing hospital infections in hospitalized patients were analyzed through multiple logistic regression, and the logistic prediction model was established. Eighty percent of the data from Gansu Provincial Hospital were used as the training set of the model, and the remaining 20% were used as the test set for internal validation. Case data from other three hospitals in Gansu Province were used for external validation. Sensitivity, specificity, accuracy, and area under the receiver operating characteristic curve (AUC) were used to evaluate the model effectiveness. Results Multiple logistic regression analysis showed that endoscopic therapeutic manipulation [odds ratio (OR)=3.360, 95% confidence interval (CI) (2.496, 4.523)], indwelling catheter [OR=3.100, 95%CI (2.352, 4.085)], organ transplantation/artifact implantation [OR=3.133, 95%CI (1.780, 5.516)], blood or blood product transfusions [OR=3.412, 95%CI (2.626, 4.434)], glucocorticoids [OR=2.253, 95%CI (1.608, 3.157)], the number of underlying diseases [OR=1.197, 95%CI (1.068, 1.342)], and the number of surgical procedures performed during hospitalization [OR=1.221, 95%CI (1.096, 1.361)] were risk factors for hospital infections. The regression equation of the prediction model was: logit(P)=–2.208+1.212×endoscopic therapeutic operations+1.131×indwelling urinary catheters+1.142×organ transplantation/artifact implantation+1.227×transfusion of blood or blood products+0.812×glucocorticosteroids+0.180×number of underlying diseases+0.200×number of surgical procedures performed during the hospitalization. The internal validation set model had a sensitivity of 72.857%, a specificity of 77.206%, an accuracy of 76.692%, and an AUC value of 0.817. The external validation model had a sensitivity of 63.705%, a specificity of 70.934%, an accuracy of 68.669%, and an AUC value of 0.726. Conclusions Endoscopic treatment operation, indwelling catheter, organ transplantation/artifact implantation, blood or blood product transfusion, glucocorticoid, number of underlying diseases, and number of surgical cases during hospitalization are influencing factors of hospital infections. The model can effectively predict the occurrence of hospital infections and guide the clinic to take preventive measures to reduce the occurrence of hospital infections.

    Release date:2024-04-25 02:18 Export PDF Favorites Scan
  • A nomogram model for predicting risk of lung adenocarcinoma by FUT7 methylation combined with CT imaging features

    Objective The management of pulmonary nodules is a common clinical problem, and this study constructed a nomogram model based on FUT7 methylation combined with CT imaging features to predict the risk of adenocarcinoma in patients with pulmonary nodules. Methods The clinical data of 219 patients with pulmonary nodules diagnosed by histopathology at the First Affiliated Hospital of Zhengzhou University from 2021 to 2022 were retrospectively analyzed. The FUT7 methylation level in peripheral blood were detected, and the patients were randomly divided into training set (n=154) and validation set (n=65) according to proportion of 7:3. They were divided into a lung adenocarcinoma group and a benign nodule group according to pathological results. Single-factor analysis and multi-factor logistic regression analysis were used to construct a prediction model in the training set and verified in the validation set. The receiver operating characteristic (ROC) curve was used to evaluate the discrimination of the model, the calibration curve was used to evaluate the consistency of the model, and the clinical decision curve analysis (DCA) was used to evaluate the clinical application value of the model. The applicability of the model was further evaluated in the subgroup of high-risk CT signs (located in the upper lobe, vascular sign, and pleural sign). Results Multivariate logistic regression analysis showed that female, age, FUT7_CpG_4, FUT7_CpG_6, sub-solid nodules, lobular sign and burr sign were independent risk factors for lung adenocarcinoma (P<0.05). A column-line graph prediction model was constructed based on the results of the multifactorial analysis, and the area under the ROC curve was 0.925 (95%CI 0.877 - 0.972 ), and the maximum approximate entry index corresponded to a critical value of 0.562, at which time the sensitivity was 89.25%, the specificity was 86.89%, the positive predictive value was 91.21%, and the negative predictive value was 84.13%. The calibration plot predicted the risk of adenocarcinoma of pulmonary nodules was highly consistent with the risk of actual occurrence. The DCA curve showed a good clinical net benefit value when the threshold probability of the model was 0.02 - 0.80, which showed a good clinical net benefit value. In the upper lobe, vascular sign and pleural sign groups, the area under the ROC curve was 0.903 (95%CI 0.847 - 0.959), 0.897 (95%CI 0.848 - 0.945), and 0.894 (95%CI 0.831 - 0.956). Conclusions This study developed a nomogram model to predict the risk of lung adenocarcinoma in patients with pulmonary nodules. The nomogram has high predictive performance and clinical application value, and can provide a theoretical basis for the diagnosis and subsequent clinical management of pulmonary nodules.

    Release date: Export PDF Favorites Scan
  • Interpretation of the TRIPOD-LLM reporting guideline for studies using large language models

    As the volume of medical research using large language models (LLM) surges, the need for standardized and transparent reporting standards becomes increasingly critical. In January 2025, Nature Medicine published statement titled by TRIPOD-LLM reporting guideline for studies using large language models. This represents the first comprehensive reporting framework specifically tailored for studies that develop prediction models based on LLM. It comprises a checklist with 19 main items (encompassing 50 sub-items), a flowchart, and an abstract checklist (containing 12 items). This article provides an interpretation of TRIPOD-LLM’s development methods, primary content, scope, and the specific details of its items. The goal is to help researchers, clinicians, editors, and healthcare decision-makers to deeply understand and correctly apply TRIPOD-LLM, thereby improving the quality and transparency of LLM medical research reporting and promoting the standardized and ethical integration of LLM into healthcare.

    Release date:2025-06-24 11:15 Export PDF Favorites Scan
  • AI-based diagnostic accuracy and prognosis research reporting guideline: interpretation of the TRIPOD+AI statement

    With the increasing availability of clinical and biomedical big data, machine learning is being widely used in scientific research and academic papers. It integrates various types of information to predict individual health outcomes. However, deficiencies in reporting key information have gradually emerged. These include issues like data bias, model fairness across different groups, and problems with data quality and applicability. Maintaining predictive accuracy and interpretability in real-world clinical settings is also a challenge. This increases the complexity of safely and effectively applying predictive models to clinical practice. To address these problems, TRIPOD+AI (transparent reporting of a multivariable prediction model for individual prognosis or diagnosis+artificial intelligence) introduces a reporting standard for machine learning models. It is based on TRIPOD and aims to improve transparency, reproducibility, and health equity. These improvements enhance the quality of machine learning model applications. Currently, research on prediction models based on machine learning is rapidly increasing. To help domestic readers better understand and apply TRIPOD+AI, we provide examples and interpretations. We hope this will support researchers in improving the quality of their reports.

    Release date:2025-02-08 09:34 Export PDF Favorites Scan
  • Construction of overall survival model for gastric cancer based on clinical characteristics and genomics

    ObjectiveTo construct a new model for predicting the overall survival rate of gastric cancer and to guide the clinical work.MethodsThe clinical information and gene expression information of patients with gastric cancer were downloaded through The Cancer Genome Atlas (TCGA) database. The clinicopathologic characteristics and gene expression information affecting the overall survival rate of gastric cancer patients were screened by univariate COX regression and Lasson regression, then the predictive model was constructed by multiple COX regression model, and the predictive model was tested by receiver operating characteristic curve, calibration curve and decision curve analysis curve. The effect of genes included in the predictive model on the overall survival rate of patients with gastric cancer was discussed, and the predictive model diagram was drawn.ResultsThrough repeated screening and comparison of the model, the patient’s age, T stage, N stage, M stage and 12 genes (INCENP, IGHD3-16, ITFG1-AS1, NEK5, MATN3, YWHABP2, SYT12, LINC01210, ZNF385C, LINC01980, CYMP-AS1 and FAT3) were included in the predictive model. The prediction ability of this model was close to or more than 80%, which was significantly higher than that of the traditional TNM staging prediction system. All the indexes included in the model were significantly different by univariate and multivariate COX regression analysis(P<0.05), and the 12 genes included were the risk factors affecting the overall survival rate of gastric cancer.ConclusionThe gastric cancer prediction model constructed by combining clinical characteristics and genomics has good predictive ability and can guide clinical work.

    Release date:2021-04-30 10:45 Export PDF Favorites Scan
3 pages Previous 1 2 3 Next

Format

Content