Evaluation of circulating hsa-miR-1203 as a diagnostic biomarker for prostate cancer using machine learning approaches
Main Article Content
Abstract
Prostate cancer is one of the most common malignancies among men, and the identification of reliable circulating biomarkers remains essential for improving diagnostic accuracy. This study aimed to evaluate the diagnostic potential of circulating hsa-miR-1203 in distinguishing prostate cancer patients from non-cancer controls using machine learning models. MicroRNA expression data were obtained from the GSE211692 dataset in the Gene Expression Omnibus, comprising 1,027 prostate cancer serum samples and 5,893 non-cancer samples. After preprocessing and log2 transformation, the dataset was divided into training (70%) and internal testing (30%) sets using stratified sampling. hsa-miR-1203 expression was significantly decreased in prostate cancer samples (log2FC = -3.77; p < 0.001). Four classification algorithms, including Extra Trees, Support Vector Machine (RBF kernel), AdaBoost, and Gaussian Naive Bayes, demonstrated excellent discriminative performance on the testing set, with area under the ROC curve (AUC) values approaching 0.98. These findings suggest that circulating hsa-miR-1203 may serve as a promising non-invasive biomarker to support prostate cancer diagnosis.
Article Details
Keywords
Prostate cancer, microRNA, hsa-miR-1203, machine learning, circulating biomarker, diagnosis
References
2. Van Dong H, Lee AH, Nga NH, et al. Epidemiology and prevention of prostate cancer in Vietnam. Asian Pac J Cancer Prev APJCP. 2014; 15(22): 9747-9751. doi:10.7314/apjcp.2014.15.22.9747.
3. Eala MA, Dee EC, Jacomina LE, et al. Prostate Cancer in Southeast Asia: An Analysis of 2022 Incidence and Mortality Data. Int J Radiat Oncol. 2024; 120(2, Supplement):e528. doi:10.1016/j.ijrobp.2024.07.1170.
4. Moyer VA, U.S. Preventive Services Task Force. Screening for prostate cancer: U.S. Preventive Services Task Force recommendation statement. Ann Intern Med. 2012; 157(2): 120-134. doi:10.7326/0003-4819-157-2-201207170-00459.
5. Bartel DP. MicroRNAs: genomics, biogenesis, mechanism, and function. Cell. 2004; 116(2): 281-297. doi:10.1016/s0092-8674(04)00045-5.
6. Nguyen MT, Quang MT. Integrated Bioinformatics Analysis of hsa-miR-4783-3p Target Genes and Functions in Prostate Cancer. Pharm Sci Asia. 2024; 51(3): 233-240. doi:10.29090/psa.2024.03.24.ap0911.
7. Selth LA, Townley S, Gillis JL, et al. Discovery of circulating microRNAs associated with human prostate cancer using a mouse model of disease. Int J Cancer. 2012; 131(3): 652-661. doi:10.1002/ijc.26405.
8. Chen X, Ba Y, Ma L, et al. Characterization of microRNAs in serum: a novel class of biomarkers for diagnosis of cancer and other diseases. Cell Res. 2008; 18(10): 997-1006. doi:10.1038/cr.2008.282.
9. Mitchell PS, Parkin RK, Kroh EM, et al. Circulating microRNAs as stable blood-based markers for cancer detection. Proc Natl Acad Sci U S A. 2008; 105(30): 10513-10518. doi:10.1073/pnas.0804549105.
10. Quang MT, Nguyen MN, Than VT. The role and regulation of cell death in cancer. Prog Mol Biol Transl Sci. 2025; 217: 135-161. doi:10.1016/bs.pmbts.2025.06.014.
11. Ghamlouche F, Yehya A, Zeid Y, et al. MicroRNAs as clinical tools for diagnosis, prognosis, and therapy in prostate cancer. Transl Oncol. 2023; 28: 101613. doi:10.1016/j.tranon.2022.101613.
12. Luo X, Wen W. MicroRNA in prostate cancer: from biogenesis to applicative potential. BMC Urol. 2024; 24(1): 244. doi:10.1186/s12894-024-01634-1.
13. Urabe F, Matsuzaki J, Yamamoto Y, et al. Large-scale Circulating microRNA Profiling for the Liquid Biopsy of Prostate Cancer. Clin Cancer Res Off J Am Assoc Cancer Res. 2019; 25(10): 3016-3025. doi:10.1158/1078-0432.CCR-18-2849.
14. Edgar R, Domrachev M, Lash AE. Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002; 30(1): 207-210. doi:10.1093/nar/30.1.207.
15. Deo RC. Machine Learning in Medicine. Circulation. 2015; 132(20): 1920-1930. doi:10.1161/CIRCULATIONAHA.115.001593.
16. Kourou K, Exarchos TP, Exarchos KP, et al. Machine learning applications in cancer prognosis and prediction. Comput Struct Biotechnol J. 2015; 13: 8-17. doi:10.1016/j.csbj.2014.11.005.
17. Davis S, Meltzer PS. GEOquery: a bridge between the Gene Expression Omnibus (GEO) and BioConductor. Bioinformatics. 2007; 23(14): 1846-1847. doi:10.1093/bioinformatics/btm254.
18. Ritchie ME, Phipson B, Wu D, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015; 43(7): e47. doi:10.1093/nar/gkv007.
19. Fawcett T. An introduction to ROC analysis. Pattern Recognit Lett. 2006; 27(8): 861-874. doi:10.1016/j.patrec.2005.10.010.
20. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988; 44(3): 837-845.
21. Ioannidis JPA. Why most discovered true associations are inflated. Epidemiology. 2008; 19(5): 640-648. doi:10.1097/EDE.0b013e31818131e7.
22. Agarwal V, Bell GW, Nam JW, et al. Predicting effective microRNA target sites in mammalian mRNAs. eLife. 2015; 4:e05005. doi:10.7554/eLife.05005.
23. Kanehisa M, Goto S. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 2000; 28(1): 27-30. doi:10.1093/nar/28.1.27.
24. Lao DT, Quang MT, Le TAH. The Role of hsa-miR-21 and Its Target Genes Involved in Nasopharyngeal Carcinoma. Asian Pac J Cancer Prev APJCP. 2021; 22(12): 4075-4083. doi:10.31557/APJCP.2021.22.12.4075.
25. Merriel SWD, Pocock L, Gilbert E, et al. Systematic review and meta-analysis of the diagnostic accuracy of prostate-specific antigen (PSA) for the detection of prostate cancer in symptomatic patients. BMC Med. 2022; 20:54. doi:10.1186/s12916-021-02230-y.