Zhaohui Ruan: VIP Section Department, State Key Laboratory of Oncology in South China, Sun Yat-sen University Cancer Center, Guangzhou, China.
Dongmei Chi: Department of Anesthesiology, State Key Laboratory of Oncology in South China, Sun Yat-sen University Cancer Center, Guangzhou, China.
Qianyu Wang: VIP Section Department, State Key Laboratory of Oncology in South China, Sun Yat-sen University Cancer Center, Guangzhou, China.
Jiaxin Jiang: VIP Section Department, State Key Laboratory of Oncology in South China, Sun Yat-sen University Cancer Center, Guangzhou, China.
Qi Quan: VIP Section Department, State Key Laboratory of Oncology in South China, Sun Yat-sen University Cancer Center, Guangzhou, China.
Jinxin Bei: Department of Experimental Research, State Key Laboratory of Oncology in South China, Sun Yat-sen University Cancer Center, Guangzhou, China.
Roujun Peng: VIP Section Department, State Key Laboratory of Oncology in South China, Sun Yat-sen University Cancer Center, Guangzhou, China.
Background: Breast carcinoma is the most common malignancy among women worldwide. It is characterized by a complex tumor microenvironment (TME), in which there is an intricate combination of different types of cells, which can cause confusion when screening tumor-cell-related signatures or constructing a gene co-expression network. The recent emergence of single-cell RNA sequencing (scRNA-seq) is an effective method for studying the changing omics of cells in complex TMEs. Methods: The Dysregulated genes of malignant epithelial cells was screened by performing a comprehensive analysis of the public scRNA-seq data of 58 samples. Co-expression and Gene Set Enrichment Analysis (GSEA) analysis were performed based on scRNA-seq data of malignant cells to illustrate the potential function of these dysregulated genes. Iterative LASSO-Cox was used to perform a second-round screening among these dysregulated genes for constructing risk group. Finally, a breast cancer prognosis prediction model was constructed based on risk grouping and other clinical characteristics. Results: Our results indicated a transcriptional signature of 1,262 genes for malignant breast cancer epithelial cells. To estimate the function of these genes in breast cancer, we also constructed a co-expression network of these dysregulated genes at single-cell resolution, and further validated the results using more than 300 published transcriptomics datasets and 31 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) screening datasets. Moreover, we developed a reliable predictive model based on the scRNA-seq and bulk-seq datasets. Conclusions: Our findings provide insights into the transcriptomics and gene co-expression networks during breast carcinoma progression and suggest potential candidate biomarkers and therapeutic targets for the treatment of breast carcinoma. Our results are available via a web app (https://prognosticpredictor.shinyapps.io/GCNBC/).