An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients

B. Györffy, Andras Lanczky, Aron C. Eklund, Carsten Denkert, Jan Budczies, Qiyuan Li, Zoltan Szallasi

Research output: Contribution to journalArticle

1142 Citations (Scopus)

Abstract

Validating prognostic or predictive candidate genes in appropriately powered breast cancer cohorts are of utmost interest. Our aim was to develop an online tool to draw survival plots, which can be used to assess the relevance of the expression levels of various genes on the clinical outcome both in untreated and treated breast cancer patients. A background database was established using gene expression data and survival information of 1,809 patients downloaded from GEO (Affymetrix HGU133A and HGU133?2 microarrays). The median relapse free survival is 6.43 years, 968/1,231 patients are estrogenreceptor (ER) positive, and 190/1,369 are lymph-node positive. After quality control and normalization only probes present on both Affymetrix platforms were retained (n = 22,277). In order to analyze the prognostic value of a particular gene, the cohorts are divided into two groups according to the median (or upper/lower quartile) expression of the gene. The two groups can be compared in terms of relapse free survival, overall survival, and distant metastasis free survival. A survival curve is displayed, and the hazard ratio with 95% confidence intervals and logrank P value are calculated and displayed. Additionally, three subgroups of patients can be assessed: systematically untreated patients, endocrine-Treated ER positive patients, and patients with a distribution of clinical characteristics representative of those seen in general clinical practice in the US. Web address: www.kmplot.com. We used this integrative data analysis tool to confirm the prognostic power of the proliferation-related genes TOP2A and TOP2B, MKI67, CCND2, CCND3, CCNDE2, as well as CDKN1A, and TK2. We also validated the capability of microarrays to determine estrogen receptor status in 1,231 patients. The tool is highly valuable for the preliminary assessment of biomarkers, especially for research groups with limited bioinformatic resources.

Original languageEnglish
Pages (from-to)725-731
Number of pages7
JournalBreast Cancer Research and Treatment
Volume123
Issue number3
DOIs
Publication statusPublished - Oct 2010

Fingerprint

Survival Analysis
Breast Neoplasms
Survival
Genes
Hospital Distribution Systems
Gene Expression
Recurrence
Computational Biology
General Practice
Estrogen Receptors
Quality Control
Biomarkers
Lymph Nodes
Databases
Confidence Intervals
Neoplasm Metastasis
Research

Keywords

  • Breast cancer
  • Prognosis
  • Survival analysis

ASJC Scopus subject areas

  • Oncology
  • Cancer Research

Cite this

An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients. / Györffy, B.; Lanczky, Andras; Eklund, Aron C.; Denkert, Carsten; Budczies, Jan; Li, Qiyuan; Szallasi, Zoltan.

In: Breast Cancer Research and Treatment, Vol. 123, No. 3, 10.2010, p. 725-731.

Research output: Contribution to journalArticle

Györffy, B. ; Lanczky, Andras ; Eklund, Aron C. ; Denkert, Carsten ; Budczies, Jan ; Li, Qiyuan ; Szallasi, Zoltan. / An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients. In: Breast Cancer Research and Treatment. 2010 ; Vol. 123, No. 3. pp. 725-731.
@article{a469c7dcd26745ada6c8ba32a145d56d,
title = "An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients",
abstract = "Validating prognostic or predictive candidate genes in appropriately powered breast cancer cohorts are of utmost interest. Our aim was to develop an online tool to draw survival plots, which can be used to assess the relevance of the expression levels of various genes on the clinical outcome both in untreated and treated breast cancer patients. A background database was established using gene expression data and survival information of 1,809 patients downloaded from GEO (Affymetrix HGU133A and HGU133?2 microarrays). The median relapse free survival is 6.43 years, 968/1,231 patients are estrogenreceptor (ER) positive, and 190/1,369 are lymph-node positive. After quality control and normalization only probes present on both Affymetrix platforms were retained (n = 22,277). In order to analyze the prognostic value of a particular gene, the cohorts are divided into two groups according to the median (or upper/lower quartile) expression of the gene. The two groups can be compared in terms of relapse free survival, overall survival, and distant metastasis free survival. A survival curve is displayed, and the hazard ratio with 95{\%} confidence intervals and logrank P value are calculated and displayed. Additionally, three subgroups of patients can be assessed: systematically untreated patients, endocrine-Treated ER positive patients, and patients with a distribution of clinical characteristics representative of those seen in general clinical practice in the US. Web address: www.kmplot.com. We used this integrative data analysis tool to confirm the prognostic power of the proliferation-related genes TOP2A and TOP2B, MKI67, CCND2, CCND3, CCNDE2, as well as CDKN1A, and TK2. We also validated the capability of microarrays to determine estrogen receptor status in 1,231 patients. The tool is highly valuable for the preliminary assessment of biomarkers, especially for research groups with limited bioinformatic resources.",
keywords = "Breast cancer, Prognosis, Survival analysis",
author = "B. Gy{\"o}rffy and Andras Lanczky and Eklund, {Aron C.} and Carsten Denkert and Jan Budczies and Qiyuan Li and Zoltan Szallasi",
year = "2010",
month = "10",
doi = "10.1007/s10549-009-0674-9",
language = "English",
volume = "123",
pages = "725--731",
journal = "Breast Cancer Research and Treatment",
issn = "0167-6806",
publisher = "Springer New York",
number = "3",

}

TY - JOUR

T1 - An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients

AU - Györffy, B.

AU - Lanczky, Andras

AU - Eklund, Aron C.

AU - Denkert, Carsten

AU - Budczies, Jan

AU - Li, Qiyuan

AU - Szallasi, Zoltan

PY - 2010/10

Y1 - 2010/10

N2 - Validating prognostic or predictive candidate genes in appropriately powered breast cancer cohorts are of utmost interest. Our aim was to develop an online tool to draw survival plots, which can be used to assess the relevance of the expression levels of various genes on the clinical outcome both in untreated and treated breast cancer patients. A background database was established using gene expression data and survival information of 1,809 patients downloaded from GEO (Affymetrix HGU133A and HGU133?2 microarrays). The median relapse free survival is 6.43 years, 968/1,231 patients are estrogenreceptor (ER) positive, and 190/1,369 are lymph-node positive. After quality control and normalization only probes present on both Affymetrix platforms were retained (n = 22,277). In order to analyze the prognostic value of a particular gene, the cohorts are divided into two groups according to the median (or upper/lower quartile) expression of the gene. The two groups can be compared in terms of relapse free survival, overall survival, and distant metastasis free survival. A survival curve is displayed, and the hazard ratio with 95% confidence intervals and logrank P value are calculated and displayed. Additionally, three subgroups of patients can be assessed: systematically untreated patients, endocrine-Treated ER positive patients, and patients with a distribution of clinical characteristics representative of those seen in general clinical practice in the US. Web address: www.kmplot.com. We used this integrative data analysis tool to confirm the prognostic power of the proliferation-related genes TOP2A and TOP2B, MKI67, CCND2, CCND3, CCNDE2, as well as CDKN1A, and TK2. We also validated the capability of microarrays to determine estrogen receptor status in 1,231 patients. The tool is highly valuable for the preliminary assessment of biomarkers, especially for research groups with limited bioinformatic resources.

AB - Validating prognostic or predictive candidate genes in appropriately powered breast cancer cohorts are of utmost interest. Our aim was to develop an online tool to draw survival plots, which can be used to assess the relevance of the expression levels of various genes on the clinical outcome both in untreated and treated breast cancer patients. A background database was established using gene expression data and survival information of 1,809 patients downloaded from GEO (Affymetrix HGU133A and HGU133?2 microarrays). The median relapse free survival is 6.43 years, 968/1,231 patients are estrogenreceptor (ER) positive, and 190/1,369 are lymph-node positive. After quality control and normalization only probes present on both Affymetrix platforms were retained (n = 22,277). In order to analyze the prognostic value of a particular gene, the cohorts are divided into two groups according to the median (or upper/lower quartile) expression of the gene. The two groups can be compared in terms of relapse free survival, overall survival, and distant metastasis free survival. A survival curve is displayed, and the hazard ratio with 95% confidence intervals and logrank P value are calculated and displayed. Additionally, three subgroups of patients can be assessed: systematically untreated patients, endocrine-Treated ER positive patients, and patients with a distribution of clinical characteristics representative of those seen in general clinical practice in the US. Web address: www.kmplot.com. We used this integrative data analysis tool to confirm the prognostic power of the proliferation-related genes TOP2A and TOP2B, MKI67, CCND2, CCND3, CCNDE2, as well as CDKN1A, and TK2. We also validated the capability of microarrays to determine estrogen receptor status in 1,231 patients. The tool is highly valuable for the preliminary assessment of biomarkers, especially for research groups with limited bioinformatic resources.

KW - Breast cancer

KW - Prognosis

KW - Survival analysis

UR - http://www.scopus.com/inward/record.url?scp=79151483638&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79151483638&partnerID=8YFLogxK

U2 - 10.1007/s10549-009-0674-9

DO - 10.1007/s10549-009-0674-9

M3 - Article

C2 - 20020197

AN - SCOPUS:79151483638

VL - 123

SP - 725

EP - 731

JO - Breast Cancer Research and Treatment

JF - Breast Cancer Research and Treatment

SN - 0167-6806

IS - 3

ER -