MSC - Resear of Dess
MSC - Resear of Dess
DEPARTMENT OF STATISTICS
BY
DESSALEW SHEFERAW
THESIS SUBMITTED TO
1
SURVIVAL ANALYSIS ON THE RISK FACTORS OF WOMEN’S WITH
CERVICAL CANCER: A CASE STUDY AT BLACK LION HOSPITAL,
ADDIS ABABA, ETHIOPIA.
BY
DESSALEW SHEFERAW
ADVISOR
Co-ADVISOR
TIGEST JEGNAW (MSc)
April, 20
I
APPROVAL SHEET-1
This is to certify that the thesis entitled “SURVIVAL ANALYSIS ON THE RISK FACTORS
OF WOMEN’S WITH CERVICAL CANCER: A CASE STUDY AT BLACK LION
HOSPITAL, ADDIS ABABA, ETHIOPIA.”, submitted in partial fulfillment of the
requirements for the degree of Master of Science in Bio-statistics with the graduate program of
the department of Statistics, University of Gondar and is a record of original research carried out
by DESSALEW SHEFERAW. No. GUN/8972/08 under my supervision and no part of the
thesis has been submitted for any other degree or diploma.
The assistance and the help received during the course of this investigation have been duly
acknowledged. Therefore, I recommend that the thesis would be accepted as partial fulfillment of
the requirement for Master of Science.
i
APPROVAL SHEET-2
We, the undersigned, members of the board of examiners of the final open defense by
DESSALEW SHIFERAW have read and evaluated his thesis entitled “SURVIVAL
ANALYSIS ON THE RISK FACTORS OF WOMEN’S WITH CERVICAL CANCER: A
CASE STUDY AT BLACK LION HOSPITAL, ADDIS ABABA, ETHIOPIA.” and
examined the candidate. This is therefore to certify that the thesis has been accepted in partial
fulfillment of the requirements for the degree of Master of Science in statistics with
specialization of Biostatistics.
ii
Acknowledgment
For the successful completion of this work, I am thankful to Allah. First and for most, I thank my
brother Mekash Sheferaw for giving me the opportunity to pursue my graduate study at the
department of statistics, university of Gondar.
I would like to gratefully and sincerely thank my thesis advisor Kassim Mohammed and Tigest
Jegnaw by giving the patience in repeatedly reading the draft manuscript of this study and for
making constructive comments. By the suggestion of them I have benefited a lot. It is palpable
fact that, without these closer follow-up and continuous encouragements with valuable
comments this thesis would not have been finalized in its present structure.
I am also indebted to take this opportunity to appreciate my family specially my father Mr.
Sheferaw Yesuf and My friend Mr. Mequannent wale for being with me closely throughout my
study.
Finally, I am grateful thanks to Gondar university department of statistics for permission to use
Survival analysis of the risk factors on women’s with cervical cancer in black lion hospital.
iii
LIST OF ABBREVIATIONS
ACS American cancer society
AHR Adjusted hazard ratio
AIC Akaike’s information criteria
AOR Adjusted odd ratio
ASIR Age Specific Incidence Rate
ASMR Age Specific Mortality Rate
BLH Black Lion Hospital
CI Confidence Interval
FIGO Federation of International Gynecology and Obstetrics
GLOBOCAN Global Burden of Cancer
HDI Human development index
HIV Human immune virus
HPV Human Papilloma Virus
HR Hazard ratio
IARC International Agency for Research on Cancer
ICO Institute catalad ’Oncologia
INCTR International Network for Cancer Treatment and Research
MOH Ministry of Health
NCI National cancer institute
RS Relative survival
SCC Squamous Cell Carcinoma
SEER Surveillance, Epidemiology, and End Results
SES Socioeconomic Status
SPSS Statistical Package for Social Scientists
SR Survival Rate
WHO World health organization
iv
ABSTRACT
Cancer is a group of diseases characterized by the uncontrolled growth and spread of abnormal
cells. If the spread is not controlled, it can result in death. The objective of this thesis was to
identify the best-fitted survival regression model from the pool of existing survival
regression models and to identify factors associated with the survival time of cervical cancer
patients in Black Lion Hospital Addis Ababa, Ethiopia. A retrospective cohort study was
conducted in Black Lion Hospital Addis Ababa, Ethiopia. Information on patients enrolled
December 2014 and had at least one follow-up until January 2017 time period in oncology center
was used in this study. Kaplan-Meier survival curves and Log-Rank tests were used to compare
the survival experience of different category of predictors. Parametric survival models were
employed to examine the effect of explanatory variables on survival times. A total of 518
cervical cancer patients in Black Lion Hospital Addis Ababa, Ethiopia were included in the
study. Out of 518 cervical cancer patients, 49.04% were live in urban area and 50.96 % were live
in rural area.
Using AIC criteria, the parametric PH was found to have the lowest AIC value and hence it is the
best model for predicting survival time of cervical cancer patients in Black Lion Hospital Addis
Ababa, Ethiopia. Among cervical cancer patients the instantaneous risk of death for urban place
of residence is 2.04 times the instantaneous risk of death for rural place of residence, after
keeping all other covariates at some constant level. Presence of HTN (hypertension), the
instantaneous risk of death for patients with HTN have 2.15 greater risk of instantaneous death
than those patients without HTN, after keeping all other covariates at some constant level.
Based on the results parametric models lead to more efficient parameter estimates than Cox
model. Improved the survival of patients was an integral part of controlling cervical cancer. This
can be done by having health education on cervical cancer incorporated in the teaching
curriculum's by the Ministry of education just like it has been done for HIV / AIDS. Carrying
out regular screening programs and community mobilization activities among other channels can
be used to create awareness.
v
Table of contents
Acknowledgment....................................................................................................iii
LIST OF ABBREVIATIONS................................................................................iv
ABSTRACT.............................................................................................................v
CHAPTER ONE......................................................................................................1
1. INTRODUCTION...............................................................................................1
1.1. BACKGROUND OF THE STUDY...................................................................................................................................1
1.2. STATEMENT OF THE PROBLEM..................................................................................................................................4
1.3. OBJECTIVES OF THE STUDY...................................................................................................................................... 5
1.3.1. General Objectives:....................................................................................................................................5
1.3.2. Specific Objective:......................................................................................................................................5
1.4. SIGNIFICANCE OF THE STUDY...................................................................................................................................5
CHAPTER TWO.....................................................................................................6
2. LITERATUR REVIEW......................................................................................6
2.1. OVERVIEW OF CERVICAL CANCER..............................................................................................................................6
2.2. RISK FACTORS OF SURVIVAL OF PATIENTS WITH CERVICAL CANCER...................................................................................8
2.2.1. Socio-demographic factors........................................................................................................................8
2.2.2. Clinical factors.........................................................................................................................................12
2.3. COX PROPORTIONAL HAZARDS VERSUS PARAMETRIC MODELS.......................................................................................15
CHAPTER THREE...............................................................................................16
vi
3.5.6. Strategies for analysis of non proportional data.....................................................................................29
3.5.7. Parametric model....................................................................................................................................30
3.5.8. Parametric Regression Model assessment...............................................................................................33
3.5.9. Comparison of Cox PH and Parametric Models.......................................................................................33
CHAPTER FOUR.................................................................................................35
CHAPTER FIVE...................................................................................................47
6. REFERENCE....................................................................................................49
vii
LIST OF TABLES
Table-1: Results of descriptive measures of categorical predicator variables, BLH, 2014-2017.......................37
Table-2: Results of log rank test for the categorical variables in BLH, 2014-2017............................................38
Table-3 Results of univariate Cox analysis, BLH, 2014-2017.............................................................................39
Table-4: Results of multivariate Cox regression model, BLH, 2014-2017.........................................................39
Table-5: Test of proportional hazards assumption based on schoenfeld residuals BLH, 2014-2017.................40
Table-6: Results of univariate Weibull PH, Exponential and Log- normal PH models BLH, 2014-2017..........42
Table-7: Results of multivariate Exponential PH model, Weibull PH and log normal PH models BLH, 2014-
2017......................................................................................................................................................................42
LIST OF FIGURES
Figure-1: Cumulative hazard plot of the cox-snell residual for multivariate Cox PH model BLH, 2014-2017....41
ANNEX 1: plot of log (-log (survival))) versus log survival time for categorical variables in the fitted model.....55
viii
CHAPTER ONE
1. INTRODUCTION
Cancer is a group of diseases characterized by the uncontrolled growth and spread of abnormal
cells. If the spread is not controlled, it can result in death. Cancer is caused by external factors,
such as tobacco, infectious organisms, and an unhealthy diet, and internal factors, such as
inherited genetic mutations, hormones, and immune conditions. Treatments include surgery,
radiation, chemotherapy, hormone therapy, immune therapy, and targeted therapy (drugs that
interfere specifically with cancer cell growth) (ACS, 2016).
According to WHO report cancer is the second leading cause of death globally, and was
responsible for 8.8 million deaths in 2015. Globally, nearly 1 in 6 deaths is due to cancer and
from this approximately 70% deaths of cancer occurring in low-income and middle-income
countries (Organization, 2015). Around one third of deaths from cancer are due to the 5 leading
behavioral and dietary risks: high body mass index, low fruit intake and vegetable intake, lack of
physical activity, tobacco use, and alcohol use. From these leading risk factors tobacco use is the
most important risk factor for cancer and is responsible for approximately 22% of cancer deaths
(Forouzanfar et al., 2016). Cancers that originate in the female reproductive system includes
cancer of the cervix, breast, ovaries, vagina, vulva and endometrial (ACS, 2011).
Worldwide 874 million women age of 15 years and older are at risk of cervical cancer; 530,232
new cervical cancer cases are diagnosed and 275,008 cervical cancer deaths occur annually.
About 86% of the global cervical cancer burden occurs in less developed countries (De Sanjosé
et al., 2012). The ASIR and ASMR were 18 and10 per 100,000, respectively, in developing
countries and 9 and 3 per 100,000, respectively, in more developed countries. The incidence and
mortality in sub-Saharan Africa are among the highest in the world and accounts for over 70% of
the global cervical cancer burden with 70,000 new cases annually. It is a health concern among
women worldwide as it ranks as the second most common cause of cancer among women
1
(WHO, 2012).The ACS estimates indicates that there will be 12,900 new diagnoses and 4100
cervical cancer-related deaths in the United States in 2015 (Pfaendler and Tewari, 2016).
In Africa, which has a population of 267.9 million women aged 15 years and older at risk of
developing cervical cancer, approximately 80,000 women are diagnosed with cervical cancer
each year, and just over 60,000 women die from the disease. However, cervical cancer incidence
in Africa also varies considerably by region. The highest rates in Africa (ASIR >40 per 100,000)
are all found in Eastern, Southern, or Western Africa(Denny and Anorlu, 2012).Cancer causing
infections such as, Hepatitis C virus, HPV and infection with HIV substantially increases the risk
of cervical cancer (Forouzanfar et al., 2016, Plummer et al., 2016). Infection with hepatitis and
HPV, are responsible for up to 25% of cancer cases in low and middle-income
countries(Plummer et al., 2016).
In 2015, only 35% of low-income countries reported having pathology services generally
available in the public sector. More than 90% of high-income countries reported treatment
services are available compared to less than 30% of low-income countries and the number of
deaths from cervical cancer is nearly 10 times greater in these low-income countries than in
developed regions, and this is mainly due to lack of access to anticancer therapy combined with
late presentation(Organization, 2015). The rates of cervical cancer in developed countries have
decreased dramatically because of cytological screening and DNA testing for high-risk HPV
types (Tewari et al., 2014). In fact, Sub-Saharan Africa has the highest incidence of cervical
cancer in the world, with an incidence rate of 50.9 cases per 100,000 women’s (Nelson et al.,
2016).
The burden of cervical cancer among women in sub-Saharan Africa including Ethiopia is very
high. This is on the fact that knowledge and awareness of patients on the continent are very poor
and mortality still very high. Facilities for the prevention and treatment of cervical cancer are
still very inadequate in many countries in the region. Governments in sub-Saharan Africa must
recognize cervical cancer as a major public health concern and allocate appropriate resources for
its prevention and treatment, and for research. Indeed, cervical cancer in this region must be
accorded the same priority as HIV, malaria, tuberculosis and childhood immunizations (Anorlu,
2008).
2
In Ethiopia according to the WHO estimates, there was 7,600 are diagnosed with cervical cancer
and roughly 6,000 women die of the disease each year (WHO, 2011, Abate, 2016). Although
there is no national cancer registry in Ethiopia, reports from a retrospective review showed.
Despite this fact, very few women receive screening services in Ethiopia. Although there is no
national cancer registry, reports from retrospective review of biopsy results have shown that
cervical cancer is the most prevalent cancer among women in the country next to breast cancer,
and low level of awareness, lack of effective screening programs, overshadowed by other health
priorities (such as AIDS, TB, malaria) and insufficient attention to women’s health are one major
determinant for the prognosis of cervical cancer is the stage at which the patient presents
(Alemayehu, 2008).
One major determinant for the prognosis of cervical cancer is the stage at which the patient
presents. Most patients in developing countries including Ethiopia present late with advanced
stage disease, in which treatment may often involve multiple modalities including surgery,
radiotherapy, chemotherapy, and has a markedly diminished chance of success. Several factors
such as educational status, financial capability, location, presence of health care facilities
determine the stage at which patients with cancer present to the health facility(Bailie et al.,
1996).
3
1.2. Statement of the problem
Internationally, the burden of cervical cancer falls most heavily on developing nations. About
85% of the cases and 88% of the deaths due to cervical cancer occur in developing nations with
the majorities in sub-Saharan countries (Globocan, 2012).The highest case load of cervical
cancer is in the eastern Africa region (43%) and lowest in the western Africa region (9.8%). The
frequency of cervical cancer death rate is also proportionally higher in the eastern Africa region
and lower in the western Africa region; 44% and 7.2% respectively(Ibrahim et al., 2011).
Ethiopia has a population of 29.4 million women aged 15 and older who are at risk of developing
cervical cancer. Each year an estimated 7095 women are diagnosed with cervical cancer; 4732
die from the disease and it is an important reproductive health problem and is a major cause of
mortality and morbidity in women than any other cancers. The possible contributing factors are
low level of awareness, cost, limited access to screening services and lack of a national cancer
registry(ICO, 2016).
According to the rationale here is, Studying on the survival of cervical cancers patients has
important practical value for patients, providers, and researchers. Cancer survival data is not
widely available in sub-Saharan Africa countries including Ethiopia. Due to this, there are few
studies that are conducted regarding to HPV burden, screening and survival time of cervical
cancer in Ethiopia, despite the high burden and mortality. The proper understanding of prognosis
may help both of the physicians and the patients decide on treatment options, balancing the
personal values for quality versus quantity of life they provide. Furthermore, this few studies on
the survival of cervical cancer patients is based on semi-parametric proportional hazards model.
Therefore, to prevent and control the risk factors that influence on the survival of cervical cancer
patients for the future it is better to assess the determinants of cervical cancer cases, and this
study will assess the survival time among cervical cancer patients and its determinants at Black
Lion hospital using Cox proportional hazard model and parametric proportional hazards model.
4
1.3. Objectives of the study
1.3.1. General Objectives:
To analyze the risk factors that influences the survival time of cervical cancer patients in Black
Lion Hospital Addis Ababa, Ethiopia.
5
CHAPTER TWO
2. LITERATUR REVIEW
Cervical cancer is a cancer arising from the cervix, which is due to the abnormal growth of cells
that have the ability to invade other part of the body. Early on, typically no symptoms are seen.
Later symptoms may include abnormal vaginal bleeding, pelvic pain, or pain during sexual
intercourse (Dhiman et al., 2014). While bleeding after sex may not be serious, it may also
indicate the presence of cervical cancer (Tarney and Han, 2014).The primary underlying cause of
cervical cancer is infection with HPV, specifically two strains HPV16 and HPV 18. There are
two HPV vaccines (Guardrail and Cervarix) that reduce the risk of cancerous or precancerous
changes of the cervix and perineum by about 93% and 62%, respectively.
The vaccines are between 92% and 100% effective against HPV 16 and 18 up to at least 8 years
(Health, 2015).Other risk factors include smoking, a weak immune system, use of birth control
pills, starting sex at a young age, and having many sexual partners (Louie et al., 2009). Diagnosis
is typically by cervical screening followed by a biopsy and medical imaging is then done to
determine whether or not the cancer has spread (McGuire, 2015). Cervical cancer stage is
classified by FIGO staging system, which is based on clinical examination, rather than surgical
findings (Pecorelli et al., 2009).
The five year survival rate in low HDI countries is less than 20% and more than 65% in
developed countries. There are five-fold or greater differences in incidence between world
regions. In those countries for which reliable temporal data are available, incidence rates appear
to be consistently declining by approximately 2% per annum.
6
There is, however, a lack of information from low HDI countries where screening is less likely to
have been successfully implemented. Estimates of the projected incidence of cervical cancer in
2030, based solely on demographic factors, indicate a 2% increase in the global burden of
cervical cancer, i.e., in balance with the current rate of decline. Due to the relative small numbers
involved, it is difficult to discern temporal trends for the other cancers associated with HPV
infection (Forman et al., 2012).
A systematic review of cervical cancer prevention and treatment that included publication
between 2004 and 2014 assess a range of cervical cancer research in Africa. This includes totals
of 380 research articles/reports. The majority (54.6 %) of cervical cancer research in Africa
focused on secondary prevention (i.e., screening). The number of publication focusing on
primary prevention (23.4 %), particularly HPV vaccination, increased significantly in the past
decade. Research regarding the treatment of precancerous lesions and invasive cervical cancer is
emerging (17.6 %), but infrastructure and feasibility challenges in many countries have impeded
efforts to provide and evaluate treatment. Studies assessing aspects of quality of life among
women living with cervical cancer are severely limited (4.1 %). Across all categories, 11.3 % of
publications focused on cervical cancer among HIV-infected women, while 17.1 % focused on
aspects of feasibility for cervical cancer control efforts (Finocchario-Kessler et al., 2016).
In Ethiopia the majority of HPV related cancers of the cervix histological type are squamous cell
carcinoma followed by adenocarcinomas and adenosquamous cell carcinoma. This is due to the
cause of HPV and the prevalence of HPV16 and/or HPV18 among women with cervical cancer
in Ethiopia are 90.8% (Papillomavirus, 2014, ICO, 2016).
A study conducted by (Tadesse, 2016) for the purpose of evaluating preventive mechanisms and
treatment of cervical cancer in Ethiopia, that were attending care in BLH. This qualitative design
study found that due to the inefficient attention paid to cervical cancer, prevention mechanisms
and treatment were found to be largely inadequate and underdeveloped. The lack of proper data
and other competing health care needs have been stated as the main reason behind the lack of
attention paid to cervical cancer. Though steps are currently being taken to expand screening,
pre-cancer treatment and invasive cancer treatment sites, the study found all the steps being
taken to be in preliminary stages.
7
Another study done in BLH, Addis Ababa, Ethiopia to estimate economic burden of cervical
cancer among patients and their family members and to determine predictors for variation in
patient related cost of cervical cancer. The average outpatient cost per patient for cervical cancer
was Birr 5,905 ($407.2) (median = 3,000 ($206.9)). Mean direct outpatient cost (Birr 4,845.3
($334.2) takes the largest share compared with the indirect counterpart (Birr 2,173.7 ($150)).
The outpatient cost for almost half of the respondent fails in a range between Birr 6,933 ($478)
and Birr 1,359 ($93.7). Mean inpatient cost for hospitalized patients was Birr 5,863.2 ($404.4).
The average direct inpatient cost was Birr 4,771 ($329) (74% medical costs and 26% non
medical costs). The mean value for total inpatient cost for nearly half of the respondent were in a
range of Birr 7,161($493.9) and 1,936 ($133.5). For every single day increment in inpatient
hospital stay there is equivalent increment of Birr 61.5 ($4.2) on total inpatient patient side cost
(Desalegne, 2011).
8
And also survival was higher in urban (57.3%) than rural residents (54.7%). Only 46% illiterate
patients had survived (p = 0.001). Survival in class I socio-economic status was 69.2% where as
in class V it was 46.7%.
In a related combined prospective and retrospective cohort study conducted in Mysuru, India for
the purpose of estimating prognostic factors for the survival of cervical cancer patients. The
result of Kaplan-Meier and the Cox proportional hazards regression model showed age at
diagnosis increases the risk of deaths with older age that is the survival rate was 53.8% among
patients aged < 35 years when followed up for five years. 65.8% and 56.8% of the patients
survived in the 35-49 and 50-64 years age group. Whereas among patients aged more than 65
years the survival was only 30.9%. The median survival time (14 months) was the highest among
patients in the age group of 50- 64 years. Patients aged < 35 years as well as patients aged > 65
years had a median survival time of 6 months (Vishma et al., 2016).
(Nesrin and KILIC, 2011) employed multivariate logistic regression to identify the risk factors
for Cervical Cancer patients from a Hospital-Based Case-Control Study in Istanbul, Turkish 209
patients with histological confirmed cervical cancer were compared with 1050 controls, who
were admitted to the different departments of the same hospital. The result shows the risk factors
for cervical cancer were found to be early age at first diagnosis (OR = 58.07, 95% CI: 27.88-
120.95). However, compared with controls, it was found that cases including higher education
(OR = 0.18, 95% CI: 0.10-0.30), non-married (OR = 0.07, 95% CI: 0.04-0.12) and non-use of
alcohol (OR =0.30, 95% CI: 0.14-0.65) had a decreased risk for cervical cancer.
9
A hospital-based retrospective study from Visakhapatnam City, Andhra Pradesh conducted by
(Kumari et al., 2010) confirmed that aged women’s have lower survival rate than the younger
one. The result in this study indicates that the overall survival of patients were 100%, 85%, 89,
80%, 75%, and 100% and disease free survival were 100%, 64%, 61%, 63%, 44% and 100% for
the patients with the age group of less than 30 years, 40 years,50 years, 60 years, 70 and 80
years.
(Mitiku and Tefera, 2016) Conducted a community based cross-sectional survey in Dessie town,
Northeast Ethiopia to assess women’s knowledge about cervical cancer and associated factors.
This study employed Binary and multiple logistic regressions to assess the risk factors. After
adjusting for covariates, having sufficient knowledge about cervical cancer was positively
associated with better educational level and income. Women with primary education (AOR: 3.4;
95% CI: 2.2–5.1) and those who had secondary and above education (AOR: 8.7; 95% CI: 5.5–
13.7) were more likely to have sufficient knowledge about cervical cancer compared to those
who had no formal education. Furthermore, women earning an average household monthly
income above 1500 Ethiopian birr (ETB) (~75 U.S. dollars) were more likely to have sufficient
knowledge (AOR: 2.3; 95% CI: 1.3–3.9) than women with an average household monthly
income less than 500 ETB (~25 U.S. dollars).
Similarly, a study conducted by (Teame, 2016) determines factors associated with cervical
precancerous lesion among women screened for cervical cancer in Addis Ababa. The bi-variate
result showed that the magnitude of cervical precancerous lesion was 12.8%. Being in the age
group of 40-49 years (44.9%) were significantly associated with cervical precancerous lesion
than being in age group of 30-39 years (39.07%) (AOR=2.40, 95%CI (1.27-4.54)).
10
This study concludes that the majority of the cervical cancer sufferers at NICRH were from
lower-socioeconomic group having less education with a late cancer presentation mostly with
squamous cell carcinoma (Jabeen et al., 2015).
(Muhamad et al., 2015) employed Kaplan Meier analyses and log-rank test to determine the
survival rates of Malaysian women with cervical cancer and associated factors. The Kaplan
Meier curve the median survival time was 65.8 months and the 5-year survival rate was 71.1%.
There were significant differences in survival rates between patients from different age groups
when tested by using the log-rank test. Women less than 45 years old have a better 5-year
survival rate compared to those 45 years old and above (85.2vs 63.8%). Similarly, women aged
less than 45 years old had higher median survival months compared to women aged 45 years old
and above (75.0vs 64.5 months). Similarly, a study conducted by (Mutai et al., 2013) on the
survival of patients with cancer of the cervix in Nairobi, Kenya. Cox regression analysis showed
the age of patients and level of education significantly affects the survival.
A retrospective study done at Rama thibodi hospital, Bangkok, Thailand on age and Survival of
Cervical Cancer Patients with Bone Metastasis and the results shows that the younger age group
had less median overall survival than the older age group, with a statistically significant
difference (21 months, 95% CI 19.93-22.06; 34 months, 95% CI 23.27-44.72, p = 0.021)
(Nartthanarung et al., 2014).
11
Another study conducted in South India using logistic regression showed marital status and
levels of education were the most significant variables i.e. women who were widowed/divorced
(OR=2.08; 95%CI: 1.24-3.50) and had a lower education (OR=2.62; 95%CI:1.29-5.31 for
women with primary school education only) (Kaku et al., 2008).
A retrospective cohort study conducted in Osaka University found those treatment modalities are
the most significant predicators on survival. The Univariate and multivariate analyses showed
the patients treated with surgery survived significantly longer than those treated with
chemotherapy (P= 0.0318) or palliative care (P= 0.0292). The patients treated with radiotherapy
survived significantly longer than those treated with chemotherapy (P= 0.0004) or palliative care
alone (P= 0.0005) (Mabuchi et al., 2010). Similarly, (Chokunonga et al., 2004) also found that
the type of treatment significantly affects survival of cervix cancer patients in Harare,
Zimbabwe. The result showed that the survival was significantly greater in the first 3 years for
patients who received radiotherapy treatment compared to those that had not.
A study done on the survival of a cohort of women with cervical cancer diagnosed in a Brazilian
cancer center employed Kaplan-Meier curves and a multivariate analysis through Cox model to
assess overall survival of women with cervical cancer and describe prognostic factors associated.
This study showed the 5-year overall survival was 48%. After multivariate analysis, tumor
staging at diagnosis was the single variable significantly associated with prognosis (p<0.001).
12
There was seen a dose-response relationship between mortality and clinical staging, ranging
from 27.8 to 749.6 per 1,000 cases-year in women stage I and IV, respectively (Carmo and Luiz,
2011).
A related study by (Juhan et al., 2013) in Malaysia also found that stage of disease was important
prognostic variables. That is patients who were diagnosed at stage III & IV are at 2.30 times the
risk of death as those in stage I & II.
A retrospective study conducted in Khan Kean University, Thailand for the purpose of
evaluating factors affecting survival of Cervical Cancer Patients Treated at the Radiation Unit.
From multivariate analyses, the factors that statistically affected survival of cervical cancer
patients included stage (p-value<0.001), hemoglobin level (p-value<0.001), interval between
external and intra sanitary radiation (p-value<0.001) and fractionation (p-value=0.024). Stage III
was associated with a 1.65-fold mortality risk compared with stage I (95% CI=1.05-2.59).
Patients with a low hemoglobin level (≤10g/dl.) demonstrated a 1.85-fold mortality risk
compared with patients a value >12 g/dl. (95% CI=1.40-2.44). An interval between external and
intra sanitary radiation >28 days was associated with 2.28-fold mortality risk compared with a
duration of <1 day (95% CI= 1.40-2.44). The fractionation 2 faction was associated with 0.25-
fold mortality risk compared with 1 fraction (95% CI=0.07-0.96) (Pomros et al., 2007).
(Chen et al., 1999) conducted a retrospective analysis to estimate the influence of histological
type on the survival rate of cervical cancer patients in Taiwan. They employed Kaplan–Meier
method, and Cox’s proportional hazards regression analysis and the result of the analysis showed
the 5-year survival rate was lower for patients with adeno carcinoma than for patients with
squamous cell carcinoma (66.5 vs. 74.0%, P= 0.0009). The 5-year survival rates for FIGO stages
I, II, III, and IV squamous cell carcinoma were 81.3, 75.2, 42.7, and 26.1%, respectively, while
for adenocarcinoma they were 75.9, 62.9, 29.2, and 0%, respectively. The difference in survival
rates between squamous cell carcinomand adeno carcinoma was found mainly in stage I (P=
0.0039) and stage II (P= 0.0103), where radiotherapy was used as the primary treatment.
A study conducted by (Intaraphet et al., 2013) in Chiang Mai University on Prognostic Impact of
Histology in Patients with Cervical cancer and the result showed histological type was the most
significant predicators on the survival. Overall, five-year survival was 60.0%, 54.7%, and 48.4%
in patients with Squalors Cell Carcinoma, Adenoma carcinoma and Small Cell Neuron endocrine
13
Carcinoma, respectively. After adjusting for other clinical and pathological factors, patients with
SNEC and ADC had higher risk of cancer-related death compared with SCC patients (HR 2.6;
95% CI, 1.9-3.5 and HR 1.3; 95% CI, 1.1-1.5, respectively). Patients with SNEC were younger
and had higher risk of cancer-related death in both early and advanced stages compared with
SCC patients (HR 4.9; 95% CI, 2.7-9.1 and HR 2.5; 95% CI, 1.7-3.5, respectively). Those with
advanced-stage ADC had a greater risk of cancer-related death (HR 1.4; 95% CI, 1.2-1.7)
compared with those with advanced-stage SCC, while no significant difference was observed in
patients with early stage lesions.
A retrospective study conducted by (Bruno, 1994) in University of North Florida for the purpose
of estimating the survival of cervical cancer patients and the result shows histological type have
prognostic value on the survival of cervical cancer patients. Another study conducted in U.S on
Prognostic model for survival in patients with early stage cervical cancer and the Cox regression
model showed that tumor size, histological type, and lymph node metastasis were independently
associated with the survival of cervical cancer patients (Biewenga et al., 2011).
(Nuranna et al., 2014) conducted a retrospective cohort study which enrolled cervical cancer
patients treated at Cipto Mangunkusumo Hospital, Jakarta. This study revealed that stage III and
IV had lower survival probability (HR 3.27). (Kaverappa et al., 2015a) in India also found that
staging (p = 0.001) are the most significant predicators on the survival. A retrospective hospital-
based study conducted in Nairobi, Kenya on survival of patients with cancer of the cervix and
the result showed the cumulative proportion surviving at the end of the study interval was 0.67 at
stage I, 0.36 at stage II, 0.15 at stage III and 0 at stage IV (Mutai et al., 2013).
14
Another study conducted by (Mascarello et al., 2013) in Brazil showed that women with stages
III and IV at diagnosis showed an increased risk of 4.33 (95%CI: 3.00-6.24) and 15.40 (95%CI:
9.72-24.39), respectively, for lower survival when compared to stage I.
The performance between Cox PH and parametric model have been done in the survival of
patients with cancer. (Pourhoseingholi et al., 2007) compared Cox regression and parametric
Models (exponential, weibull and log-normal) in patients with Gastric Carcinoma in Taleghani
hospital, Tehran from February 2003 through January 2007. They used AIC and standardized of
parameter estimates to compare the efficiency of models. In this study the proportional hazards
assumption found to be hold and they reported both Cox PH model and Exponential models are
fitted well in multivariate analysis. Although it seems that there may not be a single model that is
substantially better than others, in univariate analysis the data strongly supported the log normal
regression among parametric models and it can be lead to more precise results as an alternative
to Cox. Similarly, (Moghimi-Dehkordi et al., 2008) compared Cox regression and Parametric
Models (exponential, weibull and log-normal) in patients with stomach cancer in southern Iran.
They used AIC to compare the performance of Cox regression and Parametric Models in the
analysis of stomach cancer data.
The proportional hazards assumption found to be holding and the result showed that the Hazard
Ratio in Cox model and parametric ones are approximately similar, according to AIC; the
Weibull, Exponential models, log- normal model are the most favorable for survival analysis.
15
CHAPTER THREE
The Black Lion Cancer treatment Center was established 20 years ago by Dr. Bogale Solomon
who was at the time the only radiation and medical oncologist in the country. Black Lion
Hospital aspires to become a center of excellence in the diagnosis, treatment and care of patients
with cancer. With the support of Ethiopia’s governmental institutions, None governmental
organizations and international partners, including INCTR, the hospital is hoping to develop a
comprehensive cancer care program, including cancer registry, early detection, prevention,
standard treatment and palliative care.
3.2. Data
The target population of this study was being patients with cervical cancer at black lion hospital,
Addis Ababa was enrolled from December 2014 to January 2017 time period in oncology center.
In this retrospective cohort study the data was employ all cervical cancer patients, diagnosed in
between 2014-2017 and collect the data by reviewing follow-up charts of patients by using
standardized structured questionnaire.
16
3.4. Variable of the study
3.4.1 Dependent variable
The dependent variable (Y) is the survival time of cervical cancer patients, the length of time
from diagnosis start date until the date of death/censored measured in months; survival status
(alive or censored and dead) were used as the dependent variables.
17
3.5. Methods of statistical analysis
3.5.1. Survival analysis
Survival analysis is a statistical method for data analysis where the outcome variable of interest
is the time to the occurrence of an event. The event can be death, occurrence of disease, married,
divorce etc. Hence, survival analysis is also referred to as "time to event analysis", which is
applied in a number of applied fields, such as medicine, public health, social science, and
engineering etc. In medical science, time to event can be time until recurrence in a cancer study,
time to death, or time until infection(Lee and Wang, 2003).
Survival time is a length of time that is measured from time origin to the time the event of
interest occurred. Hence, survival data often consists of a response variable that measures the
duration of time until a specified event occurs and a set of independent variables thought to be
associated with the event-time variable. The specific difficulties in survival analysis arise largely
from the fact that only some individuals have experienced the event and other individuals have
not had the event in the end of study and thus their actual survival times are unknown. This leads
to the concept of censoring.
Censoring occurred when we have some information about individual survival time, but we do
not know the survival time exactly. There are three types of censoring: right censoring, left
censoring, and interval censoring. Right censoring is said to occur if the event occurs after the
observed survival time. Right censoring is very common in survival time data, but left censoring
is fairly rare the term "censoring" was be used in this thesis to mean in all instances "right
censoring". Censoring can also occur if we observe the presence of a condition but do not know
where it began. In this case we call it left censoring, and the actual survival time is less than the
observed censoring time. If an individual is known to have experienced an event within an
interval of time but the actual survival time is not known, we have interval censoring. The actual
occurrence time of event is known within an interval of time.
18
3.5.2. Survival time distribution
Let T be a random variable denoting the survival time. The distribution of survival times is
characterized by any of three functions: the survival function, the probability density function or
the hazard function.
These three functions give mathematically equivalent specification of the distributions of the
survival time T. If one of them is known, the other two are determined. The survival function is
most useful for comparing the survival progress of two or more groups. The hazard function
gives a more useful description of the risk of failure at any time point.
t
F ( t )=P ( T ≤ t )=∫ f ( u ) du , t ≥ 0
0
d d
f ( t )= ( 1 − S ( t ) )=− S ( t ) , t ≥ 0
dt dt
19
The hazard function is also known as the instantaneous death rate, force of mortality, conditional
mortality rate and hazard rate as it measures the conditional probability of the occurrence of
death per unit time. Hence the hazard function is given as:
P [ t ≤T <t+ ∆ t /T ≥ t ]
h ( t )= lim (3.2)
∆ t →0 ∆t
The hazard function varies from zero indicating no risk to infinity referring the certainty of
failure at that instant. In contrast to the survivor function, which focuses on surviving or not
failing, the hazard function focuses on failing that is the event occurring. Thus, in a way, survival
function and hazard function are complementary to each other. Another function called
cumulative hazard function which measures the total amount of risk that have been accumulated
up to time t can also be used to describe the survival experience.
There is a clearly defined relationship between S ( t )and ℎ ( t )which is given by the formula.
f (t ) f (t) −d
ℎ ( t )= = = lnS ( t ) (3.3)
1− F (t ) s (t) dt
[ ]
t
S ( t )=exp −∫ ℎ [ u ] du =exp ( − H (t) ) , t ≥ 0(3.4)
0
Where H ( t ) =∫ ℎ(u)du the cumulative hazard function, which can be obtained from
0
f ( t )=ℎ ( t ) S ( t ) (3.6)
20
3.5.3.1. Kaplan-Meier estimator of the survival function
The Kaplan-Meier estimator proposed by Kaplan and Meier (1958) is the standard non
parametric estimator of the survival function S(t ). Which is also called the product-limit
estimator incorporates information from all observations available, both censored and
uncensored, by considering any point in time as series of steps defined by the observed survival
and censored times (Kaplan and Meier, 1958).
Ŝ ( t )=∏
t j ≤t
n j− d j
dj jt ≤t
[ ]
=∏ 1 −
dj
nj
(3.7)
Where, n jis the number of individuals who are at risk of dying at timet j, j=1 ,2 , … mwith ^S(t)=1
for t< t1 and d jis the number of individuals who failed (died) at time t j . The variance of the K-M
survival estimator which is also known as the Greenwood’s formula is given by:
dj
var ( Ŝ ( t ) )=(Ŝ(t ))2 ∑ (3.8)
n j (n j −d j)
21
commonly used non-parametric tests for comparison of two or more survival distributions is the
log-rank test(Robins et al., 1986).
Let t 1 ≤ t 2 ≤ , …t m be the m distinct ordered death times across two groups. Suppose that d jfailures
occur at t jand that n jsubjects are at risk just prior tot j ( j=1 ,2 , … , m). Let d ij and nij be the
corresponding numbers in groupi(i=1 , 2). Then the log-rank test compares the observed
number of deaths with the expected number of deaths for group i. Consider the null
hypothesis: S(1)=S(2) i.e. there is no difference between survival curves in two groups. Given
n jand d jthe random variable d 1 j has the hyper geometric distribution
( )( )
dj n j− d j
d1 j n 1 j −d 1j
( ) nj
n1 j
Under the null hypothesis, the probability of experiencing an event at t jdoes not depend on the
dj
group, i.e. the probability of experiencing an event at t j is . So that the expected number of
nj
deaths in group one is
n1 j d j
E ( d 1 j ) =e^ 1 j= ,is the expected number of individuals who experienced an event at time t j in
nj
group 1.
The test statistic is given by the difference between the total observed and expected number of
deaths in group one
m
U L =∑ ( d1 j − v^ 1 j )(3.9)
j=1
n 1 j n2 j d j (n −d )
^v 1 j=Var ( d 1 j ) = 2
j j
(3.10)
n (n j − 1)
j
^v 1 jis the variance of the number of event occurred at time t jin group 1, d 1 j is the observed
number of failure (event occur) at time t jin group 1, n1 j is the number of individuals at risk of
22
event occur in the first group just before timet j , n2 j is the number of individuals at risk in the
second group just before timet j, d jis the total number of events occurred at t j, n j is the total
number of individuals at risk before timet j.
m
Var ( U L ) =∑ v^ 1 j =V L
j =1
Under the null hypothesis, statistic (3.9) has an approximate normal distribution with zero mean
2
UL 2
and varianceV L. This then follows x.
VL 1
The general form of the test statistic to test the equality of survival curves which can also be used
by several alternatives to the log-rank test, such as the Wilcox on test, may be defined as follows:
∑ w j (d 1 j − e ¿1 j)
Q= j=1 m
∑ w 2j v ¿1 j
j=1
Where ℎ o ( t ) is called the baseline hazard function, which is the hazard function for an individual
for whom all the variables included in the model are zero, x=(x 1 , x 2 , … , x p)' is the values of the
23
vector of explanatory variables for a particular individual, and β ' =β 1 , β 2 , … , β p is a vector of
regression coefficients.
This model, also known as the Cox regression model, makes no assumptions about the form of
ℎ o ( t )(non-parametric part of model) but assumes parametric form for the effect of the predictors
on the hazard (parametric part of model). The model is therefore referred to as a semi-parametric
model.
The measure of effect is called hazard ratio. The hazard ratio of two individuals with different
covariates x 1and x 2 is
,
exp (X 1 β)
^ =exp ( ( X 1 − X 2 ) β ) (3.12)
' '
HR= ,
exp (X 2 β)
This hazard ratio is time-independent, which is why this is called the proportional hazards model.
Assumption (2) is what led us to call this a proportional hazards model. To express this
Mathematically, consider two distinct values of the covariate x , say x 1and x 2 in equation (3.12)
HR=exp ( ( X 1 − X 2 ) β )
^ ' '
This equation indicates that the hazard ratio is independent of time t . This shows that the ratio of
the hazard functions for two individuals with different covariate values does not vary with time.
24
Suppose the survival data based on n independent observations are denoted by the triplet (t i,δ i , x i
) i =1, 2..., n
Where
t i Is the survival time for the ith individual?
δ iIs an indicator of censoring for the ith individual? Given by 0 for censored and 1 for
event experience.
Xi = (Xi1, Xi2.......Xim)' is a column vector of m covariates for individual i.
The full likelihood function for right censored data can be constructed as:
n
L ( β ) =∏ ℎ ( ti , X i , β )δi S ( ti , X i, β ) (3.13)
i=1
Whereℎ(ti , X i , β)=ℎ0 ( ti)e β ' Xi is the hazard function for the ith individual.
( )
S ( ti , X i, β ) = [ so ( ti ) ]exp β is the survival function for the ith individual.
' Xi
It follows that:
n
L ( β ) =∏ [ ℎ0 ( ti ) e β Xi ] [ s o ( ti ) ]
' δi exp ( β )
' Xi
( 3.14 )
i=1
The full maximum likelihood estimator of β can be obtained by differentiating the right hand
side of equation (3.14) with respect to the components of β and the base line hazardℎ0 ( t).
This implies that unless we explicitly specify the base line hazard , ℎ0 (t ), we cannot obtain the
maximum likelihood estimators for the full likelihood. To avoid the specification of the base line
hazard, Cox (1972) proposed a partial likelihood approach that treats the baseline hazard as a
nuisance parameter and removes it from the estimating equation (Blakely and Cox, 1972).
25
Suppose that data are available for n individuals, amongst them there are r distinct failure times
and n −r right-censored survival times, and assume that only one individual was died at each
ordered failure time, so that there are no ties. The r ordered failure times will be denoted by
t(1)<t(2)<….< t(r), so that t (i)is the ith ordered failure time.
The set of individuals who are at risk at time t (i)is the ith ordered failure (experiences an event)
time, and denoted by R(t (i )). And let x ibe the vector of explanatory variables for an individual
who experiences an event att i.
The partial likelihood function is derived by taking the product of the conditional probability of a
failure at timet (i), given the number of individuals who are at risk of experiencing the event at
timet (i).
Then the probability that the jth individual will experience an event at time t (i) is given by:
exp ( β' X ( i ) )
L p ( β )= ( 3.15 )
∑ exp ( β Xj )
'
j ∈R ( t ( i ) )
Where, the summation in the denominator is over all individuals in the risk set. Thus the partial
likelihood is the product over all event time t (i)for i=1 , 2 ,… , r of the conditional probability
(3.15) to give the partial likelihood function and can be expressed in the form:
[ exp ( β X ( i ) )
]
r '
L Lp ( β ) = ∏ ( 3.16 )
i=1 ∑ exp ( β Xj )
'
j ∈ R (t ( i ) )
i=1 {
log L p ( β )=∑ β' X ( i ) − log
[ ∑
j ∈R (t (i ))
exp ( β ' Xj) (3.17)
]}
The partial likelihood derived above is valid when there are no ties in the data set. But in most
real situations tied survival times are more likely to occur. In addition to the possibility of more
than one experience an event at a time, there might also be more than one censored observations
26
at a time of event. To handle this real-world fact, partial likelihood algorithms have been adopted
to handle ties.
There are three approaches commonly used to estimate regression parameters when there are
ties. These are Efron (1977), Breslow’s approximation and Cox (1972) approximations. The
most popular and easy approach is Breslow’s approximation (Breslow, 1974).
The Breslow approximation is proposed by Breslow and Peto by modifying the partial likelihood
takes the following form:
k
exp( β ¿ ¿ ' s i)
LB ( β )=∏
¿
i=1 { ∑ exp ( β ' x j ) }
di
(3.18)
jϵR (t i )
Whered i is the number of deaths that occurred sequentially at time t i and siis the vector of sums
of each of the m covariates for those individuals who die at the ith death time?
Where ^
H o (t i) is an estimate of the baseline cumulative hazard function at time t i, the observed
H i(t i) = − log S^ i (t i ),
survival time of that individual. The Cox-Snell residual,rc i, is the value of ^
27
H i(t i) and ^Si (t i) are the estimated values of the cumulative hazard and survivor functions
where ^
of the ith individual at t i.
If the model was well fitted, the value ^Si (t i) would have similar properties to those of Si (t i) . So
rc i=− log ^Si (t i ) will have a unit exponential distribution, and a test of this assumption provides a
test of model adequacy.
Therefore, we use a plot of H (rc i) versus rc i to check the fit of the model. This gives straight line
with unit slope and zero intercept if the fitted model is correct. Note the Cox-Snell residuals will
not be symmetrically distributed about zero and cannot be negative. This residual is also used to
test overall goodness of fit test.
r mi=δ i −r ci (3.20)
Whereδ i=1 for uncensored observation andδ i=0 for censored observation and the martingale
residuals take values between negative infinity and unity. They have a skewed distribution with
mean zero (Andersen and Gill, 1982). The deviance residuals are a normalized transform of the
martingale residuals (Therneau et al., 1990). They also have a mean of zero but are
approximately symmetrically distributed about zero when the fitted model is appropriate. The
plot of the deviance residuals against the covariates can be obtained. Any unusual patterns may
suggest features of the data that have not been adequately fitted for the model. Very large or very
small values suggest that the observation may be an outlier in need of special attention.
The ith partial or Schoenfeld residual for x i, the ith explanatory variable in the model is given by:
28
Where x ji is the value of the jth explanatory variable, j=1, 2… p, for the i th individual in the study,
and if individuals in the risk set are indexed by l, then:
∑ X ji exp ( ^β X 1)
l ∈R (t i )
α^ ji =
∑ exp ( ^β X 1 )
lϵR (t i )
Schoenfeld residuals are also used to check the proportionality of the covariates over time that is
to check the validity of the proportional hazards assumption. If the model fits well then the
residuals are randomly distributed without any systematic pattern around the zero line, reference
line.
Cox (1972) proposed a way of checking the proportional hazards assumption by introducing a
constructed time dependent covariate into the model. This is done by specifying a form for the
time by predictor interaction and testing the coefficients of such interactions for significance .i.e.,
adds interaction terms involving time to the model and test for their significance. The Cox model
extended for time-dependent variables can be specified as follows:
p p
h(t,x)=ℎ0 (t)exp(∑ β i X i+∑ γ i X i gi(t)) (3.22)
i=1 i=1
29
A plot of the scaled Schoenfeld residuals as a function of time is used to check the
proportionality assumption for each covariate. In a ‘well-behaved’ model the Schoenfeld
residuals are scattered around zero and a regression line fitted to the residuals has a slope of
approximately zero. The idea behind this test is that if the proportional hazards assumption holds
for a particular covariate, then the Schoenfeld residuals for that covariate will not be related to
survival time.
Note that the hazards are non-proportional because the baseline hazards may be different
between strata. The coefficients are assumed to be the same for each stratum g. The partial
likelihood function is simply the product of the partial likelihoods in eachstratum. A drawback of
this approach is that we cannot identify the effect of this stratified predictor. This technique is
most useful when the covariate with non-proportionality is categorical and not of direct interest.
30
proportionality by time-dependent covariates. The violations of PH assumptions are equivalent to
interactions between covariates and time. That is, the PH model assumes that the effect of each
covariate is the same at all points in time. If the effect of a variable varies with time, the PH
assumption is violated for that variable. To model a time-dependent effect, one can create a time-
dependent covariate X (t), and then βX ( t )=βX ∗ g ( t ) . g(t) is a function of t, logt or Heaviside
function, etc.
The parametric proportional hazards model is the parametric versions of the Cox proportional
hazards model. It is given with the similar form to the Cox PH models. The key difference
between the two kinds of models is that the baseline hazard function is assumed to follow a
specific distribution when a fully parametric PH model is fitted to the data, whereas the Cox
model has no such constraint. The coefficients are estimated by partial likelihood in Cox model
but maximum likelihood in parametric PH model. Other than this, the two types of models are
equivalent. Hazard ratios have the same interpretation and proportionality of hazards is still
assumed. In this study the following common parametric models are considered.
With λ,γ > 0. The hazard rate increases when γ> 1 and decreases when γ < 1 as time goes on.
When γ = 1, the hazard rate remains constant, which is the special exponential case.
31
Under the Weibull PH model, the hazard function of a particular patient with covariates
'
x=(x 1 , x 2 , … , x p) is given by
We can see that the survival time of this patient has the Weibull distribution with scale parameter
λ and shape parameter γ: Therefore the Weibull family with fixed γ possesses PH property.
This shows that the effects of the explanatory variables in the model alter the scale parameter of
the distribution, while the shape parameter remains constant.
S ( t ; x )=exp ¿ ¿
After a transformation of the survival function for a Weibull distribution, we can obtain the
formula:
The plot of log { − logS (t) } versus log (t)should give approximately a straight line if the
Weibulldistribution assumption is reasonable. The intercept and slope of the line will be rough
estimate of log λ and γ respectively. If the two lines for two groups in this plot are essentially
parallel, this means that the proportional hazards model is valid. Furthermore, if the straight line
has a slope nearly one, the simpler exponential distribution is reasonable.
Another approach to assess the suitability of a parametric model is to estimate the hazard
function using the non-parametric method. If the hazard function were reasonably constant over
time, this would indicate that the exponential distribution might be appropriate. If the hazard
function increased or decreased monotonically with increasing survival time, a Weibull
distribution might be considered.
32
S ( t )=exp ( − λt ) ,ℎ (t )= λ(3.28)
Under the exponential PH model, the hazard function of a particular patient is given by
( β¿¿' x)(3.29)¿
ℎ ( t ; x )= λ exp ( β 1 x 1+ β2 x 2+ …+ β p x p ) =λ e
In the other way, for an exponential distribution, there islogS ( t ) =− λt thus we can consider the
graph of logS ( t ) versust . This should be a line that goes through the origin if exponential
distribution is appropriate.
ℎ ( t )=
ϕ( δ )
log ( t )
(3.30)
[ 1− Φ( δ )]
log ( t )
( )
2 t
1 −t 2 1 −u
( )
Where, ϕ t = exp is the probability density function, andΦ ( t ) ∫ exp du is the
√2 π 2 −∞ √2 π 2
cumulative distribution function of the standard normal distribution. Then, the log-hazard
function of T at any covariate value X can be expressed as:
( )
t
=log ℎ0 ( t e − β x ) − β ' x
'
logℎ (3.31)
X
Obviously we no longer have a proportional hazards model. If the baseline hazard function is
desired, it can be obtained from equation (3.29) by setting x = 0. The survival function S(t/x) at
any covariate x can be expressed as:
1 β
Where,α = , β∗j = j forj=0 , 1.. … p, This the final survival model with intercept depending
δ δ
with t.
33
3.5.8. Parametric Regression Model assessment
Every step during model fitting uses the upcoming statistical procedures and later at the end we
checked all the assumptions needed for the model. One of the statistical procedures that are used
to fit the final model is: Deviance (D) is twice the difference of the likelihoods for the full model
(with higher covariates) and the reduced model (with lower covariate/s) (Hosmer and
Lemeshow, 1989)of adding a new independent variable to a parametric regression model is
tested by subtracting the deviance of the model with the new parameter from the deviance of the
model without the new parameter, the difference is then tested against a chi-square distribution
with degrees of freedom equal to the difference between the degrees of freedom of the old and
new models.
D= {( β p ) − L ( β q ) } (3.33)
Where, p and q are number of covariates such that, p>q .
Finally, those factors significant at 5% level of significance included in the final multivariate
parametric regression model.
The precision of the regression coefficients is another criterion that can be used to compare
different models. The smaller the standard error, the more precise an estimate is expected to be.
A model with more precise coefficients can be considered as a more precise model.
34
1974)may also be used when comparing the variability of different parametric models. The AIC
of a model may be defined a
AIC=−2≪+2 ( c+ k +1 ) (3.34)
Where ¿ , c , k denotes the log-likelihood, number of covariates and the number of model-specific
ancillary parameters respectively. A lower value of AIC suggests a better model.
35
CHAPTER FOUR
The descriptive summary shows a death proportion seems lower for urban women’s (5.02%)
than for rural women’s (11.00%). The married group showed the highest percentage (10.04%)
with respect to death proportions than the other three groups and patients whose religion are
orthodox have highest percentage of death (9.27%) than the others religion groups. Stage III
patients have the highest death proportion (7.72%) as compared to the other groups while stage I
patients show the lowest death rate. The patient whose histological type has Squamous cell
carcinoma was the highest percentage of death (12.36%) than the other types and well differentiated
histological grade patients have highest death proportions (7.53%) than the other grades. A patient who
took oral contraceptive was the highest death proportions (9.27%) and patients who have diabetic
were lower death proportion (1.54%) than the patients who have no diabetics. The death
proportion of patients who have no hypertension was highest (12.55%) than the patients who
have hypertension and HIV negative patients have the highest proportion of death (10.62%) than
the other groups. All the results have been summarized in Table 1 below.
36
Table-1: Results of descriptive measures of categorical predicator variables, 2014-2017
Covariates Category Status
Differences in all key variables were determined using log rank (χ2) test and assessed the
equality of survival functions for the different categorical variables. The null hypothesis to be
tested is that there is no difference between the probabilities of an event occurring at any time
37
point for each population. According to the test results residence, FIGO stage, histological type,
histological grade, OCP use, hypertension and HIV status were a statistically not equal in
experiencing the death event, whereas Marital status, religion and presence of diabetics are
statistically the same in experiencing the event death. The STATA result has been presented in
table 2 below.
Table-2: Results of log rank test for the categorical variables in BLH, 2014-2017
Variable Chi-square Pr>Ch-square
Consequently, the significant variables for building a multivariate Cox model are age at first
diagnosis, place of residence, FIGO stage at diagnosis, histological type, histological grade, and
Oral contraceptive use, presence of hypertension co-morbidity and HIV status of the patients.
The hazard ratio, 95% CI and standard errors for each variable is given below in table 3.
38
Table-3 Results of univariate Cox analysis, BLH, 2014-2017.
Variables HR SE p-value 95% CI
Lower Upper
Age 1.024 0.0094 0.009 1.006 1.042
Religion 0.888 0.160 0.511 0.624 1.265
Residence 2.159 0.512 0.001 1.357 3.435
Parity 0.971 0.042 0.494 0.893 1.056
FIGO stage 1.519 0.212 0.003 1.155 1.997
Histological type 0.609 0.117 0.010 0.417 0.888
Histological grade 0.725 0.101 0.020 0.553 0.951
Oral contraceptive use 1.792 0.399 0.009 1.157 2.772
Presence of diabetics 1.348 0.502 0.422 0.649 2.799
Presence of hypertension 2.223 0.594 0.003 1.317 3.754
HIV status 0.337 0.099 0.022 0.562 0.956
The significant variables at 5% level were considered in the model. Among the predicator
variables considered for building multivariate Cox, the forward stepwise procedure picked up six
variables age at diagnosis, place of residence, FIGO stage, histological type, histological grade
and presence of hypertension. The multivariate Cox model based on this significant variable was
summarized in table 4 below.
39
Age 0.028 1.028 0.009 0.003 1.009 1.048
Residence
1 0.706 2.027 0.490 0.003 1.262 3.256
FIGO stage
1 0.478 1.614 1.044 0.460 0.454 5.734
2 0.979 2.661 1.617 0.107 0.808 8.761
3 1.410 4.097 2.545 0.023 1.213 13.840
Histological type
1 -0.639 0.527 0.164 0.040 0.286 0.972
2 -0.768 0.464 0.201 0.075 0.199 1.081
Histological grade
1 -0.664 0.515 0.136 0.062 0.306 0.865
2 -0.810 0.445 0.127 0.005 0.253 0.779
1.hypertension 0.804 2.236 0.615 0.003 1.304 3.835
Adequacy of the fitted model that is the assumptions of proportional hazards and the goodness of
fit were assessed. We used the schoenfeld residuals to test the PH assumptions. The correlation
(ρ) between schoenfeld residuals and survival time for each covariate was presented in table 5
below.
Table-5: Test of proportional hazards assumption based on schoenfeld residuals BLH,
2014-2017
Variable Rho(ρ) Chi2 DF p-value
40
The results from table 5 indicate that all variables satisfied the PH assumption as the correlation
between schoenfeld residuals and survival time is not significant at 0.05 levels. The plot of
scaled schoenfeld residuals versus analysis time were also checks the PH assumption. The result
of plot of scaled schoenfeld residuals versus survival time in the annex 1 are more or less random
and LOESS smoothed curves have basically zero slope which is an indication of no evidence of
non proportionality.
The proportional hazard assumptions was also tested based on the (1og (-log (survival))) plot
versus log (survival time), which is called a log-cumulative hazard plot. The plot of (1og (-log
(survival))) plot versus log (survival time) were used to check the PH assumption for all the
categorical variables included in the fitted model (annex 1). The graphs for each of the
categorical variables display lines that appeared to be roughly parallel for place of residence and
presence of hypertension, thus proportional hazard assumption was met. There was an interaction
between FIGO stage, histological type and histological grade with time indicating possible
violation of PH assumption. However the overall Schoenfeld global test of the full model
satisfies the PH assumption (chi2 (6) = 3.73, Prob>chi2=0.713) as presented in table 4.5 above.
Plot of the cox-snell residuals was applied to test the overall fit of the model. In this method cox-
snell residuals were plotted against the cumulative hazard of cox-snell residuals as shown in
figure 1. The figure reveals that the overall fit of the Cox model is good. However there is little
evidence of a systematic deviation from the straight line at the right, thus the result of the graph
indicates the model fit the data well.
Figure-1: Cumulative hazard plot of the cox-snell residual for multivariate Cox PH model
BLH, 2014-2017
2
1.5
1
.5
0
0 .5 1 1.5 2
Cox-Snell residual
H Cox-Snell residual
41
4.3. Parametric proportional hazard model
Univariate PH model was applied in a similar manner as applied in a Cox PH model for
Exponential, Weibull and Log-normal PH models. The results of the univariate parametric PH
models are presented in table 6. In the three models variables significant at 5% level in the
univariate analysis were taken as candidate variables for their multivariable analysis.
Table-6: Results of univariate Weibull PH, Exponential and Log- normal PH models BLH,
2014-2017
Covariates Weibull PH Exponential PH Log-normal
HR SE P-value HR SE P-value HR SE P-value
Age 1.02 0.009 0.009 1.03 0.01 0.004 0.04 0.01 0.014
Residence 2.20 0.52 0.001 2.14 0.51 0.001 0.45 0.13 0.001
Religion 0.90 0.16 0.568 0.88 0.16 0.475 0.06 0.104 0.546
Parity 0.97 0.04 0.417 0.98 0.04 0.657 0.01 0.024 0.586
FIGO Stage 1.49 0.21 0.004 1.52 0.21 0.002 0.43 0.078 0.081
Histological type 0.61 0.12 0.011 0.59 0.01 0.008 0.28 0.100 0.003
Histological grade 0.73 0.09 0.020 0.68 0.09 0.006 0.17 0.079 0.031
OCP use 1.82 0.41 0.007 1.75 0.39 0.012 0.27 0.134 0.041
Diabetics 1.33 0.49 0.446 1.37 0.51 0.399 0.18 0.23 0.433
Hypertension 2.19 0.58 0.003 2.11 0.56 0.005 0.53 0.17 0.001
HIV status 0.74 0.09 0.024 0.73 0.18 0.019 0.18 0.08 0.016
From table 6 the Exponential PH, Weibull PH and log normal PH model picked up the same
variables namely age, residence, FIGO stage, histological type, histological grade and presence
of hypertension (HTN) as selected by Cox PH model. The hazard ratio and the corresponding
95% CI with standard error for the given models are given in table 7.
More or less the given models had the same HR with almost identical standard errors in
estimating the significant variables at 95% CI.
42
HR SE P-value HR SE P-value coef SE P-value
Age 1.03 0.01 0.003 1.03 0.01 0.004 -0.02 0.005 0.017
Residence 2.04 0.49 0.008 1.89 0.45 0.007 -0.39 0.13 0.003
FIGO stage 1.57 0.22 0.001 1.53 0.21 0.002 -0.22 0.08 0.006
Histological type 0.64 0.13 0.019 0.63 0.12 0.017 0.25 0.10 0.011
Histological grade 0.65 0.06 0.002 0.63 0.18 0.001 0.23 0.08 0.005
Presence of HTN 2.15 0.59 0.005 2.03 0.55 0.009 -0.49 0.17 0.003
The three parametric survival model Weibull, Exponential and Log-normal models HR, SE and
P-value are obtained in the above.
FIGO stage: The hazard ratio in Table 7 shows that among cervical cancer patients the
instantaneous risk of death for FIGO stage three is 1.57 times the instantaneous risk of death
for FIGO stage one, keeping all other covariates at some constant level. Similarly, among
cervical cancer patients, the instantaneous risk of death for FIGO stage four is 2.01 times the
instantaneous risk of death for FIGO stage of the reference group, keeping all other covariates at
some constant level.
Histological type: Among cervical cancer patients, the instantaneous risk of death for
adenocarcinoma (non keratinizing) status are 0.64 times the instantaneous risk of death for
squamous cell carcinoma, after keeping all other covariates at some constant level respectively.
Histological grade: Among cervical cancer patients, the instantaneous risk of death for
moderately differentiated grade 0.65 times the instantaneous risk of well differentiated grade,
after keeping all other covariates at some constant level.
43
Presence of HTN (hypertension), The instantaneous risk of death for patients with HTN have
2.15 greater risk of instantaneous death than those patients without HTN, after keeping all other
covariates at some constant level.
44
measure the goodness of statistical models’ fitness, and the smaller it is, the better it is. The
smaller the AIC is, the more efficacious of the model. The result of AIC is given in table 8
below.
Table-8, Comparison based on the Results of the Akaike Information Criterion (AIC) b/n
the Cox Proportional Hazard and Parametric proportional Models BLH, 2014-2017.
Models AIC
Cox 314.3
Exponential 193.9
Weibull 190.4
Log-normal 140.3
Most cancer researchers tend to use Cox semi-parametric model rather than parametric models.
Therefore, parametric models such as exponential, Weibull and log-normal can be better choices
in such situation.
A major objective of this paper is to compare Cox semi-parametric and parametric survival
models in modeling the survival time of cervical cancer patients. So, in this study the results of
Cox semi-parametric model and parametric models were compared in modeling the survival time
of cervical cancer patients to assess these models, Akaike information criterion (AIC) were used.
Although most researchers in medical and cancer fields have made use of Cox semi-parametric
models in the survival time of cervical cancer patients, results of parametric models have often
been more reliable and have had less bias. As parametric models do not need proportional
hazards assumption (PH) in similar situations and they consider a specific statistical distribution
for time to the occurrence of the outcome, they have a better fitness. A parametric model was
also being credible alternatives to Cox semi-parametric model where proportional hazard
assumption is not made. In addition, fully parametric models may offer some advantages.
4.6. Discussion
A major objective of this paper is to investigate the comparative performance of Cox semi-
parametric and parametric survival models in modeling the survival time of women’s with
45
cervical cancer. So, in this study the results of Cox semi-parametric model and parametric
models were compared in modeling the survival time of cervical cancer patients. To assess these
models, Akaike information criterion (AIC) revealed that parametric models had better fitness.
This finding is consistent with the findings obtained from most studies carried out on patients
with gastric cancer (Orbe et al., 2012; Nardi and Schemper, 2013; Dehkordi, 2017;
Pourhoseingholi et al., 2017).
Based on asymptotic results, parametric models lead to more efficient parameter estimates than
Cox model. When empirical information is sufficient, parametric models can provide some
insights into the shape of the baseline hazard.
Most cancer researchers tend to use Cox semi-parametric model rather than parametric models in
modeling the cancer patients. A systematic review on cancer journals indicates that only 5% of
studies in which Cox model has been used for modeling cervical cancer cases, investigated the
required assumptions for this model. The absence of proportional hazards assumption causes the
estimations to be unreliable and biased. Moreover, studies conducted in this scope demonstrate
that either proportional hazards assumption is made or not, parametric models are more efficient
(Orbe et al., 20; Patel et al., 2016).
Therefore, parametric models such as exponential, Weibull, log-normal can be better choices in
such situations. Considering a particular statistical distribution for time to the occurrence of next
state and requiring no assumption of proportional hazards (PH), these models provide fitness for
data. A major objective of this paper is to investigate the comparative performance of Cox semi-
parametric and parametric survival models in modeling cervical cancer patients.
In addition, fully parametric models may offer some advantages. Based on the results parametric
models lead to more efficient parameter estimates than Cox model. With a decrease in sample
sizes, relative efficiencies may further change in favor of parametric models. When empirical
information is sufficient, parametric models can provide some insights into the shape of the
46
baseline hazard. So, in this study the results of Cox semi-parametric model and parametric
models were compared in modeling cervical cancer cases. To assess these models, Akaike
information criterion (AIC) mode estimates were was used.
This finding is consistent with the findings obtained from most studies carried out on patients of
cervical cancer (Orbe et al., 2012; Nardi and Schemper, 2013; Dehkordi, 2016; Pourhoseingholi
et al., 2016).
47
CHAPTER FIVE
5.1. Conclusion
This was a three-year (2014-2017) retrospective cohort study based on 518 cervical cancer
patients in Black Lion Hospital Addis Ababa, Ethiopia. The purpose of the study was to identify
the best-fit parametric survival regression model from the pool of existing parametric survival
models (exponential, Weibull, Log-Logistic, and log normal), and to determine factors
associated with the survival time of cervical cancer patients in Black Lion Hospital Addis Ababa,
Ethiopia. It has been found that 16% of the considered patients were died and the remaining 84%
were censored at the end of the study.
Although most researchers in medical and cancer fields have made use of Cox semi-parametric
model into account, based on the findings of this research results of parametric models have
often been more reliable and have had less bias. As parametric models do not need proportional
hazards assumption (PH) in similar situations and they consider a specific statistical distribution
for time to the occurrence of next state, they have a better fitness.
Parametric models were also being credible alternatives to Cox semi-parametric model while
proportional hazard assumption is not made. In addition, fully parametric models may offer some
advantages. Based on the results parametric models lead to more efficient parameter estimates
than Cox model. With a decrease in sample sizes, relative efficiencies may further change in
favor of parametric models. When empirical information is sufficient, parametric models can
provide some insights into the shape of the baseline hazard.
48
5.2. RECOMMENDATIONS
Based on the finding of this study the following recommendations are given.
Survival time of cervical cancer through regular programs of women and prompt
comprehensive treatment should be taken up to improve the overall survival of the
patients.
Improved the survival of patients were an integral part of controlling cervical cancer.
This can be done by having health education on cervical cancer incorporated in the
teaching curriculums by the Ministry of education just like it has been done for HIV /
AIDS. Carrying out regular screening programs and community mobilization activities
among other channels can be used to create awareness.
Information on the availability of the HPV vaccine is some health facilities is unknown to
many Ethiopians. Therefore awareness on the survival of cervical cancer will reduce the
high mortality of patients with cancer of the cervix among women. Government
intervention to the increase the survival and to decrease the burden of the cervical cancer
treatment on the patients is very necessary.
Decentralizing cancer treatment by the national government is an important intervention.
This will reduce the number of patients who have to wait before they undergo especially
curative treatment for those diagnosed at advanced stages because as a result of long
queues, their conditions are worsened.
Further research on the survival time and the effect of treatment on the disease should be
carried out to give more insight into the survival time of the disease and disease
management.
49
6. REFERENCE
AALEN, O., BORGAN, O. & GJESSING, H. 2008. Survival and event history analysis: a
process point of view, Springer Science & Business Media.
ABATE, S. M. 2016. Trends of Cervical Cancer in Ethiopia. open access journal, 1, 2.
ACS 2011. American Cancer Society: Cancer Facts & Figures. (2011) Atlanta: American Cancer
Society journal of american cancer society (2011) atlanta, 12, 1.
ACS, A. C. S. 2016. Cancer Facts & Figures 2016. journal of american cancer society, 4.
AKAIKE, H. 1974. A new look at the statistical model identification. IEEE transactions on
automatic control, 19, 716-723.
ALEMAYEHU, Y. 2008. Cancer: a threat to Ethiopia than HIV/AIDS, TB and malaria. Walta
Information Center, Thursday, 23.
ANDERSEN, P. K. & GILL, R. D. 1982. Cox's regression model for counting processes: a large
sample study. The annals of statistics, 1100-1120.
ANORLU, R. I. 2008. Cervical cancer: the sub-Saharan African perspective. Reproductive
health matters, 16, 41-49.
BAILIE, R., SELVEY, C., BOURNE, D. & BRADSHAW, D. 1996. Trends in cervical cancer
mortality in South Africa. International journal of epidemiology, 25, 488-493.
BAYO, S., BOSCH, F. X., DE SANJOSÉ, S., MUÑOZ, N., COMBITA, A. L., COURSAGET,
P., DIAZ, M., DOLO, A., VAN DEN BRULE, A. J. & MEIJER, C. J. 2002. Risk factors
of invasive cervical cancer in Mali. International journal of epidemiology, 31, 202-209.
BIEWENGA, P., VAN DER VELDEN, J., MOL, B. W. J., STALPERS, L. J., SCHILTHUIS,
M. S., VAN DER STEEG, J. W., BURGER, M. P. & BUIST, M. R. 2011. Prognostic
model for survival in patients with early stage cervical cancer. Cancer, 117, 768-776.
BLAKELY, R. J. & COX, A. 1972. Evidence for short geomagnetic polarity intervals in the
early Cenozoic. Journal of Geophysical Research, 77, 7065-7072.
BRESLOW, N. 1974. Covariance analysis of censored survival data. Biometrics, 89-99.
BRUNO, R. M. 1994. Cervical cancer survival data analysis. thesis, 2, 46.
CARMO, C. C. D. & LUIZ, R. R. 2011. Survival of a cohort of women with cervical cancer
diagnosed in a Brazilian cancer center. Revista de Saúde Pública, 45, 661-667.
CHEN, R.-J., LIN, Y.-H., CHEN, C.-A., HUANG, S.-C., CHOW, S.-N. & HSIEH, C.-Y. 1999.
Influence of histologic type and age on survival rates for invasive cervical carcinoma in
Taiwan. Gynecologic oncology, 73, 184-190.
CHOKUNONGA, E., RAMANAKUMAR, A. V., NYAKABAU, A. M., BOROK, M. Z.,
CHIRENJE, Z. M., SANKILA, R. & PARKIN, D. M. 2004. Survival of cervix cancer
patients in Harare, Zimbabwe, 1995–1997. International journal of cancer, 109, 274-277.
COX, D. R. & SNELL, E. J. 1968. A general definition of residuals. Journal of the Royal
Statistical Society. Series B (Methodological), 248-275.
DE SANJOSÉ, S., SERRANO, B., CASTELLSAGUÉ, X., BROTONS, M., MUÑOZ, J.,
BRUNI, L. & BOSCH, F. 2012. Human papillomavirus (HPV) and related cancers in the
50
Global Alliance for Vaccines and Immunization (GAVI) countries. A WHO/ICO HPV
Information Centre Report. Vaccine, 30, D1-83.
DENNY, L. & ANORLU, R. 2012. Cervical cancer in Africa. Cancer Epidemiology and
Prevention Biomarkers, 21, 1434-1438.
DESALEGNE, A. 2011. Cost and Predictors of Cost of Cervical Cancer Treatment in Patients
Attending Tikur Anbessa Hospital; Addis Ababa, Ethiopia. Addis Ababa University.
DHIMAN, A., HIREMATH, S., PATHAK, M. & MISHRA, D. 2014. A review article on
anticancerous drugs in ayurveda and screened anticancer activity of medicinal plants.
INTERNATIONAL JOURNAL, 2.
FINOCCHARIO-KESSLER, S., WEXLER, C., MALOBA, M., MABACHI, N., NDIKUM-
MOFFOR, F. & BUKUSI, E. 2016. Cervical cancer prevention and treatment research in
Africa: a systematic review from a public health perspective. BMC women's health, 16,
29.
FORMAN, D., DE MARTEL, C., LACEY, C. J., SOERJOMATARAM, I., LORTET-
TIEULENT, J., BRUNI, L., VIGNAT, J., FERLAY, J., BRAY, F. & PLUMMER, M.
2012. Global burden of human papillomavirus and related diseases. Vaccine, 30, F12-
F23.
FOROUZANFAR, M. H., AFSHIN, A., ALEXANDER, L. T., ANDERSON, H. R., BHUTTA,
Z. A., BIRYUKOV, S., BRAUER, M., BURNETT, R., CERCY, K. & CHARLSON, F.
J. 2016. Global, regional, and national comparative risk assessment of 79 behavioural,
environmental and occupational, and metabolic risks or clusters of risks, 1990-2015.
Lancet.
GLOBOCAN, I. 2012. Cervical Cancer Incidence and Mortality Worldwide in 2008.
HEALTH, N. I. O. 2015. National Cancer Institute: PDQ Cervical Cancer Prevention. Bethesda,
MD: National Cancer Institute. Date last modified 12/17/2015. Accessed 05/20.
HOSMER, D. W. & LEMESHOW, S. 1989. Applied regression analysis. New York, John
Willey.
IBRAHIM, A., RASCH, V., PUKKALA, E. & ARO, A. R. 2011. Cervical cancer risk factors
and feasibility of visual inspection with acetic acid screening in Sudan. Int J Womens
Health, 3, 117-122.
ICO 2016. Human Papillomavirus and Related Diseases Report in ETHIOPIA:
www.hpvcentre.netJournal of ICO HPV Information Centre, 1, 7.
INTARAPHET, S., KASATPIBAL, N., SIRIAUNKGUL, S., SOGAARD, M.,
PATUMANOND, J., KHUNAMORNPONG, S., CHANDACHAM, A. & SUPRASERT,
P. 2013. Prognostic impact of histology in patients with cervical squamous cell
carcinoma, adenocarcinoma and small cell neuroendocrine carcinoma. Asian Pacific
Journal of Cancer Prevention, 14, 5355-5360.
JABEEN, S., ISLAM, M. J., TALUKDER, M. H., NURUNNABI, A. S. M. & HAQUE, M.
2015. Pathoepidemiology of Cervical Cancer in National Institute of Cancer Research
and Hospital of Bangladesh. Journal of Dhaka Medical College, 23, 203-210.
51
JUHAN, N., RAZAK, N. A., ZUBAIRI, Y. Z., KHATTAK, M. N. & NAING, N. N. 2013.
Survey of Patients with Cervical Cancer in Hospital UniversitiSains Malaysia: Survival
Data Analysis with Time-Dependent Covariate. Iranian journal of public health, 42, 980.
KAKU, M., MATHEW, A. & RAJAN, B. 2008. Impact of socio-economic factors in delayed
reporting and late-stage presentation among patients with cervix cancer in a major cancer
hospital in South India. Asian Pac J Cancer Prev, 9, 589-594.
KAPLAN, E. L. & MEIER, P. 1958. Nonparametric estimation from incomplete observations.
Journal of the American statistical association, 53, 457-481.
KAVERAPPA, V. B., BORALINGAIAH, P., KULKARNI, P. & MANJUNATH, R. 2015a.
Determinants of Survival among Patients with Cervical Cancer: A Hospital Based Study.
infection, 2, 5.
KAVERAPPA, V. B., PRAKASH, B., KULKARNI, P. & RENUKA, M. 2015b.
Sociodemographic profile of patients with cervical cancer in a tertiary-care cancer
hospital in Mysuru, Karnataka. International Journal of Medical Science and Public
Health, 4, 1187-1190.
KLEINBAUM, D. G. & KLEIN, M. 2012. Parametric survival models. Survival analysis.
Springer.
KUMARI, K. G., SUDHAKAR, G., RAMESH, M., KALPANA, V. & PADDAIAH, G. 2010.
Prognostic factors in cervical cancer: a hospital-based retrospective study from
Visakhapatnam City, Andhra Pradesh. J Life Sci, 2, 99-105.
LEE, E. T. & WANG, J. 2003. Statistical methods for survival data analysis, John Wiley &
Sons.
LOUIE, K., DE SANJOSE, S., DIAZ, M., CASTELLSAGUE, X., HERRERO, R., MEIJER, C.,
SHAH, K., FRANCESCHI, S., MUNOZ, N. & BOSCH, F. 2009. Early age at first sexual
intercourse and early pregnancy are risk factors for cervical cancer in developing
countries. British journal of cancer, 100, 1191-1197.
MABUCHI, S., ISOHASHI, F., YOSHIOKA, Y., TEMMA, K., TAKEDA, T., YAMAMOTO,
T., ENOMOTO, T., MORISHIGE, K., INOUE, T. & KIMURA, T. 2010. Prognostic
factors for survival in patients with recurrent cervical cancer previously treated with
radiotherapy. International Journal of Gynecological Cancer, 20, 834-840.
MASCARELLO, K. C., ZANDONADE, E. & AMORIM, M. H. C. 2013. Survival analysis of
women with cervical cancer treated at a referral hospital for oncology in Espírito Santo
State, Brazil, 2000-2005. Cadernos de Saúde Pública, 29, 823-831.
MCGUIRE, S. 2015. World Cancer Report 2014. Geneva, Switzerland: World Health
Organization, International Agency for Research on Cancer. WHO Press.
MITIKU, I. & TEFERA, F. 2016. Knowledge about Cervical Cancer and Associated Factors
among 15-49 Year Old Women in Dessie Town, Northeast Ethiopia. PloS one, 11,
e0163136.
MOGHIMI-DEHKORDI, B., SAFAEE, A., POURHOSEINGHOLI, M. A., FATEMI, R.,
TABEIE, Z. & ZALI, M. R. 2008. Statistical comparison of survival models for analysis
of cancer data. Asian Pac J Cancer Prev, 9, 417-420.
52
MUHAMAD, N. A., KAMALUDDIN, M. A., ADON, M. Y., NOH, M. A., BAKHTIAR, M. F.,
IBRAHIM TAMIM, N., MAHMUD, S. H. & ARIS, T. 2015. Survival rates of cervical
cancer patients in Malaysia. Asian Pacific journal of cancer prevention: APJCP, 16,
3067-72.
MUTAI, C., MUGO, C. W. & KHAEMBA, E. N. 2013. The Survival Of Patients With Cancer
Of The Cervix In Nairobi, Kenya.
NARTTHANARUNG, A., THANAPPRAPASR, K., UDOMSUBPAYAKUL, U. &
THANAPPRAPASR, D. 2014. Age and survival of cervical cancer patients with bone
metastasis. Asian Pac J Cancer Prev, 15, 8401-4.
NELSON, S., KIM, J., WILSON, F. A., SOLIMAN, A. S., NGOMA, T., KAHESA, C. &
MWAISELAGE, J. 2016. Cost-Effectiveness of Screening and Treatment for Cervical
Cancer in Tanzania: Implications for other Sub-Saharan African Countries. Value in
health regional issues, 10, 1-6.
NESRIN, R. & KILIC, D. 2011. Risk Factors for Cervical Cancer: Results from a Hospital-
Based Case-Control Study. International Journal of Hematology and Oncology, 26, 153-
159.
NURANNA, L., PRASTASARI, R. & SUTRISNA, B. 2014. Survival of cervical cancer patients
and its prognostic factors at Cipto Mangunkusumo Hospital, Jakarta. Medical Journal of
Indonesia, 23, 163.
ORGANIZATION, W. H. 2015. Cancer Fact sheet N 297. Updated February 2015.
PAPILLOMAVIRUS, H. 2014. Related Cancers, Fact Sheet 2016. ICO Information Centre on
HPV and Cancer (October 7, 2016).
PECORELLI, S., ZIGLIANI, L. & ODICINO, F. 2009. Revised FIGO staging for carcinoma of
the cervix. International Journal of Gynecology & Obstetrics, 105, 107-108.
PFAENDLER, K. S. & TEWARI, K. S. 2016. Changing paradigms in the systemic treatment of
advanced cervical cancer. American journal of obstetrics and gynecology, 214, 22-30.
PLUMMER, M., DE MARTEL, C., VIGNAT, J., FERLAY, J., BRAY, F. & FRANCESCHI, S.
2016. Global burden of cancers attributable to infections in 2012: a synthetic analysis.
The Lancet Global Health, 4, e609-e616.
POMROS, P., SRIAMPORN, S., TANGVORAPHONKCHAI, V., KAMSA-ARD, S. &
POOMPHAKWAEN, K. 2007. Factors affecting survival of cervical cancer patients
treated at the radiation unit of Srinagarind Hospital, Khon Kaen University, Thailand.
Asian Pacific Journal of Cancer Prevention, 8, 297.
POURHOSEINGHOLI, M. A., HAJIZADEH, E., MOGHIMI DEHKORDI, B., SAFAEE, A.,
ABADI, A. & ZALI, M. R. 2007. Comparing Cox regression and parametric models for
survival of patients with gastric carcinoma. Asian Pacific Journal of Cancer Prevention,
8, 412.
ROBINS, J., BRESLOW, N. & GREENLAND, S. 1986. Estimators of the Mantel-Haenszel
variance consistent in both sparse data and large-strata limiting models. Biometrics, 311-
323.
53
RUPPERT, D., WAND, M. P. & CARROLL, R. J. 2003. Semiparametric regression, Cambridge
university press.
SALEM, M. A. 2015. Treatment Outcomes and Prognostic Factors of Cervical Cancer at South
Egypt Cancer Institute. Journal of Cancer Prevention & Current Research 2, 2.
SANKARANARAYANAN, R., NAIR, M. K., JAYAPRAKASH, P., STANLEY, G.,
VARGHESE, C., RAMADAS, V., PADMAKUMARY, G. & PADMANABHAN, T.
1995. Cervical cancer in Kerala: a hospital registry-based study on survival and
prognostic factors. British journal of cancer, 72, 1039.
SCHOENFELD, D. 1982. Partial residuals for the proportional hazards regression model.
Biometrika, 239-241.
SHOWALTER, T. N., CAMACHO, F., CANTRELL, L. A. & ANDERSON, R. T. 2016.
Determinants of quality care and mortality for patients with locally advanced cervical
cancer in Virginia. Medicine, 95.
TADESSE, S. K. 2016. Preventive Mechanisms and Treatment of Cervical Cancer in Ethiopia.
Cervical Cancer, an open access journal, 1, 1.
TARNEY, C. M. & HAN, J. 2014. Postcoital bleeding: a review on etiology, diagnosis, and
management. Obstetrics and gynecology international, 2014.
TEAME, H. 2016. Factors Associated With Cervical Precancerous Lesion among Women
Screened for Cervical Cancer in Addis Ababa, Ethiopia 2016. AAU, 2016.
TEWARI, K. S., SILL, M. W., LONG III, H. J., PENSON, R. T., HUANG, H.,
RAMONDETTA, L. M., LANDRUM, L. M., OAKNIN, A., REID, T. J. & LEITAO, M.
M. 2014. Improved survival with bevacizumab in advanced cervical cancer. New
England Journal of Medicine, 370, 734-743.
THERNEAU, T. M., GRAMBSCH, P. M. & FLEMING, T. R. 1990. Martingale-based residuals
for survival models. Biometrika, 147-160.
VISHMA, B., PRAKASH, B. & PRAVEEN KULKARNI, R. M. 2016. Survival and prognostic
factors for cervical cancer: a hospital based study in Mysuru, India. International
Journal, 3, 218.
WHO 2011. WHO/ICO information centre on Human papilloma virus (HPV) and cervical
cancer. Available from: www. who. int/ hpvcentre.
WHO 2012. Cancers of the Cervix. World Health Organization, Geneva. http://www. who.
int/reproductivehealth/topics/cancers/en/index. html (accessed August 4, 2014).
54
ANNEX 1: plot of log (-log (survival) versus log survival time for categorical variables in
the fitted model
6
6
4
4
-ln[-ln(Survival Probability)]
-ln[-ln(Survival Probability)]
2
2
0
0
1 1.5 2 2.5 3 3.5
-2
0 10 20 30 40
analysis time
55
Kaplan-Meier survival estimates
1.00
1.00
0.75
0.75
0.50
0.50
0.25
0.25
0.00
0 10 20 30 40
0.00
analysis time
0 10 20 30 40
grade = well differentiated grade = moderately differentiated analysis time
grade = poorly differentiated OCP = no OCP = yes
0 10 20 30 40
analysis time
56
57
58