StudentShare
Contact Us
Sign In / Sign Up for FREE
Search
Go to advanced search...
Free

Regression analysis - Statistics Project Example

Cite this document
Summary
Where p is the probability that someone will be tested with prostate cancer, k is the coefficient of regression, and v is the most significant variable in determining the probability p. Outliers in this dataset include cancer, inv and cap since they have fixed and immeasurable…
Download full paper File format: .doc, available for editing
GRAB THE BEST PAPER97% of users find it useful
Regression analysis project
Read Text Preview

Extract of sample "Regression analysis"

Statistics Project: Statistics Regression Analysis Project PROBLEM 2 Model Specification The model to be used in this test is Logistic regression with the following model:P = kvWhere p is the probability that someone will be tested with prostate cancer, k is the coefficient of regression, and v is the most significant variable in determining the probability p. Outliers in this dataset include cancer, inv and cap since they have fixed and immeasurable quantities compared to the rest of the variables.

Transformation and important VariablesThe necessary transformation in this regression process is the binomial transformation using “mylogit” function. The most important variable in this case is c.vol, because the estimated volume of prostate cancer is the greatest determinant of whether a patient will test negative or positive (Howell, 2010). The best model, therefore, uses c to represent c.vol, in the regression model, which now becomes: P = kcThe test is accurate since it was done with a confidence interval of 97.5%.OutputThe code for this regression is found in the Appendix A section, while the output is found in Appendix B section.

The coefficient of logistic regression in this analysis for the c.vol is 0.80404, a highly positive coefficient, giving a probability (pr = 0.01539) that the patient will test positive. The second variable that indicates a positive test is psa, with a coefficient of 0.00226 a low positive coefficient, with a probability of 0.03847. The rest of the variables have negative association with the test for the hypothesis (Long, 1997).PredictionThis test estimates the cancer diagnosis for someone with 10 psa, 5 c.

vol, 40g for weight, age 67, with 2.5 benign, with no seminal vesicle invasion, and with 0.5 cm cap. The test is done through the same model and the results are found in Appendix C section. The coefficient of association is 1.46414, with a probability of 0.160296, hence the patient tests positive with prostate cancer.PROBLEM 3Model SpecificationThe model to be used in this test is Logistic regression with the following model:P = CvWhere p is the probability that gender or treatment has a significant effect on blood calcium level, C is the coefficient of regression, and v is the most significant variable in determining the probability p.

there is no outlier in this dataset.Transformation and important VariablesThe necessary transformation in this regression process is the binomial transformation using “mylogit” function. The two variables gender and treatment are both significant in determining the results of the hypothesis test (Hosmer & Lemeshow, 2000). The best model, therefore, combines the two variables in the regression as follows:P = C * g * T where C is the coefficient of regression, g is the gender and T is the Test.

The test is accurate since it was done with a confidence interval of 97.5%.Output and InterpretationThe code for this regression is found in the Appendix D section, while the output is found in Appendix E section. The coefficient of logistic regression in this analysis is 0.0206, a low positive coefficient, giving a probability (pr = 0.0247) that the Treatment significantly affects the blood calcium level. The second variable that indicates effect is the gender G, with a coefficient of 0.7004 a highly positive coefficient, with a probability of 0.

0139 that it will affect the blood calcium level.PredictionThe prediction of blood calcium level in this test is done with Gender as Male and treatment as T2. The results are shown in appendix F. The treatment T2 gives a coefficient of 0.00348 and a probability of 0.03744. Gender “Male” gives a coefficient of 0.0156 and probability of 0.414. The two variables both give positive coefficient and significant probabilities; hence the blood calcium level is calculated as follows:C = p / (g * T) C = 0.

22572 / (1 * 0.00954) = 23.6603ReferencesHosmer, D. & Lemeshow, S. (2000). Applied Logistic Regression (Second Edition). New York: John Wiley & Sons, Inc.Long, J. S. (1997). Regression Models for Categorical and Limited Dependent Variables. Thousand Oaks, CA: Sage PublicationsHowell, D. C. (2010). Statistical Methods for Psychology, 7th ed. Belmont, CA; Thomson WadsworthAPPENDIXAppendix Adf

Read More
Cite this document
  • APA
  • MLA
  • CHICAGO
(“Regression analysis project Statistics Example | Topics and Well Written Essays - 500 words”, n.d.)
Retrieved from https://studentshare.org/statistics/1697627-regression-analysis-project
(Regression Analysis Project Statistics Example | Topics and Well Written Essays - 500 Words)
https://studentshare.org/statistics/1697627-regression-analysis-project.
“Regression Analysis Project Statistics Example | Topics and Well Written Essays - 500 Words”, n.d. https://studentshare.org/statistics/1697627-regression-analysis-project.
  • Cited: 0 times

CHECK THESE SAMPLES OF Regression analysis project

ARCH modeling: forecasting the return in the UK stock market

This study “ARCH modeling: forecasting the return in the UK stock market” would help to develop an investing strategy based on the results that would bring about significant positive returns.... Empirical researches have proved the predictive ability of ARCH models.... hellip; The author of the paper states that for the purposes of economic statistics and econometric calculations a wide range of tools and theories are available....
55 Pages (13750 words) Dissertation

Tax Income in the United State and the United Kingdom

23 Pages (5750 words) Dissertation

Multiple Regressions Empirical Project

To make this a success, secondary data downloadable from http://www.... swego.... du/~kane/econometrics was used.... From the results it was found out that a billion… Further, a billion dollar increase in direct foreign investment holding consumption and net exports constant leads to 0.... 8 billion dollar decrease in A billion dollar increase in net exports holding consumption and direct foreign investments constant leads to 0....
6 Pages (1500 words) Essay

The Effects of The Financial Crisis of 2008 on the UK Economic Growth

The present project seeks to bring forth the impacts of the financial crisis on the economic growth of United Kingdom.... The project seeks to bring forth the evaluation of the financial crisis of 2008 on the economic growth of United Kingdom.... hellip; This research paper includes analysis of previous research findings of authors and practitioners, government survey results on different parameters such as wage rates, interest rates, unemployment rates and growth trends etc....
6 Pages (1500 words) Dissertation

The Great Depression vs The Great Recession

Finally this paper shall provide a discussion of the major points raised by analysis of both phenomena.... The great depression that began 1929 and led to the stock market crash and, on the other hand, the great recession that lead to the global crisis of that began in 2007 were the… The breadth and depth of the both crises and in particular the suffering that resulted from the great depression is legendary....
16 Pages (4000 words) Essay

Mutual Funds Pre and Post Global Financial Crisis

Mutual funds are investment vehicles made up of pools of funds collected from investors for the purpose of making investments in securities such as equity, fixed income, commodities, derivatives and other similar investment instruments.... These funds are managed by a professional… anager who makes investments using the fund's capital in an attempt to produce capital gains and income, and in a manner consistent to the investment objectives stated within the fund's prospectus. ...
26 Pages (6500 words) Essay

Analysis of the Determinants of Electricity Demand in Libya

The paper "analysis of the Determinants of Electricity Demand in Libya" examines the factors affecting the demand for electricity in Libya during the period (1980-2010) focusing on the use of ordinary least square techniques in constructing the empirical model of the demand function for electricity....
11 Pages (2750 words) Research Proposal

Multiple Regression Analysis: Empirical Project

This study "Multiple regression analysis: Empirical Project" is meant to determine the impact of increasing consumption expenditures, investments, disposable income and exports on the GDP growth.... urther, simple regression models of the nature GDP = β0 + β1ya (where you are the IV) were estimated....
8 Pages (2000 words) Essay
sponsored ads
We use cookies to create the best experience for you. Keep on browsing if you are OK with that, or find out how to manage cookies.
Contact Us