Free

Data Analysis - Statistics Project Example

Add to wishlist

Summary

This paper deals with the Gross Domestic Product data analysis. According to the text, different sources of data exist on subjects and even though data creates awareness and informs decisions, only reliable data should be used. …

Download full paper File format: .doc, available for editing

GRAB THE BEST PAPER96.1% of users find it useful

Read Text Preview

Subject: Statistics
Type: Statistics Project
Level: Undergraduate
Pages: 6 (1500 words)
Downloads: 3
Author: delores58

Extract of sample "Data Analysis"

Data analysis June 17, Data analysis Different sources of data exist on s and even though data creates awareness and informs decisions, only reliable data should be used. This identifies need to verify data sources and consistency in data features between data sources is one of the approaches to establishing reliability of the two sources. Data on Gross Domestic Product (GDP) per capita for European Union countries is investigated to establish reliability between the World Bank and the Central Intelligence Agency (US) as information sources. Average GDP per capita (2013) of the European Union is computed from the World Bank data and is compared with data for individual countries as reported by the Central Intelligence Agency data (based on purchasing power parity). Data from the World Bank shows that the European Union had a GDP of $ 17.96 trillion and a population of 507.0 million in the year 2013. GDP per capita for the region can therefore be computed as follows (The World Bank, 2015). GDP per capita= 17.96 trillion/507 million= $ 35424.06 Hypothesis HO: µCIA= 35424.06; No significant difference between the two sources, sources are reliable HA: µCIA≠ 35424.06; A significant difference exist, one of the sources is not reliable µ represents mean The data are on a ratio scale and represents average output of a country per person per year. Data was collected through observation from existing sources, the World Bank and the Central Intelligence Agency (US), and with European Union countries as the population. The data is with respect to the year 2013 and may not represent the countries’ and the region’s current GDP per capita. The following table shows the data set for 25 European Union countries Table 1: Sample countries and their GDPs Country GDP per capita Country GDP per capita Country GDP per capita Austria 45100 Czech Republic 27800 Germany 44200 Belgium 41400 Denmark 43800 Greece 25500 Bulgaria 16800 Estonia 26500 Hungary 23600 Croatia 20600 Finland 40700 Ireland 45400 Cyprus 29200 France 40500 Italy 34700 Latvia 23200 Netherland 47200 Slovak Republic 27100 Lithuania 25800 Poland 23700 Spain 32500 Luxembourg 91900 Portugal 26100 Malta 31100 Romania 17700 (Central Intelligence Agency, n.d.) Statistics The following table shows the statistics, retrieved from Appendixes A, B, and C that are excel outputs. Table 2: Statistics Statistics Value Statistics Value Sample size 25 Mode 16800 Minimum value 16800 Mid range 54350{(minimum+ maximum)/2} Maximum value 91900 Range 75100 ∑x2 34614970000 Standard deviation 15237.008 Mean 34084 Variance 232166400 Median 29200 Q1 and Q3 14600 and 42600 respectively Graphs The following graphs describe the data. Graph 1: Frequency table GDPpercapita Frequency Percent Valid Percent Cumulative Percent Valid 16800.00 1 4.0 4.0 4.0 17700.00 1 4.0 4.0 8.0 20600.00 1 4.0 4.0 12.0 23200.00 1 4.0 4.0 16.0 23600.00 1 4.0 4.0 20.0 23700.00 1 4.0 4.0 24.0 25500.00 1 4.0 4.0 28.0 25800.00 1 4.0 4.0 32.0 26100.00 1 4.0 4.0 36.0 26500.00 1 4.0 4.0 40.0 27100.00 1 4.0 4.0 44.0 27800.00 1 4.0 4.0 48.0 29200.00 1 4.0 4.0 52.0 31100.00 1 4.0 4.0 56.0 32500.00 1 4.0 4.0 60.0 34700.00 1 4.0 4.0 64.0 40500.00 1 4.0 4.0 68.0 40700.00 1 4.0 4.0 72.0 41400.00 1 4.0 4.0 76.0 43800.00 1 4.0 4.0 80.0 44200.00 1 4.0 4.0 84.0 45100.00 1 4.0 4.0 88.0 45400.00 1 4.0 4.0 92.0 47200.00 1 4.0 4.0 96.0 91900.00 1 4.0 4.0 100.0 Total 25 100.0 100.0 Graph 2: Histogram Graph 3: Stem and leaf plot GDPpercapita Stem-and-Leaf Plot Frequency Stem & Leaf 2.00 1 . 67 4.00 2 . 0333 7.00 2 . 5566779 3.00 3 . 124 .00 3 . 5.00 4 . 00134 3.00 4 . 557 1.00 Extremes (>=91900) Stem width: 10000.00 Each leaf: 1 case(s) Graph 4: Box plot Hypothesis results The following table shows test of hypothesis results. One-Sample Test Test Value = 35424.06 t df Sig. (2-tailed) Mean Difference 95% Confidence Interval of the Difference Lower Upper GDPpercapita -.440 24 .664 -1340.06000 -7629.5876 4949.4676 The test is not significant (p=0.664> 0.05, t=-0.44). The null hypothesis is not rejected to the effect that mean of the sample is equal to the average GPD for the region. The two data sources are therefore consistent and identify reliability. Conclusion Distribution of the countries’ GDP per capita is skewed, positively, and has an outlier in GDP of Luxembourg that was $ 91900. The countries GDP per capita also differed widely as the high values of range ($ 75100) and standard deviation (15237) shows. High concentration of countries occurred in the second and fourth stems. Mean of the countries’ GDP per capita, based on CIA data (34084) is statistically the same as regional value that the Wwaorld Bank reported for the same year. References Central Intelligence Agency. (n.d.). The World Factbook. Central Intelligence Agency. Retrieved from: https://www.cia.gov/library/publications/the-world-factbook/. The World Bank. (2015). European Union. The World Bank. Retrieved from: http://data.worldbank.org/region/EUU. Appendix A: Summary Case Processing Summary Cases Valid Missing Total N Percent N Percent N Percent GDPpercapita 25 100.0% 0 0.0% 25 100.0% Appendix B: Descriptive statistics Descriptives Statistic Std. Error GDPpercapita Mean 34084.0000 3047.40152 95% Confidence Interval for Mean Lower Bound 27794.4724 Upper Bound 40373.5276 5% Trimmed Mean 32318.8889 Median 29200.0000 Variance 232166400.000 Std. Deviation 15237.00758 Minimum 16800.00 Maximum 91900.00 Range 75100.00 Interquartile Range 18000.00 Skewness 2.340 .464 Kurtosis 8.019 .902 Appendix C: Percentiles Percentiles Percentiles 5 10 25 50 Weighted Average(Definition 1) GDPpercapita 17070.0000 19440.0000 24600.0000 29200.0000 Tukeys Hinges GDPpercapita 25500.0000 29200.0000 Appendix C-continued Percentiles Percentiles 75 90 95 Weighted Average(Definition 1) GDPpercapita 42600.0000 46120.0000 78490.0000 Tukeys Hinges GDPpercapita 41400.0000 Appendix D: Mode Statistics GDPpercapita N Valid 25 Missing 0 Mode 16800.00a a. Multiple modes exist. The smallest value is shown Appendix D: X2 Statistics sqrGDPpercapita N Valid 25 Missing 0 Sum 34614970000.00 Read More

CHECK THESE SAMPLES OF Data Analysis

Survey data analysis

Statistical analysis of Employee Attitudes Name Instructor Executive Summary The current global business environment has become quite competitive and for any organizations to have the all important competitive advantage it has to invest in its talents.... Statistical analysis of Employee Attitudes Executive Summary The current global business environment hasbecome quite competitive and for any organizations to have the all important competitive advantage it has to invest in its talents....

3 Pages (750 words) Assignment

Economic data analysis

However, it is important to remember that, in cross-sectional analysis, the ordering of the data must have some logic, or economic interest, to make sense of any determination of whether (spatial) auto correlation is present or not. The situation is likely to be very different if we are dealing with time series data, for the observations in such data follow a natural ordering over time so that successive observations are likely to exhibit inter correlations, especially if the time interval between successive observations is short, such as a day, a week, or a month rather than....

5 Pages (1250 words) Essay

Statistics and Data Analysis

It has a non-parametric ordinal level of data because its values are categories which can be ordered (Black 2006, p.... It has a non-parametric nominal level of data since its values are categories which cannot be ordered (Black 2006, p.... xamples of ordinal data that occur in categories but can be ordered are: heart murmurs grades I (heard only with special effort) to VI (audible with the stethoscope off the chest), the risk of birth defects from drugs during pregnancy as graded by the U....

17 Pages (4250 words) Assignment

Quantitative Data Analysis 5

In order to test customer perception of quality of service at four-star hotels in the UK, a ten-question survey is administered to a total of 50 participants.... This survey also takes into account demographic elements, such as the gender and age of the participants, in order to… The majority of the respondents were male, but were drawn from all the various age groups, although there were more respondents from the 30 to 40 age group. From se figures, it may be noted that in general, the participants have not provided a very high rating on any of the questions, rather the average result for most of these questions is an overall “Fair”, although some aspects rate slightly higher than others....

4 Pages (1000 words) Essay

Categorical Data Analysis

The design, conduct and analysis of surveys should reflect sound statistical methodologies.... This in turn ensures that the research question(s) of interest are adequately answered with reasonable… The aim of a survey is to have an adequate sample that is representative of the population for which the research question affects....

10 Pages (2500 words) Assignment

The Multiple Linear Regression Model Specification

Regress vote on the variables in the Table above.... Note that a constant should be included in the model (denote it as ).... Provide the multiple linear regression model specification including the independent variables in the same order as they are presented in the table above.... hellip; Inflation: the coefficient for the inflation is given as -0....

10 Pages (2500 words) Essay

Data analysis task

This implies a mutation in the base sequence, most likely a deletion. -there are more bands due to the increased number of restriction Data Analysis Task- Molecular (Southern Blotting Using the restriction map list the size of fragment(s) generated from digestion of the entire wildtype allele shown in Figure 1 witha) EcoRI- 1.... ) The turnaround time for Southern Blot analysis is typically around 10 days....

2 Pages (500 words) Assignment

Survey and Data Analysis

The "Survey and Data Analysis" paper states that the data is sufficient to interpret and make any conclusion.... The data is taken from a particular geographical region.... It would be better if the data is recent and taken from all geographical regions to make any conclusion for the present trend.... ) The data suggests that there is a strong positive linear association between the percent of teens who have used marijuana and who have used other drugs....

6 Pages (1500 words) Assignment