Our website is a unique platform where students can share their papers in a matter of giving an example of the work to be done. If you find papers
matching your topic, you may use them only as an example of work. This is 100% legal. You may not submit downloaded papers as your own, that is cheating. Also you
should remember, that this work was alredy submitted once by a student who originally wrote it.
Input Data Analysis
[Student Name]
[Course Title]
[Instructor Name]
[Date]
Table of Contents
Introduction 4
Matlab implementation 4
Conclusion 10
Works cited 11
Appendix 12
List of figures
Figure 1: q-q plot 5
Figure 2: normal distribution curve 6
Figure 3: scatter diagram 7
Figure 4: comparison of theoretical and function 9
Introduction
Matlab can be used to perform many functions in mathematics and engineering. It can be used to calculate the mean, standard deviation as well as carry out tests of normality. This paper is going to carry out a number of functions that average of data, standard deviation and chi-square test. Various plots will be made while carrying out those functions.
Matlab implementation
In order to open the file saved as data.csv matlab function the csvread (`filename`) is opened and saved in the file called Implementation. The data file was in the desktop so it was opened as
m=csvread('D:\Documents and Settings\Administrator\desktop\Data.csv');
When the data was saved it was used to calculate the mean and standard deviation using the following functions
k1=sum(m)/50
s1 = 9.8365
This was verified using the inbuilt functions
s=std(m);
s = 0.8143
>> k=mean(m)
k = 9.8365
From the results obtained using the own custom functions, it could be suggested that the standard deviations and mean are close to the mean value and standard deviation obtained functions built into MATLAB. The reason for using functions built into MATLAB is that it improves and verify the findings of the statistical values. The standard deviations are 0.8143 on either side of the mean and mean is 9.8365. Furthermore, by observing the values of the variable provided it could be concluded that the data is normally distributed as majority of the values lie within two standard deviations of the mean value.
This is further tested by plotting q-q plot which a diagram, data quartiles and standard normal quartiles are two variables to be displayed in the form of a collection of numerous points, where each point is comprised of the values of two variables. A q-q plot actually suggests the type of correlation between two variables. This is done using using the following function qqplot(m). the output is as follows;
Figure 1;
qqplot(m)
Figure 1: q-q plot
From the plot above it can be concluded that the data is normally distributed that is normal probability distribution. The results show that the histograms of data are bell-shaped normal distributions whereas that of data are not symetric as shown below;
Figure 2: normal distribution curve
Therefore, this probability distribution will be used to approximately the same number of elements of another data. This is done using the following Matlab command R=k+s.*randn(50,1) data generated is as follows
10.7143, 10.0874, 10.0805, 9.6759, 9.7172, 9.7525, 7.5572, 10.1567, 10.6428, 8.7798, 8.5971, 10.3422, 8.6089, 8.4689, 10.4789, 9.3037, 10.8502, 8.7842, 9.3362, 10.0333, 10.2838, 10.2173, 9.9924, 9.6493, 9.3648, 10.2277, 9.5215, 10.1798, 10.7222, 8.0048 , 11.3059, 9.3218, 10.9085, 11.1000, 8.6403, 9.9806, 12.6591, 9.6617, 10.2325, 10.1059, 10.8689, 10.7245, 9.0658, 9.4794, 10.1160, 9.7889, 11.9007, 10.1937, 10.1927, 9.1543
Bearing in mind the two sets of data that is data provided and data generated we can can carry out complete a regression analysis as well as determine the equation of the line using own functions;
format long
b1=x/y; this will provide x coefficient that is required in the equation.
ycal=b1*x;
figure(2)
scatter(x,y)
hold on
plot(x,ycal)
xlabel('data');
ylabel('generated data')
y= -0.15x + 11.42
Comparing this value (1) with the value provide generated by inbuilt matlab function, it can be observed that there is a difference between both values. From the matlab function, the value of y=0.843x+9.8365 less than the value obtained using the regression equation created. This difference can be explained on the basis of the ability of the regression to predict the variations assessed by the value of coefficient of determination to predict the total variations in the values of variables. The plot is shown below;
Figure 3: scatter diagram
In sample testing has suggested that the regressions pertaining to data variability results in attractive outcomes in terms of the downbeat relationships amongst mean and unexpected standard deviation. The predictions relating to presentation of such provisional and changing variation models have been related to the more recent asymmetrical models The findings reveal that the regression model that is characteristic of fat tailed density tends to improve over all estimation in the measurement of conditional variation.
Additional objectives
Chi-Square function in MATLAB
The function in Matlab is as follows;
In built function which is h=chi2gof(x) produces the same results as the function created. The answer is 0. This means the value calculated is the sa as the value from the other function at 95% level of confidence. This is clear that the two functions arrive at the same answer.
Figure 4: comparison of theoretical and function
Chi-Square analysis of the data
This will look at the best fit line, comparison of our best fit line and theoretical model and the goodness of that fit, and
In our case we will use the chi-square goodness-of-fit test to test data whether it is it is normally distributed. Therefore the data null hypotheses are
H0: the variable is normally distributed.
H1: the variable is not normally distributed.
Then at this stage we set . This is significant level which we will use to test the data.
Reject H0 if Fcalc < F crit
Do not reject H0 if Fcalc ≥ F crit
In analysis data we first determine Degree of Freedom (df) = n-1, where n= the number of classifications of data, therefore, n=1. The Chi Square demonstrates the expected for the data is .0.
Critical value at 0.05 or 95% level of significance
Reject H0 if Fcalc < F crit (0)
Do not reject H0 if Fcalc ≥ F crit (0)
Comparing
Since F calc =F crit (0 =0), we accept H0.
Then we calculate the value. The calculated value is 0 while from the table the value is 0.
The C.V. with .f. = 0 and is 0, so the null hypothesis is not rejected. There is sufficient evidence from the sample, which suggest that the data is normally distributed. It means that we accept null hypothesis and reject alternative hypothesis at the 5% level of significance.
Conclusion
It is felt that the test used in this paper proves to be an efficient determinant of for the normality of the data. Firstly, it is well recognized that the estimate pertaining to the standard normal distribution numbers is more specific with the enhancement of sampling frequencies. The findings of the data reveal mean is 9.8365 and standard deviation is 0.365. The study also made use of regression to determine skewness, and variance data. This reflects that for all these variables the normality has been violated
Works cited
Mathworks. Linear Regression. 30 Jan. 2010. web 12April 2016 (http://www.mathworks.com/help/matlab/data_analysis/linear-regression.html)
Gonzalez RC & Woods, RE, 2008, ‘Digital Image Processing, Instructor's Manual’ New York: Prentice Hall.
Gonzalez, RC, Woods, RE and Eddins, SL, 2009, Digital Image Processing Using MatLab’, New York: Prentice Hall.
Appendix
Read
More
Then in-depth analysis was conducted to find out the main problem in the production line and a solution was devised based on lean manufacturing methods.... In order to improve the production, data was collected from the company's existing processes and management information systems....
MULTIVARIATE analysis TECHNIQUE ASSESMENT Submitted to: [Name of Instructor] Submitted by: [Name of Student] Submitted on: [June 20, 2011] INTRODUCTION This report is aimed to present a review and critical analysis of the multivariate analysis techniques used in the three selected research papers.... The expected learning outcome is the theoretical knowledge of various multivariate analysis techniques and their practical applications....
This study declared that the pattern of results given in Hanushek's vote count is not consistent with the null hypothesis of no effect in every study and thus making a type II error in his statistical analysis due to the low power associated with vote counting.... The only setback of his work is that the production function is not known and must be inferred from data on students and their schools.... Name: Course: Instructor: Institution: Date: Economics of Education Question 1 Hanushek study In his study he explains that the most useful approach to finding the determinants of students' achievement is the production-function approach, or else input-output to establish how educational inputs can be transformed into educational outcomes....
As the paper "Data Envelopment analysis in Health Care" outlines, presently, DEA is still broadly studied and being used in a lot of areas and fields.... Since data envelopment analysis was initially brought in 1978, the just authoritative technique has been advanced and utilized to evaluate the comparative efficiencies of multiple-response multiple-output decision-making units.... DATA ENVELOPMENT analysis IN HEALTH CARE By Presented to DATA ENVELOPMENT analysis IN HEALTH CARE I....
CPU recognizes these input data properly and process it properly.... Raw data is given to the CPU whereas.... In other words, raw data is given through inputs while finished data is obtained through outputs from a computer system.... Various equipment are used for inputting data and to obtain the outputs from a computer system.... Raw data is given to the CPU whereas finished data is delivered by the CPU....
he development of surveys in collection of input data involves four main stages.... It is through the results and analysis that the project managers will be able to understand what will give the best result to what is expected by the customers/users.... ampling is delineated as a process that is used in the statistical analysis whereby the predetermined number of various organizations will be picked from the main population.... The methodology that is used by various researchers in sampling from a larger population mainly depends on the specific type of analysis....
The Data Envelopment analysis model has been in discussion by relating itself to many managerial contexts.... In the field of health care, there have been many other additions for the applications of Data Envelopment analysis that have used the various measure for measuring the output.... Olesen and Petersen (1995) have studied and conducted the applications of Data Envelopment analysis in primary schools.... According to the results obtained and achieved and also according to the laws of the health care systems in Australia, it has been suggested how the Data Envelopment analysis can be applied in the GP clinics and what are the possible outcomes measures of the system....
This case study "Systems analysis Designs" presents a webpage that will be login by the students.... Running Head: SYSTEMS analysis DESIGNS
... (2006), Systems analysis Designs, San tiego: J.... The system will check the user name and password to see if they match the data stored onto their database.... The validation will verify that the user has entered valid data onto the different fields.... If all inputs are valid, the related reservation data will be saved onto the apartment database.
...
10 Pages(2500 words)Case Study
sponsored ads
Save Your Time for More Important Things
Let us write or edit the assignment on your topic
"Input Data Analysis"
with a personal 20% discount.