StudentShare
Contact Us
Sign In / Sign Up for FREE
Search
Go to advanced search...
Free

Data Mining - Lab Report Example

Cite this document
Summary
The paper "Data Mining" tells us about the process of sorting through large data sets to identify patterns and relationships that can help solve business problems through data analysis. Data mining techniques and tools enable enterprises to predict future trends and make more-informed business decisions…
Download full paper File format: .doc, available for editing
GRAB THE BEST PAPER99% of users find it useful
Data Mining
Read Text Preview

Extract of sample "Data Mining"

The survey aimed at developing information on students’ backgrounds for informing teaching practices. Gender, number of previous data science courses by a student, students’ self-assessed data mining efficiency, future career goals, geo-location, and preference for a one-by-one virtual meeting were the study’s variables. Data issues with the collected data, cleaning, and analysis results are discussed. SPSS software was used for analysis.

Data issues and cleaning

Missing data was the most prevalent issue in the data set (Tan, Steinbach, & Kumar, 2006). All data for one participant (ID+ R_wZTAo2AjoAUTWvf) were missing. In addition, data on the number of science-related courses that a student had taken and data on years of professional experience that a student had prior to the course were missing for some of the participants. In addition, data on the expected salary for the first job had unrealistically low values and required cleaning. Means were used to clean data on a previous number of science courses and professional experience and expected salary while the mode was used to clean ordinal data.

The following table summarizes descriptive statistics of the numeric scale variables.

Table 1: Descriptive statistics

 

Descriptive Statistics

 

N

Minimum

Maximum

Mean

Std. Deviation

Skewness

Statistic

Statistic

Statistic

Statistic

Statistic

Statistic

Std. Error

previous data science related courses

23

.00

4.00

2.8226

1.05466

-1.098

.481

previous years of professional experience in dara areas

23

.00

21.00

3.6478

4.17722

3.425

.481

expected first salary after graduation

23

29795.78

145000.00

46605.9435

32496.99730

2.341

.481

Valid N (listwise)

23

 

 

 

 

 

 

 

The three data sets are skewed (p> 0.05) and this means that the media is the best descriptive statistics. The following table shows the statistics.

Table 2: Median for the numeric variables

Statistics

 

previous data science related courses

previous years of professional experience in data areas

expected first salary after graduation

N

Valid

23

23

23

Missing

0

0

0

Mean

2.8226

3.6478

46605.9435

Median

3.0000

3.6500

29795.7800

Mode

2.82a

3.65

29795.78

a. Multiple modes exist. The smallest value is shown

 

A majority of the students, therefore, had undertaken about three science-related courses and had about 3.65 years of professional experience in data areas. The students expected a first salary of about $ 29795.78.

A majority of the students (60.9 percent) were fair in data mining efficiency while only 8.7 were good. Only 21.7 percent had much confidence in becoming data analysts after graduation while 56.5 percent were not sure of their positions. Most of the students lived away from campus with 34.8 percent being within driving distance while 52.2 percent lived far away, though within the United States. Most of the students preferred a one-by-one virtual meeting. The following histograms illustrate the distributions.

Graph 1: Data mining efficiency

Graph 2: Interest in becoming a data analyst after graduation

Graph 3: Distance from campus

Graph 4: Preference for a one-by-one virtual meeting

The following table shows significant correlations, based on results in Appendix A.

Table 3: Significant correlations

Previous data science related courses

Previous years of experience in data

0.448

Previous years of experience in data

Expected first salary

0.494

Efficiency

Interest in data analysis

0.489

 

Correlation between expected salary and level of efficiency identifiny the role of expected salary on motivating students into the subject.

Summary

Majoforrity of the students have sufficient background knowledge in data mining, having done many related courses. They however lack experience in data mining and report average efficiency. Their level of motivation into the data analysis profession is low, their locations are far from the campus, and they prefer one-by-one virtual meetings. A one-on-one approach to learning that focuses on technology for online study is therefore recommended.

 

Read More
Cite this document
  • APA
  • MLA
  • CHICAGO
(“Data Mining Lab Report Example | Topics and Well Written Essays - 500 words”, n.d.)
Data Mining Lab Report Example | Topics and Well Written Essays - 500 words. Retrieved from https://studentshare.org/information-technology/1698533-data-mining
(Data Mining Lab Report Example | Topics and Well Written Essays - 500 Words)
Data Mining Lab Report Example | Topics and Well Written Essays - 500 Words. https://studentshare.org/information-technology/1698533-data-mining.
“Data Mining Lab Report Example | Topics and Well Written Essays - 500 Words”, n.d. https://studentshare.org/information-technology/1698533-data-mining.
  • Cited: 0 times

CHECK THESE SAMPLES OF Data Mining

Data Mining In Tracking Customer Behavior Patterns

The intention of the following essay is to justify the use of Data Mining aimed at gathering consumer behavior information in business.... Furthermore, the essay "Data Mining In Tracking Customer Behavior Patterns" would discuss the principles of Data Mining and address some of the common issues.... hellip; Data Mining is a key technology development in the sphere of data extraction.... Data Mining helps in providing predictive information allowing the manager to be more proactive....
6 Pages (1500 words) Essay

Data Mining and Web Personalization

This paper ''Data Mining and Web Personalization'' tells that Web personalization is a business solution that is becoming more involved as technology advances and consumers can be more selective in how they pursue business relationships....             Data Mining as used in web personalization allows business owners to concentrate on solving many problems and making e-commerce more of a viable business option....
8 Pages (2000 words) Essay

Data Mining as the Process

The focus of this paper "Data Mining as the Process" is on Data Mining, the process used by the firms to extract underlying information stored in the vast amount of data they have about their customers.... hellip; Data Mining can be oriented to for three paradigms: discovery, predictive and forensic.... Irrespective of the technique, Data Mining can be broadly carried out in three steps in generic terms: classification (applied to group data based on set rules), association (the relation between objects within the group is identified) and sequence analysis (the sequence in which a data repeats itself is identified)....
1 Pages (250 words) Essay

High Level ETL and Data Mining Requirements

The prices of ETL High Level ETL and Data Mining Requirements Introduction A Data Mining and ETL methodologies seek to organize the pattern discovery process in the data warehouse of an organization.... ?Data Mining.... ?Geographic Data Mining and knowledge discovery.... ata mining It is the practice of analyzing information from different dimensions and simplifying it in meaningful information (Clifton, 2010)....
2 Pages (500 words) Research Paper

Exploratory Data Mining in Behavioral Research

The case study "Exploratory Data Mining in Behavioral Research" states that the method is used when information about a situation is not available.... nbsp;… It helps to decide the best design for future research, how to collect data and the subjects to deal with.... Interviews are conducted and data collected.... From the data theories and hypotheses are developed by observing patterns that emerge....
1 Pages (250 words) Case Study

Data Mining and Big Data

However, confusion reigns with many regarding Big Data is an over-hyped buzzword for Data Mining”, I agree with it. Data Mining entails the process of… On the other hand, the term, ‘Big data' describes a massive structured and unstructured data volume, which is so complex to process using the common or traditional 2 May Do you agree with this ment?... However, confusion reigns with many regarding Big Data is an over-hyped buzzword for Data Mining”, I agree with it....
2 Pages (500 words) Assignment

Efficient Data Mining Classification Technique

This research project “Efficient Data Mining Classification Technique” aims to study the shortcomings of existing novel class detection, data reduction, and class balancing Data Mining techniques in terms of their accuracy and efficiency.... Classification, clustering, and aggregation are some of the Data Mining hot topics that are of extreme value in all engineering and scientific areas, such as, biological, physical and bio-medical sciences....
8 Pages (2000 words) Essay

Patient access, data mining and PHI

Another problem is the encryption of the data and Patient Access, Data Mining and PHI Patient Access, Data Mining and PHI Health information technology is one of the best initiatives to improve quality and efficiency in the healthcare system.... Another problem is the encryption of the data and security of the patients' data in the system.... However, the primary concern about the system is its ability to support use by people at different levels including patients without compromising the confidentiality of the patients' data....
1 Pages (250 words) Essay
sponsored ads
We use cookies to create the best experience for you. Keep on browsing if you are OK with that, or find out how to manage cookies.
Contact Us