StudentShare
Contact Us
Sign In / Sign Up for FREE
Search
Go to advanced search...
Free

Waikato Environment for Knowledge Analysis - Essay Example

Cite this document
Summary
From the paper "Waikato Environment for Knowledge Analysis" it is clear that Weka Programme is a suite for machine learning software which is a collection of algorithms written in Java and used for data mining tasks. These algorithms can be applied directly to the dataset or through the Java code…
Download full paper File format: .doc, available for editing
GRAB THE BEST PAPER99% of users find it useful
Waikato Environment for Knowledge Analysis
Read Text Preview

Extract of sample "Waikato Environment for Knowledge Analysis"

Weka Programme Introduction Weka Programme is a suite for machine learning software which is a collection of algorithms written in Java and used for data mining tasks. These algorithms can be applied either directly to the dataset or through the Java code. The software contains many tools including that for data pre-processing, classification, clustering, regression, association rules, and visualization. It can also be used for the development of new machine learning schemes. This software was found by the University of Waikato on the islands of New Zealand. It was named after a bird which had an inquisitive nature. It is a free software issues under GNU General Public License. Weka stands for Waikato Environment for Knowledge Analysis. It is the software which allows businesses to extract useful information out of large databases. This is the process of data mining which is a field that analyzes large sets of data and discovers patterns and methods for the management, processing, and inference considerations of the data. Weka software offers businesses a collection of learning tools and schemes that may be used for data mining (Witten, 2011, p. 132). There are four applications in the program which are accessible; these are Explorer, Experimenter, KnowledgeFlow, and Simple CLI. The first section of the software is to open the database or the dataset and make the edits as the user wishes. These edits include the filtering of data contents, changing of attributes, and visualizing the result in a bar chart. The available data can also be classified according to the set of rules which are predefined and can perform a complete analysis of cost-benefit that would display automatically the threshold curve and the cost matrix (Witten, 2011, p. 132). Moreover, the program also has many tools for the data clustering, attributes evaluator, and association rules. In addition, businesses also use it for the data plotting allowing the user to view and analyze the point graphs individually for each possible attribute combination. Businesses also use the weka programme as it is suitable for the development of new machine learning schemes. The user simply has to configure the experiment by choosing the type: classification or regression. Then the desired data set and algorithm has to be chosen and then it is ready to be run. The results of this procedure can be saved in CSV format or ARFF or as JDBC database. A data file can also be analyzed or tested through the program as it allows the user to choose the comparison field and the sorting criteria on which the test is based. Weka is used in businesses as it easy to use software designed for those familiar with procedures of data mining and database analysis (Hill, 2012, p. 538). Data mining is the most common thing used in the tech industry as the businesses are generating thousands of data points for their users and looking for ways to turn the collected information into increased revenues. Data mining is used as a collective term for several techniques that combine and function to turn data from raw information into something meaningful. Models and rules are created in the application of data mining which are helpful to predict the future behaviour, improve the business, and explain things which are not understandable initially (Witten, 2011, p. 132). The process also helps in finding things that are new and that the user never knew of. Weka provides software for data mining which is usable for an average person who would start data mining. There are other commercial software products as well but it is very difficult to actually be an expert in data mining. Weka provides an easier approach and guides the user through the initial problems. Weka provides an open source for the business to mine their own data and find out about the clients and apply the results to improve the performance of the business (Witten, 2011, p. 132). When using the weka programme, the user has an option to apply different algorithms into the data set to produce a representation of the information in order to make the observation easier. Identifying the suitable options which would provide the best output may be difficult. However, the best way to approach is to apply a combination of the available choices independently and see what produces something closest to the results desired (Hill, 2012, p. 538). The classify tab is the category which gives several options that could be selected. There is an option for test which gives choices to user for four different test mode scenarios to be used on the dataset. These four modes are: use training set, supplied training set, cross validation, and split percentage (Kaluza, 2013, p. 28). The classification in weka has been developed in order to train the data set for the production of the output that has been classified on the basis of the last attribute in the dataset’s characteristics. In order to use a specific attribute, the user must select the option from the options menu prior to the testing (Kaluza, 2013, p. 28). The cluster tab is used to open the process which identifies commonalties or clusters within the dataset and produces information that the user can analyze. Some of the options in the cluster window are similar to those in the classifier tab (Kaluza, 2013, p. 28). The cluster tab provides an option for cluster evaluation which compares the data with a pre-assigned data class. In cluster mode, the users have an option to ignore some of the dataset attributes. This is useful when there are specific attributes that cause results to get out of range (Kaluza, 2013, p. 28). Selecting specific attributes is used for the process of calculation. In the default of the software, all the attributes that are available are used for evaluation. If the user wants to exclude some attributes they will have to deselect those choices from the list (Kaluza, 2013, p. 28). Some attributes are in different forms as the alphanumeric data. For the desired calculation, the software automatically selected the attributes and decides which ones will fit best (Hill, 2012, p. 538). Thus, many businesses in the tech industry have started using this software as it is a simple one for beginners and helps in the generation of revenues of business through data mining. References Hill, T. 2012. Practical Text Mining and Statistical Analysis for Non-Structured Text Data Applications. London: Academic Press Kaluza, B. 2013. Instand Weka How-to. London: Packt Publishing Ltd Witten, I. 2011. Data Mining: Practical Machine Learning Tools and Techniques. NY: Elsevier Read More
Cite this document
  • APA
  • MLA
  • CHICAGO
(“Weka programme Essay Example | Topics and Well Written Essays - 1000 words”, n.d.)
Weka programme Essay Example | Topics and Well Written Essays - 1000 words. Retrieved from https://studentshare.org/miscellaneous/1645091-weka-programme
(Weka Programme Essay Example | Topics and Well Written Essays - 1000 Words)
Weka Programme Essay Example | Topics and Well Written Essays - 1000 Words. https://studentshare.org/miscellaneous/1645091-weka-programme.
“Weka Programme Essay Example | Topics and Well Written Essays - 1000 Words”, n.d. https://studentshare.org/miscellaneous/1645091-weka-programme.
  • Cited: 0 times

CHECK THESE SAMPLES OF Waikato Environment for Knowledge Analysis

Who Owns the Water of Waikato River in New Zealand

Through analysis of the competing discourses, the emerging tensions and interpretation issues are discussed in-depth and on their basis, a conclusion is derived.... This essay "Who Owns the Water of waikato River in New Zealand" sheds some light on the waikato River that is not just an important national resource but also contributes a lot in making history of the waikato people who make claim to it.... The process has in recent times culminated in the waikato-Tainui and the Crown duel....
8 Pages (2000 words) Essay

Turnover intentions

A characteristic of organisations in the 21st century is the continuous and rapid pace of change.... Volatile economic environments, rapidly changing technologies, global competition, workforce diversity, and new organisational structures are some of the challenges an organisation faces.... ... ...
29 Pages (7250 words) Essay

Construction op analysis class (R13 - reading articles)

According to the Article ‘Regression-Based Prediction Methods for Adjusting Construction Cost Estimates by Project Location', the choice of the relevant attributes for WEKA (Waikato Environment for Knowledge Analysis) Global Regression Analysis is based on; ... Assignment R13 Reading Articles Questions and Answers MULTIPLE CHOICE According to the Article ‘Regression-Based Prediction Methods for Adjusting Construction Cost Estimates by Project Location', the choice of the relevant attributes for WEKA (Waikato Environment for Knowledge Analysis) Global Regression Analysis is based on;a) Conditional Nearest Neighbor Method (CNN) b) Spatial non-stationary in regression model c) M5 model tree-based selection method d) Inverse Distance Weighted Method (IDW)....
1 Pages (250 words) Essay

Waste Management in Fonterra Tirau

The assignment "Waste Management in Fonterra Tirau" focuses on the critical analysis of the major issues concerning waste management in the Fonterra Tirau site.... detailed EIA report that was submitted to the Waikato department of environment should have the following details which are the main components of EIA: Screening, Scoping, prediction and mitigation, management, and monitoring or audit.... The site has an input of about 2,900,000 liters of milk every day from local shareholders from farms within the waikato, this site normally operates only from August to December which is the season....
8 Pages (2000 words) Assignment

Steps to Protect Public Safety and the Environment when Designing a Building

The paper "Steps to Protect Public Safety and the Environment when Designing a Building" sums up privilege tied with practicing architecture is trusted to people qualified and having liability for implement engineering plans, scientific knowledge, and ingenuity for the advancement of human welfare.... With all this construction going on we ask, what measures are kept in place to ensure public safety and protection of the environment.... With all this construction going on, it is important for a designer to consider public safety and the safety of the environment....
8 Pages (2000 words) Assignment

Waikato River settlement

According to a recent analysis, forestry, fishery and agriculture account for about 1-1.... The history of the Maori a distinctive group of people and their perspective on the different entities will be used to set up a bi cultural partnership for the management of the environment (Bodansky, Partnerships normally offer great opportunities to increase the availability of knowledge and the available resources and this opens up opportunities for development of an area which is more inclusive and unified as it brings on board all the concerned parties....
7 Pages (1750 words) Assignment

Eight Phases of Planning My Project

In the project proposal, the author of the paper "Data Mining System" planned to do the project in eight phases and some of them are gathering Information for the paper, system design, and analysis, and detailed analysis of factors involved in reading databases.... ata mining system sequence analysis ... ystem design and analysisIn system design and analysis I had to do flow charts and look at varying designs and architecture of similar systems from the more than twenty sources I collected and cited....
9 Pages (2250 words) Assignment

Teaching in the 21st Century

The atmosphere and culture of home and school are different and contextualizing varies depending on individuals carrying out the analysis.... The community environment may reflect on the behaviour of students and hence determine the kind of behaviour they portray to outsiders and non-students plus the staff....
8 Pages (2000 words) Essay
sponsored ads
We use cookies to create the best experience for you. Keep on browsing if you are OK with that, or find out how to manage cookies.
Contact Us