Loading and Understanding the Data
NOTE:- The sample data used in this exercise is the"bank data" available in CSV format.You can find the data from here ⮞ bank-data .
The data contained the following fields.
In this example,we load the data set into WEKA and perform a series of operations using Weka's preprocessing filters in upcoming blogs. A point to be noted here is that all these operations can be performed from the command line,we use the GUI interface for Weka Explorer.
The data contained the following fields.
In this example,we load the data set into WEKA and perform a series of operations using Weka's preprocessing filters in upcoming blogs. A point to be noted here is that all these operations can be performed from the command line,we use the GUI interface for Weka Explorer.
- Initially in the preprocessing tab click "open" and search to the directory containing the data file (.csv or .arff). In this case open the above data file.
- Once the data is loaded,Weka will recognise the attributes and during the scan of data the software will compute some of the basics statistics on each attribute.
(in image below i have marked the number to each particular panel ) - Listed the detail of each of the panel numbering from 1-41- indicates the names of the base relation (or table) and the current working relation (which are same initially)2- it shows the list of recognised attributes in the uploaded data set.🠆 clicking on any attribute in Clicking on any attribute in the left panel will show the basic statistics on that attribute. For categorical attributes, the frequency for each attribute value is shown, while for continuous attributes we can obtain min, max, mean, standard deviation, etc
3- panel no.3 shows the output result in graphical representation of the selected attribute by you (in the image the attribute selected is "INCOME").You can Visualise the attributes based on selected class. One way is to visualise selected attribute based on class selected in the ‘Class’ pull-down window, or visualise all attributes by clicking on ‘Visualize All’ button.
4- It shows the results of selected attribute (here it is "income") in the form of min, max, mean, standard deviation.
Thank you!!
Comments
Post a Comment