Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
decision-tree.ipynb		decision-tree.ipynb

README.md

Decision tree

Goals

To implement the algorithm of building the decision tree.
Search for the optimal height of the tree.
To implement the algorithm of building a forest of deciduous trees.
Results analysis.

Data Sets

Use these data sets to test your classifier. Each dataset is pre-divided into a training and test sample. The class label is the last number in each line. For convenience, they are also available in .txt format.

Task

For each dataset, determine the optimal height of the decision tree for the accuracy of the classification on the test set.

Select two datasets: the minimum and maximum optimal height datasets. For these two datasets, plot the height dependency of the accuracy classification on the training set and the test set.

In this lab work, you are allowed to use sklearn.tree.DecisionTreeClassifier. If you use this implementation, in addition to the tree height, you need to configure hyperparameters and splitter (see documentation).

For each dataset, build a forest of deciduous trees without height limitation (i.e. without pruning) and determine the accuracy of classification on the training and check set.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

decision-tree

decision-tree

README.md

Decision tree

Goals

Data Sets

Task

Files

decision-tree

Directory actions

More options

Directory actions

More options

Latest commit

History

decision-tree

Folders and files

parent directory

README.md

Decision tree

Goals

Data Sets

Task