November 15, 2023

  1. After analysing the “food establishment inspections” dataset, it was determined that the decision tree technique would work best because it can be used for both regression and classification, can handle both numerical and categorical data, implicitly perform feature selection, robust to outliers and missing values.
  2. They use a structure akin to a tree to represent decisions and their possible outcomes.
  3. The nodes in the tree stand for features, the branches for decision rules, and the leaves for the result or goal variable.
  4. Decision trees determine the optimal feature to split the dataset at each node based on a variety of parameters, such as information gain, entropy, and Gini index.
  5. The algorithm recursively splits the dataset based on the selected features until a stopping criterion is

Leave a Reply

Your email address will not be published. Required fields are marked *