Question 1

What is the algorithm pattern for Decision Tree?

Accepted Answer

Recursive Binary Splitting on Best Feature Threshold: A Decision Tree greedily selects the split (feature + threshold) that most reduces impurity (Gini or entropy) at each step.

Question 2

How do you solve Decision Tree step by step?

Accepted Answer

Compute the Gini impurity of the current node: 1 - Σ(p_i²) where p_i is the fraction of each class. For every possible split (feature ≤ threshold), compute the weighted Gini of the two child nodes. Choose the split with the lowest weighted Gini (highest information gain). Partition the data into left (≤ threshold) and right (> threshold) subsets. Recurse on each subset until a stopping condition is met (pure leaf, max depth, min samples).

Question 3

What are common mistakes when solving Decision Tree?

Accepted Answer

Gini impurity of a pure node = 0 (best possible). A 50/50 split has Gini = 0.5 (worst). Decision trees overfit easily — use max_depth or min_samples_leaf to regularize. The greedy split is locally optimal but may not be globally optimal.

Decision Tree — Step-by-Step Visualization

Algorithm Pattern

Key Idea

Step-by-Step Approach

Common Gotchas

Related Problems