Random forest Algorithm using machine learnig.pptx

enamulmd427 44 views 13 slides Aug 28, 2024
Slide 1
Slide 1 of 13
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13

About This Presentation

This ppt is about random forest algorithm


Slide Content

Random Forests Algorithm By: Md Enamullah RA2332241030042 21-02-2024

Outline What is Random Forests? Algorithm How does Algorithm wor k? Case Study Diagram Advantage Disadvantage

What is Random Forests An ensemble classifier using many decision tree models. Can be used for classification or Regression. Accuracy and variable importanceinformation is provided with the results.

The Algorithm Let the number of training cases be N, and the number ofvariables in the classifier be M . The number m of input variables are used to determine thedecision at a node of the tree; m should be much less than M . Choose a training set for this tree by choosing N times withreplacement from all N available training cases. Use the rest of thecases to estimate the error of the tree, by predicting their classes . For each node of the tree, randomly choose m variables on whichto base the decision at that node. Cálculate the best split based onthese m variables in the training set . Each tree is fully grown and not pruned.

How does Random Forests Algorithm work

Case Study

Bagging at Training time Training set N subsets (with replacement)

Bagging at inference time A test sample 75% confidence

Random Subspace Method at training time

Random Subspace Method at inference time A test sample 66% confidence

Advantage It produces a highly accurate classifier and learning is fast It is capable of handling large datasets with high dimensionality . It can handle thousands of input variables without variable deletion . It enhances the accuracy of the model and prevents the overfitting issue . It offers an experimental method for detecting variable interactions .

Disadvantage Although random forest can be used for both classification and regression tasks, it is not more suitable for Regression tasks.

THANK YOU…
Tags