Bagging Algorithm

A Simple Explanation - By Varsha Saini

Bagging also known as Bootstrap Aggregation is an ensemble technique that uses multiple Decision Tree as its base model and improves the overall performance of the model. An example of a bagging algorithm is Random Forest.

Decision Tree has a major problem of Overfitting which can be resolved by a Bagging algorithm like Random Forest which considers multiple Decision Trees to solve the same problem and the output is aggregated causing variance to reduce and resolve Overfitting.

Every Decision Tree in Bagging Algorithm is given equal importance.
Each model is built independently i.e. there is no effect of one model on another.
Bagging algorithms reduced variance, not bias, hence suitable for high variance low bias problems.

Bootstrapping

It is the process of randomly selecting data with replacement i.e. the data which is already selected can be selected again.

Steps to Implement Bagging

Decide the number of Decision Trees.
For every Decision Tree,
1. Select samples without replacement.
2. Select a subset of features.
Implement all the models independent of each other.
Aggregate the output from all the models.

Advantages of Bagging

Many weak learners are combined to become a strong learner.
Resolves the Overfitting problem.

Disadvantages of Bagging

Computationally expensive, since a number of models are trained.
The final output can have some bias if the proper procedure is ignored.

Varsha Saini

Bagging Algorithm

A Simple Explanation - By Varsha Saini

Bootstrapping

Steps to Implement Bagging

Advantages of Bagging

Disadvantages of Bagging

Other Popular Terms

Adjusted R-Squared

Autocorrelation

Bagging Algorithm

Bessel’s Correction

Boosting Algorithm

CatBoost

Citizen Data Scientist

Cohen Kappa

Confusion Matrix

Correlation

Cross Validation

Data Drift

Data Imputation

Differential Privacy

Elastic Net Regression

Evaluation Metrics

Feature Selection

Genetic Programming