0% found this document useful (0 votes)

13 views

ML Algorithms Cheat Sheet

Uploaded by

Reddy Mohan

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

ML Algorithms Cheat Sheet

Uploaded by

Reddy Mohan

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Machine Learning Algorithms Cheat Sheet

1. Linear Regression

Overview: Linear Regression is a linear approach to modeling the relationship between a

dependent variable and one or more independent variables.

**Key Hyperparameters**:

- `fit_intercept`: Whether to calculate the intercept for the model. Default is `True`.

- `normalize`: If `True`, the regressors X will be normalized before regression. Default is `False`.

**Example Code**:

```python

from sklearn.linear_model import LinearRegression

from sklearn.model_selection import train_test_split

from sklearn.metrics import mean_squared_error

# Example data

X = ...

y = ...

# Split data

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Model initialization

lr = LinearRegression(fit_intercept=True, normalize=False)
# Model fitting

lr.fit(X_train, y_train)

# Predictions

y_pred = lr.predict(X_test)

# Evaluation

mse = mean_squared_error(y_test, y_pred)

print(f'Mean Squared Error: {mse}')

```

2. Logistic Regression

**Overview**: Logistic Regression is used for binary classification problems. It models the probability

of a binary outcome using a logistic function.

**Key Hyperparameters**:

- `penalty`: Used to specify the norm used in the penalization (`'l1'`, `'l2'`, `'elasticnet'`, `'none'`).

- `C`: Inverse of regularization strength; smaller values specify stronger regularization.

- `solver`: Algorithm to use in the optimization problem (`'newton-cg'`, `'lbfgs'`, `'liblinear'`, `'sag'`,

`'saga'`).

**Example Code**:

```python

from sklearn.linear_model import LogisticRegression

from sklearn.model_selection import train_test_split

from sklearn.metrics import accuracy_score

# Example data

X = ...

y = ...

# Split data

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Model initialization

log_reg = LogisticRegression(penalty='l2', C=1.0, solver='lbfgs', max_iter=1000)

# Model fitting

log_reg.fit(X_train, y_train)

# Predictions

y_pred = log_reg.predict(X_test)

# Evaluation

accuracy = accuracy_score(y_test, y_pred)

print(f'Accuracy: {accuracy}')

```

3. Decision Tree

**Overview**: Decision Tree is a non-parametric supervised learning method used for classification
and regression.

**Key Hyperparameters**:

- `criterion`: The function to measure the quality of a split (`'gini'` for Gini impurity, `'entropy'` for

information gain).

- `max_depth`: The maximum depth of the tree.

- `min_samples_split`: The minimum number of samples required to split an internal node.

- `min_samples_leaf`: The minimum number of samples required to be at a leaf node.

**Example Code**:

```python

from sklearn.tree import DecisionTreeClassifier

from sklearn.model_selection import train_test_split

from sklearn.metrics import accuracy_score

# Example data

X = ...

y = ...

# Split data

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Model initialization

dt = DecisionTreeClassifier(criterion='gini', max_depth=None, min_samples_split=2,

min_samples_leaf=1)
# Model fitting

dt.fit(X_train, y_train)

# Predictions

y_pred = dt.predict(X_test)

# Evaluation

accuracy = accuracy_score(y_test, y_pred)

print(f'Accuracy: {accuracy}')

```

4. Random Forest

**Overview**: Random Forest is an ensemble method that combines multiple decision trees to

improve classification or regression results.

**Key Hyperparameters**:

- `n_estimators`: The number of trees in the forest.

- `criterion`: The function to measure the quality of a split (`'gini'`, `'entropy'`).

- `max_depth`: The maximum depth of the tree.

- `min_samples_split`: The minimum number of samples required to split an internal node.

- `min_samples_leaf`: The minimum number of samples required to be at a leaf node.

**Example Code**:

```python

from sklearn.ensemble import RandomForestClassifier

from sklearn.model_selection import train_test_split

from sklearn.metrics import accuracy_score

# Example data

X = ...

y = ...

# Split data

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Model initialization

rf = RandomForestClassifier(n_estimators=100, criterion='gini', max_depth=None,

min_samples_split=2, min_samples_leaf=1)

# Model fitting

rf.fit(X_train, y_train)

# Predictions

y_pred = rf.predict(X_test)

# Evaluation

accuracy = accuracy_score(y_test, y_pred)

print(f'Accuracy: {accuracy}')

```

5. AdaBoost
**Overview**: AdaBoost is an ensemble method that combines multiple weak classifiers to create a

strong classifier.

**Key Hyperparameters**:

- `n_estimators`: The maximum number of estimators at which boosting is terminated.

- `learning_rate`: Weight applied to each classifier at each boosting iteration.

- `base_estimator`: The base estimator from which the boosted ensemble is built (e.g.,

`DecisionTreeClassifier`).

**Example Code**:

```python

from sklearn.ensemble import AdaBoostClassifier

from sklearn.tree import DecisionTreeClassifier

from sklearn.model_selection import train_test_split

from sklearn.metrics import accuracy_score

# Example data

X = ...

y = ...

# Split data

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Model initialization

ada = AdaBoostClassifier(base_estimator=DecisionTreeClassifier(max_depth=1), n_estimators=50,

learning_rate=1.0)

# Model fitting

ada.fit(X_train, y_train)

# Predictions

y_pred = ada.predict(X_test)

# Evaluation

accuracy = accuracy_score(y_test, y_pred)

print(f'Accuracy: {accuracy}')

```

6. K-Nearest Neighbors (KNN)

**Overview**: KNN is a non-parametric method used for classification and regression by finding the

k most similar instances in the training data.

**Key Hyperparameters**:

- `n_neighbors`: Number of neighbors to use.

- `weights`: Weight function used in prediction (`'uniform'`, `'distance'`).

- `algorithm`: Algorithm used to compute the nearest neighbors (`'auto'`, `'ball_tree'`, `'kd_tree'`,

`'brute'`).

**Example Code**:

```python
from sklearn.neighbors import KNeighborsClassifier

from sklearn.model_selection import train_test_split

from sklearn.metrics import accuracy_score

# Example data

X = ...

y = ...

# Split data

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Model initialization

knn = KNeighborsClassifier(n_neighbors=5, weights='uniform', algorithm='auto')

# Model fitting

knn.fit(X_train, y_train)

# Predictions

y_pred = knn.predict(X_test)

# Evaluation

accuracy = accuracy_score(y_test, y_pred)

print(f'Accuracy: {accuracy}')

```

Regression Analysis - Cheatsheet
No ratings yet
Regression Analysis - Cheatsheet
9 pages
Model Evaluation and Selection Cheatsheet 1708023215
No ratings yet
Model Evaluation and Selection Cheatsheet 1708023215
7 pages
Grade 5 CBT Social Studies Grade 5 Practice July 4 2018 Reviewed
100% (3)
Grade 5 CBT Social Studies Grade 5 Practice July 4 2018 Reviewed
12 pages
Random Forest: The Algorithm in A Nutshell
No ratings yet
Random Forest: The Algorithm in A Nutshell
10 pages
ML Algorithms Python
No ratings yet
ML Algorithms Python
4 pages
Comprehensive Overview of Common ML Techniques
No ratings yet
Comprehensive Overview of Common ML Techniques
7 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
8 To 12 Jaimeen
No ratings yet
8 To 12 Jaimeen
34 pages
ML for predictive analysis
No ratings yet
ML for predictive analysis
4 pages
A) What Is Motivation Behind Ensemble Methods? Give Your Answer in Probabilistic Terms
100% (1)
A) What Is Motivation Behind Ensemble Methods? Give Your Answer in Probabilistic Terms
6 pages
Decision Trees
No ratings yet
Decision Trees
11 pages
Module_5
No ratings yet
Module_5
5 pages
Algorithms For ML
No ratings yet
Algorithms For ML
3 pages
CSET301 LabW8L2
No ratings yet
CSET301 LabW8L2
1 page
ml lab programs 2
No ratings yet
ml lab programs 2
16 pages
Data Modeling - Cheatsheet
No ratings yet
Data Modeling - Cheatsheet
9 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
ML Codes
No ratings yet
ML Codes
9 pages
ml using python programs
No ratings yet
ml using python programs
12 pages
Machine_Learning_Algorithms_Overview
No ratings yet
Machine_Learning_Algorithms_Overview
7 pages
Practical No4 - 5 ML
No ratings yet
Practical No4 - 5 ML
11 pages
Random Forest
No ratings yet
Random Forest
3 pages
MLT - Lab - Manual FINAL
No ratings yet
MLT - Lab - Manual FINAL
38 pages
ML_4,5 (1)
No ratings yet
ML_4,5 (1)
5 pages
2. Random Forest Algorithm
No ratings yet
2. Random Forest Algorithm
2 pages
Exercise Random Forests
No ratings yet
Exercise Random Forests
2 pages
Random Forest
No ratings yet
Random Forest
11 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
14 pages
Bagging and Random Forest Presentation1
100% (2)
Bagging and Random Forest Presentation1
23 pages
UNIT III MACHINE LEARNING
No ratings yet
UNIT III MACHINE LEARNING
19 pages
Supersived Machine Learning
No ratings yet
Supersived Machine Learning
52 pages
MLA Lab 6:-Implementation of Decision Tree
No ratings yet
MLA Lab 6:-Implementation of Decision Tree
16 pages
ML Assigment 3
No ratings yet
ML Assigment 3
4 pages
Python Essential Methods In Machine Learning
No ratings yet
Python Essential Methods In Machine Learning
6 pages
Decision Tree - Jupyter Notebook
No ratings yet
Decision Tree - Jupyter Notebook
4 pages
MLT 1 - 7 Kanish
No ratings yet
MLT 1 - 7 Kanish
24 pages
Unit 2
No ratings yet
Unit 2
5 pages
unit 4 ML
No ratings yet
unit 4 ML
24 pages
Lecture 7.2 - DTC Algorithm Implementation
No ratings yet
Lecture 7.2 - DTC Algorithm Implementation
7 pages
Data Mining Notes
No ratings yet
Data Mining Notes
5 pages
ML Unit 2
No ratings yet
ML Unit 2
8 pages
decision tree
No ratings yet
decision tree
6 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Tuning A CART's Hyperparameters: Elie Kawerk
No ratings yet
Tuning A CART's Hyperparameters: Elie Kawerk
26 pages
MlLabManualdocx 2024 09 04 22 02 58
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
19 pages
Dl
No ratings yet
Dl
10 pages
Vtu ML
No ratings yet
Vtu ML
13 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
RANDOM_FOREST__1737667979
No ratings yet
RANDOM_FOREST__1737667979
11 pages
Karmbir 19 ML
No ratings yet
Karmbir 19 ML
20 pages
AIH_Lab2
No ratings yet
AIH_Lab2
10 pages
Learneverythingai 1699846539
No ratings yet
Learneverythingai 1699846539
7 pages
Machine Learning Strategies
No ratings yet
Machine Learning Strategies
59 pages
LAB MANUAL For Machine Learning
No ratings yet
LAB MANUAL For Machine Learning
15 pages
Linearregression SVM
No ratings yet
Linearregression SVM
3 pages
Decision Tree and Related Techniques For Classification in Scalation
No ratings yet
Decision Tree and Related Techniques For Classification in Scalation
12 pages
Day 2 Presentation
No ratings yet
Day 2 Presentation
65 pages
Machine Learning Classification Bootcamp Cheatsheet
No ratings yet
Machine Learning Classification Bootcamp Cheatsheet
7 pages
Random Forest
No ratings yet
Random Forest
2 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
GNN Hands On 04
No ratings yet
GNN Hands On 04
8 pages
Fallsem2024-25 Bmat205l TH VL2024250102541 Cat-1-Qp - Key
No ratings yet
Fallsem2024-25 Bmat205l TH VL2024250102541 Cat-1-Qp - Key
3 pages
Avr 1611 Om e - 005
No ratings yet
Avr 1611 Om e - 005
78 pages
5 - PICKMULSE - Formulation guidelines - Encapsulation v3.0 (1)
No ratings yet
5 - PICKMULSE - Formulation guidelines - Encapsulation v3.0 (1)
5 pages
Integrated Performance Task - Grade 12 - Revised
100% (1)
Integrated Performance Task - Grade 12 - Revised
2 pages
Probability Distribution
No ratings yet
Probability Distribution
2 pages
Why Do You Want To Learn A New Language?
No ratings yet
Why Do You Want To Learn A New Language?
3 pages
R Fib
No ratings yet
R Fib
13 pages
The Impact of Artificial Intelligence On The Future of Education
No ratings yet
The Impact of Artificial Intelligence On The Future of Education
4 pages
English: Literature Guided Questions
No ratings yet
English: Literature Guided Questions
2 pages
Week 2 Lothal
No ratings yet
Week 2 Lothal
1 page
DP 1 Biology - Cell Biology
No ratings yet
DP 1 Biology - Cell Biology
9 pages
UG-PSYCHOLOGY-PDF-Organizational
No ratings yet
UG-PSYCHOLOGY-PDF-Organizational
5 pages
Define Culture
No ratings yet
Define Culture
3 pages
Tanzanian A Level Chemistry Syllabus
67% (3)
Tanzanian A Level Chemistry Syllabus
5 pages
Assigment Navigation 2.0
No ratings yet
Assigment Navigation 2.0
17 pages
019 Dimas Willy Prayoga (2019)
No ratings yet
019 Dimas Willy Prayoga (2019)
9 pages
essay guideline
No ratings yet
essay guideline
2 pages
Glass Micro Fibre Filter Comparison Guide
No ratings yet
Glass Micro Fibre Filter Comparison Guide
2 pages
Presentation Geoxp
No ratings yet
Presentation Geoxp
9 pages
Organisational Behaviour: Assignment - 2
No ratings yet
Organisational Behaviour: Assignment - 2
13 pages
Y9 Unit 2 Review
No ratings yet
Y9 Unit 2 Review
5 pages
Bayambang Central School: (#1) Summative Test (Q2) Mathematics 4
No ratings yet
Bayambang Central School: (#1) Summative Test (Q2) Mathematics 4
2 pages
Anselm's Ontological Argument & Aquinas Perspective On It
No ratings yet
Anselm's Ontological Argument & Aquinas Perspective On It
4 pages
Gdels Theorems and Zermelos Axioms 1nbsped 9783030522780 9783030522797 Compress
100% (1)
Gdels Theorems and Zermelos Axioms 1nbsped 9783030522780 9783030522797 Compress
234 pages
Valve Geho Pump Assy PDF
100% (1)
Valve Geho Pump Assy PDF
5 pages
To Investigate The Dependence of Angle of Deviation On Angle of Incidence Using A Hollow Prism Filled One by One, With Different Transparent Fluids
86% (7)
To Investigate The Dependence of Angle of Deviation On Angle of Incidence Using A Hollow Prism Filled One by One, With Different Transparent Fluids
19 pages
高三英语试卷 20210408
No ratings yet
高三英语试卷 20210408
11 pages
Scetticismo e Matematica Nella Vérité Des Sciences Di Mersenne
No ratings yet
Scetticismo e Matematica Nella Vérité Des Sciences Di Mersenne
24 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

ML Algorithms Cheat Sheet

Uploaded by

ML Algorithms Cheat Sheet

Uploaded by

Machine Learning Algorithms Cheat Sheet

**Overview**: Linear Regression is a linear approach to modeling the relationship between a

dependent variable and one or more independent variables.

from sklearn.linear_model import LinearRegression

from sklearn.model_selection import train_test_split

from sklearn.metrics import mean_squared_error

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

mse = mean_squared_error(y_test, y_pred)

print(f'Mean Squared Error: {mse}')

of a binary outcome using a logistic function.

- `C`: Inverse of regularization strength; smaller values specify stronger regularization.

from sklearn.linear_model import LogisticRegression

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

log_reg = LogisticRegression(penalty='l2', C=1.0, solver='lbfgs', max_iter=1000)

accuracy = accuracy_score(y_test, y_pred)

- `max_depth`: The maximum depth of the tree.

- `min_samples_split`: The minimum number of samples required to split an internal node.

- `min_samples_leaf`: The minimum number of samples required to be at a leaf node.

from sklearn.tree import DecisionTreeClassifier

from sklearn.model_selection import train_test_split

from sklearn.metrics import accuracy_score

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

dt = DecisionTreeClassifier(criterion='gini', max_depth=None, min_samples_split=2,

accuracy = accuracy_score(y_test, y_pred)

improve classification or regression results.

- `n_estimators`: The number of trees in the forest.

- `criterion`: The function to measure the quality of a split (`'gini'`, `'entropy'`).

- `max_depth`: The maximum depth of the tree.

- `min_samples_split`: The minimum number of samples required to split an internal node.

- `min_samples_leaf`: The minimum number of samples required to be at a leaf node.

from sklearn.ensemble import RandomForestClassifier

from sklearn.metrics import accuracy_score

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

rf = RandomForestClassifier(n_estimators=100, criterion='gini', max_depth=None,

accuracy = accuracy_score(y_test, y_pred)

- `n_estimators`: The maximum number of estimators at which boosting is terminated.

- `learning_rate`: Weight applied to each classifier at each boosting iteration.

from sklearn.ensemble import AdaBoostClassifier

from sklearn.tree import DecisionTreeClassifier

from sklearn.model_selection import train_test_split

from sklearn.metrics import accuracy_score

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

ada = AdaBoostClassifier(base_estimator=DecisionTreeClassifier(max_depth=1), n_estimators=50,

accuracy = accuracy_score(y_test, y_pred)

6. K-Nearest Neighbors (KNN)

k most similar instances in the training data.

- `n_neighbors`: Number of neighbors to use.

- `weights`: Weight function used in prediction (`'uniform'`, `'distance'`).

from sklearn.model_selection import train_test_split

from sklearn.metrics import accuracy_score

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

knn = KNeighborsClassifier(n_neighbors=5, weights='uniform', algorithm='auto')

accuracy = accuracy_score(y_test, y_pred)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Overview: Linear Regression is a linear approach to modeling the relationship between a