Naive Bayes Theorem | DiamondClover

Edit me

Bayes Theorem

$P(A|B) = \frac{P(B|A)*P(A)}{P(B)}$

Probabilty that A occurs given B has occured P(A | B)

The Bayes Theorem is a method of finding what the proababilty is of something ocuuring (A) given that something else has just occured (B).

Two machines produce wrenches. We know that:

Machine 1 : Produces 30 Wrenches Per Hour
Machine 2 : Produces 20 Wrenches Per Hour
1% of all wrenches are defective, where 50% of the wrenches that are defective are from machine 1 and 50% from machine 2.

Question: What is the probabilty that a part produced by machine 2 is defective?

Answer:

$P(D | M2) = \frac{P(M2 | D)*P(D)}{P(M2)} = \frac{ 0.5 * 0.01}{0.4} = 0.0125$

The implementation for a NAive Bayes theorem will be a supervised machine learning algorithm used to classify data with previous known classes.

Bayes Theorem Example

How it Works:

Apply Bayes Theorem in this example we’ll use it to find the proabalilty that this person walks based on his features (the specific age and salary of that data point). Calculate the probability of each of the components of BAyes Theorem.
Apply Theorem again, in this case it would be to find the probability that the new dataset Drives based on its features (age and salary).
Compare the two and assign a class to the dataset.

Bayes Theorem Example

Bayes Theorem Step 1
- Prior Probabilty Calculation
- Marginal Likehood : Select a radius around the data point. P(X) is the probabilty of any given point to fall within that selected radius.
- Likelihood: Same radius. P(X | Walks) : Probability of the data point would be in this circle given that that datapoint walks.
- Calculate Bayes Theorem
Bayes Theorem Step 2 (repeat)
Since in this case the probability of the data point walking is greater than driving we can say that the data point is assigned to walking

Use the classification template to set everything up.

Create the classifier and set it up

from sklearn.naive_bayes import GaussianNB
classifier = GaussianNB()
classifier.fit(X_train, y_train)

run code…