Support Vector Machines- I

Support Vector Machine (SVM):

It is a supervised Machine Learning Algorithm. It is used to Classification and Regression model.

What is SVM

SVM is a Machine Learning algorithm. It creates the best line or decision boundary (Hyper plane) that can separate out n-dimensional space into classes. The decision boundary is used to classify new data.

SVM is a type of supervised Learning Algorithm.
It requires labeled data (Input and output both) for training.
It is used to solve classification and regression problems.

When should we use a Support Vector Machine?

Support vector machine is used two classify two-dimensional and multidimensional data and also regression model.

Decision Boundary: Decision boundary (Line or Plane) plays important role in the classification.

Figure1: Hyperplane and line Classify the data

To classify blue and red dots Support vector creates a decision boundary between these two data sets.

If it is multidimensional data, plane is used and if it is two-dimensional data line is used. This line or a plane is called Hyperplane as shown in figure1

Following are the important concepts in SVM

Define following concept.

1. Linearly and non linear SVM

2. Hyper Plane.

3. Distance Margin

4. Support Vectors

1. Types of SVM

SVM can be of two types:

Linear SVM: Linear SVM is used for linearly separable data, which means if a dataset can be classified into two classes by using a single straight line, then such data is termed as linearly separable data, and classifier is used called as Linear SVM classifier. Figure 2 is an example of Linear SVM.
Figure 2: Linear Classification

Non-linear SVM: Non-Linear SVM is used to classify non-linearly separated data, which means if a datasets not be classified by using a straight line, then such data is termed as non-linear data
Figure 3: Non Linear SVM

Figure 3 is an example of Non Linear SVM. In figure 3.It is not possible to draw a line to separate blue and yellow spot. Plane is used to classify data. To ceat plane, need to convert into a multidimensional plot using kernal. Plane is created using kerne to classify data

Figure 4: Plane to separate non linear data

Difference between Linear SVM and Non Linear SVM

Linear SVM	Non-Linear SVM
Linear line will separate the data and classify	Linear line will not separate the data
Data is classified with the help of hyperplane.	Kernels are used to make non-separable data into separable data.
Data can be easily classified by drawing a straight line.	Need to map data from low dimensional space to high dimensional space for classification.

Summary of Linear SVM and Non-Linear SVM.

In Linear SVM Data set can be classified into two classes by using a single straight line,
In Non Linear SVM Data set cannot be classified into two classes by using a single straight line
In Non Linear classifier plane is used to classify the data.
Hyperplane(Line/Plane) is used to classify data in SVM
Kernels are used to classify non linear Data.
Kernels are used to map data from low dimensional space to high dimensional space for classification.
Kernels are used to make non-separable data into separable data in non linear classifier

1. What is Hyperplane?

A most important step to build a SVM Model is to create Hyperplane which has a maximum margin

Hyperplane:

Hyperplane is a decision plane or space that divides set of objects having different classes between the data points. There can be multiple lines/decision boundaries to seperate the classes in n-dimensional space, but best decision boundary need to find out that helps to classify the data points. This best boundary is known as the hyperplane of SVM.In figure 4 large margin is the best boundary.

Figure 5: Hyperplane
1. Margin Maximum margin hyperplane is a decision plane or space that divides set of objects having different classes between the data points. We always create a hyperplane that has a maximum margin, which means the maximum distance. It can be easily seen that line from figure 4 with a large margin is the one that best separates the two classes.
2. Support Vector- The data points on hyperplane boundary is called Support Vector. Figure 6 shows Hyper plane, Margin, Support Vector.
Figure 6: Hyperplane. Margin, Support Vector

To build SVM Model the most important part is to choose the hyperplane which is as far as possible from the data points of each category.

How Doeds SVM Works?

Step1: Collect data set of green and blue dot. It is supervised learning with labels green for green dot and blue for blue dot.

Step 2: Apply SVM .It creates hyperplane. It is two diamensional data Hyperplane is line which separate green and blue dots.

Step 3:Distance between positive hyperplane and Negative hyperplane should be maximum hen it is maximum margin hyperplane.Check Hyperplane is a maximum margin hyperplane.

Step 4: Find Support Vectors.

Step 5: Support vectors are used to classify the new points.

What are the advantages of supporting Vector Machine over other classification problem?

SVMs are extremely popular because of their ability to handle multiple continuous and categorical variables.

Disadvantages of Neural Network over SVM

1. Having many local minima

2. To find number of neuron and number of layers in the architecture.

3. Trial and error method to decide structure and hyperparameters to reach optimality of that NN

There are many linear classifiers (hyper planes) that separate the data. However the goals of SVM are separating the data with hyper plane and extend this to non-linear boundaries using kernel trick.

Advantages of SVM are

It is maximum margin classifier or hyper plane. It gives maximum separation between two class

· Text and Hypertext category

· Classification of Images

· Classification of satellite data.

· Recognition of hand written character

· Biological and other sciences to classify proteins with up to 90% of the compounds classified correctly.

Search This Blog

Machine Learning and Deep Learning

Support Vector Machines- I

Comments

Post a Comment

Popular posts from this blog

Linear Regression

Logistic Regression

k-Nearest Neighbors Algorithm