Image for post
Image for post

If you are a software engineer or a programmer you must have used StackOverflow at least once in your lifetime. But have you ever wondered how StackOverflow predicts the tags for a given question ? In this blog, I will discuss the StackOverflow tag predictor case study.

Contents

  1. Overview of Stack OverFlow Dataset.

Stack Overflow is the largest, most trusted online community for developers to learn, share their programming knowledge, and build their careers. …


Image for post
Image for post

Contents

  1. Overview of Dataset.

First We want to know What is Amazon Fine Food Review Analysis?

This dataset consists of reviews of fine foods from amazon. The data span a period of more than 10 years, including all ~500,000 reviews up to October 2012. Reviews include product and user information, ratings, and a plaintext review. We also have reviews from all other Amazon categories.

Amazon reviews are often the most publicly visible reviews of consumer products. As a frequent Amazon user, I was interested in examining the structure of a large database of Amazon reviews and visualizing this information so as to be a smarter consumer and reviewer. …


Image for post
Image for post

In this blog, we’ll try to understand one of the most important algorithms in machine learning i.e. Random Forest Algorithm. We will try to look at the things that make Random Forest so special and will try to implement it on a real-world dataset.

Contents

  1. What Are Ensembles?

What Are Ensembles?

Commonly, the individual model suffers from bias or variances and that’s why we need ensemble learning. …


Image for post
Image for post

Decision trees are a popular supervised learning method for a variety of reasons. The benefits of decision trees include that they can be used for both regression and classification, they are easy to interpret and they don’t require feature scaling. They have several flaws including being prone to overfitting.

Contents

  1. What are Decision Trees?.

What are decision trees?

Decision trees which are also modernly known as classification and regression trees (CART) were introduced by Leo Breiman to refer, Decision Tree algorithms. They have a supervised learning algorithm that has a pre-defined target variable & they are mostly used in non-linear decision making with a simple linear decision surface. In other words, they are adaptable for solving any kind of problem at hand (classification or regression). …


Image for post
Image for post

SVM is a supervised Machine Learning algorithm that is used in many classifications and regression problems. It still presents as one of the most used robust prediction methods that can be applied to many use cases involving classifications.

Contents

  1. Geometric Intuition Of Support Vector Machines.

Key Idea of SVM

Logistic Regression doesn’t care whether the instances are close to the decision boundary. Therefore, the decision boundary it picks may not be optimal. If a point is far from the decision boundary, we may be more confident in our predictions. …


Image for post
Image for post

Logistic regression is a classification algorithm used to assign observations to a discrete set of classes. There are lots of classification problems that are available, but the logistics regression is common and is a useful regression method for solving the binary classification problem.

There are lots of classification problems that are available, but the logistics regression is common and is a useful regression method for solving the binary classification problem.

Contents

  1. Geometric Intuition Of Logistic Regression

2. Regularization techniques to avoid Overfitting and Underfitting

3. Probabilistic interpretation of Logistic Regression

4. Loss Minimization Interpretation of Logistic Regression

5.Implementation of Logistic Regression Algorithm with Amazon Food…


Image for post
Image for post

The solution to an optimization problem can be done by selecting different methods. Moreover, the user can navigate on the surface or curve to establish an initial point and find the optimal or critical point, which can be observed on the plotted function.

Contents

1.Single Value Differentiation

2. Minima and Maxima

3. Gradient descent algorithm

4. Steps for Gradient descent algorithm

5. Types of Gradient Descent algorithms

6. Implementation of Stochastic Gradient Descent

For Optimization problems Differentiation is very important, Let’s see some maths,

Single Value Differentiation

Differentiation allows us to find rates of change. …


Image for post
Image for post

Contents

  1. Geometric Intuition for Linear Regression

2. Linear Regression using Loss-Minimization

3. Assumptions of Linear Regression

4. Implementation of the Linear Regression using Python

What is Regression?

Regression analysis is a form of predictive modeling technique that investigates the relationship between a dependent and independent variable.

Geometric Intuition for Linear Regression

Linear regression is perhaps one of the most well known and well-understood algorithms in statistics and machine learning. …


Image for post
Image for post

Naive Bayes is a statistical classification technique based on Bayes Theorem. It is one of the simplest supervised learning algorithms. Naive Bayes classifier is a fast, accurate, and reliable algorithm. Naive Bayes classifiers have high accuracy and speed on large datasets.

To understand the Naive Bayes algorithm first we want to know some basic concepts of probability.

Contents

  1. Probability

2. Conditional Probability

3. Independent Events

4. Mutually Exclusive Events

5. Bayes Theorem

6. Naive Bayes Algorithm

7. Toy Example using Naive Bayes

8. Naive Bayes Algorithm on Text data

9. Laplace (or) Additive Smoothing

10. Log-Probabilities and Numerical Stability

11. Bias-variance Trade…


Image for post
Image for post

First We want to know What is Amazon Fine Food Review Analysis?

This dataset consists of reviews of fine foods from amazon. The data span a period of more than 10 years, including all ~500,000 reviews up to October 2012. Reviews include product and user information, ratings, and a plaintext review. We also have reviews from all other Amazon categories.

Amazon reviews are often the most publicly visible reviews of consumer products. …

About

Sachin D N

Trained on Data Science and Machine Learning at @6benches

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store