If you are a software engineer or a programmer you must have used StackOverflow at least once in your lifetime. But have you ever wondered how StackOverflow predicts the tags for a given question ? In this blog, I will discuss the StackOverflow tag predictor case study.
Stack Overflow is the largest, most trusted online community for developers to learn, share their programming knowledge, and build their careers. …
This dataset consists of reviews of fine foods from amazon. The data span a period of more than 10 years, including all ~500,000 reviews up to October 2012. Reviews include product and user information, ratings, and a plaintext review. We also have reviews from all other Amazon categories.
Amazon reviews are often the most publicly visible reviews of consumer products. As a frequent Amazon user, I was interested in examining the structure of a large database of Amazon reviews and visualizing this information so as to be a smarter consumer and reviewer. …
In this blog, we’ll try to understand one of the most important algorithms in machine learning i.e. Random Forest Algorithm. We will try to look at the things that make Random Forest so special and will try to implement it on a real-world dataset.
Commonly, the individual model suffers from bias or variances and that’s why we need ensemble learning. …
Decision trees are a popular supervised learning method for a variety of reasons. The benefits of decision trees include that they can be used for both regression and classification, they are easy to interpret and they don’t require feature scaling. They have several flaws including being prone to overfitting.
Decision trees which are also modernly known as classification and regression trees (CART) were introduced by Leo Breiman to refer, Decision Tree algorithms. They have a supervised learning algorithm that has a pre-defined target variable & they are mostly used in non-linear decision making with a simple linear decision surface. In other words, they are adaptable for solving any kind of problem at hand (classification or regression). …
SVM is a supervised Machine Learning algorithm that is used in many classifications and regression problems. It still presents as one of the most used robust prediction methods that can be applied to many use cases involving classifications.
Logistic Regression doesn’t care whether the instances are close to the decision boundary. Therefore, the decision boundary it picks may not be optimal. If a point is far from the decision boundary, we may be more confident in our predictions. …
Logistic regression is a classification algorithm used to assign observations to a discrete set of classes. There are lots of classification problems that are available, but the logistics regression is common and is a useful regression method for solving the binary classification problem.
There are lots of classification problems that are available, but the logistics regression is common and is a useful regression method for solving the binary classification problem.
2. Regularization techniques to avoid Overfitting and Underfitting
3. Probabilistic interpretation of Logistic Regression
4. Loss Minimization Interpretation of Logistic Regression
5.Implementation of Logistic Regression Algorithm with Amazon Food…
The solution to an optimization problem can be done by selecting different methods. Moreover, the user can navigate on the surface or curve to establish an initial point and find the optimal or critical point, which can be observed on the plotted function.
1.Single Value Differentiation
2. Minima and Maxima
3. Gradient descent algorithm
4. Steps for Gradient descent algorithm
5. Types of Gradient Descent algorithms
6. Implementation of Stochastic Gradient Descent
For Optimization problems Differentiation is very important, Let’s see some maths,
Differentiation allows us to find rates of change. …
2. Linear Regression using Loss-Minimization
3. Assumptions of Linear Regression
4. Implementation of the Linear Regression using Python
Regression analysis is a form of predictive modeling technique that investigates the relationship between a dependent and independent variable.
Linear regression is perhaps one of the most well known and well-understood algorithms in statistics and machine learning. …
Naive Bayes is a statistical classification technique based on Bayes Theorem. It is one of the simplest supervised learning algorithms. Naive Bayes classifier is a fast, accurate, and reliable algorithm. Naive Bayes classifiers have high accuracy and speed on large datasets.
To understand the Naive Bayes algorithm first we want to know some basic concepts of probability.
2. Conditional Probability
3. Independent Events
4. Mutually Exclusive Events
5. Bayes Theorem
6. Naive Bayes Algorithm
7. Toy Example using Naive Bayes
8. Naive Bayes Algorithm on Text data
9. Laplace (or) Additive Smoothing
10. Log-Probabilities and Numerical Stability
11. Bias-variance Trade…
First We want to know What is Amazon Fine Food Review Analysis?
This dataset consists of reviews of fine foods from amazon. The data span a period of more than 10 years, including all ~500,000 reviews up to October 2012. Reviews include product and user information, ratings, and a plaintext review. We also have reviews from all other Amazon categories.
Amazon reviews are often the most publicly visible reviews of consumer products. …
About