This is in continuation of part-1,
In this post, I will be discussing the following things(In bold). If you are new to Naive Bayes or want a quick revision, please check my notes to get started.
1. Why in Naive Bayes do we assume the features are conditional independent?
2. Sklearn has GaussianNB, MultinomialNB, CategoricalNB, BernoulliNB → Given data that has categories, numerical, binary features which model will you choose?
3. How do you implement Multinomial Naive Bayes from scratch for text data and match the results with Sklearn MultinomialNB?
4. How do you implement Categorical Naive Bayes from scratch…
Just like Naive Bayes makes a naive assumption that the features we give to the model are independent, I have also made several naive assumptions about how NB works.
A simple NLP dataset to quickly get your hands on Text cleaning, pre-processing and training.
If you are a beginner to Machine Learning and want to work on a simple NLP dataset, I definitely recommend to go check this Kaggle challenge and try to solve it on your own before reading further.
In this challenge, I have got my hands on cleaning HTML tags, analyzing histograms, building custom W2V, TFIDF-W2V for vectorization, dealing with ordinal target labels, generating n-grams, using SVD for dimensionality reduction, deployment in a cloud server.
A basic understanding of LSTM and TensorFlow will suffice to go till the end of this post.
Encoder — decoder models are used in Machine Translation, Conversational chatbot. It is also used in Image captioning(given an image briefly describe the image), video captioning(given a video file briefly describe the video), LaTex Mathematical expressions (given an image of the formula generate mathematical expression in LaTex format).
In this post, we will learn the basics of a sequence to sequence models and build a simple calculator application.
A sequence to sequence model is used widely in machine translation, image captioning. In our calculator sequence to sequence model, given an input sequence (‘10+21’) we will try to predict an output sequence (‘31’). We will limit our scope to three-digit addition, multiplication, subtraction, and division.
At the end of this section we will try to generate input output pairs as shown…
In middleware companies and startups, people expect data scientists to do everything from data pre-processing to model productionization. So if you are a beginner level ML engineer who wants to build a product or someone interested in deploying their model you are in the correct place.
Before going further I will be using Amazon AWS free EC2 services for launching an Instance, you need to have a valid debit or credit card to create an AWS account.
Let’s get started!!.
1. Writing inference code for our model. 2. Creating a simple web API using Flask. 3. Containerizing the application with…
Trying to make machines more smart.