If you’re selling, promoting, and engaging customers to buy new services, you’ve certainly come across the concept of lead scoring. This term is a source of interest for marketing agencies that, engaging possible information about the client, look for those who will be interested in a specific product or service.
MLJAR's Blog
-
Lead Scoring
March 05, 2021 by Aleksandra Płońska Lead scoring
-
How does AutoML work?
March 04, 2021 by Piotr Płoński Automl
The AutoML stands for Automated Machine Learning. It builds a Machine Learning pipeline in an automated way. But how exactly it works? What is behind the scene? There are many proprietary AutoML systems, and we probably never get to know how they work. Luckily, the MLJAR AutoML is open-source. Its code is available at GitHub. In this article, we will look inside MLJAR AutoML to show how it works.
-
AutoML in the Notebook
March 04, 2021 by Piotr Płoński Automl Notebook
Python Notebooks provide interactive computing environment that is perfect for experimenting with data. The Notebooks are widely used by Data Scientists in data analysis and discovery tasks. Currently, there are many versions of Notebooks. The first, and the most used version is Jupyter Notebook. There are also many cloud-based Notebooks, like Kaggle Notebooks or CoCalc Notebooks.
-
Extract Rules from Decision Tree in 3 Ways with Scikit-Learn and Python
February 25, 2021 by Piotr Płoński Decision tree Scikit learn
The rules extraction from the Decision Tree can help with better understanding how samples propagate through the tree during the prediction. It can be needed if we want to implement a Decision Tree without Scikit-learn or different than Python language. Decision Trees are easy to move to any programming language because there are set of
if-else
statements. I’ve seen many examples of moving scikit-learn Decision Trees into C, C++, Java, or even SQL. -
Tensorflow vs Scikit-learn
October 01, 2020 by Piotr Płoński Tensorflow Scikitlearn Neuralnetwork
Have you ever wonder what is the difference between Tensorflow and Sckit-learn? Which one is better? Have you ever needed Tensorflow when you already use Scikit-learn?
-
PostgreSQL and Machine Learning
September 16, 2020 by Piotr Płoński Postgresql Automl Supervised
-
AutoML as easy as MLJar
September 12, 2020 by Jeff King Automl Supervised
If there has been an open-source library that has made me an avid machine learning practitioner and won the battle of the AutoMLs hands down it has to be MlJar. I simply can’t stop eulogizing this library because it has helped overcome my deficiency in the field of coding and programming but at the same time automating the predictive modeling flow with very little user involvement. I have taken it for a spin in a few Hackathons and am not overtly surprised to find it amongst the top performers. It saves a lot of time as you do not need Data Preprocessing and feature Engineering before feeding the dataset to the model.
-
Xgboost Feature Importance Computed in 3 Ways with Python
August 17, 2020 by Piotr Płoński Xgboost
Xgboost is a gradient boosting library. It provides parallel boosting trees algorithm that can solve Machine Learning tasks. It is available in many languages, like: C++, Java, Python, R, Julia, Scala. In this post, I will show you how to get feature importance from Xgboost model in Python. In this example, I will use
boston
dataset availabe inscikit-learn
pacakge (a regression task). -
How many trees in the Random Forest?
June 30, 2020 by Piotr Płoński Random forest
I have trained
3,600
Random Forest Classifiers (each with1,000
trees) on72
data sets (from OpenML-CC18 benchmark) to check how many trees should be used in the Random Forest. What I’ve found: -
How to visualize a single Decision Tree from the Random Forest in Scikit-Learn (Python)?
June 29, 2020 by Piotr Płoński Random forest
The Random Forest is an esemble of Decision Trees. A single Decision Tree can be easily visualized in several different ways. In this post I will show you, how to visualize a Decision Tree from the Random Forest.