THREE DATA SCIENCE PROJECTS FOR RFM ANALYSIS, K-MEANS CLUSTERING, AND MACHINE LEARNING BASED PREDICTION WITH PYTHON GUI

THREE DATA SCIENCE PROJECTS FOR RFM ANALYSIS, K-MEANS CLUSTERING, AND MACHINE LEARNING BASED PREDICTION WITH PYTHON GUI
Author :
Publisher : BALIGE PUBLISHING
Total Pages : 627
Release :
ISBN-10 :
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis THREE DATA SCIENCE PROJECTS FOR RFM ANALYSIS, K-MEANS CLUSTERING, AND MACHINE LEARNING BASED PREDICTION WITH PYTHON GUI by : Vivian Siahaan

Download or read book THREE DATA SCIENCE PROJECTS FOR RFM ANALYSIS, K-MEANS CLUSTERING, AND MACHINE LEARNING BASED PREDICTION WITH PYTHON GUI written by Vivian Siahaan and published by BALIGE PUBLISHING. This book was released on 2022-05-11 with total page 627 pages. Available in PDF, EPUB and Kindle. Book excerpt: PROJECT 1: RFM ANALYSIS AND K-MEANS CLUSTERING: A CASE STUDY ANALYSIS, CLUSTERING, AND PREDICTION ON RETAIL STORE TRANSACTIONS WITH PYTHON GUI The dataset used in this project is the detailed data on sales of consumer goods obtained by ‘scanning’ the bar codes for individual products at electronic points of sale in a retail store. The dataset provides detailed information about quantities, characteristics and values of goods sold as well as their prices. The anonymized dataset includes 64.682 transactions of 5.242 SKU's sold to 22.625 customers during one year. Dataset Attributes are as follows: Date of Sales Transaction, Customer ID, Transaction ID, SKU Category ID, SKU ID, Quantity Sold, and Sales Amount (Unit price times quantity. For unit price, please divide Sales Amount by Quantity). This dataset can be analyzed with RFM analysis and can be clustered using K-Means algorithm. The machine learning models used in this project to predict clusters as target variable are K-Nearest Neighbor, Random Forest, Naive Bayes, Logistic Regression, Decision Tree, Support Vector Machine, LGBM, Gradient Boosting, XGB, and MLP. Finally, you will plot boundary decision, distribution of features, feature importance, cross validation score, and predicted values versus true values, confusion matrix, learning curve, performance of the model, scalability of the model, training loss, and training accuracy. PROJECT 2: DATA SCIENCE FOR GROCERIES MARKET ANALYSIS, CLUSTERING, AND PREDICTION WITH PYTHON GUI RFM analysis used in this project can be used as a marketing technique used to quantitatively rank and group customers based on the recency, frequency and monetary total of their recent transactions to identify the best customers and perform targeted marketing campaigns. The idea is to segment customers based on when their last purchase was, how often they've purchased in the past, and how much they've spent overall. Clustering, in this case K-Means algorithm, used in this project can be used to place similar customers into mutually exclusive groups; these groups are known as “segments” while the act of grouping is known as segmentation. Segmentation allows businesses to identify the different types and preferences of customers/markets they serve. This is crucial information to have to develop highly effective marketing, product, and business strategies. The dataset in this project has 38765 rows of the purchase orders of people from the grocery stores. These orders can be analyzed with RFM analysis and can be clustered using K-Means algorithm. The machine learning models used in this project to predict clusters as target variable are K-Nearest Neighbor, Random Forest, Naive Bayes, Logistic Regression, Decision Tree, Support Vector Machine, LGBM, Gradient Boosting, XGB, and MLP. Finally, you will plot boundary decision, distribution of features, feature importance, cross validation score, and predicted values versus true values, confusion matrix, learning curve, performance of the model, scalability of the model, training loss, and training accuracy. PROJECT 3: ONLINE RETAIL CLUSTERING AND PREDICTION USING MACHINE LEARNING WITH PYTHON GUI The dataset used in this project is a transnational dataset which contains all the transactions occurring between 01/12/2010 and 09/12/2011 for a UK-based and registered non-store online retail. The company mainly sells unique all-occasion gifts. Many customers of the company are wholesalers. You will be using the online retail transnational dataset to build a RFM clustering and choose the best set of customers which the company should target. In this project, you will perform Cohort analysis and RFM analysis. You will also perform clustering using K-Means to get 5 clusters. The machine learning models used in this project to predict clusters as target variable are K-Nearest Neighbor, Random Forest, Naive Bayes, Logistic Regression, Decision Tree, Support Vector Machine, LGBM, Gradient Boosting, XGB, and MLP. Finally, you will plot boundary decision, distribution of features, feature importance, cross validation score, and predicted values versus true values, confusion matrix, learning curve, performance of the model, scalability of the model, training loss, and training accuracy.


THREE DATA SCIENCE PROJECTS FOR RFM ANALYSIS, K-MEANS CLUSTERING, AND MACHINE LEARNING BASED PREDICTION WITH PYTHON GUI Related Books

THREE DATA SCIENCE PROJECTS FOR RFM ANALYSIS, K-MEANS CLUSTERING, AND MACHINE LEARNING BASED PREDICTION WITH PYTHON GUI
Language: en
Pages: 627
Authors: Vivian Siahaan
Categories: Computers
Type: BOOK - Published: 2022-05-11 - Publisher: BALIGE PUBLISHING

GET EBOOK

PROJECT 1: RFM ANALYSIS AND K-MEANS CLUSTERING: A CASE STUDY ANALYSIS, CLUSTERING, AND PREDICTION ON RETAIL STORE TRANSACTIONS WITH PYTHON GUI The dataset used
DATA SCIENCE FOR GROCERIES MARKET ANALYSIS, CLUSTERING, AND PREDICTION WITH PYTHON GUI
Language: en
Pages: 335
Authors: Vivian Siahaan
Categories: Computers
Type: BOOK - Published: 2022-05-03 - Publisher: BALIGE PUBLISHING

GET EBOOK

The objective of this data science project is to analyze and predict customer behavior in the groceries market using Python and create a graphical user interfac
Python Machine Learning
Language: en
Pages: 455
Authors: Sebastian Raschka
Categories: Computers
Type: BOOK - Published: 2015-09-23 - Publisher: Packt Publishing Ltd

GET EBOOK

Unlock deeper insights into Machine Leaning with this vital guide to cutting-edge predictive analytics About This Book Leverage Python's most powerful open-sour
Business and Consumer Analytics: New Ideas
Language: en
Pages: 1000
Authors: Pablo Moscato
Categories: Computers
Type: BOOK - Published: 2019-05-30 - Publisher: Springer

GET EBOOK

This two-volume handbook presents a collection of novel methodologies with applications and illustrative examples in the areas of data-driven computational soci
Big Data Analytics with Java
Language: en
Pages: 419
Authors: Rajat Mehta
Categories: Computers
Type: BOOK - Published: 2017-07-31 - Publisher: Packt Publishing Ltd

GET EBOOK

Learn the basics of analytics on big data using Java, machine learning and other big data tools About This Book Acquire real-world set of tools for building ent