Machine Learning with LightGBM and Python: A practitioner's guide to developing production-ready machine learning systems

Ebook512 pages3 hours

Machine Learning with LightGBM and Python: A practitioner's guide to developing production-ready machine learning systems

Name: Machine Learning with LightGBM and Python: A practitioner's guide to developing production-ready machine learning systems
Author: Andrich van Wyk
ISBN: 9781800563056

By Andrich van Wyk

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Machine Learning with LightGBM and Python is a comprehensive guide to learning the basics of machine learning and progressing to building scalable machine learning systems that are ready for release.
This book will get you acquainted with the high-performance gradient-boosting LightGBM framework and show you how it can be used to solve various machine-learning problems to produce highly accurate, robust, and predictive solutions. Starting with simple machine learning models in scikit-learn, you’ll explore the intricacies of gradient boosting machines and LightGBM. You’ll be guided through various case studies to better understand the data science processes and learn how to practically apply your skills to real-world problems. As you progress, you’ll elevate your software engineering skills by learning how to build and integrate scalable machine-learning pipelines to process data, train models, and deploy them to serve secure APIs using Python tools such as FastAPI.
By the end of this book, you’ll be well equipped to use various -of-the-art tools that will help you build production-ready systems, including FLAML for AutoML, PostgresML for operating ML pipelines using Postgres, high-performance distributed training and serving via Dask, and creating and running models in the Cloud with AWS Sagemaker.

Skip carousel

LanguageEnglish

PublisherPackt Publishing

Release dateSep 29, 2023

ISBN9781800563056

Author

Andrich van Wyk

Related authors

Skip carousel

Related to Machine Learning with LightGBM and Python

Related ebooks

Skip carousel

Machine Learning Engineering with Python: Manage the production life cycle of machine learning models using MLOps with practical examples
Ebook
Machine Learning Engineering with Python: Manage the production life cycle of machine learning models using MLOps with practical examples
byAndrew P. McMahon
Rating: 0 out of 5 stars
0 ratings
The Machine Learning Solutions Architect Handbook: Create machine learning platforms to run solutions in an enterprise setting
Ebook
The Machine Learning Solutions Architect Handbook: Create machine learning platforms to run solutions in an enterprise setting
byDavid Ping
Rating: 0 out of 5 stars
0 ratings
Engineering MLOps: Rapidly build, test, and manage production-ready machine learning life cycles at scale
Ebook
Engineering MLOps: Rapidly build, test, and manage production-ready machine learning life cycles at scale
byEmmanuel Raj
Rating: 0 out of 5 stars
0 ratings
Ultimate Machine Learning with ML.NET: Build, Optimize, and Deploy Powerful Machine Learning Models for Data-Driven Insights with ML.NET, Azure Functions, and Web API (English Edition)
Ebook
Ultimate Machine Learning with ML.NET: Build, Optimize, and Deploy Powerful Machine Learning Models for Data-Driven Insights with ML.NET, Azure Functions, and Web API (English Edition)
byKalicharan Mahasivabhattu
Rating: 0 out of 5 stars
0 ratings
Hands-On Machine Learning with ML.NET: Getting started with Microsoft ML.NET to implement popular machine learning algorithms in C#
Ebook
Hands-On Machine Learning with ML.NET: Getting started with Microsoft ML.NET to implement popular machine learning algorithms in C#
byJarred Capellman
Rating: 0 out of 5 stars
0 ratings
The Machine Learning Solutions Architect Handbook: Practical strategies and best practices on the ML lifecycle, system design, MLOps, and generative AI
Ebook
The Machine Learning Solutions Architect Handbook: Practical strategies and best practices on the ML lifecycle, system design, MLOps, and generative AI
byDavid Ping
Rating: 0 out of 5 stars
0 ratings
Automated Machine Learning on AWS: Fast-track the development of your production-ready machine learning applications the AWS way
Ebook
Automated Machine Learning on AWS: Fast-track the development of your production-ready machine learning applications the AWS way
byTrenton Potgieter
Rating: 0 out of 5 stars
0 ratings
A Handbook of Mathematical Models with Python: Elevate your machine learning projects with NetworkX, PuLP, and linalg
Ebook
A Handbook of Mathematical Models with Python: Elevate your machine learning projects with NetworkX, PuLP, and linalg
byDr. Ranja Sarkar
Rating: 0 out of 5 stars
0 ratings
Agile Machine Learning with DataRobot: Automate each step of the machine learning life cycle, from understanding problems to delivering value
Ebook
Agile Machine Learning with DataRobot: Automate each step of the machine learning life cycle, from understanding problems to delivering value
byBipin Chadha
Rating: 0 out of 5 stars
0 ratings
Machine Learning in Production: Master the art of delivering robust Machine Learning solutions with MLOps (English Edition)
Ebook
Machine Learning in Production: Master the art of delivering robust Machine Learning solutions with MLOps (English Edition)
bySuhas Pote
Rating: 0 out of 5 stars
0 ratings
Machine Learning with BigQuery ML: Create, execute, and improve machine learning models in BigQuery using standard SQL queries
Ebook
Machine Learning with BigQuery ML: Create, execute, and improve machine learning models in BigQuery using standard SQL queries
byAlessandro Marrandino
Rating: 0 out of 5 stars
0 ratings
Active Machine Learning with Python: Refine and elevate data quality over quantity with active learning
Ebook
Active Machine Learning with Python: Refine and elevate data quality over quantity with active learning
byMargaux Masson-Forsythe
Rating: 0 out of 5 stars
0 ratings
Feature Store for Machine Learning: Curate, discover, share and serve ML features at scale
Ebook
Feature Store for Machine Learning: Curate, discover, share and serve ML features at scale
byJayanth Kumar M J
Rating: 0 out of 5 stars
0 ratings
Deep Learning with Microsoft Cognitive Toolkit Quick Start Guide: A practical guide to building neural networks using Microsoft's open source deep learning framework
Ebook
Deep Learning with Microsoft Cognitive Toolkit Quick Start Guide: A practical guide to building neural networks using Microsoft's open source deep learning framework
byWillem Meints
Rating: 0 out of 5 stars
0 ratings
Machine Learning with the Elastic Stack.: Gain valuable insights from your data with Elastic Stack's machine learning features
Ebook
Machine Learning with the Elastic Stack.: Gain valuable insights from your data with Elastic Stack's machine learning features
byRich Collier
Rating: 0 out of 5 stars
0 ratings
Debugging Machine Learning Models with Python: Develop high-performance, low-bias, and explainable machine learning and deep learning models
Ebook
Debugging Machine Learning Models with Python: Develop high-performance, low-bias, and explainable machine learning and deep learning models
byAli Madani
Rating: 0 out of 5 stars
0 ratings
Machine Learning Engineering with MLflow: Manage the end-to-end machine learning life cycle with MLflow
Ebook
Machine Learning Engineering with MLflow: Manage the end-to-end machine learning life cycle with MLflow
byNatu Lauchande
Rating: 0 out of 5 stars
0 ratings
Mastering Azure Machine Learning.: Execute large-scale end-to-end machine learning with Azure
Ebook
Mastering Azure Machine Learning.: Execute large-scale end-to-end machine learning with Azure
byKörner Christoph
Rating: 0 out of 5 stars
0 ratings
Amazon SageMaker Best Practices: Proven tips and tricks to build successful machine learning solutions on Amazon SageMaker
Ebook
Amazon SageMaker Best Practices: Proven tips and tricks to build successful machine learning solutions on Amazon SageMaker
bySireesha Muppala
Rating: 0 out of 5 stars
0 ratings
Learning Google Cloud Vertex AI: Build, deploy, and manage machine learning models with Vertex AI (English Edition)
Ebook
Learning Google Cloud Vertex AI: Build, deploy, and manage machine learning models with Vertex AI (English Edition)
byHemanth Kumar K
Rating: 0 out of 5 stars
0 ratings
Metaprogramming with Python: A programmer's guide to writing reusable code to build smarter applications
Ebook
Metaprogramming with Python: A programmer's guide to writing reusable code to build smarter applications
bySulekha AloorRavi
Rating: 0 out of 5 stars
0 ratings
Applied Machine Learning Explainability Techniques: Make ML models explainable and trustworthy for practical applications using LIME, SHAP, and more
Ebook
Applied Machine Learning Explainability Techniques: Make ML models explainable and trustworthy for practical applications using LIME, SHAP, and more
byAditya Bhattacharya
Rating: 0 out of 5 stars
0 ratings
Getting Started with Streamlit for Data Science: Create and deploy Streamlit web applications from scratch in Python
Ebook
Getting Started with Streamlit for Data Science: Create and deploy Streamlit web applications from scratch in Python
byTyler Richards
Rating: 0 out of 5 stars
0 ratings
Accelerate Model Training with PyTorch 2.X: Build more accurate models by boosting the model training process
Ebook
Accelerate Model Training with PyTorch 2.X: Build more accurate models by boosting the model training process
byMaicon Melo Alves
Rating: 0 out of 5 stars
0 ratings
Interpretable Machine Learning with Python: Learn to build interpretable high-performance models with hands-on real-world examples
Ebook
Interpretable Machine Learning with Python: Learn to build interpretable high-performance models with hands-on real-world examples
bySerg Masís
Rating: 0 out of 5 stars
0 ratings
Practical Automated Machine Learning Using H2O.ai.: Discover the power of automated machine learning, from experimentation through to deployment to production
Ebook
Practical Automated Machine Learning Using H2O.ai.: Discover the power of automated machine learning, from experimentation through to deployment to production
bySalil Ajgaonkar
Rating: 0 out of 5 stars
0 ratings
Hands-On Machine Learning with C++: Build, train, and deploy end-to-end machine learning and deep learning pipelines
Ebook
Hands-On Machine Learning with C++: Build, train, and deploy end-to-end machine learning and deep learning pipelines
byKirill Kolodiazhnyi
Rating: 0 out of 5 stars
0 ratings
R Machine Learning Projects: Implement supervised, unsupervised, and reinforcement learning techniques using R 3.5
Ebook
R Machine Learning Projects: Implement supervised, unsupervised, and reinforcement learning techniques using R 3.5
byDr. Sunil Kumar Chinnamgari
Rating: 0 out of 5 stars
0 ratings
Codeless Time Series Analysis with KNIME: A practical guide to implementing forecasting models for time series analysis applications
Ebook
Codeless Time Series Analysis with KNIME: A practical guide to implementing forecasting models for time series analysis applications
byKNIME AG
Rating: 0 out of 5 stars
0 ratings
Automated Machine Learning: Hyperparameter optimization, neural architecture search, and algorithm selection with cloud platforms
Ebook
Automated Machine Learning: Hyperparameter optimization, neural architecture search, and algorithm selection with cloud platforms
byAdnan Masood
Rating: 0 out of 5 stars
0 ratings

Programming For You

Skip carousel

Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
Ebook
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
byJames Tudor
Rating: 5 out of 5 stars
5/5
Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.
Ebook
Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.
byGwendolyn Faraday
Rating: 5 out of 5 stars
5/5
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
Ebook
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
byNigel Tillery
Rating: 0 out of 5 stars
0 ratings
Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1
Ebook
Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1
byKevin Clark
Rating: 5 out of 5 stars
5/5
Coding All-in-One For Dummies
Ebook
Coding All-in-One For Dummies
byNikhil Abraham
Rating: 4 out of 5 stars
4/5
Python Programming : How to Code Python Fast In Just 24 Hours With 7 Simple Steps
Ebook
Python Programming : How to Code Python Fast In Just 24 Hours With 7 Simple Steps
byJason Scotts
Rating: 4 out of 5 stars
4/5
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
HTML & CSS: Learn the Fundaments in 7 Days
Ebook
HTML & CSS: Learn the Fundaments in 7 Days
byMichael Knapp
Rating: 4 out of 5 stars
4/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
Mastering Windows PowerShell Scripting
Ebook
Mastering Windows PowerShell Scripting
byBrenton J.W. Blawat
Rating: 4 out of 5 stars
4/5
So You Want to Start a Podcast: Finding Your Voice, Telling Your Story, and Building a Community That Will Listen
Ebook
So You Want to Start a Podcast: Finding Your Voice, Telling Your Story, and Building a Community That Will Listen
byKristen Meinzer
Rating: 3 out of 5 stars
3/5
Learn Python Programming for Beginners: The Best Step-by-Step Guide for Coding with Python, Great for Kids and Adults. Includes Practical Exercises on Data Analysis, Machine Learning and More.
Ebook
Learn Python Programming for Beginners: The Best Step-by-Step Guide for Coding with Python, Great for Kids and Adults. Includes Practical Exercises on Data Analysis, Machine Learning and More.
byFlynn Fisher
Rating: 5 out of 5 stars
5/5
Spies, Lies, and Algorithms: The History and Future of American Intelligence
Ebook
Spies, Lies, and Algorithms: The History and Future of American Intelligence
byAmy B. Zegart
Rating: 4 out of 5 stars
4/5
The JavaScript Workshop: Learn to develop interactive web applications with clean and maintainable JavaScript code
Ebook
The JavaScript Workshop: Learn to develop interactive web applications with clean and maintainable JavaScript code
byJoseph Labrecque
Rating: 5 out of 5 stars
5/5
Python Programming for Beginners: A Comprehensive Crash Course With Practical Exercises to Quickly Learn Coding and Programming for Data Analysis and Machine Learning
Ebook
Python Programming for Beginners: A Comprehensive Crash Course With Practical Exercises to Quickly Learn Coding and Programming for Data Analysis and Machine Learning
byAnthony Adams
Rating: 4 out of 5 stars
4/5
PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project
Ebook
PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project
byMark Chan
Rating: 5 out of 5 stars
5/5
Microsoft Office 365 Bible: 10:1 Mastery | Excel in Your Profession, Enhance Time Management, and Foster Exceptional Collaboration [III EDITION]: Career Elevator
Ebook
Microsoft Office 365 Bible: 10:1 Mastery | Excel in Your Profession, Enhance Time Management, and Foster Exceptional Collaboration [III EDITION]: Career Elevator
byKevin Pitch
Rating: 5 out of 5 stars
5/5
SQL All-in-One For Dummies
Ebook
SQL All-in-One For Dummies
byAllen G. Taylor
Rating: 3 out of 5 stars
3/5
Python QuickStart Guide: The Simplified Beginner's Guide to Python Programming Using Hands-On Projects and Real-World Applications
Ebook
Python QuickStart Guide: The Simplified Beginner's Guide to Python Programming Using Hands-On Projects and Real-World Applications
byRobert Oliver
Rating: 0 out of 5 stars
0 ratings
Programming Arduino: Getting Started with Sketches
Ebook
Programming Arduino: Getting Started with Sketches
bySimon Monk
Rating: 4 out of 5 stars
4/5
Learn JavaScript in 24 Hours
Ebook
Learn JavaScript in 24 Hours
byAlex Nordeen
Rating: 3 out of 5 stars
3/5
Python: Learn Python in 24 Hours
Ebook
Python: Learn Python in 24 Hours
byAlex Nordeen
Rating: 4 out of 5 stars
4/5
JavaScript All-in-One For Dummies
Ebook
JavaScript All-in-One For Dummies
byChris Minnick
Rating: 5 out of 5 stars
5/5
Excel 101: A Beginner's & Intermediate's Guide for Mastering the Quintessence of Microsoft Excel (2010-2019 & 365) in no time!
Ebook
Excel 101: A Beginner's & Intermediate's Guide for Mastering the Quintessence of Microsoft Excel (2010-2019 & 365) in no time!
byJohannes Wild
Rating: 0 out of 5 stars
0 ratings
Lua Game Development Cookbook
Ebook
Lua Game Development Cookbook
byMário Kašuba
Rating: 0 out of 5 stars
0 ratings
Learn PowerShell in a Month of Lunches, Fourth Edition: Covers Windows, Linux, and macOS
Ebook
Learn PowerShell in a Month of Lunches, Fourth Edition: Covers Windows, Linux, and macOS
byTravis Plunk
Rating: 0 out of 5 stars
0 ratings
How To Become A Data Scientist With ChatGPT: A Beginner's Guide to ChatGPT-Assisted Programming
Ebook
How To Become A Data Scientist With ChatGPT: A Beginner's Guide to ChatGPT-Assisted Programming
byRafiq Muhammad
Rating: 5 out of 5 stars
5/5
Python for Beginners: Learn the Fundamentals of Computer Programming
Ebook
Python for Beginners: Learn the Fundamentals of Computer Programming
byJ Foster
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

Machine in Production = Data Engineering + ML + Software Engineering // Satish Chandra Gupta // MLOps Coffee Sessions #16
Podcast episode
Machine in Production = Data Engineering + ML + Software Engineering // Satish Chandra Gupta // MLOps Coffee Sessions #16
byMLOps.community
0 ratings
0% found this document useful
Harnessing Generative AI For Creating Educational Content With Illumidesk: Generative AI has unlocked a massive opportunity for content creation. There is also an unfulfilled need for experts to be able to share their knowledge and build communities. Illumidesk was built to take advantage of this intersection. In this episode Greg Werner explains how they are using generative AI as an assistive tool for creating educational material, as well as building a data driven experience for learners.
Podcast episode
Harnessing Generative AI For Creating Educational Content With Illumidesk: Generative AI has unlocked a massive opportunity for content creation. There is also an unfulfilled need for experts to be able to share their knowledge and build communities. Illumidesk was built to take advantage of this intersection. In this episode Greg Werner explains how they are using generative AI as an assistive tool for creating educational material, as well as building a data driven experience for learners.
byData Engineering Podcast
0 ratings
0% found this document useful
How to Build Production-Ready AI Models for Manufacturing // [Exclusive] LatticeFlow Roundtable
Podcast episode
How to Build Production-Ready AI Models for Manufacturing // [Exclusive] LatticeFlow Roundtable
byMLOps.community
0 ratings
0% found this document useful
MLOps Meetup #29 // Scaling Machine Learning Capabilities in Large Organizations // Bertjan Broeksema & Axel Goblet
Podcast episode
MLOps Meetup #29 // Scaling Machine Learning Capabilities in Large Organizations // Bertjan Broeksema & Axel Goblet
byMLOps.community
0 ratings
0% found this document useful
MLOps - Design Thinking to Build ML Infra for ML and LLM Use Casess // Amritha Arun Babu & Abhik Choudhury // #221
Podcast episode
MLOps - Design Thinking to Build ML Infra for ML and LLM Use Casess // Amritha Arun Babu & Abhik Choudhury // #221
byMLOps.community
0 ratings
0% found this document useful
Reduce Friction In Your Business Analytics Through Entity Centric Data Modeling: For business analytics the way that you model the data in your warehouse has a lasting impact on what types of questions can be answered quickly and easily. The major strategies in use today were created decades ago when the software and hardware for warehouse databases were far more constrained. In this episode Maxime Beauchemin of Airflow and Superset fame shares his vision for the entity-centric data model and how you can incorporate it into your own warehouse design.
Podcast episode
Reduce Friction In Your Business Analytics Through Entity Centric Data Modeling: For business analytics the way that you model the data in your warehouse has a lasting impact on what types of questions can be answered quickly and easily. The major strategies in use today were created decades ago when the software and hardware for warehouse databases were far more constrained. In this episode Maxime Beauchemin of Airflow and Superset fame shares his vision for the entity-centric data model and how you can incorporate it into your own warehouse design.
byData Engineering Podcast
0 ratings
0% found this document useful
Luigi in Production // MLOps Coffee Sessions #18 // Luigi Patruno ML in Production
Podcast episode
Luigi in Production // MLOps Coffee Sessions #18 // Luigi Patruno ML in Production
byMLOps.community
0 ratings
0% found this document useful
The Changing Faces of Data and Analytics
Podcast episode
The Changing Faces of Data and Analytics
byInsights Tomorrow
0 ratings
0% found this document useful
Practitioners Guide to MLOps // Donna Schut and Christos Aniftos // Coffee Sessions #82
Podcast episode
Practitioners Guide to MLOps // Donna Schut and Christos Aniftos // Coffee Sessions #82
byMLOps.community
0 ratings
0% found this document useful
Understanding Machine Learning Features and Platforms
Podcast episode
Understanding Machine Learning Features and Platforms
byThe Cloudcast
0 ratings
0% found this document useful
Ads Ranking Evolution at Pinterest // Aayush Mudgal // #211
Podcast episode
Ads Ranking Evolution at Pinterest // Aayush Mudgal // #211
byMLOps.community
0 ratings
0% found this document useful
End-to-End Data Science to Drive Business Decisions at LinkedIn with Burcu Baran - TWiML Talk #256: In this episode of our Strata Data conference series, we’re joined by Burcu Baran, Senior Data Scientist at LinkedIn. At Strata, Burcu, along with a few members of her team, delivered the presentation “Using the full spectrum of data science to...
Podcast episode
End-to-End Data Science to Drive Business Decisions at LinkedIn with Burcu Baran - TWiML Talk #256: In this episode of our Strata Data conference series, we’re joined by Burcu Baran, Senior Data Scientist at LinkedIn. At Strata, Burcu, along with a few members of her team, delivered the presentation “Using the full spectrum of data science to...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Explainability in the MLOps Cycle // Dattaraj Rao // MLOps Podcast #138
Podcast episode
Explainability in the MLOps Cycle // Dattaraj Rao // MLOps Podcast #138
byMLOps.community
0 ratings
0% found this document useful
ML Lifecycle with Dale Markowitz and Craig Wiley: Jenny Brown co-hosts with Mark Mirchandani this week for a great conversation about the ML lifecycle with our guests Craig Wiley and Dale Markowitz.
Podcast episode
ML Lifecycle with Dale Markowitz and Craig Wiley: Jenny Brown co-hosts with Mark Mirchandani this week for a great conversation about the ML lifecycle with our guests Craig Wiley and Dale Markowitz.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Continuous Application Profiling
Podcast episode
Continuous Application Profiling
byThe Cloudcast
0 ratings
0% found this document useful
Managing ML Lifecycles with Vertex AI with Erwin Huizenga: We're learning all about Vertex AI this week as Carter Morgan and Jay Jenkins host guest Erwin Huizenga.
Podcast episode
Managing ML Lifecycles with Vertex AI with Erwin Huizenga: We're learning all about Vertex AI this week as Carter Morgan and Jay Jenkins host guest Erwin Huizenga.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Making Email Better With AI At Shortwave: Generative AI has rapidly transformed everything in the technology sector. When Andrew Lee started work on Shortwave he was focused on making email more productive. When AI started gaining adoption he realized that he had even more potential for a transformative experience. In this episode he shares the technical challenges that he and his team have overcome in integrating AI into their product, as well as the benefits and features that it provides to their customers.
Podcast episode
Making Email Better With AI At Shortwave: Generative AI has rapidly transformed everything in the technology sector. When Andrew Lee started work on Shortwave he was focused on making email more productive. When AI started gaining adoption he realized that he had even more potential for a transformative experience. In this episode he shares the technical challenges that he and his team have overcome in integrating AI into their product, as well as the benefits and features that it provides to their customers.
byData Engineering Podcast
0 ratings
0% found this document useful
A "AI & ML" Look Ahead for 2020
Podcast episode
A "AI & ML" Look Ahead for 2020
byThe Cloudcast
0 ratings
0% found this document useful
The Role of Infrastructure in ML // Niels Bantilan // #197
Podcast episode
The Role of Infrastructure in ML // Niels Bantilan // #197
byMLOps.community
0 ratings
0% found this document useful
Safely Test Your Applications And Analytics With Production Quality Data Using Tonic AI: The most interesting and challenging bugs always happen in production, but recreating them is a constant challenge due to differences in the data that you are working with. Building your own scripts to replicate data from production is time consuming and error-prone. Tonic is a platform designed to solve the problem of having reliable, production-like data available for developing and testing your software, analytics, and machine learning projects. In this episode Adam Kamor explores the factors that make this such a complex problem to solve, the approach that he and his team have taken to turn it into a reliable product, and how you can start using it to replace your own collection of scripts.
Podcast episode
Safely Test Your Applications And Analytics With Production Quality Data Using Tonic AI: The most interesting and challenging bugs always happen in production, but recreating them is a constant challenge due to differences in the data that you are working with. Building your own scripts to replicate data from production is time consuming and error-prone. Tonic is a platform designed to solve the problem of having reliable, production-like data available for developing and testing your software, analytics, and machine learning projects. In this episode Adam Kamor explores the factors that make this such a complex problem to solve, the approach that he and his team have taken to turn it into a reliable product, and how you can start using it to replace your own collection of scripts.
byData Engineering Podcast
0 ratings
0% found this document useful
A VC's Perspective on AI and Security
Podcast episode
A VC's Perspective on AI and Security
byThe Cloudcast
0 ratings
0% found this document useful
Experiment Tracking in the Age of LLMs // Piotr Niedźwiedź // MLOps Podcast #168
Podcast episode
Experiment Tracking in the Age of LLMs // Piotr Niedźwiedź // MLOps Podcast #168
byMLOps.community
0 ratings
0% found this document useful
Scaling Machine Learning on Graphs at LinkedIn with Hema Raghavan and Scott Meyer - TWiML Talk #236: Today we’re joined by Hema Raghavan and Scott Meyer of LinkedIn. Hema is an Engineering Director Responsible for AI for Growth and Notifications, while Scott serves as a Principal Software Engineer. In this conversation, Hema, Scott and I dig into...
Podcast episode
Scaling Machine Learning on Graphs at LinkedIn with Hema Raghavan and Scott Meyer - TWiML Talk #236: Today we’re joined by Hema Raghavan and Scott Meyer of LinkedIn. Hema is an Engineering Director Responsible for AI for Growth and Notifications, while Scott serves as a Principal Software Engineer. In this conversation, Hema, Scott and I dig into...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
What Business Stakeholders Want to See from the ML Teams // Peter Guagenti // #222
Podcast episode
What Business Stakeholders Want to See from the ML Teams // Peter Guagenti // #222
byMLOps.community
0 ratings
0% found this document useful
Let Your Business Intelligence Platform Build The Models Automatically With Omni Analytics: Business intelligence has gone through many generational shifts, but each generation has largely maintained the same workflow. Data analysts create reports that are used by the business to understand and direct the business, but the process is very labor and time intensive. The team at Omni have taken a new approach by automatically building models based on the queries that are executed. In this episode Chris Merrick shares how they manage integration and automation around the modeling layer and how it improves the organizational experience of business intelligence.
Podcast episode
Let Your Business Intelligence Platform Build The Models Automatically With Omni Analytics: Business intelligence has gone through many generational shifts, but each generation has largely maintained the same workflow. Data analysts create reports that are used by the business to understand and direct the business, but the process is very labor and time intensive. The team at Omni have taken a new approach by automatically building models based on the queries that are executed. In this episode Chris Merrick shares how they manage integration and automation around the modeling layer and how it improves the organizational experience of business intelligence.
byData Engineering Podcast
0 ratings
0% found this document useful
PTC x IoT ONE EP050 – Machine Vision and the Importance of IoT Data Management Platforms – JL Beaudoin, VP of Platforms and Innovation, Averna: *This episode of the Industrial IoT Spotlight Podcast is sponsored by . In this episode of the IIoT Spotlight Podcast, we discuss machine vision related to core components of testing and measurement systems, and importance of IoT data...
Podcast episode
PTC x IoT ONE EP050 – Machine Vision and the Importance of IoT Data Management Platforms – JL Beaudoin, VP of Platforms and Innovation, Averna: *This episode of the Industrial IoT Spotlight Podcast is sponsored by . In this episode of the IIoT Spotlight Podcast, we discuss machine vision related to core components of testing and measurement systems, and importance of IoT data...
byIndustrial IoT Spotlight
0 ratings
0% found this document useful
Google Analytics and BigQuery at Trade Me: Emily Melhuish and Lester Litchfield share how Trade Me, the second most visited page in New Zealand, integrates Google Analytics and BigQuery to understand their traffic and provide statistics to their users.
Podcast episode
Google Analytics and BigQuery at Trade Me: Emily Melhuish and Lester Litchfield share how Trade Me, the second most visited page in New Zealand, integrates Google Analytics and BigQuery to understand their traffic and provide statistics to their users.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Surveying The Market Of Database Products: Databases are the core of most applications, whether transactional or analytical. In recent years the selection of database products has exploded, making the critical decision of which engine(s) to use even more difficult. In this episode Tanya Bragin shares her experiences as a product manager for two major vendors and the lessons that she has learned about how teams should approach the process of tool selection.
Podcast episode
Surveying The Market Of Database Products: Databases are the core of most applications, whether transactional or analytical. In recent years the selection of database products has exploded, making the critical decision of which engine(s) to use even more difficult. In this episode Tanya Bragin shares her experiences as a product manager for two major vendors and the lessons that she has learned about how teams should approach the process of tool selection.
byData Engineering Podcast
0 ratings
0% found this document useful
GitOps, Security and Modern CI/CD Pipelines
Podcast episode
GitOps, Security and Modern CI/CD Pipelines
byThe Cloudcast
0 ratings
0% found this document useful
Understanding Time-Series Database Patterns
Podcast episode
Understanding Time-Series Database Patterns
byThe Cloudcast
0 ratings
0% found this document useful

Skip carousel

Getting The edge
The European Business Review
Article
Getting The edge
Feb 25, 2021
7 min read
COMPETITIVE ADVANTAGE THROUGH SOFTWARE: Contrasting Enterprises & Startups
The European Business Review
Article
COMPETITIVE ADVANTAGE THROUGH SOFTWARE: Contrasting Enterprises & Startups
Feb 4, 2019
6 min read
Generative AI: What Leaders Need To Know
Rotman Management
Article
Generative AI: What Leaders Need To Know
Jan 1, 2024
12 min read
Businesses Were Not Prepared For The Pandemic, Warns IBM Chief Executive
Evening Standard
Article
Businesses Were Not Prepared For The Pandemic, Warns IBM Chief Executive
Sep 7, 2020
4 min read
IBM Boss: Big Companies Were Not Prepared For The Pandemic
Evening Standard
Article
IBM Boss: Big Companies Were Not Prepared For The Pandemic
Sep 4, 2020
Sreeram Visvanathan is the new chief executive of IBM UK and Ireland. The 53 year-old is from Bangalore in India and previously led IBM’s global Public Sector team. In the middle of London Tech Week, he talked to the Evening Standard about the future
4 min read
Leadership Forum: Investing in Disruption
Rotman Management
Article
Leadership Forum: Investing in Disruption
Jan 1, 2019
10 min read
What It Takes To Be A Smart Business
Rotman Management
Article
What It Takes To Be A Smart Business
Jan 1, 2019
Why is it important for every Western businessperson to be familiar with Alibaba's business model? Alibaba’s business model provides key insights into the future of strategy. The sources of competitive advantage have shifted dramatically, and compani
6 min read
Cognitive Enterprise
Techfastly
Article
Cognitive Enterprise
Dec 1, 2021
6 min read
01 Ready Or Not, AI Is Here To Assist You
HWM Singapore
Article
01 Ready Or Not, AI Is Here To Assist You
Jul 11, 2023
4 min read
2024: What Is The Near Future Of Generative AI?
The European Business Review
Article
2024: What Is The Near Future Of Generative AI?
Jan 26, 2024
8 min read
Marketing Leaders Are Taking A Run Approach Towards Cognitive Computing
Techfastly
Article
Marketing Leaders Are Taking A Run Approach Towards Cognitive Computing
Dec 1, 2021
3 min read
The Current Frontier In Undustrial Manufacturing: BRINGING SOFTWARE SYSTEMS TO MARKET
The European Business Review
Article
The Current Frontier In Undustrial Manufacturing: BRINGING SOFTWARE SYSTEMS TO MARKET
Jan 31, 2020
6 min read
Quantum Leap
Marketing
Article
Quantum Leap
Jul 11, 2019
6 min read
Arnab PANDEY
Techfastly
Article
Arnab PANDEY
Apr 1, 2021
11 min read
How Google Is Making The AI That Powers Its Products Better.
HWM Singapore
Article
How Google Is Making The AI That Powers Its Products Better.
Jun 3, 2019
3 min read
How Can I Use Artificial Intelligence (AI) More Effectively At Work?
Her World Singapore
Article
How Can I Use Artificial Intelligence (AI) More Effectively At Work?
May 7, 2024
2 min read
There’s A New Career In Town
True Love
Article
There’s A New Career In Town
Oct 21, 2019
2 min read
Forward Thinking
Racecar Engineering
Article
Forward Thinking
Feb 4, 2022
8 min read
ARTIFICIAL INTELLIGENCE (AI) IN SUPPLY CHAIN PLANNING THE Future is Here & Now
The European Business Review
Article
ARTIFICIAL INTELLIGENCE (AI) IN SUPPLY CHAIN PLANNING THE Future is Here & Now
Dec 3, 2019
7 min read
In Conversation with Surbhi Rathore
Techfastly
Article
In Conversation with Surbhi Rathore
Oct 1, 2021
4 min read
Digital Trust Is On The Horizon
The European Business Review
Article
Digital Trust Is On The Horizon
Mar 1, 2022
11 min read
The Digital Replica
Business Today
Article
The Digital Replica
May 27, 2022
6 min read
“Be Global But Act Local because Each Economy Is Unique”
Business Today
Article
“Be Global But Act Local because Each Economy Is Unique”
Dec 8, 2023
6 min read
In Conversation with Rajesh Dhuddu Global Head, Blockchain & Metaverse Practice, Tech Mahindra
Techfastly
Article
In Conversation with Rajesh Dhuddu Global Head, Blockchain & Metaverse Practice, Tech Mahindra
Nov 1, 2022
6 min read
Powering Costing With Artificial Intelligence: The Case Of Vodafone Procurement
The European Business Review
Article
Powering Costing With Artificial Intelligence: The Case Of Vodafone Procurement
May 25, 2021
8 min read
Edge Computing In Europe: A Key Driver Of Business Innovation
The European Business Review
Article
Edge Computing In Europe: A Key Driver Of Business Innovation
Jan 26, 2024
1 83% of our survey respondents believe that edge computing will be essential to remaining competitive in the future but only 65% are using edge today. 2 Super Integrators — edge adopters that tie edge to business in transformation adoption — compris
8 min read
“The Biggest Problem I See When People Are Working From Home Is A Poorly Designed Network”
PC Pro Magazine
Article
“The Biggest Problem I See When People Are Working From Home Is A Poorly Designed Network”
Jun 8, 2023
6 min read
The Key Success Factors Of A Powerful AI Factory
The European Business Review
Article
The Key Success Factors Of A Powerful AI Factory
Jan 26, 2024
5 min read
Scroll Media
NZ Marketing
Article
Scroll Media
Sep 16, 2018
You have been in the digital advertising industry since 2001, what changes have you seen and what’s your view on it today? It seems we have come a long way from faxing order forms across town and fixed weekly rates, so any automation is a good thing.
3 min read
The Significance of Machine Learning
Techfastly
Article
The Significance of Machine Learning
Mar 1, 2022
3 min read

Related categories

Skip carousel

Reviews for Machine Learning with LightGBM and Python

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Machine Learning with LightGBM and Python - Andrich van Wyk

Cover.png

Machine Learning with LightGBM and Python

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing or its dealers and distributors, will be held liable for any damages caused or alleged to have been caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

Group Product Manager: Niranjan Naikwadi

Publishing Product Manager: Tejashwini R

Senior Editor: Gowri Rekha

Content Development Editor: Manikandan Kurup

Technical Editor: Kavyashree K S

Copy Editor: Safis Editing

Project Coordinator: Farheen Fathima

Proofreader: Safis Editing

Indexer: Subalakshmi Govindhan

Production Designer: Shyam Sundar Korumilli

Marketing Coordinator: Vinishka Kalra

First published: September 2023

Production reference: 1220923

Published by Packt Publishing Ltd.

Grosvenor House

11 St Paul’s Square

Birmingham

B3 1RB, UK.

ISBN: 978-1-80056-474-9

www.packtpub.com

Countless nights and weekends have been dedicated to completing this book, and I would like to thank my wife, Irene, for her eternal support, without which, nobody would be reading any of this. Further, I’m grateful to my daughter, Emily, for inspiring me to reach a little further.

– Andrich van Wyk

Contributors

About the author

Andrich van Wyk has 15 years of experience in machine learning R&D, building AI-driven solutions, and consulting in the AI domain. He also has broad experience as a software engineer and architect with over a decade of industry experience working on enterprise systems.

He graduated cum laude with an M.Sc. in Computer Science from the University of Pretoria, focusing on neural networks and evolutionary algorithms.

Andrich enjoys writing about machine learning engineering and the software industry at large. He currently resides in South Africa with his wife and daughter.

About the reviewers

Valentine Shkulov is a renowned visiting lecturer at a top tech university, where he seamlessly melds academia with real-world expertise as a distinguished Data Scientist in Fintech and E-commerce. His ingenuity in crafting ML-driven solutions has transformed businesses, from tech giants to budding startups. Valentine excels at introducing AI innovations and refining current systems, ensuring they profoundly influence vital business metrics. His passion for navigating product challenges has established him as a pioneer in leveraging ML to elevate businesses.

Above all, a heartfelt thanks to my spouse, the unwavering pillar of support in my remarkable journey.

Kayewan M Karanjia has over 7 years of experience in machine learning, artificial intelligence (AI), and data technologies, and brings a wealth of expertise to his current role at DrDoctor. Here, as a machine learning engineer, he is dedicated to implementing advanced machine learning models that have a direct impact on enhancing healthcare services and process optimization for the NHS. In the past, he has also worked with multiple MNCs such as Reliance Industries Limited, and implemented solutions for the government of India.

Table of Contents

Preface

Part 1: Gradient Boosting and LightGBM Fundamentals

Introducing Machine Learning

Technical requirements

What is machine learning?

Machine learning paradigms

Introducing models, datasets, and supervised learning

Models

Hyperparameters

Datasets

Overfitting and generalization

Supervised learning

Model performance metrics

A modeling example

Decision tree learning

Entropy and information gain

Building a decision tree using C4.5

Overfitting in decision trees

Building decision trees with scikit-learn

Decision tree hyperparameters

Summary

References

Ensemble Learning – Bagging and Boosting

Technical requirements

Ensemble learning

Bagging and random forests

Random forest

Gradient-boosted decision trees

Gradient descent

Gradient boosting

Gradient-boosted decision tree hyperparameters

Gradient boosting in scikit-learn

Advanced boosting algorithm – DART

Summary

References

An Overview of LightGBM in Python

Technical requirements

Introducing LightGBM

LightGBM optimizations

Hyperparameters

Limitations of LightGBM

Getting started with LightGBM in Python

LightGBM Python API

LightGBM scikit-learn API

Building LightGBM models

Cross-validation

Parameter optimization

Predicting student academic success

Summary

References

Comparing LightGBM, XGBoost, and Deep Learning

Technical requirements

An overview of XGBoost

Comparing XGBoost and LightGBM

Python XGBoost example

Deep learning and TabTransformers

What is deep learning?

Introducing TabTransformers

Comparing LightGBM, XGBoost, and TabTransformers

Predicting census income

Detecting credit card fraud

Summary

References

Part 2: Practical Machine Learning with LightGBM

LightGBM Parameter Optimization with Optuna

Technical requirements

Optuna and optimization algorithms

Introducing Optuna

Optimization algorithms

Pruning strategies

Optimizing LightGBM with Optuna

Advanced Optuna features

Summary

References

Solving Real-World Data Science Problems with LightGBM

Technical requirements

The data science life cycle

Defining the data science life cycle

Predicting wind turbine power generation with LightGBM

Problem definition

Data collection

Data preparation

EDA

Modeling

Model deployment

Communicating results

Classifying individual credit scores with LightGBM

Problem definition

Data collection

Data preparation

EDA

Modeling

Model deployment and results

Summary

References

AutoML with LightGBM and FLAML

Technical requirements

Automated machine learning

Automating feature engineering

Automating model selection and tuning

Risks of using AutoML systems

Introducing FLAML

Cost Frugal Optimization

BlendSearch

FLAML limitations

Case study – using FLAML with LightGBM

Feature engineering

FLAML AutoML

Zero-shot AutoML

Summary

References

Part 3: Production-ready Machine Learning with LightGBM

Machine Learning Pipelines and MLOps with LightGBM

Technical requirements

Introducing machine learning pipelines

Scikit-learn pipelines

Understanding MLOps

Deploying an ML pipeline for customer churn

Building an ML pipeline using scikit-learn

Building an ML API using FastAPI

Containerizing our API

Deploying LightGBM to Google Cloud

Summary

LightGBM MLOps with AWS SageMaker

Technical requirements

An introduction to AWS and SageMaker

AWS

SageMaker

SageMaker Clarify

Building a LightGBM ML pipeline with Amazon SageMaker

Setting up a SageMaker session

Preprocessing step

Model training and tuning

Evaluation, bias, and explainability

Deploying and monitoring the LightGBM model

Results

Summary

References

LightGBM Models with PostgresML

Technical requirements

Introducing PostgresML

Latency and round trips

Getting started with PostgresML

Training models

Deploying and prediction

PostgresML dashboard

Case study – customer churn with PostgresML

Data loading and preprocessing

Training and hyperparameter optimization

Predictions

Summary

References

Distributed and GPU-Based Learning with LightGBM

Technical requirements

Distributed learning with LightGBM and Dask

GPU training for LightGBM

Setting up LightGBM for the GPU

Running LightGBM on the GPU

Summary

References

Index

Other Books You May Enjoy

Preface

Welcome to Machine Learning with LightGBM and Python: A Practitioner’s Guide to Developing Production-Ready Machine Learning Systems. In this book, you’ll embark on a rich journey, taking you from the foundational principles of machine learning to the advanced realms of MLOps. The cornerstone of our exploration is LightGBM, a powerful and flexible gradient-boosting framework that can be harnessed for a wide range of machine-learning challenges.

This book is tailor-made for anyone passionate about transforming raw data into actionable insights using the power of Machine Learning (ML). Whether you’re an ML novice eager to get your hands dirty or an experienced data scientist seeking to master the intricacies of LightGBM, there’s something in here for you.

The digital era has equipped us with a treasure trove of data. However, the challenges often lie in extracting meaningful insights from this data and deploying scalable, efficient, and reliable models in production environments. This book will guide you in overcoming these challenges. By diving into gradient boosting, the data science life cycle, and the nuances of production deployment, you will gain a comprehensive skill set to navigate the ever-evolving landscape of ML.

Each chapter is designed with practicality in mind. Real-world case studies interspersed with theoretical insights ensure your learning is grounded in tangible applications. Our focus on LightGBM, which sometimes gets overshadowed by more mainstream algorithms, provides a unique lens to appreciate and apply gradient boosting in various scenarios.

For those curious about what sets this book apart, it’s our pragmatic approach. We take pride in transcending beyond merely explaining algorithms or tools. Instead, we will prioritize hands-on applications, case studies, and real-world challenges, ensuring you’re not just reading but also doing ML.

As we traverse through the chapters, remember that the world of ML is vast and constantly evolving. This book, while comprehensive, is a stepping stone in your lifelong journey of learning and exploration in the domain. As you navigate the world of LightGBM, data science, MLOps, and more, keep your mind open, your curiosity alive, and your hands ready to code.

Who this book is for

Machine Learning with LightGBM and Python: A Practitioner’s Guide to Developing Production-Ready Machine Learning Systems is tailored for a broad spectrum of readers passionate about harnessing data’s power through ML. The target audience for this book includes the following:

Beginners in ML: Individuals just stepping into the world of ML will find this book immensely beneficial. It starts with foundational ML principles and introduces them to gradient boosting using LightGBM, making it an excellent entry point for newcomers.

Experienced data scientists and ML practitioners: For those who are already familiar with the landscape of ML but want to deepen their knowledge of LightGBM and/or MLOps, this book offers advanced insights, techniques, and practical applications.

Software engineers and architects looking to learn more about data science: Software professionals keen on transitioning to data science or integrating ML into their applications will find this book valuable. The book approaches ML theoretically and practically, emphasizing hands-on coding and real-world applications.

MLOps engineers and DevOps professionals: Individuals working in the field of MLOps or those who wish to understand the deployment, scaling, and monitoring of ML models in production environments will benefit from the chapters dedicated to MLOps, pipelines, and deployment strategies.

Academicians and students: Faculty members teaching ML, data science, or related courses, as well as students pursuing these fields, will find this book to be both an informative textbook and a practical guide.

Knowledge of how to program Python is necessary. Familiarity with Jupyter notebooks and Python environments is a bonus. No prior knowledge of ML is required.

In essence, anyone with a penchant for data, a background in Python programming, and an eagerness to explore the multifaceted world of ML using LightGBM will find this book a valuable addition to their repertoire.

What this book covers

Chapter 1, Introducing Machine Learning, starts our journey into ML, viewing it through the lens of software engineering. We will elucidate vital concepts central to the field, such as models, datasets, and the various learning paradigms, ensuring clarity with a hands-on example using decision trees.

Chapter 2, Ensemble Learning – Bagging and Boosting, delves into ensemble learning, focusing on bagging and boosting techniques applied to decision trees. We will explore algorithms such as random forests, gradient-boosted decision trees, and more advanced concepts such as Dropout meets Additive Regression Trees (DART).

Chapter 3, An Overview of LightGBM in Python, examines LightGBM, an advanced gradient-boosting framework with tree-based learners. Highlighting its unique innovations and enhancements to ensemble learning, we will guide you through its Python APIs. A comprehensive modeling example using LightGBM, enriched with advanced validation and optimization techniques, sets the stage for a deeper dive into data science and production systems ML.

Chapter 4, Comparing LightGBM, XGBoost, and Deep Learning, pits LightGBM against two prominent tabular data modeling methods – XGBoost and deep neural networks (DNNs), specifically TabTransformer. We will assess each method’s complexity, performance, and computational cost through evaluations of two datasets. The essence of this chapter is ascertaining LightGBM’s competitiveness in the broader ML landscape, rather than an in-depth study of XGBoost or DNNs.

Chapter 5, LightGBM Parameter Optimization with Optuna, focuses on the pivotal task of hyperparameter optimization, introducing the Optuna framework as a potent solution. Covering various optimization algorithms and strategies to prune the hyperparameter space, this chapter guides you through a hands-on example of refining LightGBM parameters using Optuna.

Chapter 6, Solving Real-World Data Science Problems with LightGBM, methodically breaks down the data science process, applying it to two distinct case studies – a regression and a classification problem. The chapter illuminates each step of the data science life cycle. You will experience hands-on modeling with LightGBM, paired with comprehensive theory. This chapter also serves as a blueprint for data science projects using LightGBM.

Chapter 7, AutoML with LightGBM and FLAML, delves into automated machine learning (AutoML), emphasizing its significance in simplifying and expediting data engineering and model development. We will introduce FLAML, a notable library that automates model selection and fine-tuning with efficient hyperparameter algorithms. Through a practical case study, you will witness FLAML’s synergy with LightGBM and the transformative Zero-Shot AutoML functionality, which renders the tuning process obsolete.

Chapter 8, Machine Learning Pipelines and MLOps with LightGBM, moves on from modeling intricacies to the world of production ML. It introduces you to ML pipelines, ensuring consistent data processing and model building, and ventures into MLOps, a fusion of DevOps and ML, which is vital to deploying resilient ML systems.

Chapter 9, LightGBM MLOps with AWS SageMaker, steers our journey toward Amazon SageMaker, Amazon Web Services’ comprehensive suite to craft and maintain ML solutions. We will deepen our understanding of ML pipelines by delving into advanced areas such as bias detection, explainability in models, and the nuances of automated, scalable deployments.

Chapter 10, LightGBM Models with PostgresML, introduces PostgresML, a distinct MLOps platform and a PostgreSQL database extension that facilitates ML model development and deployment directly via SQL. This approach, while contrasting the scikit-learn programming style that we’ve embraced, showcases the benefits of database-level ML, particularly regarding data movement efficiencies and faster inferencing.

Chapter 11, Distributed and GPU-Based Learning with LightGBM, delves into the expansive realm of training LightGBM models, leveraging distributed computing clusters and GPUs. By harnessing distributed computing, you will understand how to substantially accelerate training workloads and manage datasets that exceed a single machine’s memory capacity.

To get the most out of this book

This book is written assuming that you have some knowledge of Python programming. None of the Python code is very complex, so even understanding the basics of Python should be enough to get you through most of the code examples.

Jupyter notebooks are used for the practical examples in all the chapters. Jupyter Notebooks is an open source tool that allows you to create code notebooks that contain live code, visualizations, and markdown text. Tutorials to get started with Jupyter Notebooks are available at https://realpython.com/jupyter-notebook-introduction/ and at https://plotly.com/python/ipython-notebook-tutorial/.

We recommend using Anaconda for Python environment management when setting up your own environment. Anaconda also bundles many data science packages, so you don’t have to install them individually. Anaconda can be downloaded from https://www.anaconda.com/download. Notably, the book is accompanied by a GitHub repository, which includes an Anaconda environment file, to create the environment required to run the code examples in this book.

If you are using the digital version of this book, we advise you to type the code yourself or access the code from the book’s GitHub repository (a link is available in the next section). Doing so will help you avoid any potential errors related to the copying and pasting of code.

Download the example code files

You can download the example code files for this book from GitHub at https://github.com/PacktPublishing/Practical-Machine-Learning-with-LightGBM-and-Python. If there’s an update to the code, it will be updated in the GitHub repository.

We also have other code bundles from our rich catalog of books and videos available at https://github.com/PacktPublishing/. Check them out!

Conventions used

There are several text conventions used throughout this book.

Code in text: Indicates code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles. Here is an example: The code is almost identical to our classification example – instead of a classifier, we use DecisionTreeRegressor as our model and calculate mean_absolute_error instead of the F1 score.

A block of code is set as follows:

import numpy as np

import pandas as pd

from matplotlib import pyplot as plt

import seaborn as sns

from sklearn.linear_model import LinearRegression

from sklearn.metrics import mean_absolute_error

When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:

model = DecisionTreeRegressor(random_state=157, max_depth=3, min_samples_split=2)

model = model.fit(X_train, y_train)

mean_absolute_error(y_test, model.predict(X_test))

Any command-line input or output is written as follows:

conda create -n your_env_name python=3.9

Bold: Indicates a new term, an important word, or words you see on screen. For instance, words in menus or dialog boxes appear in bold. Here is an example: Therefore, data preparation and cleaning are essential parts of the machine-learning process.

Tips or important notes

Appear in blocks such as these.

Get in touch

Feedback from our readers is always welcome.

General feedback: If you have questions about any aspect of this book, email us at [email protected] and mention the book title in the subject of your message.

Errata: Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you have found a mistake in this book, we would be grateful if you would report this to us. Please visit www.packtpub.com/support/errata and fill in the form.

Piracy: If you come across any illegal copies of our works in any form on the internet, we would be grateful if you would provide us with the location address or website name. Please get in touch with us at [email protected] with a link to the material.

If you are interested in becoming an author: If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, please visit authors.packtpub.com.

Share Your Thoughts

Once you’ve read Machine Learning with LightGBM and Python, we’d love to hear your thoughts! Please click here to go straight to the Amazon review page for this book and share your feedback.

Your review is important to us and the tech community and will help

Enjoying the preview?

Page 1 of 1

Machine Learning with LightGBM and Python: A practitioner's guide to developing production-ready machine learning systems

About this ebook

Andrich van Wyk

Related authors

Related to Machine Learning with LightGBM and Python

Related ebooks

Programming For You

Related podcast episodes

Related articles

Related categories

Reviews for Machine Learning with LightGBM and Python

What did you think?

Book preview

Machine Learning with LightGBM and Python - Andrich van Wyk