Détail de l'auteur
Auteur Michael WALKER |
Documents disponibles écrits par cet auteur (2)
Ajouter le résultat dans votre panier Faire une suggestion Affiner la recherche
Data Cleaning and Exploration with Machine Learning : Get to grips with machine learning techniques to achieve sparkling-clean data quickly / Michael WALKER / PACKT PUBLISHING (2022)
Titre : Data Cleaning and Exploration with Machine Learning : Get to grips with machine learning techniques to achieve sparkling-clean data quickly Type de document : e-book Auteurs : Michael WALKER Editeur : PACKT PUBLISHING Année de publication : 2022 ISBN/ISSN/EAN : 9781803241678 Note générale : copyrighted Langues : Anglais (eng) Résumé : Explore supercharged machine learning techniques to take care of your data laundry loadsKey FeaturesLearn how to prepare data for machine learning processesUnderstand which algorithms are based on prediction objectives and the properties of the dataExplore how to interpret and evaluate the results from machine learningBook DescriptionMany individuals who know how to run machine learning algorithms do not have a good sense of the statistical assumptions they make and how to match the properties of the data to the algorithm for the best results.As you start with this book, models are carefully chosen to help you grasp the underlying data, including in-feature importance and correlation, and the distribution of features and targets. The first two parts of the book introduce you to techniques for preparing data for ML algorithms, without being bashful about using some ML techniques for data cleaning, including anomaly detection and feature selection. The book then helps you apply that knowledge to a wide variety of ML tasks. You'll gain an understanding of popular supervised and unsupervised algorithms, how to prepare data for them, and how to evaluate them. Next, you'll build models and understand the relationships in your data, as well as perform cleaning and exploration tasks with that data. You'll make quick progress in studying the distribution of variables, identifying anomalies, and examining bivariate relationships, as you focus more on the accuracy of predictions in this book.By the end of this book, you'll be able to deal with complex data problems using unsupervised ML algorithms like principal component analysis and k-means clustering.What you will learnExplore essential data cleaning and exploration techniques to be used before running the most popular machine learning algorithmsUnderstand how to perform preprocessing and feature selection, and how to set up the data for testing and validationModel continuous targets with supervised learning algorithmsModel binary and multiclass targets with supervised learning algorithmsExecute clustering and dimension reduction with unsupervised learning algorithmsUnderstand how to use regression trees to model a continuous targetWho this book is forThis book is for professional data scientists, particularly those in the first few years of their career, or more experienced analysts who are relatively new to machine learning. Readers should have prior knowledge of concepts in statistics typically taught in an undergraduate introductory course as well as beginner-level experience in manipulating data programmatically. Nombre d'accès : Illimité En ligne : http://library.ez.neoma-bs.fr/login?url=https://www.scholarvox.com/book/88934025 Permalink : https://cataloguelibrary.neoma-bs.fr/index.php?lvl=notice_display&id=556156
Titre : Python Data Cleaning Cookbook Type de document : e-book Auteurs : Michael WALKER Editeur : PACKT PUBLISHING Année de publication : 2020 ISBN/ISSN/EAN : 9781800565661 Note générale : copyrighted Langues : Anglais (eng) Résumé : Discover how to describe your data in detail, identify data issues, and find out how to solve them using commonly used techniques and tips and tricksKey FeaturesGet well-versed with various data cleaning techniques to reveal key insightsManipulate data of different complexities to shape them into the right form as per your business needsClean, monitor, and validate large data volumes to diagnose problems before moving on to data analysisBook DescriptionGetting clean data to reveal insights is essential, as directly jumping into data analysis without proper data cleaning may lead to incorrect results. This book shows you tools and techniques that you can apply to clean and handle data with Python. You'll begin by getting familiar with the shape of data by using practices that can be deployed routinely with most data sources. Then, the book teaches you how to manipulate data to get it into a useful form. You'll also learn how to filter and summarize data to gain insights and better understand what makes sense and what does not, along with discovering how to operate on data to address the issues you've identified. Moving on, you'll perform key tasks, such as handling missing values, validating errors, removing duplicate data, monitoring high volumes of data, and handling outliers and invalid dates. Next, you'll cover recipes on using supervised learning and Naive Bayes analysis to identify unexpected values and classification errors, and generate visualizations for exploratory data analysis (EDA) to visualize unexpected values. Finally, you'll build functions and classes that you can reuse without modification when you have new data.By the end of this Python book, you'll be equipped with all the key skills that you need to clean data and diagnose problems within it.What you will learnFind out how to read and analyze data from a variety of sourcesProduce summaries of the attributes of data frames, columns, and rowsFilter data and select columns of interest that satisfy given criteriaAddress messy data issues, including working with dates and missing valuesImprove your productivity in Python pandas by using method chainingUse visualizations to gain additional insights and identify potential data issuesEnhance your ability to learn what is going on in your dataBuild user-defined functions and classes to automate data cleaningWho this book is forThis book is for anyone looking for ways to handle messy, duplicate, and poor data using different Python tools and techniques. The book takes a recipe-based approach to help you to learn how to clean and manage data. Working knowledge of Python programming is all you need to get the most out of the book. Nombre d'accès : Illimité En ligne : http://library.ez.neoma-bs.fr/login?url=https://www.scholarvox.com/book/88906473 Permalink : https://cataloguelibrary.neoma-bs.fr/index.php?lvl=notice_display&id=526603
LIBRARY - Campus Rouen
NEOMA Business School
pmb
-
59 Rue Taittinger, 51100 Reims
-
00 33 (0)3 26 77 46 15
Library Campus Reims
-
1 Rue du Maréchal Juin, BP 215
76825 Mont Saint Aignan cedex -
00 33 (0)2 32 82 58 26