What you’ll learn
-
Learn multiple techniques for missing data imputation
-
Transform categorical variables into numbers while capturing meaningful information
-
Learn how to deal with infrequent, rare and unseen categories
-
Transform skewed variables into Gaussian
-
Convert numerical variables into discrete
-
Remove outliers from your variables
-
Extract meaningful features from dates and time variables
-
Learn techniques used in organisations worldwide and in data competitions
-
Increase your repertoire of techniques to preprocess data and build more powerful machine learning models
Specifically, you will learn:
- How to impute your missing data
- How to encode your categorical variables
- How to transform your numerical variables so they meet ML model assumptions
- How to convert your numerical variables into discrete intervals
- How to remove outliers
- How to handle date and time variables
- How to work with different time zones
- How to handle mixed variables which contain strings and numbers
Who this course is for:
- Data Scientists who want to get started in pre-processing datasets to build machine learning models
- Data Scientists who want to learn more techniques for feature engineering for machine learning
- Data Scientist who want to improve their coding skills and best programming practices for feature engineering
- Software engineers, mathematicians and academics switching careers into data science
- Data Scientists who want to try different feature engineering techniques on data competitions
- Software engineers who want to learn how to use Scikit-learn and other open-source packages for feature engineering
Deal Score0
Disclosure: This post may contain affiliate links and we may get small commission if you make a purchase. Read more about Affiliate disclosure here.