ML Data Preprocessing

This project shows a simple way to prep data before using any machine learning model.
It’s meant to be easy to follow and learn from.

What it does

Here’s what the code does:

Handles missing values
Encodes categorical (non-numeric) data
Splits the dataset into training and testing sets
Scales numerical columns so big numbers don’t mess up the model

Dataset

The file Data.csv has:

Country (France, Spain, Germany)
Age
Salary
Purchased (Yes / No)

How to use it

Put Data.csv in the same folder as any of workflow file
Run the Python file
After running, you’ll get:
- x_train and x_test: prepped features
- y_train and y_test: encoded target labels

Notes

Some steps, like scaling, aren’t needed for certain models (like Decision Trees or Random Forest).
The code has comments explaining each step in a simple way.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Data.csv		Data.csv
ML_WorkFlow_AR_comments		ML_WorkFlow_AR_comments
ML_WorkFlow_EN_comments.py		ML_WorkFlow_EN_comments.py
README.md		README.md
README_AR.md		README_AR.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML Data Preprocessing

What it does

Dataset

How to use it

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ML Data Preprocessing

What it does

Dataset

How to use it

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages