Project information
- Category: Data Processing
- Project date: 10 April, 2024
- download the Dataset: Go to kaggle
- Project repository: Git repository
- Project Documentation: See Document
- Project Codes: ETL Notebook
About This project
In this project, I created a notebook file to process a large dataset. My main goal was to handle this dataset with the least possible damage to the data integrity. If you run my code, it will swap some values of columns and convert some values such as age and birth year. By running this Python script, you will obtain a CSV file named 'ultra_marathon_clean_dataset.csv'. For more information, you can read the report or watch the video below