A step-by-step guide to reproduce experiments and get a submission.
- Install Python 3.8+ and create a virtualenv: python -m venv .venv source .venv/bin/activate pip install -r requirements.txt
- Download Kaggle dataset and place files: data/raw/train.csv data/raw/test.csv
-
Run: python src/preprocess.py
--train data/raw/train.csv
--test data/raw/test.csv
--out data/processed/ -
What it does:
- Parses datetimes, fills missing values
- Adds features: hour, dow, haversine_distance, bearing
- Saves parquet files in
data/processed/
notebooks/— walkthroughssrc/preprocess.py— transformation pipelinesrc/train.py— training configs and CV logic