End-to-End ML-Powered Sarajevo Real Estate Application by musss2003 · Pull Request #10 · EmreArapcicUevak/EE418-Introduction-to-Machine-Learning-Project

musss2003 · 2025-12-25T10:18:48Z

Home Page – Price Prediction

Added a home screen where users can select a feature set (location, size, rooms, condition, etc.).

The app uses trained ML regression models to predict the expected market price of a property.

Predictions are returned instantly and are based on cleaned, preprocessed historical data.

All ML models are loaded once at application startup and reused across requests to ensure fast response times and improved stability.

Prediction History

Implemented prediction history tracking, allowing users to review previously generated price predictions.

Enables comparison and transparency of model outputs over time.

Listings & Deal Score Evaluation

Integrated real estate listings enriched with predicted prices.

Each listing is assigned a Deal Score (0–100) that reflects how favorable the asking price is compared to the predicted market value.

Deal Score Logic

def deal_score_from_diff(diff_pct: float, min_diff: float, max_diff: float) -> int:
"""
Convert relative price difference to a deal score in range [0, 100].

Negative diff -> cheaper than expected -> higher score
Positive diff -> overpriced -> lower score
"""
if max_diff == min_diff:
    return 50

diff_pct = max(min_diff, min(max_diff, diff_pct))
score = (max_diff - diff_pct) / (max_diff - min_diff) * 100
return int(round(max(0, min(100, score))))

Lower-than-expected prices result in higher scores.

Overpriced listings receive lower scores.

Scores are normalized using observed market ranges to ensure consistency.

Favorites Management

Added the ability to add and remove listings from favorites.

Favorites are persisted and can be revisited later by the user.

Improves usability and supports shortlisting of interesting properties.

Statistics & Market Insights

Implemented a statistics section based on cleaned and preprocessed data stored in the database.

Includes:

Price distribution and averages
Price per square meter
Municipality and condition distributions
Accessibility metrics (distance to POIs)

All statistics are computed from validated, normalized data produced during preprocessing.

ML & Data Integration Improvements

ML models are loaded once on first application run and reused across the app lifecycle.

POI (Points of Interest) data is also cached and reused.

Written the project pitch related to our data and added a notebook that does feature analysis

Generated by create-expo-app 3.5.3.

Updated README to reflect project details and features.

All code errors have been successfully resolved: Renamed use-prediction-history.ts to use-prediction-history.tsx - The file contained JSX code but had a .ts extension Fixed import paths - Updated all imports to use the correct kebab-case filenames: usePredictionHistory → use-prediction-history useColorScheme → use-color-scheme Colors → imported from theme.ts Created missing TabBarIcon component - Added the component that was being imported but didn't exist Fixed TypeScript return type - Added proper React import and return type annotation for PredictionHistoryProvider

Supabase client with AsyncStorage for persistent sessions AuthContext provider with global auth state API service with automatic JWT token injection Sign in/up screens with modern gradient UI Updated Screens: Profile Tab - Shows user info, menu items, and sign out button (or guest features for non-authenticated users) Predict Screen - Now uses makePrediction() from API service with auth headers, displays user email in header History Screen - Fetches user-specific predictions from backend for authenticated users, falls back to local storage for guests

Dockerized FastAPI backend with ML runtime & Nginx reverse proxy

…EE418-Introduction-to-Machine-Learning-Project into mustafa_branch

musss2003 · 2025-12-25T10:20:45Z

Question regarding rentals model input features

I wanted to double-check one thing before we merge.

In the backend, feature construction for predictions currently includes the condition field:

def build_features_from_request(req, poi_data):
    point = Point(req.longitude, req.latitude)
    point = gpd.GeoSeries([point], crs="EPSG:4326").to_crs(epsg=32634).iloc[0]

    features = {
        "condition": req.condition,
        "rooms": req.rooms,
        "square_m2": req.square_m2,
        "equipment": req.equipment,
        "level": req.level,
        "heating": req.heating,
    }

    for poi_name, pois in poi_data.items():
        col = f"closest_{poi_name}_m"
        idx = pois.sindex.nearest(point, 1)[1][0]
        features[col] = point.distance(pois.geometry.iloc[idx])

    return pd.DataFrame([features])

For rentals, the frontend will not send condition (as agreed earlier), so it will be empty on the backend.

Can you confirm whether condition was dropped from the rentals model during training, or whether the model expects it to be present ?

I just want to make sure this won’t affect rental predictions once this PR is merged, since after merging I plan to deploy the backend to AWS EC2.

Thanks!

backend/app/ml/poi/poi_loader.py

backend/app/services/model.py

backend/scripts/backfill_predictions.py

EmreArapcicUevak · 2025-12-25T12:35:03Z

Question regarding rentals model input features

I wanted to double-check one thing before we merge.

In the backend, feature construction for predictions currently includes the condition field:
def build_features_from_request(req, poi_data):

    point = Point(req.longitude, req.latitude)

    point = gpd.GeoSeries([point], crs="EPSG:4326").to_crs(epsg=32634).iloc[0]



    features = {

        "condition": req.condition,

        "rooms": req.rooms,

        "square_m2": req.square_m2,

        "equipment": req.equipment,

        "level": req.level,

        "heating": req.heating,

    }



    for poi_name, pois in poi_data.items():

        col = f"closest_{poi_name}_m"

        idx = pois.sindex.nearest(point, 1)[1][0]

        features[col] = point.distance(pois.geometry.iloc[idx])



    return pd.DataFrame([features])
For rentals, the frontend will not send condition (as agreed earlier), so it will be empty on the backend.

Can you confirm whether condition was dropped from the rentals model during training, or whether the model expects it to be present ?

I just want to make sure this won’t affect rental predictions once this PR is merged, since after merging I plan to deploy the backend to AWS EC2.

Thanks!

We haven't agree on rentals dropping all of the conditions. In the feature analysis notebook rentals just cannot have the "in constructions" and "needs renovation" as their condition. As logically you cannot rent an apartment that isn't built nor the one that needs repairs. The features for both rentals and sales are the same just the domain is difference for some

musss2003 · 2025-12-25T13:56:32Z

Here is the core logic for our model, where i differentiated ConditionType for Sale and Rent, maybe you can take a look one more time at this file: backend/app/api/predict.py

EmreArapcicUevak

Just please close pending reviews with the reference to the commit and changes that were done to the target file

MachineLearning/notebooks/rents_ML.ipynb

backend/__pycache__/main.cpython-314.pyc

backend/app/api/predict.py

backend/app/ml/features/build_features.py

backend/app/main.py

backend/requirements.txt

notebooks/feature_analysis.ipynb

notebooks/scraping_flats_nekretnine.ipynb

frontend/app.plugin.js

EmreArapcicUevak

Most of these changes fixed what I wanted. The only think I really want changed is the flats.csv file to be removed. Other changes are optional good job Mustafa :)

backend/app/ml/features/build_features.py

backend/app/services/price_prediction.py

backend/scripts/data/flats.csv

…fic function so predict api does not know anything about that

musss2003 and others added 30 commits October 28, 2025 18:34

Commit snapshot to mustafa_branch

a8b0934

Added notebook with explanations by which data is scraped from OLX

0292f4c

Scapped data on Nekretnine.ba, doing analysis

ed55b78

Further exploration OLX dataset

bd48c1b

Municipality engineering, taken from title, address or description

13ab461

update .gitignore

5b98361

Update requirements.txt

964d1cc

Add unwanted latex files to the .gitignore

37c5765

Start work on the project pitch

a568af5

Add title section and table of content

37c202a

Finish work on the introduction section

5e277b0

Finish work on the feature analysis notebook

41bf203

Add needed images

df45629

Add new images and finish work on the pdf

dc746a0

Finish work on the presentation

aaa6ee2

Do some final changes to have nicer plots

11fa9df

Fix up the bar plots

0d3a025

Updates on extracting longitude and latitude

8114b98

Merge pull request #1 from EmreArapcicUevak/project_pitch

5a9c4e6

Written the project pitch related to our data and added a notebook that does feature analysis

Add multithreaded coordinate extraction for OLX listings

c5ac3fd

Remove CSV datasets from git tracking

6a7611e

Research in dataset of web platfrom Nekretnine.ba

31eaa33

Initial commit

2a852b7

Generated by create-expo-app 3.5.3.

Added initial page

f262fee

Revise README for Price Predictor Mobile App

9be6b8a

Updated README to reflect project details and features.

First version of the initial app

a12cb4e

Updated dir structure

86a0990

Added backend files

ecb6dec

musss2003 added 12 commits December 22, 2025 14:52

Preparing for deployment

0c8d3e6

Prepared for deployment

c3ef5eb

Added ALLOWED_ORIGINS, improved predict function, improved logging

818b6d8

Improved model service function, added mult stage build

e5129fa

Merge pull request #2 from musss2003/mustafa_branch

d1c5d2e

Dockerized FastAPI backend with ML runtime & Nginx reverse proxy

Merge from main into my branch

244f51b

Merge branch 'mustafa_branch' of https://github.com/EmreArapcicUevak/…

b71931a

…EE418-Introduction-to-Machine-Learning-Project into mustafa_branch

Preparing app for linking with the ML model

5eb94d8

Listings with correct dealScore is working

150f602

Fixed exploring listings and CRUD favorites

ae893a4

Works history of predictions and properly predicting price

99aded5

Finished working statistics page with all features

83298fa

musss2003 requested a review from EmreArapcicUevak December 25, 2025 12:16

EmreArapcicUevak requested changes Dec 25, 2025

View reviewed changes

backend/app/ml/poi/poi_loader.py Outdated Show resolved Hide resolved

backend/app/services/model.py Outdated Show resolved Hide resolved

backend/scripts/backfill_predictions.py Outdated Show resolved Hide resolved

backend/scripts/backfill_predictions.py Show resolved Hide resolved

FIxed suggested improvements

19947fa

EmreArapcicUevak self-assigned this Dec 25, 2025

EmreArapcicUevak requested changes Dec 25, 2025

View reviewed changes

backend/requirements.txt Show resolved Hide resolved

notebooks/feature_analysis.ipynb Outdated Show resolved Hide resolved

notebooks/scraping_flats_nekretnine.ipynb Show resolved Hide resolved

EmreArapcicUevak requested changes Dec 25, 2025

View reviewed changes

frontend/app.plugin.js Outdated Show resolved Hide resolved

musss2003 added 5 commits December 26, 2025 11:32

Fixed based on PR suggestions

a2917db

Improvement of backend code design

c322645

Restored notebook from main branch

9783faa

Fixed tsconfig.json

523c474

Moved predict_price within service function to serve as helper

6353613

musss2003 requested a review from EmreArapcicUevak December 27, 2025 14:41

EmreArapcicUevak requested changes Dec 27, 2025

View reviewed changes

backend/app/ml/features/build_features.py Show resolved Hide resolved

backend/app/services/price_prediction.py Show resolved Hide resolved

backend/scripts/data/flats.csv Outdated Show resolved Hide resolved

Removing flats.csv and decoupling getting po_data and models to speci…

a9de964

…fic function so predict api does not know anything about that

Conversation

musss2003 commented Dec 25, 2025

Home Page – Price Prediction

Prediction History

Listings & Deal Score Evaluation

Deal Score Logic

Favorites Management

Statistics & Market Insights

ML & Data Integration Improvements

Uh oh!

musss2003 commented Dec 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

EmreArapcicUevak commented Dec 25, 2025

Uh oh!

musss2003 commented Dec 25, 2025

Uh oh!

EmreArapcicUevak left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

EmreArapcicUevak left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants