Skip to content
View Lua-Matlab-Python-R-J2EE's full-sized avatar
💭
Looking for DS/ML/AI roles
💭
Looking for DS/ML/AI roles

Block or report Lua-Matlab-Python-R-J2EE

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

About me:

  • Data Scientist (DS), Machine Learning (ML) and Artificial Intelligence (AI) practitioner with a background in scientific computing and biomedical research, and 26 peer-reviewed publications (18 first-author) spanning healthcare, clinical imaging, pharmaceuticals, and medical data analysis.
  • I build end-to-end AI/ML pipelines (from exploratory analysis to production deployment) bridging rigorous scientific methodology (cross-validation, Bland-Altman analysis, small-sample inference) with practical ML/AI engineering (Streamlit apps, CI/CD, FastAPI).
  • My project experience spans clinical imaging, biometrics, healthcare analytics, banking, and finance. Currently expanding into neural networks and deep learning with a focus on applied computer vision.

What I'm looking for:

  • DS/ML/AI roles in the UK where I can apply statistical modeling, build production ML/AI systems, and grow my MLOps expertise, particularly in finance, consulting, healthcare, life sciences or any domain where rigorous data methodology matters.

Skills

  • Core: Python (PyTorch, Scikit-learn, XGBoost, MLflow, Optuna, Pandas, NumPy, Pytest), SQL, R
  • ML/Visualization: Seaborn, Matplotlib, Statsmodels
  • Deployment: Streamlit, FastAPI, Docker, AWS (Bedrock, SageMaker)
  • Data: MySQL, CSV, Medical Imaging (DICOM, NIfTI)

How to read this profile:

  • The pinned projects demonstrate practical data science work, including real-world datasets, model development, and deployment. Additional repositories contain EDA and learning exercises.

Selected technical posts


Completed projects

  • Health Insurance Premium Prediction Python 3.10 Streamlit

  • Gait Analysis in Python
    ML analysis of gait biometrics across nine experimental pipelines, showcasing skills in data preprocessing, modeling, cross-validation, oversampling, clustering, and rigorous evaluation. Designed to highlight practical ML abilities, strong methodology, and clear reasoning in small-data, high-dimensional settings.

  • Expense Tracking System in Python
    A comprehensive expense management full-stack data application built with API design with FastAPI backend and Streamlit frontend, featuring real-time analytics and MySQL database integration for efficient personal finance tracking.

  • EDA in Banking Domain in Python
    Data analysis for an imaginary bank (using 50,000 records) to design and launch a competitive credit card product that aligns with market demands and customer preferences while minimizing failure risk.

  • EDA in Hospitality Domain in Python
    Data analysis for an imaginary hotel chain to uncover insights and recommend strategies for growth.

  • Movies Project in SQL
    A comprehensive SQL reference guide with practical examples covering fundamental to advanced SQL queries. All examples use a movies database schema for real-world learning.

  • Lean Body Mass Estimation in R
    Statistical Analysis: Comparison of ten predictive statistical models for estimating lean body mass against dual-energy X-ray absorptiometry (DXA) in older patients using correlation, Bland-Altman plots, and hypothesis testing.

  • DCE-MRI Tool in MATLAB
    Scientific Computing: General utility functions written in MATLAB/Octave as part of a software toolkit for analyzing 4-dimensional (4D) dynamic contrast-enhanced magnetic resonance imaging (dce-mri) data.


Completed github skills

Completed github skills

  • Hello GitHub Actions
    Learned the basics of GitHub Actions, including how to automate workflows directly from your repository using YAML configuration files.

  • Test with Actions 2
    Practiced configuring and running advanced CI workflows using GitHub Actions, focusing on automated testing and continuous integration best practices.

  • Publish Packages
    Practiced GitHub Actions to publish my project to a Docker image.

  • Your First Extension for GitHub Copilot
    Built and published a custom extension for GitHub Copilot, extending its coding capabilities to fit specific development needs.

  • Getting Started with GitHub Copilot
    Explored GitHub Copilot’s AI-powered code completions, learning how to boost productivity and write code faster.

  • Introduction to GitHub
    Covered GitHub essentials: creating repositories, managing files, and collaborating with others on code projects.

  • Communicate Using Markdown
    Mastered Markdown syntax to create well-formatted README files, documentation, and collaborative notes.

  • GitHub Pages
    Learned to publish and customize personal or project websites directly from GitHub repositories using GitHub Pages.

  • Review Pull Requests
    Practiced code review workflows, including providing feedback on pull requests and collaborating with team members to improve code quality.

  • Resolve Merge Conflicts
    Learned how to identify, understand, and resolve merge conflicts when working in collaborative repositories.

  • Release Based Workflow
    Explored advanced branching and release management strategies to ship project updates in a controlled and organized manner.

  • Connect the Dots
    Developed skills in linking issues, pull requests, and commits to streamline project management and maintain clear development history.

  • Code with Codespaces
    Learned to set up and use GitHub Codespaces for cloud-based development, enabling instant coding environments in the browser.

  • Introduction to Repository Management
    Gained foundational knowledge in managing repository settings, access controls, and collaboration features for effective project organization.

Pinned Loading

  1. ml-based-premium-prediction ml-based-premium-prediction Public

    Machine learning based healthcare premium prediction

    Jupyter Notebook

  2. Expense-Tracking-System Expense-Tracking-System Public

    Python

  3. EDA-Banking-Domain EDA-Banking-Domain Public

    Exploratory Data Analytics (EDA) in Banking Domain

    Python

  4. EDA-Hospitality-Domain EDA-Hospitality-Domain Public

    Exploratory Data Analytics (EDA) in Hospitality Domain

    Python

  5. gait_analysis gait_analysis Public

    machine learning based supervised and un-supervised gait analysis

    Jupyter Notebook

  6. LBM-R LBM-R Public

    Comparison of ten predictive equations for estimating lean body mass with dual-energy X-ray absorptiometry in older patients

    R