My name's Josh! As a senior data scientist, I work on full stack data science pipelines. My passions include:
- Learning always
- Using data well
- Supporting open source
You'll find a variety of projects in my Github repos. Here are some highlights:
- shinyfilters: Create shiny inputs from vectors data.frames, or any R object
- r-in-aws: Deploying an R package on AWS lambda, comprehensively tested, using Terraform for infra
- statistical-rethinking-notes: Notes and code from McElreath's Statistical Rethinking
- accelerated-cpp: C++ programming exercises and implementations
- jx11: Building a digital synthesizer using the JUCE framework in C++
- julia-transformers: Exploring transformer models implementation in Julia
- rstuff: My package for everything R
Most of my work is for a paycheck and not on my personal Github. This is what I generally do:
- R Development: Author and maintainer of R packages, both proprietary and open source
- Data Science: Full-stack data science solutions from ETL to deployment using R, SQL, Python, Stan, ...
- Programming Languages: R, Python, SQL
- Areas of Focus: Statistical Modeling, Package Development, Data Pipeline Architecture


