📚 Personal Reading Analytics

A self-built, fully automated data pipeline with CI/CD governance, leveraging GitHub Actions and MongoDB event sourcing to transform raw reading data into actionable insights and interactive visualizations with zero infrastructure.

Beyond standard charts, it performs an AI Delta Analysis to generate a qualitative weekly narrative, answering three specific questions:

Velocity: Are you reading faster or slower than usual?
Backlog Health: Are you clearing old debt (>1 year) or just adding new noise?
Chronology: Which specific years of content are you focusing on right now?

🔗 Live Analytics

👉 See Live Analytics

🌿 Engineering Principles

This project is built to reflect how I believe small, personal tools should work:

Zero infrastructure → No servers or hosting costs. Runs entirely on GitHub (Actions + Pages).
Fully automated → Scheduled GitHub Actions keep data fresh, utilizing CI/CD governance for human-in-the-loop code review and merging.
Observability first → Uses an Event Sourcing pattern (MongoDB) to decouple extraction from analytics, ensuring full auditability and health monitoring.
Cost-effective → Uses only free tiers (GitHub, Google Sheets API, MongoDB Atlas), proving powerful automation doesn’t require budget.

📚 Documentation

For all project documentation, including architectural diagrams, operational guides, and detailed schema specifications, please visit the Project Documentation. This central hub includes details on the external Observability Hub, which processes events from this pipeline (MongoDB to PostgreSQL) for Grafana visualization. Note that the Grafana instance itself is not publicly exposed.

🛠 Tech Stacks

📊 What It Shows

Key Metrics Section:

Total articles: Tracking total articles across currently supported sources
Read rate: Percentage of articles completed with visual highlighting
AI Delta Analysis: Multi-dimensional analysis of reading Velocity (pace), Backlog Health (clearing old debt vs. new noise), and Chronology (era of content focus) to provide narrative context beyond raw numbers.
Historical Archive: A permanent record of past weekly snapshots, accessible via a context-aware selector to track progress over time.
Reading statistics: Read count, unread count, and average articles per month
Highlight badges: Top read rate source, most unread source, current month's read articles

7 Interactive Visualizations (Chart.js):

Year Breakdown: Bar chart showing article distribution by publication year
Read/Unread by Year: Stacked bar chart with reading progress across years
Monthly Breakdown: Toggle between total articles (line chart) and by-source distribution (stacked bar)
Read/Unread by Month: Seasonal reading patterns across all months
Read/Unread by Source: Horizontal stacked bars comparing progress per provider
Unread Age Distribution: Age buckets (<1 month, 1-3 months, 3-6 months, 6-12 months, >1 year)
Unread by Year: Identifies which years have the most unread backlog

Source Analytics:

Per-source statistics with read/unread split and read percentages
Substack per-author average calculation (total articles ÷ author count)
Top 3 oldest unread articles with clickable links, dates, and age calculations
Source metadata showing when each provider was added to tracking

📖 How This Project Evolved

Learn about the journey of this project: from local-only execution, to Docker containerization, to automated GitHub Actions workflows.

🚀 Ready to Explore?

Don't just take my word for it, interact with the real data.

👉 Launch Personal Reading Analytics

Name		Name	Last commit message	Last commit date
Latest commit History 332 Commits
.github		.github
cmd		cmd
docs		docs
internal		internal
metrics		metrics
script		script
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.markdownlint.json		.markdownlint.json
AGENTS.md		AGENTS.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
requirements.txt		requirements.txt
shell.nix		shell.nix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📚 Personal Reading Analytics

🔗 Live Analytics

🌿 Engineering Principles

📚 Documentation

🛠 Tech Stacks

📊 What It Shows

📖 How This Project Evolved

🚀 Ready to Explore?

About

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📚 Personal Reading Analytics

🔗 Live Analytics

🌿 Engineering Principles

📚 Documentation

🛠 Tech Stacks

📊 What It Shows

📖 How This Project Evolved

🚀 Ready to Explore?

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages