Case 1 · Annual Telecom API Peak Load Analysis (Hadoop / Hive)
48-hour annual API peak-load analysis for a 42M-record/day telecom system. 15.3B yearly logs made Oracle infeasible — resolved via rapid Hadoop/Hive cluster cloning and parallel ETL.
Engineering & Data Projects — ETL, performance tuning, and monitoring at scale.
This section highlights end-to-end engineering and analytics projects including ETL workflows, SQL optimization, Hadoop/Hive pipelines, Power BI DAX modelling, and large-scale data processing. Each project demonstrates real-world problem solving, optimization, and scalable design across telecom and civic datasets.
48-hour annual API peak-load analysis for a 42M-record/day telecom system. 15.3B yearly logs made Oracle infeasible — resolved via rapid Hadoop/Hive cluster cloning and parallel ETL.
From monthly full-scan to daily partitioned checks; ~30× scan reduction.
Sub-second lookup without ES — index + SQL tuning.
120 hosts, shell orchestration, proactive incident catch.
6M users cutover — resilient ETL and data integrity checks.
Detect & auto-resolve issues before customer calls.
From T−2 to T−1, ensuring on-time billing.
Demographic and engagement analytics of youth services in Calgary — exploring equity and participation through data visualization.
Browse engineering projects alongside city dashboards in Civic & Social Analytics, Public Health Analytics, Crime & Disorder Insights, Housing Market Insights, and Economy & Energy. Read long-form analysis in the Blog or learn more about my background on the About page.