Caleb Akinwande
Senior Data Engineer · Kafka + Spark + Delta + Databricks · 8B events/day
◆Summary◆
Senior data engineer with seven years across SaaS + fintech. Owns the merchant-events ingestion pipeline at a Series C SaaS — Kafka + Spark Structured Streaming + Delta Lake on Databricks, 8B events/day with p99 SLA under 4 minutes from event-write to gold-table availability. Cut Snowflake compute spend $480k/yr (-32%). Two merged PRs to dbt-core; DataCouncil 2024 speaker.
◆Skills◆
Warehouse + Processing
Databricks + Delta Lake·Snowflake·Spark Structured Streaming·dbt (220 models)
Streaming + Orchestration
Kafka + Schema Registry·Debezium (CDC)·Airflow·Dagster
Quality + Languages
Great Expectations + dbt tests·OpenLineage + Marquez·Python (PySpark, Polars)·Scala (Spark)
◆Experience◆
Senior Data Engineer
Q
Quill · Remote (Lagos)
May 2022—Present
- Built the merchant-events ingestion pipeline (Kafka → Spark Structured Streaming → Delta on Databricks); SLA p99 stays under 4 min from event-write to gold-table availability across an 8B-event/day workload.
- Cut Snowflake compute spend $480k/yr (-32%) through partition pruning, materializing the 12 most-queried dashboards as incremental dbt models, and right-sizing the warehouse for off-hours queries.
- Built 38 dbt tests across critical gold-tables (freshness, uniqueness, referential integrity, business-rule); freshness violations dropped 84%; ownership documented per test with on-call rotation.
- Authored the org's data-contract framework (Protobuf + dbt-source freshness + Great Expectations); 24 critical contracts published; upstream-breaking changes caught at PR-time.
- Owns 220 dbt models across 4 marts (finance, growth, product, ops); model-execution time fell 38% via incremental materialization + clustering keys.
Data Engineer
P
Paystack · Lagos, NG
Sep 2019—Apr 2022
- Migrated the data warehouse from Redshift to Snowflake over 4 months; 380 tables migrated; query latency p95 12s → 1.8s; concurrent-user capacity tripled.
- Built the CDC ingestion from PostgreSQL to lakehouse (Debezium + Kafka + Spark merge); 14 source tables onboarded; freshness SLA 1h → 90s.
- Built the data-lineage observability layer (OpenLineage + Marquez); 38 weekly active analysts using the catalog.
Analytics Engineer
F
Flutterwave · Lagos, NG
Jul 2017—Aug 2019
- Migrated 80 ad-hoc analytics queries into dbt models; reduced data-team turnaround time on metric requests from 5 days to under 1 day.
◆Open Source & Speaking◆
dbt-labs/dbt-core
Contributor (2 merged PRs)Two merged PRs to dbt-core — one extended the on-run-end hook for incremental-model freshness reporting; one closed a partial-failure-recovery bug in incremental materialization. Plus: DataCouncil 2024 speaker — 'Data contracts at 8B events/day.'
Pythondbt
◆Education◆
BSc in Computer Science
University of Lagos
Sep 2013—Jun 2017
Senior
7 years data. Owns 8B-event/day pipeline. Kafka + Spark + Delta + Databricks.
Use this template