A Data Engineer based in Islamabad, Pakistan, focused on building cloud-native data pipelines and scalable data infrastructure on AWS.
I graduated with a BSc in Information Technology (University of Bedfordshire, 2024) and have been hands-on building real-world data engineering projects ever since.
Currently preparing for the AWS Certified Data Engineer – Associate (DEA-C01).
Languages: Python · SQL
Big Data: PySpark · Apache Spark · Databricks
AWS: S3 · Glue · Athena · Lambda · Kinesis · Lake Formation · CloudWatch · SNS · Redshift
Visualisation: Tableau · Amazon QuickSight
Other: Medallion Architecture · ETL/ELT · Real-Time Streaming · Data Lakes
| Project | Description | Stack |
|---|---|---|
| Crypto Realtime Pipeline | Real-time crypto price streaming — CoinGecko → Kinesis → Lambda → S3 → Athena → Tableau | Python, AWS Kinesis, Lambda, Athena |
| Healthcare Data Lake – AWS | HIPAA-compliant data lake using Medallion Architecture with PySpark ETL and Lake Formation governance | PySpark, AWS Glue, S3, Lake Formation |
| E-Commerce Realtime Analytics | Real-time e-commerce analytics pipeline with live Tableau dashboarding | Python, Kinesis, Lambda, Firehose, Tableau |