Open to Work

Chinedu
Agwunobi

Data Engineer Cloud Architect Data Scientist

12x Certified (AWS · Azure · GCP · Databricks)  |  MSc Data Science, Teesside University  |  London, UK

I build scalable data pipelines, cloud-native architectures, and ML-powered solutions that turn raw data into real business value. Open to Data Engineering, Cloud Architecture, and Data Science opportunities.

12Certifications
2+Years Exp
23Projects
Chinedu Agwunobi
AWS
Azure
Spark
Python

The Story Behind the Data

Data Engineer

Designing and optimising end-to-end data pipelines using Apache Spark, Kafka, Airflow, and dbt. Migrating legacy systems to scalable cloud platforms on AWS, Azure, and GCP with robust data governance practices.

Cloud Architect

Designing highly available, cost-optimised cloud architectures aligned with the AWS Well-Architected Framework. Expertise in IaC with Terraform and CloudFormation across multi-cloud environments.

Data Scientist

Applying advanced ML techniques including clustering, NLP, and deep learning with PyTorch to deliver actionable insights. MSc Data Science at Teesside University with hands-on industry experience at Think Pacific.

Experience & Education

Data Engineer

Reliance Infosystems
Mar 2021 – Jul 2022
  • Designed, built, and maintained cloud-based ETL/ELT pipelines using AWS Glue, Lambda, and Azure Data Factory, delivering scalable data integration solutions across the full development lifecycle.
  • Automated data validation, profiling, and cleansing workflows, reducing manual intervention by 20 hours per week and strengthening data quality governance.
  • Led a cloud migration project achieving a 15% reduction in infrastructure costs while improving security and architectural standards.
  • Led peer code reviews and pipeline design reviews, maintaining high standards of quality within an Agile delivery environment.
  • Collaborated with analysts, business stakeholders, and technical teams to ensure data solutions met user requirements and aligned to organisational goals.
AWS GlueAWS LambdaAzure Data FactoryETL/ELTData GovernanceAgile

Data Engineer Intern

Reliance Infosystems
Aug 2020 – Feb 2021
  • Built and optimised big-data pipelines in Azure Databricks, improving model performance and enabling more accurate segmentation with a 25% improvement in accuracy.
  • Improved SQL query logic and efficiency, reducing execution times by 30% and improving reporting responsiveness.
  • Supported data ingestion and modelling tasks for analytics teams, ensuring accuracy and consistency across datasets.
Azure DatabricksSQLBig DataData ModellingAnalytics

Data Science Intern

Think Pacific
Sep 2023 – Dec 2023
  • Applied clustering techniques including K-Means, DBSCAN, and Hierarchical Clustering to analyse the Mall Customer Dataset in R-Studio, identifying distinct customer segments to support targeted marketing strategies.
  • Developed Python-based data preprocessing pipelines, reducing missing data issues by 30% and improving downstream modelling outcomes.
  • Implemented effective data visualisation techniques to communicate insights derived from complex datasets to non-technical stakeholders.
  • Created operational dashboards in Power BI, enabling evidence-based decision-making across teams.
  • Conducted extensive research into cutting-edge data science techniques, applying findings to improve analytical methodologies.
  • Collaborated with cross-functional stakeholders to interpret analytical outcomes and improve data visibility across the organisation.
PythonR-StudioK-MeansDBSCANPower BIData VisualisationClustering

MSc Data Science

Teesside University
2022 – 2024

Advanced postgraduate study covering machine learning, big data analytics, statistical modelling, NLP, and deep learning. Dissertation: "Empowering Market Forecasting and Investment Decisions with Advanced Sentiment Analysis Techniques."

Machine LearningBig DataNLPDeep LearningStatistics

Skills & Technologies

Apache Spark Apache Kafka Apache Airflow dbt Python SQL / NoSQL Snowflake Delta Lake Apache Hive ETL / ELT Pipelines Data Governance Git / CI/CD AWS Glue AWS Lambda Amazon S3 Amazon RDS Amazon Redshift Amazon Athena Amazon Kinesis AWS Step Functions Azure Data Factory Azure Synapse Analytics Microsoft Fabric Databricks
AWS (12 Services) Microsoft Azure Google Cloud Platform Terraform CloudFormation Docker Kubernetes GitHub Actions IAM / Security VPC / Networking CloudWatch EC2 / Lambda
Machine Learning PyTorch Scikit-learn Pandas / NumPy Computer Vision Transfer Learning NLP / Sentiment Analysis Clustering / Segmentation AutoGluon / AutoML Amazon SageMaker Power BI / QuickSight Jupyter Notebooks Time Series Statistical Analysis R Databricks ML AWS Step Functions
Agentic AI Workflows Prompt Engineering AI-Assisted Development Large Language Models Multi-Agent Systems Kiro IDE AI Automation RAG / Vector Search LangChain Concepts Responsible AI

Featured Projects

Real-world projects spanning data engineering, cloud architecture, data science, and agentic AI

12x Cloud Certified

Validated expertise across AWS, Azure, GCP, and Databricks

Amazon Web Services

Three AWS certifications validating cloud architecture, data engineering, and foundational cloud knowledge.

AWS Certified Solutions Architect Associate AWS Certified Data Engineer AWS Certified Cloud Practitioner

Microsoft Azure

Six Azure certifications spanning administration, architecture, data engineering, AI, and business intelligence.

AZ-104: Azure Administrator Associate AZ-305: Solutions Architect Expert DP-700: Fabric Data Engineer Associate DP-900: Azure Data Fundamentals AI-900: Azure AI Fundamentals AZ-900: Azure Fundamentals

Google Cloud & IT

GCP cloud architecture certification and Google IT Support Specialisation demonstrating broad technical foundations.

Google Cloud Associate Solutions Architect Google IT Support Specialisation

Databricks

Databricks certification validating expertise in Apache Spark, Delta Lake, and lakehouse data engineering.

Databricks Certified Associate Data Engineer

WorldQuant University

Eight end-to-end applied data science projects completed with 90%+ on each assessment, covering ML, time series, NLP, A/B testing, and API design across global datasets.

BeSA — Agentic AI on AWS

Fundamentals of AI Agents and AWS Agentic AI services. Covers LLMs, MLOps, NLP, system design, and the Solutions Architect mindset for the AI era. Skills: Python, AWS, Azure, GCP, Docker, APIs, LLMs, MLOps, NLP, System Design.

Let's Build Something Together

Open to Data Engineer, Cloud Architect, and Data Scientist roles. Based in London, open to remote.

Location

London, United Kingdom

Seeking Roles In:

Data Engineer Cloud Architect Data Scientist