Hello, I'm

Emad Abuzeid Alvarez

Tech Lead & Senior Data Engineer · Google Cloud Architect & ML Engineer

Madrid, Spain · emad.abuzeid95@gmail.com · +34 636 789 982

Motivated professional with 5+ years of combined technical and business experience. Proven track record in architecting, technical leadership and optimizing infrastructure as data engineer. Passionate about data-driven decision-making, business intelligence and digital transformation.

About Me

I'm a Tech Lead and Senior Data Engineer at Telefónica's Chief Data Officer (CDO) unit in Madrid, Spain. With 5+ years of combined technical and business experience, I have a proven track record in architecting, technical leadership, and optimizing infrastructure as a data engineer.

I hold three Google Professional certifications — Cloud Architect, Data Engineer, and Machine Learning Engineer — and I believe that programming is not an art: it's an engineering discipline that can be taught, learned, and continuously improved. That's why I created "Programming is not an art", a free course to help non-technical people break into tech.

I'm passionate about data-driven decision-making, business intelligence, and the digital transformation of organizations. When I'm not building data platforms or mentoring teams, I create educational content to help others break into tech with an engineering mindset.

Experience

Tech Lead & Senior Data Engineer at CDO

Jun 2022 — Present

Telefónica

Leading data engineering teams and architecting scalable, cost-effective data platforms on Google Cloud for Telefónica's Chief Data Officer unit. Designing and modeling business intelligence data lake/warehouse architecture while implementing governance and security policies.

  • Led and mentored two data engineering teams (6 people total), boosting feature delivery by 60% and reducing project timelines by 30%
  • Designed and modeled a scalable and cost-effective BI data lake/warehouse architecture on Google Cloud, implementing data governance and security policies
  • Optimized and migrated six Cloud SQL databases, improving performance by 40% and reducing maintenance time by 25%
  • Refactored the data processing framework, resulting in a 70% average performance improvement
  • Built a framework for real-time data pipelines with Spark, Kafka, and Hadoop, reducing costs by 65%
  • Implemented a local execution environment for ETL frameworks, improving project agility by 80% and reducing implementation time by 40%
  • Collaborated effectively with multiple teams across Telefónica IoT and the Video Platform for aligned cross-functional work
  • Saved over €80k on infrastructure in one year
Python Apache Spark Apache Kafka Hadoop Google Cloud Platform BigQuery Cloud SQL Terraform Airflow Dataflow Looker

Data Architecture and Government for BBVA

Mar 2021 — Jun 2022

Bluetab (IBM Company)

Designed and governed data architecture solutions for BBVA, creating Scala/Spark frameworks and monitoring systems that enabled 40 data engineers and 10 teams.

  • Enabled 40 data engineers with expert solution design support, reducing project delivery time by 30%
  • Created and maintained Scala/Spark-based frameworks, enabling 10 teams to improve productivity by 25%
  • Collaborated with managers, data scientists, analysts, and engineers to translate business needs into architecture solutions, improving project delivery by 15%
  • Optimized Scala/Spark processes, reducing process time by 30% across multiple workflows
  • Designed and developed a data lifecycle system on on-premise cloud, achieving 30% savings in storage costs for area projects
  • Conducted code reviews and supervised 40 developers, ensuring adoption of best practices and reducing bugs in production
  • Designed and developed a monitoring and observability system for 500+ processes using Spark and Looker, cutting operational costs by 30%
  • Led and mentored a team of new developers, accelerating onboarding through training and task delegation
Scala Apache Spark Looker Python SQL Hadoop

Data Engineer at Santander Global Tech UK

Sep 2020 — Mar 2021

Grupo AVALON

Built data processing pipelines for Santander's business intelligence platform, developing Scala/Spark and PySpark processes and designing cost-efficient data pipelines on hybrid cloud with AWS.

  • Developed Scala/Spark and PySpark processes for business intelligence projects, improving processing efficiency by 20%
  • Created Spark processes based on non-technical requirements, reducing development time by 25% and improving stakeholder alignment
  • Designed data pipelines to reduce cluster costs on hybrid cloud with AWS
  • Translated Hive/Impala processes to Spark code, increasing clarity and maintainability by 40% and reducing technical debt
Scala Apache Spark PySpark Python AWS SQL Hadoop Hive

Projects

Data Platform Architecture at Telefónica CDO

Featured

Architected and led a scalable, cost-effective BI data lake/warehouse on Google Cloud processing billions of events daily. Implemented real-time data pipelines with Spark, Kafka, and Hadoop, reducing costs by 65% and saving over €80k on infrastructure per year.

data-engineering architecture gcp real-time leadership
active

ML Pipeline Framework

Built an end-to-end ML pipeline framework automating model training, validation, and deployment on Google Cloud Platform, reducing ML model delivery time and enabling production-grade ML systems.

machine-learning mlops gcp data-pipelines
completed

Programming is not an art

Featured

Free programming course focused on enabling people with no technical knowledge to get started in software development. Two editions covering Java, OOP, SQL, and cloud fundamentals with a pragmatic, engineering-focused approach.

education programming community engineering
active

Skills

Cloud

  • Amazon Web Services (familiar)
  • Google Cloud Platform (expert)

Frameworks & Libraries

  • Apache Airflow (proficient)
  • Apache Kafka (proficient)
  • Apache Spark (expert)
  • Hadoop (proficient)
  • Machine Learning (proficient)

Languages

  • English (proficient)
  • Spanish (expert)

Methodologies

  • Agile & Scrum (expert)

Programming

  • Bash (proficient)
  • Go (familiar)
  • Java (intermediate)
  • JavaScript (familiar)
  • Python (proficient)
  • Scala (proficient)
  • SQL (proficient)
  • TypeScript (familiar)

Soft Skills

  • Leadership (expert)

Tools

  • Docker (proficient)
  • Git & GitHub (proficient)
  • Jenkins (intermediate)
  • Looker (proficient)
  • Terraform (proficient)

Certifications

AWS Solutions Architect Associate

Amazon Web Services

Issued January 2025

Google Professional Cloud Architect

Google Cloud

Issued April 2024

View Credential

Google Professional Data Engineer

Google Cloud

Issued April 2024

View Credential

Google Professional Machine Learning Engineer

Google Cloud

Issued April 2024

View Credential

Education

Universidad Politécnica de Madrid (UPM)

BSc in Telematics Engineering

Madrid, Spain

University of Granada

ERP Systems Program (SICUE Exchange)

Granada, Spain

IES Palomeras-Vallecas

Technical Diploma in Network & Information Services Management

Madrid, Spain

Get in Touch

Interested in collaborating, discussing data engineering, or just saying hello? I'd love to hear from you.