Abstract data network visualization with glowing blue connections
13+ Years of Experience

Arshiya (r-she-yeah) Hussain

Senior Data Engineer & Architect

AWSKafkaSparkAirflowJavaPython
About

Professional Profile

Senior Data/Software Engineer and Architect specialised in building scalable data pipelines and lakes on AWS, with extensive experience in Kafka, Spark, EMR, and Airflow. Proven ability to lead autonomous, high-throughput technical solutions for enterprise clients, delivering major improvements in efficiency, processing time, and data governance.

Cloud Architecture

Deep expertise in AWS services including EMR, S3, Lambda, Kinesis, SQS, SNS, and CloudFormation.

Data Pipelines

Scalable ETL/ELT pipelines with Spark, Kafka, Airflow, and Apache Beam for batch and stream processing.

System Design

Microservices architecture, event-driven systems, and high-throughput data lake design.

Enterprise Delivery

Proven track record delivering autonomous, cost-efficient solutions for major enterprise clients.

Career

Professional Experience

Senior Data Engineer (Cloud Data Architecture)

BBC Worldwide, UK
Feb 2016 – Present

Senior developer designing and deploying microservices-based applications and high-throughput data pipelines on AWS for the Audience Data team, leveraging multiple languages (Java, Python, Scala).

  • Engineered end-to-end data analytics pipeline for Single Customer View (SCV), orchestrating dependencies with Airflow and metadata cataloguing using AWS Glue
  • Designed Amazon S3-based data lake architecture, providing aggregated data to analysts via Redshift
  • Developed core ingestion service (Java Spring Boot, EC2/ASG) and performed heavy data transformations on Amazon EMR (Spark)
  • Led technical design for greenfield Audience Gateway system, optimising transactional performance with Spring Boot and Amazon MS Kafka
  • Optimised pipeline performance by 70% and achieved 300% cost-effective scalability through architectural choices and EMR tuning
  • Developed end-to-end automated testing framework for AWS stacks and reusable pipelines for ID/Campaign Attribution using Apache Beam

Software Developer

IBM UK - Monitise, UK
Jul 2013 – Feb 2016

Developed APIs for Standing Order Maintenance and Mobile Payment Service using Apache CXF web services. Established CI/CD via Jenkins.

  • Built SOAP and REST API services for core payment products
  • Implemented CI/CD strategies boosting deployment reliability

Lead Developer

Trafalgar Management Services, UK
Jan 2011 – Jul 2013

Developed and maintained Java/J2EE CMS applications and search engine integration, and managed core systems administration.

  • Created CCAPI web services and integrated SOLR search for CMS
  • Acted as Senior System Administrator (Linux, MySQL, Apache2, Tomcat)
Expertise

Technical Skills

Programming

Java 11Spring BootGroovyScalaPython 3.7

Cloud (AWS)

SQSSNSLambdaEC2EMR (Spark/Livy)KinesisMS KafkaCloudFormationDynamoDB

Data Pipelining

Apache BeamSparkGlueHiveLivyApache AirflowAVRO Schemas

MLOps & CI/CD

JenkinsMavenANTGITGERRITDockerKubernetes (COSMOS ECS)

Testing & Quality

TestContainersLocalStackGatlingCucumber BDDJUnitMockito
Impact

Key Achievements

90%

Pipeline Processing Time Reduction

Drastically reduced data pipeline processing time through architectural optimisation.

300%

Cost-Effective Scalability

Achieved through technical redesign and EMR cluster tuning.

70%

Performance Optimisation

Optimised transactional system performance with multi-threading strategies.

E2E

Microservices on AWS

Designed, deployed, and automated end-to-end microservices solutions.

Scale

Enterprise Systems

Delivered high-throughput, scalable, and autonomous enterprise systems.

TDD/BDD

Quality Practices

Implemented comprehensive automated testing across all projects.

Education

Academic Background

MSc Enterprise Architecture

Northumbria University, UK

2025

B.Sc. (Hons) in Multimedia Technology

University of Huddersfield, UK

2:1 Honours