About

Hello! I am Rana Hamza Intisar. I am a Senior Data Engineer, Data Scientist and a Photographer. I am a curious engineer behind a computer, bridging the entire data engineering stack with the data science world. When I am not a digital guy, I am a movie lover with great passion for photography and discovering new cultures.

Basic Information
Age:
30
Email:
ranahamzaintisar@gmail.com
Phone:
+49-1514-3622712
Address:
Berlin, Germany
Language:
English: C1, German: B1, Urdu: Native
Work Experience

April 2024 – Present

LexisNexis IPlytics GmbH

(A company of LexisNexis)

Berlin, Germany

Senior Data Engineer
  • Modernized a mission-critical legacy system by migrating C# based batch jobs to Databricks with PySpark, improving maintainability and reducing processing time by ~50%
  • Designed and optimized high-throughput, scalable ETL pipelines handling 3–5× growth in data volume, ensuring end-to-end reliability and low-latency delivery
  • Accelerated delivery by introducing Agile workflow optimizations, improving sprint predictability and throughput by ~20%
  • Supported team scaling — hiring, onboarding, and mentoring data engineers and data scientists, raising overall delivery quality and standards within the team
  • Tools and Technologies: PySpark, Databricks, PostgreSql, Airflow, Dremio, Jenkins, Delta Lake, Azure, Docker, Kubernetes, Java, Python

January 2022 – May 2024

LexisNexis IPlytics GmbH

(A company of LexisNexis)

Berlin, Germany

Data Scientist
  • Created NLP-based semantic similarity models (Gensim) improving text-analysis accuracy and information-retrieval relevance by >25%
  • Tools and Technologies: Spark, Airflow, SQL,Jenkins, Delta Lake, Azure, Kubernetes, Python

May 2021 – April 2024

LexisNexis IPlytics GmbH

(A company of LexisNexis)

Berlin, Germany

Data Engineer
  • Developed and maintained Spark-based streaming and batch pipelines powering live platform features, improving data freshness from daily to near real-time
  • Built and productionized Kubernetes-orchestrated Spark workflows with Apache Airflow, cutting operational interventions by 40% through automation
  • Profiling and resource tuning improved Spark cluster usage efficiency, reducing cost per run by 30% and eliminating key performance bottlenecks
  • Designed and maintained Delta Lake pipelines on Azure Data Lake, improving governance, lineage tracking, and downstream developer autonomy
  • Accelerated delivery by introducing Agile workflow optimizations, improving sprint predictability and throughput by ~20%
  • Supported team scaling — hiring, onboarding, and mentoring data engineers and data scientists, raising overall delivery quality and standards within the team
  • Tools and Technologies: Spark, PostgreSql, Airflow, Dremio, Jenkins, Delta Lake, Azure, Docker, Kubernetes, Java, Python

February 2021 – April 2021

Afiniti

Islamabad, Pakistan

Data Scientist - Aritificial Intelligence
  • Developed and productionized generative AI models to optimize customer–agent pairing in call centers, improving first-call resolution by ~15% and reducing average handling time by 10–20% through smarter skill-based routing
  • Designed and built real-time ML deployment pipelines enabling seamless model updates, low- latency inference, and automated monitoring, significantly improving model uptime and operational reliability
  • Technologies and Tool: R, Python, STAN

February 2020 – June 2020

Syncier GmbH

(A company of Allianz)

Hannover, Germany

Master Thesis (Data and Machine Learning Engineer)
  • Engineered scalable real-time data ingestion and transformation pipelines using Java Spring Boot and Python, reducing processing latency and improving data delivery reliability across multiple cloud environments (AWS & Azure)
  • Converted real-time data into time series and stored in time series database (InfluxDB)
  • Applied unsupervised anomaly detection models on live data streams, generating hourly and daily performance metrics that improved early-issue detection and system visibility
  • Developed monitoring dashboards and analytics visualizations in Metabase and Grafana, enabling real-time decision-making for product stakeholders
  • Technologies and Tool: Spring Boot, Scikit-Learn, Graffana, Docker, Kubernetes, Kafka, AWS, InfluxDB, Jenkins, Bash, Python

July 2019 – June 2020

Syncier GmbH

(A company of Allianz)

Hannover, Germany

Data Engineer
  • Built microservices consuming the OpenSky Network live API, converting aviation telemetry into time-series datasets stored in InfluxDB, enabling continuous operational analytics
  • Designed and deployed containerized applications with Docker and Kubernetes, improving deployment repeatability and streamlining CI/CD workflows
  • Developed monitoring dashboards and analytics visualizations in Metabase and Grafana, enabling real-time decision-making for product stakeholders
  • Actively contributed to Agile ceremonies and cross-functional delivery, driving feature releases and platform scalability improvements for the Granary product
  • Technologies and Tools: Spring Boot, Metabase, Graffana, Prometheus, Kafka, PostGreSQL, Docker, Kubernetes, Jenkins, AWS, Azure, Bash, Java, Python

March 2019 – May 2019

Lobensberg Gute Weine

Bremen, Germany

Data Analyst - Internship
  • Built and deployed a personalized product recommendation engine using TensorFlow and Google Cloud Analytics, driving +18–25% uplift in click-through-rates and increasing cross-sell conversions through predictive customer preference modeling
  • Analyzed dormant customer behavior and developed targeted promotion strategies based on historical purchase patterns, contributing to reactivation of churned users and boosting repeat-purchase rates
  • Independently delivered data-driven insights through clear, narrative-focused presentations, shaping marketing campaign decisions and improving stakeholder alignment on customer engagement strategies
  • Technologies and Tools: TensorFlow, Google Cloud Analytics Platform, Tableau, Python

August 2017 – August 2018

Teradata Global Delivery Centre

Islamabad, Pakistan

Data Engineer
  • Built and optimized large-scale Data Lake architectures for Tier-1 telecom operators (Jazz, Banglalink, Djezzy), integrating distributed technologies including Hive, Spark, Ignite, Cassandra, Kafka, NiFi, and Tez to enable high-throughput ingestion and analytics
  • Engineered Source-to-Target data mappings to ensure accurate business rule implementation and seamless downstream consumption across enterprise systems
  • Led requirements discovery sessions with client stakeholders, translating business objectives into scalable technical designs and ensuring delivery alignment
  • Automated job scheduling, testing, and operational monitoring, improving pipeline reliability and reducing execution failures
  • Enhanced data integration workflows by developing robust ETL components in Teradata UDI Studio, reducing manual processing overhead
  • Supported full release lifecycle including UAT, validation, and controlled deployments, ensuring smooth production rollout and better feature adoption
  • Technologies and Tools: Teradata, Hadoop, Spark, Hive, Ignite, Cassandra, Kafka, Nifi, Tez, Bash, Java, Docker

June 2016 – August 2016

Teradata Pakistan PVT LTD

Islamabad, Pakistan

Data Engineer - Internship
  • Increasing the security of the cluster with Ranger
  • Handling large data sets and transformations with Presto
  • Data Visualizations with Tableau
  • SQL Development

July 2015 – August 2015

PITB

Lahore, Pakistan

Android Developer Internship
  • Android app development on number plates of vehicles to track the unregistered vehicles
  • User acceptance testing

June 2015 – July 2015

Huawei Technologies

Islamabad, Pakistan

Computer Networking Engineer - Internship
  • Computer networking for huawei servers in local telecommunication sector
  • Maintaining server networks
Education

2018 - 2020

Master's Degree
Master of Science in Data Engineering

Jacobs University, Bremen, Germany

Focusing on:

  • Data Analytics
  • Data Mining
  • Machine Learning
  • Statistical Modeling
  • Big Databases and cloud services
  • Databases
  • Data Visualization and Image Processing
  • Data Security

2013 - 2017

Bachelor's Degree
Bachelor of Science in Computer Sciences

National University of Computer and Emerging Sciences, Islamabad, Pakistan

Focusing on:

  • Data Structures
  • Advanced Programming
  • Database Systems
  • Data Warehousing
  • Artificial Intelligence
  • Concurrent and Distributed systems

2011 - 2013

Cambridge International Examinations

A-Levels
Pre-Engineering

Beaconhouse School System Margalla Campus, Islamabad, Pakistan

Focusing on:

  • Mathematics
  • Physics
  • Chemistry

2009 - 2011

Cambridge International Examinations

O-Levels
Pre-Engineering

The City School, Bahawalpur, Pakistan

Focusing on:

  • Mathematics
  • Physics
  • Chemistry
  • Biology
Professional Skills
Programming Languages
Java (SE+EE+FX)
80%
Python
80%
R
60%
UNIX
80%
C++
75%
Databases
MySQL
85%
PostgreSQL
85%
Delta Lake
80%
HQL
70%
Cassandra
80%
SQLite
80%
Oracle
80%
Teradata
70%
InfluxDB
80%
Big Data Tools
Apache Spark
85%
Databricks
80%
Apache Airflow
85%
Apache Kafka
80%
Apache Ignite
70%
Apache Hadoop
80%
Map Reduce
60%
Apache Pig
60%
Sqoop
60%
Apache Solr
70%
Elastic Search
60%
Hive
70%
Data Science Tools
Pandas
80%
Numpy
70%
Scikit-Learn
75%
Keras
70%
Matplotlib
70%
TensorFlow
60%
Cloud Technologies
Microsoft Azure
80%
Amazon Web Services
70%
Google Cloud Platform
50%
Programming Tools
Google Kubernetes
80%
Git
80%
Docker
80%
JIRA Software
75%
CI/CD (Jenkins)
70%
Graffana
70%
Tableau
60%
Metabase
60%
Oracle
80%
SQL Server
80%
Helm
70%
Dremio
85%
Professional Courses
March 2025
Python Essentials - O'Reilly
March 2025
Databricks Data Engineer Associate - O'Reilly
June 2024
Data Engineering on Databricks - Databricks
June 2024
Data Analysis on Databricks - Databricks
February 2023
Kubernetes Certified Application Developer -Udemy
November 2022
Apache Airflow: The Operators Guide - Udemy
June 2022
Data Analytics with PySpark - Udemy
April 2021
AWS Foundations: Getting Started with the AWS Cloud Essentials - Pluralsight
April 2021
Getting Started with Spark 2 - Pluralsight
April 2021
Programming with R - Pluralsight
December 2017
Apache Ignite and Apache Cassandra Training - Teradata
References
Contact Me
Feel free to contact me

Address

Berlin, Germany

Phone

+49-1514-3622712

Email

ranahamzaintisar@gmail.com