Hello! I am Rana Hamza Intisar. I am a Senior Data Engineer, Data Scientist and a Photographer. I am a curious engineer behind a computer, bridging the entire data engineering stack with the data science world. When I am not a digital guy, I am a movie lover with great passion for photography and discovering new cultures.
April 2024 – Present
(A company of LexisNexis)
Berlin, Germany
- Modernized a mission-critical legacy system by migrating C# based batch jobs to Databricks with PySpark, improving maintainability and reducing processing time by ~50%
- Designed and optimized high-throughput, scalable ETL pipelines handling 3–5× growth in data volume, ensuring end-to-end reliability and low-latency delivery
- Accelerated delivery by introducing Agile workflow optimizations, improving sprint predictability and throughput by ~20%
- Supported team scaling — hiring, onboarding, and mentoring data engineers and data scientists, raising overall delivery quality and standards within the team
- Tools and Technologies: PySpark, Databricks, PostgreSql, Airflow, Dremio, Jenkins, Delta Lake, Azure, Docker, Kubernetes, Java, Python
January 2022 – May 2024
(A company of LexisNexis)
Berlin, Germany
- Created NLP-based semantic similarity models (Gensim) improving text-analysis accuracy and information-retrieval relevance by >25%
- Tools and Technologies: Spark, Airflow, SQL,Jenkins, Delta Lake, Azure, Kubernetes, Python
May 2021 – April 2024
(A company of LexisNexis)
Berlin, Germany
- Developed and maintained Spark-based streaming and batch pipelines powering live platform features, improving data freshness from daily to near real-time
- Built and productionized Kubernetes-orchestrated Spark workflows with Apache Airflow, cutting operational interventions by 40% through automation
- Profiling and resource tuning improved Spark cluster usage efficiency, reducing cost per run by 30% and eliminating key performance bottlenecks
- Designed and maintained Delta Lake pipelines on Azure Data Lake, improving governance, lineage tracking, and downstream developer autonomy
- Accelerated delivery by introducing Agile workflow optimizations, improving sprint predictability and throughput by ~20%
- Supported team scaling — hiring, onboarding, and mentoring data engineers and data scientists, raising overall delivery quality and standards within the team
- Tools and Technologies: Spark, PostgreSql, Airflow, Dremio, Jenkins, Delta Lake, Azure, Docker, Kubernetes, Java, Python
February 2021 – April 2021
Islamabad, Pakistan
- Developed and productionized generative AI models to optimize customer–agent pairing in call centers, improving first-call resolution by ~15% and reducing average handling time by 10–20% through smarter skill-based routing
- Designed and built real-time ML deployment pipelines enabling seamless model updates, low- latency inference, and automated monitoring, significantly improving model uptime and operational reliability
- Technologies and Tool: R, Python, STAN
February 2020 – June 2020
(A company of Allianz)
Hannover, Germany
- Engineered scalable real-time data ingestion and transformation pipelines using Java Spring Boot and Python, reducing processing latency and improving data delivery reliability across multiple cloud environments (AWS & Azure)
- Converted real-time data into time series and stored in time series database (InfluxDB)
- Applied unsupervised anomaly detection models on live data streams, generating hourly and daily performance metrics that improved early-issue detection and system visibility
- Developed monitoring dashboards and analytics visualizations in Metabase and Grafana, enabling real-time decision-making for product stakeholders
- Technologies and Tool: Spring Boot, Scikit-Learn, Graffana, Docker, Kubernetes, Kafka, AWS, InfluxDB, Jenkins, Bash, Python
July 2019 – June 2020
(A company of Allianz)
Hannover, Germany
- Built microservices consuming the OpenSky Network live API, converting aviation telemetry into time-series datasets stored in InfluxDB, enabling continuous operational analytics
- Designed and deployed containerized applications with Docker and Kubernetes, improving deployment repeatability and streamlining CI/CD workflows
- Developed monitoring dashboards and analytics visualizations in Metabase and Grafana, enabling real-time decision-making for product stakeholders
- Actively contributed to Agile ceremonies and cross-functional delivery, driving feature releases and platform scalability improvements for the Granary product
- Technologies and Tools: Spring Boot, Metabase, Graffana, Prometheus, Kafka, PostGreSQL, Docker, Kubernetes, Jenkins, AWS, Azure, Bash, Java, Python
March 2019 – May 2019
Bremen, Germany
- Built and deployed a personalized product recommendation engine using TensorFlow and Google Cloud Analytics, driving +18–25% uplift in click-through-rates and increasing cross-sell conversions through predictive customer preference modeling
- Analyzed dormant customer behavior and developed targeted promotion strategies based on historical purchase patterns, contributing to reactivation of churned users and boosting repeat-purchase rates
- Independently delivered data-driven insights through clear, narrative-focused presentations, shaping marketing campaign decisions and improving stakeholder alignment on customer engagement strategies
- Technologies and Tools: TensorFlow, Google Cloud Analytics Platform, Tableau, Python
August 2017 – August 2018
Islamabad, Pakistan
- Built and optimized large-scale Data Lake architectures for Tier-1 telecom operators (Jazz, Banglalink, Djezzy), integrating distributed technologies including Hive, Spark, Ignite, Cassandra, Kafka, NiFi, and Tez to enable high-throughput ingestion and analytics
- Engineered Source-to-Target data mappings to ensure accurate business rule implementation and seamless downstream consumption across enterprise systems
- Led requirements discovery sessions with client stakeholders, translating business objectives into scalable technical designs and ensuring delivery alignment
- Automated job scheduling, testing, and operational monitoring, improving pipeline reliability and reducing execution failures
- Enhanced data integration workflows by developing robust ETL components in Teradata UDI Studio, reducing manual processing overhead
- Supported full release lifecycle including UAT, validation, and controlled deployments, ensuring smooth production rollout and better feature adoption
- Technologies and Tools: Teradata, Hadoop, Spark, Hive, Ignite, Cassandra, Kafka, Nifi, Tez, Bash, Java, Docker
June 2016 – August 2016
Islamabad, Pakistan
- Increasing the security of the cluster with Ranger
- Handling large data sets and transformations with Presto
- Data Visualizations with Tableau
- SQL Development
July 2015 – August 2015
Lahore, Pakistan
- Android app development on number plates of vehicles to track the unregistered vehicles
- User acceptance testing
June 2015 – July 2015
Islamabad, Pakistan
- Computer networking for huawei servers in local telecommunication sector
- Maintaining server networks
2018 - 2020
Jacobs University, Bremen, Germany
Focusing on:
- Data Analytics
- Data Mining
- Machine Learning
- Statistical Modeling
- Big Databases and cloud services
- Databases
- Data Visualization and Image Processing
- Data Security
2013 - 2017
National University of Computer and Emerging Sciences, Islamabad, Pakistan
Focusing on:
- Data Structures
- Advanced Programming
- Database Systems
- Data Warehousing
- Artificial Intelligence
- Concurrent and Distributed systems
2011 - 2013
Cambridge International Examinations
Beaconhouse School System Margalla Campus, Islamabad, Pakistan
Focusing on:
- Mathematics
- Physics
- Chemistry
2009 - 2011
Cambridge International Examinations
The City School, Bahawalpur, Pakistan
Focusing on:
- Mathematics
- Physics
- Chemistry
- Biology
Address
Berlin, Germany
Phone
+49-1514-3622712
ranahamzaintisar@gmail.com
