Profile Photo

Senior Data Engineer

7+ years building robust, scalable data pipelines for fraud detection, analytics, and real-time systems

Contact Me

About Me

Senior Data Engineer with 7+ years of experience specializing in building scalable data pipelines, optimizing big data workloads, and implementing machine learning solutions. Proven expertise in anti-money laundering systems, fraud detection, and A/B testing platforms. Strong background in both batch and streaming data processing using modern cloud-native technologies. Currently leading data engineering teams while driving strategic AML initiatives at BNP Paribas CIB.

Technical Skills

Programming Languages

  • Scala
  • Python
  • Java
  • SQL
  • OCaml
  • JavaScript(React.js, Node.js)
  • HTML

Big Data & Processing

  • Apache Spark
  • Kafka
  • Hadoop Ecosystem
  • HBase
  • Airflow
  • Quantexa Platform
  • ELK Stack (Elasticsearch, Logstash, Kibana)

Databases & Storage

  • MongoDB
  • MySQL
  • ElasticSearch
  • HBase
  • S3

Cloud Platforms

  • AWS
  • Databricks
  • Private Cloud

DevOps & Infrastructure

  • Kubernetes (Advanced)
  • Docker
  • Jenkins
  • GitHub Actions
  • Terraform
  • Git
  • Skaffold & Kustomize

Leadership & Management

  • Team Building & Mentorship
  • Technical Leadership
  • Agile / Scrum
  • Cross-functional Collaboration

Professional Experience

BNP Paribas Corporate & Institutional Banking

Paris, France

Lead Data Engineer

Big Data Team - IT Trade Finance | Technical Leadership & Team Building

05/2025 - Present

Promoted to Lead Data Engineer, now responsible for technical leadership, team management, and establishing engineering excellence within the Big Data team while continuing to drive strategic AML & fraud detection initiatives.

  • Team Building & Management: Led the establishment of a new data engineering team, including recruitment strategy, team structure design, and resource allocation to support growing AML/fraud detection initiatives
  • Technical Leadership & Mentorship: Provided technical direction on complex data engineering challenges, performed code reviews, and made key architectural decisions for scalable and resilient data platform evolution
  • Engineering Excellence: Established coding standards, architectural patterns, and development best practices for data pipelines, ensuring consistency, maintainability, and high code quality across the team
  • Performance & Quality: Implemented monitoring frameworks, performance benchmarks, and quality gates to ensure optimal pipeline performance and data accuracy for AML detection
Technical Stack: Scala, Spark, Kubernetes, Jenkins, Git, Quantexa Platform, Skaffold, Kustomize, S3, CI/CD, DataOps, Team Leadership, Architecture Design

Senior Data Engineer

Big Data Team - IT Trade Finance | Strategic AML & Fraud Detection Project

05/2022 - 05/2025

Worked within BNP Paribas CIB's Big Data team on a strategic anti-money laundering and financial fraud detection project. Designed, developed, and maintained comprehensive data pipelines for identifying suspicious cases.

  • End-to-end Data Pipeline Development: Designed and implemented complete data workflows from ingestion to alert generation and business exposure, delivering actionable fraud alerts to compliance teams
  • Advanced Spark Optimization: Performed advanced tuning of Spark jobs, including code optimization, data skew management, and performance improvements on massive data volumes
  • Quantexa Platform Integration: Integrated Quantexa AI-powered platform to enrich data, build relational graphs for network analysis, and provide contextual insights around AML alerts
  • Cloud Infrastructure & CI/CD: Deployed and orchestrated data pipelines on private cloud using Kubernetes, and established automated CI/CD pipelines using Jenkins and Git
Technical Stack: Scala, Spark (Advanced Tuning), Kubernetes, Jenkins, Git, Skaffold, Kustomize, S3, Deep Java Library, Quantexa Platform, CI/CD, DataOps, Anti-Money Laundering, Fraud Detection

Bedrock Streaming

Lyon, France

Data Engineer

A/B Testing Team

01/2022 - 05/2022
  • Development of data pipelines for A/B testing framework supporting millions of users
  • Setting up CI/CD chain with GitHub Actions and Jenkins for automated deployments
  • Airflow DAGs implementations for workflow orchestration and data pipeline management
  • Building AWS and Databricks infrastructure with Terraform for scalable data processing
Technical Stack: Scala, Python, Spark, Jenkins, GitHub Actions, AWS, Databricks, Terraform, Docker, Airflow, Scrum, Confluence

Société Générale

Paris, France

Data Engineer

MOSAIC Team - Check Fraud Perimeter

11/2020 - 01/2022

As a member of the MOSAIC team, we developed a platform for detecting fraud across various payment methods such as checks, instant payments, and wire transfers. I was responsible for the check fraud perimeter, working on the design and implementation of scoring pipelines under a microservices architecture, in both batch and streaming modes.

  • Scalable Data Pipeline Development: Built scalable data pipelines for parsing, profiling, and scoring financial transactions under a microservices architecture, supporting both batch and streaming processing modes
  • Application Maintenance & Evolution: Maintained and evolved applications handling large-scale payment data, ensuring system reliability and performance for fraud detection operations
  • Agile Collaboration: Represented the check fraud perimeter in agile ceremonies and daily stand-ups, facilitating cross-functional communication and alignment on fraud detection initiatives
  • Machine Learning Implementation: Developed and deployed a machine learning model for automated check fraud detection with high accuracy, contributing to the platform's fraud prevention capabilities
  • Stakeholder Reporting: Produced weekly reports to update stakeholders on progress and planned actions, ensuring transparency and alignment on fraud detection project milestones
Technical Stack: Scala, Python, Java, Spark, Kafka, HBase, Hive, Hadoop, Cloudera, DevOps, Machine Learning, Microservices

Moobifun

Lyon, France

Data Engineer

Data Platform & Machine Learning

10/2018 - 11/2020
  • Responsible for data platform managing over 7 terabytes of data based on ELK stack
  • Developed and maintained systems for collecting, storing, and making available internally produced data
  • Conducted POCs on data tools and implemented new data platform architecture (Lambda architecture)
  • Developed machine learning engines for customer segmentation and retention
  • Productionized machine learning engines via REST API with Flask
  • Developed WhatsApp and Telegram bots in Node.js for customer engagement
  • Supported newcomers and interns through mentorship and technical guidance
Technical Stack: MySQL, Python3, Scala, Spark, Hadoop, Kafka, ElasticSearch (ELK), Hive, Scikit-learn, Hortonworks, Flask, Node.js, REST APIs

LIP6 - Computer Science Laboratory

Paris, France

Research Intern

Graph Theory & Algorithm Development

01/2017 - 05/2017
  • Analysis of thesis chapter showing construction rules for a class of graphs
  • Description and implementation (OCaml) of algorithm for random generation of graphs
  • Conducted research on graph theory applications and algorithmic complexity
Technical Stack: OCaml, Graph Theory, Algorithms, Research Methodology

Education & Certifications

Master's Degree in Software Engineering

Sorbonne University - Computer Science

2016 - 2018

Certifications

Databricks Certified Associate Developer for Apache Spark 3.0 Machine Learning (Coursera)

Get In Touch

I'm currently open to new opportunities. Feel free to reach out to discuss how I can contribute to your data engineering needs.