Available for opportunities

Hello, I'm Sidi Camara

|

Building robust data architectures and real-time processing pipelines. Specialized in Big Data solutions for the financial sector with expertise in AML, Fraud Detection, and scalable cloud infrastructures.

SC
0 Years Experience
0 TB Data Managed
0 Teams Built
Scala Spark AWS
Scroll to explore

About Me

Lead/Senior Data Engineer with 6+ years of experience designing and implementing robust data architectures in demanding environments.

I specialize in building scalable Big Data solutions and real-time processing pipelines for the financial sector. My expertise spans from low-latency streaming systems to complex analytical platforms, with a particular focus on AML (Anti-Money Laundering) and Fraud Detection systems.

Throughout my career at major financial institutions like BNP Paribas CIB and Société Générale, I've led technical teams, architected mission-critical systems, and delivered high-impact projects that process millions of transactions in real-time.

Big Data Architecture

Designing scalable distributed systems handling petabytes of data

Real-time Processing

Building low-latency streaming pipelines for instant insights

Financial Security

AML & Fraud Detection systems protecting billions in transactions

0 +
Years of Experience
0 + TB
Data Platform Managed
0
Major Financial Institutions
0 +
Engineers Mentored

Technical Skills

Languages

Scala Python Java PySpark SQL Node.js React / React Native

Big Data & Streaming

Apache Spark Apache Kafka Filebeat Logstash Flume Apache Hive HBase

Cloud & Infrastructure

AWS Kubernetes Docker Terraform Databricks Hortonworks Cloudera

CI/CD & Orchestration

Jenkins GitHub Actions Skaffold Kustomize Airflow Maven SBT

AI & Analytics

Quantexa H2O Scikit-learn Deep Java Library

Databases

MongoDB MySQL ElasticSearch HBase S3

Professional Experience

May 2022 - Present 2+ years
Current

Lead / Senior Data Engineer

BNP Paribas CIB

IT Trade Finance - AML & Fraud Detection Program, Paris

Leadership & Management

  • Built and structured a new data engineering team from scratch: hiring, onboarding, resource planning, and upskilling
  • Drove architectural decisions and conducted code reviews, ensuring maintainability and code quality
  • Defined development standards, architectural patterns, and best practices adopted team-wide

Data Engineering

  • Designed end-to-end data pipelines (ingestion, transformation, scoring, alert generation) processing high-volume transactional data
  • Optimized Spark jobs through advanced tuning and data skew management, significantly reducing execution times
  • Integrated Quantexa AI platform for building entity-resolution graphs and enriching AML alerts with contextual intelligence
  • Deployed and orchestrated services on private Kubernetes cloud using Skaffold, Kustomize, and S3
Scala Spark Kubernetes Jenkins Git Skaffold S3 Deep Java Library Quantexa
Jan 2022 - May 2022 5 months

Data Engineer (Freelance)

Bedrock Streaming

A/B Testing Team, Lyon

  • Developed and maintained data pipelines for the A/B testing platform, processing large-scale audience data for France's leading streaming services
AWS Terraform Python Scala Spark Databricks Airflow
Nov 2020 - Jan 2022 1+ years

Data Engineer - Technical Lead

Société Générale

MOSAIC Team - Check Fraud Detection, Paris

  • Built microservices-based data pipelines from raw data parsing to transaction scoring in batch and streaming modes
  • Developed and deployed a Machine Learning model (H2O / GBM) for automated check fraud detection into production
  • Served as technical point of contact for check fraud: daily stand-ups, weekly reporting, and cross-team coordination
Scala Python Java Spark Kafka HBase Hive Cloudera
Oct 2018 - Nov 2020 2 years

Data Engineer & Scrum Master

Moobifun

Data Platform Owner (7+ TB), Lyon

  • Managed the full data platform built on the ELK stack: collection, storage, transformation, and data serving
  • Led a POC and implemented a Lambda Architecture, improving both real-time and batch processing capabilities
  • Built Machine Learning engines for customer segmentation and retention, deployed via REST API (Flask)
  • Acted as Scrum Master for a 6-person team and mentored junior engineers and interns
MySQL Python Node.js Spark Hadoop Kafka ElasticSearch Scikit-learn
Feb 2018 - Aug 2018 6 months

Process Automation Intern

AXA France

Nanterre, France

  • Automated business processes and functional testing using C#, Java, LeanFT, and Azure DevOps
C# Java LeanFT Azure DevOps
Jan 2017 - May 2017 5 months

Research Intern

LIP6 - Sorbonne University

Paris, France

  • Analyzed and implemented (OCaml) a random graph generation algorithm derived from graph theory research
OCaml Graph Theory Algorithms

Education & Certifications

Education

2016 - 2018

M.Sc. Computer Science

Sorbonne University (UPMC), Paris

Software Science & Technology

2013 - 2016

B.Sc. Mathematics & Computer Science

University of Hassiba Ben Bouali, Chlef, Algeria

Ranked 1st out of 300 students

Certifications

Machine Learning

Stanford University (Coursera)

Deep Learning Specialization

deeplearning.ai (Coursera)

Hadoop Platform and Application Framework

Big Data Certification

Languages

French Native
English Fluent (C1)

Let's Work Together

Interested in collaborating or have a project in mind? I'd love to hear from you.

LinkedIn LinkedIn Profile
Location Paris, France

Prefer a more traditional approach?

Download my CV