Understanding Isolation Forest for Anomaly Detection
Isolation Forest is one of those machine learning algorithms that seems almost too simple, yet surprisingly powerful.
Oct 7, 2025

I'm a Data Scientist, Data Engineer, and AI Engineer - the kind of profile that bridges the gap between research and production.
Over the past few years, I've worked across the full AI stack: designing Big Data pipelines that process 500M+ time-series data points at SNCF, building hybrid recommendation systems combining knowledge graphs and NLP, and shipping LLM-powered agents and RAG pipelines for real clients as a freelance AI developer.
My current focus is Generative AI engineering - RAG architectures, multi-agent systems with LangGraph, and LLM deployment. I care about systems that actually work in production, not just in notebooks.
When I'm not building, I write about ML on Medium and contribute to open-source AI projects on GitHub.
Master's in Data Science
Université Lumière - Lyon, France
Bachelor's in Data Science
Université Lumière - Lyon, France
Bachelor's in Software Engineering
IFRI - Calavi, Benin
French
Native
English
Professional B2 Level
Python - Kaggle
Machine Learning Engineer - Google Skills
GCP Essentials - Google Skills
Freelance
November 2025 - Present
SNCF Réseau
March 2025 - September 2025
EffetB
April 2024 - August 2024
Intside LLC
December 2021 - August 2023
Alibora SARL
December 2021 - August 2022
3+
Years Experience
15+
Projects Done
30+
Technologies

A production LLM-powered SaaS for LinkedIn prospecting - AI scoring, automated scraping pipeline, and outreach sequencing at scale.

Voice-based emotion detection system using Wav2Vec2 for audio feature extraction and LSTM networks for multi-class emotion classification.
E-commerce platform with recommendation engine using collaborative filtering.
Production-grade anomaly detection pipeline on 500M+ time-series records. LSTM Autoencoder for pattern learning combined with Isolation Forest for outlier scoring - deployed on Azure Databricks with PySpark.
Hybrid recommendation engine combining Neo4j knowledge graphs with NLP embeddings - 30% cold start reduction via contextual embeddings and community detection (GraphSAGE, FastRP).
End-to-end RAG system for intelligent document Q&A. Hybrid retrieval (dense + BM25), re-ranking, and grounding check to minimize hallucinations. Built with LangChain, FastAPI, and a vector database.
Isolation Forest is one of those machine learning algorithms that seems almost too simple, yet surprisingly powerful.
Oct 7, 2025
How to detect anomalies in sequential data using a deep learning-based LSTM Autoencoder, followed by KMeans clustering.
Oct 7, 2025
I'm actively looking for a CDI position in AI Engineering, LLM systems, or Data Engineering - ideally working on ambitious, production-focused AI products. Open to hybrid or remote roles across France. If you're building something interesting or want to talk RAG, agents, or pipelines, I'd love to connect.
Get in Touch