Understanding Isolation Forest for Anomaly Detection
Isolation Forest is one of those machine learning algorithms that seems almost too simple, yet surprisingly powerful.
Oct 7, 2025

I'm a Data Scientist, Data Engineer, and AI Engineer - the kind of profile that bridges the gap between research and production.
Over the past few years, I've worked across the full AI stack: designing Big Data pipelines that process 500M+ time-series data points at SNCF, building hybrid recommendation systems combining knowledge graphs and NLP, and shipping LLM-powered agents and RAG pipelines for real clients as a freelance AI developer.
My current focus is Generative AI engineering - RAG architectures, multi-agent systems with LangGraph, and LLM deployment. I care about systems that actually work in production, not just in notebooks.
When I'm not building, I write about ML on Medium and contribute to open-source AI projects on GitHub.
Master's in Data Science
Université Lumière - Lyon, France
Bachelor's in Data Science
Université Lumière - Lyon, France
Bachelor's in Software Engineering
IFRI - Calavi, Benin
French
Native
English
Professional B2 Level
Python - Kaggle
Machine Learning Engineer - Google Skills
GCP Essentials - Google Skills
Freelance
November 2025 - Present
SNCF Réseau
March 2025 - September 2025
EffetB
April 2024 - August 2024
Intside LLC
December 2021 - August 2023
Alibora SARL
December 2021 - August 2022
3+
Years Experience
15+
Projects Done
30+
Technologies

A production LLM-powered SaaS for LinkedIn prospecting - AI scoring, automated scraping pipeline, and outreach sequencing at scale.

Voice-based emotion detection system using Wav2Vec2 for audio feature extraction and LSTM networks for multi-class emotion classification.
E-commerce platform with recommendation engine using collaborative filtering.
Production-grade anomaly detection pipeline on 500M+ time-series records. LSTM Autoencoder for pattern learning combined with Isolation Forest for outlier scoring - deployed on Azure Databricks with PySpark.
Hybrid recommendation engine combining Neo4j knowledge graphs with NLP embeddings - 30% cold start reduction via contextual embeddings and community detection (GraphSAGE, FastRP).
End-to-end RAG system for intelligent document Q&A. Hybrid retrieval (dense + BM25), re-ranking, and grounding check to minimize hallucinations. Built with LangChain, FastAPI, and a vector database.
Isolation Forest is one of those machine learning algorithms that seems almost too simple, yet surprisingly powerful.
Oct 7, 2025
How to detect anomalies in sequential data using a deep learning-based LSTM Autoencoder, followed by KMeans clustering.
Oct 7, 2025