Tools & technologies
Machine learning pipelines
From raw data to a trained model, with a process you can repeat: data preparation, leakage-free validation, model comparison and honest evaluation.
From raw data to a trained model, with a process you can repeat: data preparation, leakage-free validation, model comparison and honest evaluation.
Leakage-free, from the start
Preprocessing lives inside the pipeline, so it is fit only on the training data in each cross-validation fold. What I measure in validation is what you can expect in production, not an inflated number.
Evaluation that tells the truth
Metrics that fit the problem, not just accuracy: ROC-AUC and PR-AUC on imbalanced data, probability calibration and per-segment error analysis to see where the model fails and why.
Operating locations
Studio in Madrid
Based in Madrid, working remotely with brands, studios and agencies inside and outside Spain.
Info graphic
Decision flow
Reproducible, not a loose notebook.
Info graphic
Friction map
Leakage-free validation.
Info graphic
Journey shape
Honest evaluation.
Work with JMWEB
Let's build something that reaches production.
It all starts with a conversation. Bring a dataset, a goal or a model that is stuck; I will take care of the rest.
Start a project
Next projects:

Credit Risk Platform
End-to-end MLOps platform for credit-default scoring: reproducible training with a versioned model registry, a FastAPI inference API with input validation, PSI drift monitoring and a Prometheus and Grafana observability stack, all containerised with CI. Code at github.com/delcenjo/credit-risk-platform.
View project
Transformer from scratch
A GPT-style language model implemented from scratch in PyTorch: multi-head attention, causal masking and residual blocks written by hand, plus a byte-pair tokenizer and an ablation study. Code at github.com/delcenjo/transformer-from-scratch.
View project
Credit Risk Model
Credit-default risk model with rigorous evaluation: a leakage-free pipeline, cross-validated model comparison, calibrated probabilities, a cost-based decision threshold and per-segment error analysis. Code at github.com/delcenjo/credit-risk.
View project
AI Insight Assistant
An assistant that combines RAG and a SQL agent over your data: it retrieves from a document corpus and queries a database, served with FastAPI and Streamlit and packaged in Docker. Code at github.com/delcenjo/ai-insight-assistant.
View project