Experience
A summary of my professional roles and educational background. I thrive on tackling challenging problems and continuously learning new skills.
Professional Experience
NLP Research Engineer | Fondation des Sciences du Patrimoine (FSP) | Apr 2024 – Apr 2025
- Extracted named entities (metadata, paradata) from unstructured cultural heritage analysis data using LLMs, Prompting, and RAG to ensure provenance and traceability.
- Optimized named entity extraction through the implementation of LLM reasoning techniques (Chain-of-Thought, internal reasoning).
- Fine‑tuned Whisper models with LoRA on cultural heritage audio data, enhancing transcription accuracy of domain‑specific named entities by 5%.
- Linked extracted entities to Opentheso and ORCID knowledge bases, strengthening data accessibility and traceability.
- Partnered with heritage science experts (documentalists, conservators, restorers) to create training and evaluation datasets.
Data Science Research Engineer | Centre national de la recherche scientifique (CNRS) | Feb 2023 – Feb 2024
- Automated the release of new versions of the PARSEME corpora across 26 languages via GitLab CI/CD, considerably reducing publication times.
- Designed and implemented new modules for quality control of linguistic corpora.
- Unified and integrated PARSEME’s disparate tools, increasing operational efficiency by 50%.
- Enhanced French multiword expression identification F1-score by 10% leveraging CamemBERT and FlauBERT models.
- Built an inference pipeline on lab clusters to identify multiword expressions in a large 21GB corpus.
Deep Learning Intern | Capgemini Engineering | Mar 2022 – Aug 2022
- Designed and implemented a lightweight, efficient CNN for avionics protocol recognition using Tensorflow, achieving aerospace‑standard \(10\)ms latency and error rate below \(10^{-5}\).
- Collaborated with simulation team to enhance protocol recognition through data collection, analysis and production testing.
- Orchestrated the deployment of the recognition model on AWS Cloud and Raspberry Pi 4 using TensorFlow Lite, ensuring optimal performance.
- Developed and launched an intuitive, interactive dashboard using Dear PyGui for real‑time monitoring of model performance in production.
- Developed substitution models using Transformer for dynamic systems.
Artificial Intelligence Intern | OLGHAM | Apr 2021 – Aug 2021
- Programmed Scannol, an AI‑based auditing tool using expert systems to detect anomalies in C/C++ code.
- Tracked down and identified efficiently functional errors in each of the 4 source codes analyzed.
- Adopted and mastered the demanding DO‑178C development standards.
Education
M.S. in Computer Science and Computer Security | INSA Centre Val de Loire | 2017 – 2022
- Project: Fused question and image attention to optimize visual question answering (VQA) using deep learning techniques.
Skills
- Programming: Python, C/C++, Java, JavaScript, R, HTML
- Machine Learning: Tensorflow, PyTorch, Hugging Face, Scikit‑learn, LlamaIndex, OpenCV, spaCy, Kubeflow, Spark, Slurm
- Database: SQL, NoSQL, PostgreSQL, Oracle
- Exploratory Data Analysis: Seaborn, Matplotlib
- Cloud Computing: Docker, Kubernetes, AWS, GCP
- Software Development: Git, Jira, Agile, Anaconda, Visual Studio, Django, FastAPI, Jenkins, CI/CD, Bash