Skip to content
View michaelromagne's full-sized avatar

Block or report michaelromagne

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
michaelromagne/README.md

👋

As a Machine Learning Engineer with 5 years of experience, I’ve contributed end-to-end to the productionization of multiple AI products at Ubisoft, GitGuardian, and Sanofi.

I’ve worked on complex use cases such as:

  • E-commerce fraud detection using XGBoost, advanced feature engineering, MLOps tooling, AWS, and Kubernetes.
  • Deployment of state-of-the-art NLP models for Secrets Detection (credentials) in source code. Stack: Transformers, PyTorch, FastAPI, ONNX Runtime, AWS EKS.
  • Development of a full Terraform module for an Unstructured Data Pipeline, turning PDFs, PPTX, DOCX into vector embeddings in Pinecone. We used Weave to optimise and monitor the Pipeline. Stack: AWS Lambda, S3, ECR, Step Functions, Claude Sonnet, Amazon Nova Pro, Docling, HuggingFace models, AWS Textract, PyMuPDF, Pinecone, Weave

Portfolio LinkedIn Malt

👨‍🔬 Skills

Programming language: Python

Machine Learning: ML, NLP, GenAI, Pytorch, Tensorflow, Scikit-Learn

Generative AI: OpenAI API, AWs Bedrock, HuggingFace, Langchain

DevOps: AWS, Kubernetes, Docker, Gitlab CI, Github Actions, Helm, Argo CD, Terraform

MLOps: DVC, SkyPilot, Okteto, BentoML, ClearML, Mlflow

Dataviz: Streamlit, Grafana, Tableau

Data Engineering: Dagster, Airflow, Spark, Hadoop (HDFS, Hive), Snowflake

And Team Work, Being friendly with colleagues and Goal oriented 😄

Contact

Please contact me through Linkedin, Malt or email.

Pinned Loading

  1. iterative/dvc iterative/dvc Public

    🦉 Data Versioning and ML Experiments

    Python 14.7k 1.2k

  2. explodinggradients/ragas explodinggradients/ragas Public

    Supercharge Your LLM Application Evaluations 🚀

    Python 10.3k 1k

  3. wandb/weave wandb/weave Public

    Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.

    Python 955 116

  4. dataforgoodfr/batch11_e_cartomobile dataforgoodfr/batch11_e_cartomobile Public

    Encourager et planifier la mobilité électrique dans les territoires avec l’Open-Data

    Jupyter Notebook 6 4