- ๐ MSc in Computing (Data Analytics) Dublin City University, Ireland
- ๐ฌ Researching LLM safety, adversarial robustness, and interpretability for financial decision-making
- ๐ฏ Applying for a PhD in Interpretable & Behavioural Risk Assessment of Language Models (DCU)
- ๐ผ Building production-grade Data Analytics and Data Engineering projects
- ๐ฎ๐ช Based in Dublin, Ireland
Python |
SQL |
PyTorch |
Hugging Face |
TensorFlow |
Scikit-Learn |
Pandas |
NumPy |
Plotly |
Streamlit |
Power BI |
Tableau |
Azure |
AWS |
Databricks |
Airflow |
GitHub |
Google Colab |
VS Code |
Excel |
Stress-testing financial LLMs (Qwen2.5-3B) across 4 environments baseline, panic, pressure, and prompt injection with cognitive bias detection (Kahneman & Tversky) and emotional contagion analysis.
Results: 86.7% baseline safety ยท 17.8% attack success rate
Tech: PyTorch, Hugging Face Transformers, BART-MNLI, Streamlit, Plotly
๐ Repository ยท ๐ Live Demo
Statistical calibration of FinBERT on financial sentiment using Temperature Scaling, Platt Scaling, and Conformal Prediction to quantify and reduce model overconfidence.
Results: 56% ECE reduction (0.095 โ 0.041) ยท 91.7% conformal coverage ยท avg set size 1.54
Tech: PyTorch, Hugging Face Transformers, Scikit-Learn, Streamlit, Plotly
๐ Repository ยท ๐ Live Demo
Comparing three attribution methods Integrated Gradients, Attention Rollout, and Leave-One-Out on FinBERT financial sentiment to measure whether explainability methods actually agree.
Key finding: Methods largely disagree (IG-vs-Attn ฯ = 0.10, IG-vs-LOO ฯ = 0.31) choosing one method alone gives an incomplete picture.
Tech: PyTorch, Captum, Hugging Face Transformers, Streamlit, Plotly
๐ Repository ยท ๐ Live Demo
A two-stage cascaded deep learning framework using ResNet50 for accurate early and advanced diabetic retinopathy detection, trained on APTOS 2019 and Diabetic Retinopathy Resized datasets. (MSc Dissertation)
Tech: Python, TensorFlow, Pandas, NumPy
๐ Repository
Cloud ETL pipeline for LendingClub 2018Q4 loan data using Azure Databricks (Spark), ADLS Gen2, and Azure SQL. Includes notebooks, PySpark modules, and SQL scripts.
Tech: Azure Databricks, PySpark, ADLS Gen2, Azure SQL
๐ Repository
Power BI inventory analytics dashboard for monitoring stock, WIP, in-transit inventory, and Days on Hand across plants and materials. Built with Power Query, DAX, and data modelling.
Tech: Power BI, Power Query, DAX
๐ Repository
Hierarchical e-commerce product categorization using TF-IDF, SMOTE, and an LR/RF/LightGBM ensemble (top-level) and Ridge (bottom-level).
Tech: Python, Scikit-Learn, LightGBM
๐ Repository
Machine learning model to predict loan default risk using borrower profiles, credit history, and financial features.
Tech: Python, Scikit-Learn, imbalanced-learn
๐ Repository
Visualizing busiest airline routes (2015โ2019) using Python + Tableau.
Tech: Tableau, Pandas, Matplotlib
๐ Repository
| Area | Metric |
|---|---|
| ๐ก๏ธ LLM Safety | 86.7% baseline safety score, 17.8% attack success across 4 adversarial environments |
| ๐ Calibration | 56% ECE reduction via temperature scaling, 91.7% conformal coverage |
| ๐ Explainability | 3 attribution methods compared surfaced low inter-method agreement (ฯ = 0.10โ0.31) |
| ๐ณ Risk Modelling | 0.99 recall on loan defaulters fewer missed high-risk customers |
| โก Automation | Cleaning scripts โ ~40% faster preprocessing pipelines |
