Skip to content
View wuyuyu1024's full-sized avatar

Block or report wuyuyu1024

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
wuyuyu1024/README.md

Yu Wang

Software Engineer · AI Engineer · Explainable ML · Visual Analytics · Agentic AI

Dual Ph.D. background in Computer Science and Geology. I build research software that makes complex models, high-dimensional data, and scientific workflows easier to inspect, reproduce, and use.

Website · Google Scholar · Repositories · GitHub


About

I am based in Utrecht, Netherlands, with a Ph.D. in Computer Science from Utrecht University and a Ph.D. in Geology from China University of Geosciences, Beijing. My work connects machine learning, visual analytics, geoscience, and research-oriented software development: implementing methods from papers, building usable prototypes, and turning research workflows into software that other people can run and inspect.

I am especially interested in research software engineering roles where software quality, reproducibility, and collaboration across scientific domains matter. Recently, I have also been focusing on agentic AI, including retrieval-augmented generation, tool-using agents, and AI-assisted workflows. My broader research interests include explainable machine learning, human-in-the-loop data generation, inverse projection, and decision maps.

Research Credentials

Start Here

For a quick review of my research software work:

  • Best RSE example: LCIP shows tested Python research software, GUI tooling, CUDA/PyTorch workflows, and reproducibility scripts.
  • Best Earth-science example: SDBM for Pyrite is a reproducible geoscience ML workflow linked to an American Mineralogist paper.
  • Best reusable-package example: InverseProjections exposes inverse-projection methods through a scikit-learn-style API.

Selected Research Software

Loss-Controlled Inverse Projection of High-Dimensional Data. This is my strongest current research-software project: a paper-linked Python implementation with a Qt GUI, command-line entry points, tests, CUDA/PyTorch-based demos, reproducibility scripts, and documented workflows for inverse projection experiments.

In plain terms, LCIP supports human-in-the-loop, visually guided generation of high-dimensional data from a 2D embedding. It connects visualization, generative modelling, and interactive model steering.

Signals: research software engineering, scientific visualization, human-in-the-loop generative ML, interactive tooling, reproducibility, GPU-enabled workflows.

Reproducibility workflow for interpreting mineral-genesis classification with supervised decision maps on pyrite trace-element data. The project combines geoscience data, classifier evaluation, SSNP-based projection, inverse feature mapping, and notebook-generated manuscript figures.

Signals: Earth-science-facing ML, geochemistry, explainable classification, visual analytics, reproducible computational research.

Implementation accompanying research on fast and accurate decision maps for explaining classification models.

Signals: explainable AI, model inspection, visual analytics, research-method implementation.

A Python package implementing inverse projection techniques such as NNinv, iLAMP, RBF inverse mapping, and MDS multilateration with a scikit-learn-style API.

Signals: reusable research code, dimensionality reduction, interpretable high-dimensional data analysis.

Agentic AI And Tools

A lightweight local shell agent that turns natural-language requests into executable shell commands, explains the proposed command, and asks for confirmation before running it.

Signals: LLM tooling, CLI user experience, safe tool use, practical agent design.

Exploration of retrieval-augmented generation workflows and knowledge-aware LLM application design.

Signals: retrieval, context construction, LLM application engineering.

A browser-based interactive decision-map demo using TensorFlow.js and D3 to inspect MNIST projections, inverse projections, decision regions, and observation windows directly in the browser.

Signals: interactive visualization, browser ML, scientific demos, user-facing research prototypes.

Technical Profile

Languages:        Python · JavaScript · SQL
ML/data:          PyTorch · TensorFlow · scikit-learn · pandas · NumPy · XGBoost
Visualization:    Matplotlib · seaborn · D3.js · PySide/PyQt · pyqtgraph · vispy
RSE strengths:    Reproducibility · Scientific Workflows · Documentation · Interactive Tools
AI systems:       Agentic AI · RAG · LLM Applications · Tool Use · Workflow Automation
Engineering:      FastAPI · SQLAlchemy · Pydantic · pytest · PostgreSQL · TensorFlow.js

Open To

I am interested in research software engineering, AI engineering, and data science roles, especially in teams that build software for scientific research, environmental and Earth-science applications, visual analytics, explainable AI, agentic AI, or knowledge-intensive workflows.

Contact

Website: http://yuwang-vis.github.io/

GitHub: https://github.com/wuyuyu1024

Pinned Loading

  1. RAG_thesis RAG_thesis Public

    FastAPI RAG app for querying a PhD thesis with hybrid retrieval, reranking, and citation lookup.

    Python

  2. browser-decision-map browser-decision-map Public

    Browser-based TensorFlow.js and D3 decision-map demo for MNIST inverse projection.

    JavaScript

  3. lcip lcip Public

    Loss-Controlled Inverse Projection research software with Qt GUI, CUDA/PyTorch demos, tests, and reproducibility scripts.

    Python

  4. InverseProjections InverseProjections Public

    Python package for inverse projection methods with a scikit-learn-style API.

    Python

  5. SDBM_for_Pyrite SDBM_for_Pyrite Public

    Reproducible decision-map workflow for pyrite trace-element mineral-genesis classification.

    Jupyter Notebook 1

  6. generalized_fastDBM generalized_fastDBM Public

    Forked from yuwang-vis/generalized_fastDBM

    Research software for fast decision maps explaining classification models.

    Jupyter Notebook