Skip to content
View HieuNTg's full-sized avatar

Block or report HieuNTg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
HieuNTg/README.md

Hi, I'm Hieu (Nguyen Trung Hieu) 👋

AI Engineer · Computer Vision × LLM / Agentic AI × Speech (ASR)

I turn research into production — systems that see, listen, reason, and create.

GitHub Email Focus


👀 About Me

AI Engineer with 2 years of experience building and shipping production ML systems across Computer Vision, OCR, LLM / Agentic AI, and Speech (ASR). I own the full lifecycle — fine-tuning (PEFT/LoRA, 4-bit), evaluation, and GPU-optimized inference — and enjoy taking a model from a research paper all the way to a reliable, end-to-end pipeline.

  • 🔭 Currently building OCR, object-detection, and LLM-agent systems @ WorkerBot AI
  • 🧠 Deepest expertise: speech recognition & efficient LLM fine-tuning
  • ⚡ Fun fact: I fine-tuned Gemma 3N for Vietnamese ASR down to 7.21% WER

⭐ Flagship Project — Audio2Text (Vietnamese ASR)

End-to-end Vietnamese speech recognition on a fine-tuned Gemma 3N — built from scratch.

  • 🎯 7.21% WER on a 5,000-sample test set (0 empty predictions, ~97K reference words)
  • 🧩 Production inference pipeline: Demucs → denoise → VAD → overlap-aware chunking → context-aware decoding
  • ⚙️ PEFT/LoRA + 4-bit quantization via Unsloth — trainable on a single consumer GPU
  • 📦 Clean, reproducible codebase: separate train / evaluate / predict modules

🔗 Explore the repo → github.com/HieuNTg/Audio2Text


🚀 Featured Projects

Project What it does Tech
🎙️ Audio2Text Vietnamese ASR toolkit on fine-tuned Gemma 3N — training, eval & production inference. 7.21% WER Gemma 3N PEFT/LoRA Unsloth Demucs VAD
📖 StoryForge Multi-agent story generator — 13-agent drama simulation, LLM-as-judge auto-revision & RAG FastAPI LLM Multi-Agent RAG
🧑‍💼 AI HR Interview Full-stack AI interviewer with real-time voice/video via Gemini Live + JD↔CV matching Next.js Gemini Live PostgreSQL Redis
💊 MedGraph Drug-interaction cascade analyzer — knowledge graph over CYP450 pathways on real FDA data FastAPI React Knowledge Graph
🔢 Date-Recognition Expiry-date OCR — YOLOv8 detection + CRNN/CTC recognition with Streamlit UI YOLOv8 OCR CRNN
🙂 FaceReg Real-time face recognition — MTCNN + FaceNet across image, video & live camera PyTorch MTCNN FaceNet

🛠️ Tech Stack

Languages

Python SQL Bash

ML / LLM

PyTorch TensorFlow Hugging Face PEFT Unsloth

Computer Vision · Speech

OpenCV YOLOv8 Demucs FFmpeg

Backend · Infra

FastAPI Docker PostgreSQL Redis Streamlit


🔍 Currently Exploring

  • Efficient LLM fine-tuning & on-device / low-VRAM inference
  • Multi-agent systems and autonomous research workflows
  • Advanced OCR & document understanding

"Turn research into systems people can actually use."

📫 Reach me at nt.hieu2207@gmail.com · ⭐ Star anything you find useful!

Pinned Loading

  1. STORYFORGE STORYFORGE Public

    AI-powered story generator with multi-agent drama simulation, branching narratives, and character-consistent image generation. Self-hosted & privacy-first.

    Python

  2. medgraph medgraph Public

    Drug Interaction Cascade Analyzer — Detect dangerous multi-drug interactions using knowledge graph + CYP450 enzyme cascade analysis on real FDA data

    Python

  3. Date-Recognition Date-Recognition Public

    Expiry Date Recognition System — YOLOv8 detection + CTC-OCR text recognition pipeline with Streamlit UI

    Jupyter Notebook 1

  4. FaceReg FaceReg Public

    Real-time face recognition system using MTCNN + InceptionResnetV1 (FaceNet) with Streamlit web UI. Supports image, video & live camera recognition with dynamic face enrollment. 97% accuracy on 31-c…

    Jupyter Notebook