Skip to content
View candytabata's full-sized avatar

Block or report candytabata

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
candytabata/README.md

Hi, I'm Candy Tabata ๐Ÿ‘‹๐Ÿพ

๐ŸŽ“ MSc Computer Science โ˜๏ธ Ex-AWS Cloud Engineer ๐Ÿ“Š Data Engineering & Machine Learning


๐Ÿš€ About Me

Iโ€™m a Computer Science Master's student with a background in AWS cloud engineering, focused on building data-driven systems and practical machine learning solutions.

My work combines:

  • Data engineering (ETL pipelines, data modelling, SQL)
  • Machine learning systems and MLOps practices
  • Cloud-based system design on AWS

Iโ€™m interested in building systems that are scalable, efficient and usable in real-world environments.


๐Ÿง  Current Focus

  • Data pipelines and ETL workflows
  • Advanced SQL and data modelling
  • Machine learning in practical applications
  • End-to-end systems combining data and ML

Pinned Loading

  1. computer-vision computer-vision Public

    I will be using this repository for codes based on computer vision.

    Jupyter Notebook

  2. NLP NLP Public

    I will be using this repository for codes based on NLP.

    Jupyter Notebook

  3. NLP-Flask-Application-On-EC2 NLP-Flask-Application-On-EC2 Public

    I will be using this repository for cloud deployment projects

    Shell

  4. aws-loadshedding-telecom-churn aws-loadshedding-telecom-churn Public

    Tests whether power outage exposure predicts telco churn in South Africa โ€” SageMaker training pipeline + model registry, infra managed with AWS CDK.

    Python

  5. medallion-aws-data-pipeline medallion-aws-data-pipeline Public

    AWS CDK medallion pipeline for synthetic SA bank transactions using Glue PySpark and Great Expectations data quality checks.

    Jupyter Notebook