Senior Data Engineer | 6+ Years | GCP · AWS · Azure | Healthcare & Fintech
I design and build production-grade data pipelines, streaming architectures, and analytics platforms across multi-cloud environments — with deep expertise in healthcare data standards (Epic EHR, HIPAA, ICD/CPT) and regulated financial services.
Cloud & Platforms
GCP Pub/Sub Dataflow Dataproc BigQuery AWS EMR Glue Redshift Lambda Azure Databricks Synapse Analytics Data Factory Delta Lake Microsoft Fabric
Data Engineering
Apache Spark PySpark Apache Airflow dbt Kafka Snowflake PostgreSQL CDC Pipelines Medallion Architecture
Languages & Tools
Python SQL T-SQL PySpark Git Docker
Domain Expertise
Healthcare Epic EHR ICD/CPT Claims HIPAA/PHI SDOH FHIR R4 Financial Services
End-to-end healthcare data engineering platform built with Python, PostgreSQL 15, and dbt.
- 1,134 synthetic patients (Synthea FHIR R4) across 9 SDOH domains via PRAPARE framework
- Bronze / Silver / Gold Medallion Architecture with custom dbt schema macros
- Targets healthcare data engineering roles requiring HIPAA, claims, and EHR domain knowledge
Stack: Python PostgreSQL dbt FHIR R4 PRAPARE Medallion Architecture
🌐 Portfolio Site · Repo
Personal portfolio showcasing data engineering projects, skills, and experience.
Stack: React 19 Vite JavaScript
| Company | Role | Stack |
|---|---|---|
| Digimarc (OR, USA) | Senior Data Engineer | GCP · Pub/Sub · Dataflow · BigQuery · Snowflake · dbt · Airflow |
| Wells Fargo | Data Engineer | Azure · Databricks · Synapse · Delta Lake · Data Factory |
| HCLTech | Data Engineer | AWS · EMR · Glue · Redshift · Lambda · Epic EHR · HIPAA |
📧 kavyasreede@gmail.com · 📍 Baltimore, MD