Senior Data Engineer · Healthcare & Enterprise · Sacramento, CA
Senior Data Engineer and IT Strategist with over 15 years of full-stack software and analytics experience, including 7 years designing and operating scalable, HIPAA-compliant data platforms. Proven track record architecting ETL pipelines, big-data solutions (Spark, Databricks, Microsoft Fabric), and data warehouses for healthcare operations — driving up to 50% reductions in pipeline latency and ensuring 100% PII/PHI regulatory adherence. Recognized mentor and cross-functional leader skilled at translating business requirements into high-value BI dashboards and predictive models.
Real-time telemetry pipeline using Spark Structured Streaming enabling sub-second latency for 50+ clinical devices.
Spark StreamingKafkaDatabricksAzureHIPAA-compliant Azure Blob data lake with automated PII masking supporting 5M+ records/month, zero breaches.
Azure BlobPythonPII MaskingRBACEnd-to-end Bronze/Silver/Gold architecture with star schema and automated Power BI dashboards via PySpark.
Microsoft FabricPySparkPower BIDAXPython framework in Airflow pipelines for automated schema-drift detection, reducing production errors 75%.
PythonAirflowSQL AgentMigrated on-premises SQL Server DW to cloud platform, improving query performance by 60% across healthcare ops.
SQL ServerAzure.NETSSISOptimized SSIS workflows and Python scripts for parallel execution, significantly reducing compute overhead.
SSISPythonParallelism