Professional Summary
Data engineer with a background in enterprise IT security and systems administration,
specializing in high-performance columnar analytics using DuckDB, Apache Spark, and cloud-native
data pipelines (AWS, Azure, GCP). Since December 2022, while managing a long-term medical
disability, I have been in continuous full-time technical development — earning industry
certifications, architecting distributed data systems, and benchmarking production workloads
at 171M rows/sec on a single node using DuckDB against 685M rows of real-world data.
- Certified: Databricks Associate Developer for Apache Spark (Scala) · AWS Certified Solutions Architect (Associate)
- Benchmark: 171M rows/sec — 5 analytical queries across 685M NYC taxi records, single node, zero cluster
- Stack: DuckDB · Apache Spark 3.5 · Scala (Pure FP) · Python · AWS S3 · Azure Blob · GCP Storage · Parquet · Streamlit
- Background: HIPAA compliance, HITRUST audit remediation, CrowdStrike, CyberArk, Qualys — enterprise security depth
- Inventor: skr8tr — a sovereign, masterless distributed systems framework secured with post-quantum cryptography (ML-DSA-65)
Technical Skills
Data Engineering
DuckDB, Apache Spark 3.5, Scala (FP), PySpark, Parquet, sbt, Databricks
Cloud Platforms
AWS (S3, EC2, Glue, Athena) · Azure (Blob, ADLS Gen2, Synapse) · GCP (GCS, BigQuery)
Analytics & Visualization
DuckDB SQL, Streamlit, Plotly, Pandas, columnar pipeline design
Security & Compliance
Qualys, CrowdStrike, CyberArk, HITRUST, HIPAA, vulnerability management
Systems
Linux (Ubuntu/RHEL), Active Directory, VMware, Hyper-V, SCCM, PLESK
Infrastructure
VPS hardening, NixOS, post-quantum crypto (ML-DSA-65), bare-metal orchestration
Professional Experience
- Benchmarked DuckDB at 171M rows/sec processing 685M NYC Yellow Cab records (2022–2025) across 48 Parquet files on a single node — zero cluster, zero JVM overhead.
- Architected multi-cloud data ingestion pipelines reading directly from AWS S3, Azure Blob Storage, and GCP Cloud Storage using DuckDB's httpfs, azure, and gcs extensions.
- Engineered HazyNet: a multi-node standalone Apache Spark 3.5 cluster in Scala with pure functional programming patterns and CUDA integration on RTX 3060.
- Earned Databricks Certified Associate Developer for Apache Spark (Scala) and AWS Certified Solutions Architect (Associate) while maintaining full technical stack depth.
- Built real-time Streamlit analytics dashboards backed by DuckDB — interactive columnar analytics with zero cloud dependency.
- Invented skr8tr — a masterless distributed systems framework using ML-DSA-65 post-quantum signatures for command authentication across a UDP mesh network. No certificate authorities. No central broker. Sovereign by design.
- Automated enterprise-wide hardware deployments by creating and pushing OS images via MS SCCM.
- Administered Active Directory and Office 365/Teams deployments for a global user base.
- Governed VMware Horizon virtual machines and managed CCURE secure access systems.
- Managed CrowdStrike endpoint protection and CyberArk privilege management platforms.
- Executed rigorous patch management schedules through SCCM to maintain system integrity.
- Restored mission-critical "Store Down" scenarios under pressure to ensure business continuity.
- Supported virtual servers and in-store infrastructure via VNC and Hyper-V remote control.
- Governed remote access to POS registers and backend retail applications for 24/7 uptime.
- Engineered an AWS-based medical records retrieval system with HIPAA-compliant secure portals.
- Analyzed security data using Qualys; led intrusion detection audits and firewall security configuration.
- Remediated audit findings for HITRUST certification and led HIPAA compliance initiatives.
- Orchestrated vulnerability management and patch cycles for the entire server fleet.
- Advanced to Level 3 Linux support; configured VPS setups on PLESK and managed high-traffic Linux servers.
- Managed 120+ client accounts including WordPress architecture, security hardening, and VPS configuration.
Projects & Inventions
- Designed a sovereign, masterless distributed systems framework built from first principles — no certificate authorities, no central broker, no single point of failure.
- Implemented post-quantum authenticated command propagation using ML-DSA-65 (NIST FIPS 204) signed tokens over a UDP mesh network — every packet cryptographically verified on arrival.
- Architected for air-gapped, regulated, and adversarial environments where trusting the network is not an option — healthcare, finance, government, and critical infrastructure.
- Built entirely in C23 and Haskell with zero external identity dependencies — the mesh is the authority.
Certifications
🏆 Databricks Certified Associate Developer for Apache Spark
Databricks · Scala track
Verify Credential ↗
Education
Western Governors University
Information Technology · 2008 – 2011 · 3 years completed
The Knox School
High School Diploma · 1985 – 1990