Principal Java Data Engineer at Purple Hires Inc — NeverHard
Principal Java Data Engineer at Purple Hires Inc in Regent Park, City of Toronto. Skills: AI, AWS, Apache Hudi, Azure, Azure Data Lake. Apply on NeverHard.
Company
Purple Hires Inc
Location
Regent Park, City of Toronto
Type
contract
Remote: Yes
Required skills:
AI
AWS
Apache Hudi
Azure
Azure Data Lake
CI/CD
DBT
Data Governance
Data Pipelines
Event-Driven Architectures
Role:
Principal Java Data Engineer
Location: Canada (Remote)
Long Term contract
Mandatory skill:- Spec-Driven Development (SDD) is Must with AI
Ideal Experience
Lead and guide the design and implementation of scalable distributed systems based on Java microservices
Engineer and optimize data pipelines using solutions like Apache Hudi, Apache Trino, Azure ADLS
Collaborate cross-functionally with product, analytics, and AI teams to ensure data is a strategic asset
Principal Software Data Engineer with at least 10 years of professional experience in software or data engineering, including a minimum of 4 years focused on data pipelines (batch and streaming)
Proven experience driving technical direction and mentoring engineers while delivering complex, high-scale solutions as a hands-on contributor
Strong understanding of event-driven architectures and distributed systems, with hands-on experience implementing resilient, low-latency pipelines
Practical experience with cloud platforms (AWS, Azure, or GCP) and containerized deployments for data workloads
Fluency in data quality practices and CI/CD integration, including schema management, automated testing, and validation frameworks (e.g., dbt, Great Expectations)
Operational excellence in observability, with experience implementing metrics, logging, tracing, and alerting for data pipelines using modern tools
Solid foundation in data governance and performance optimization, ensuring reliability and scalability across batch and streaming environments
Proven experience with Lakehouse architectures and related technologies, including Apache Hudi, Azure ADLS Gen2, HDFS, and other big data technologies (Trino, Databricks, Spark)
Strong collaboration and communication skills, with the ability to influence stakeholders and evangelize modern data practices within your team and organization.