Lead Data Engineer (AWS Cloud)

Remote, USA Full-time
Position: - Lead Data Engineer (AWS Cloud) Location: - Remote Type: - Contract to Hire Job Description • Design, develop, and maintain ETL/ELT pipelines using PySpark on Databricks. • Build and optimize batch and streaming data pipelines. • Implement Delta Lake solutions (Delta tables, time travel, ACID transactions). • Collaborate with data scientists, analysts, and architects to deliver analytics-ready datasets. • Optimize Spark jobs for performance, scalability, and cost. • Integrate data from multiple sources (RDBMS, APIs, files, cloud storage). • Implement data quality checks, validation, and monitoring. • Manage Databricks notebooks, jobs, clusters, and workflows. • Follow data governance, security, and compliance standards. • Participate in code reviews and contribute to best practices. Qualifications • Hands-on experience with Data Frames, RDDs, joins, transformations, and actions within PySpark. • Proven experience leading teams and mentoring engineers. • Job optimization, cluster configuration, repartitioning, and Shuffle mechanics in Databricks. • S3 buckets, IAM, CloudWatch, and integration with Databricks and AWS. • Strong query skills for analytics and ETL with SQL. • Performance tuning: Partitioning, caching, broadcast joins, and skew handling. • Delta Lake, Medallion Architecture, Spark Streaming, Spark ML, and CI/CD pipelines. • ETL/ELT design patterns. - Handling large-scale structured and semi-structured data. • Performance tuning (partitioning, caching, broadcast joins). • Understanding of data warehousing concepts. • Excellent communication and stakeholder management skills. • Ability to work in Agile delivery environments. • Ownership mindset and delivery-focused approach. • Strong technical decision-making and problem-solving skills. Apply tot his job
Apply Now

Similar Jobs

Systems Programmer - AI Data Pipelines

Remote, USA Full-time

[Remote] C# Data Platform Software Engineer / C# + SQL Developer (Remote), 25-14061

Remote, USA Full-time

Senior Software Engineer (Data Connectors & Integration Platform)

Remote, USA Full-time

Digital Solutions - Data Engineer

Remote, USA Full-time

Sr Backend Engineer, Platform Engineering - Network Data New

Remote, USA Full-time

Ads Privacy Engineer (L6)

Remote, USA Full-time

Remote - Lead Data Product Manager

Remote, USA Full-time

Product Manager/Product Owner, Data Science Storm Insights- REMOTE

Remote, USA Full-time

Product Manager, Data and Reporting

Remote, USA Full-time

Virtual Data Privacy Compliance Officer

Remote, USA Full-time

Head of Risk (Contract)

Remote, USA Full-time

Nurse Case Management Unit Manager, Public Health Consultant Manager

Remote, USA Full-time

Principal Contracts Specialist

Remote, USA Full-time

TD Bank: Azure Data Engineer(Remote)

Remote, USA Full-time

**Experienced Customer Support Executive – Healthcare Solutions Specialist (Onsite, New York, USA)**

Remote, USA Full-time

Pre-Licensing Training Agent - Remote Opportunity with Competitive Salary and Comprehensive Benefits

Remote, USA Full-time

Coder IV - Claim Edits Coder (medical coding)

Remote, USA Full-time

Online Night Positions Part Time | $25–$35/Hour Remote Work Evenings – Quiet Shifts, Adaptable Hours, No College Degree Needed

Remote, USA Full-time

Experienced Remote Benefits Representative and Customer Service Agent – Delivering Exceptional Support and Guidance to Clients

Remote, USA Full-time

Experienced Remote Customer Service Representative – Delivering Exceptional Pet Parent Experiences for arenaflex in Kentucky

Remote, USA Full-time
Back to Home