Staff Data Engineer - RCM (Remote)

Remote, USA Full-time
We believe that mental health is just as important as physical health. We recognize that mental health issues can be complex and multifaceted, and we are dedicated to treating the whole person, not just the symptoms. We aim to create a world where mental health is no longer stigmatized or marginalized, but rather is embraced as an integral part of one's overall well-being. We believe that by providing quality care that is both evidence-based and compassionate, we can empower individuals to take charge of their mental health and achieve their full potential. We are passionate about making a positive impact on the lives of those struggling with mental health issues and we strive to be a force for positive change in the field of mental healthcare. Rula is a remote-first company. We currently hire in most U.S. states, with the exception of Hawaii. About the Role At Rula, our mission is to make mental health care more accessible and effective for those who need it. As a Staff Data Engineer for Operational Reporting, you will oversee the design and implementation of a greenfield near real-time data platform, starting with micro-batching pipelines using Kafka to deliver critical operational reports and evolving into a scalable Apache Flink architecture for sub-second analytics. Your work will power real-time dashboards and insights that enable our providers, leadership, and operational teams to make data-driven decisions, ultimately improving patient outcomes. You will join our collaborative data team, nested within the broader engineering organization, working closely with business analysts, product managers, and data experts to transform raw event streams into reliable, actionable reporting data. Your daily responsibilities—building fault-tolerant pipelines, ensuring data accuracy, and optimizing for low-latency delivery—will lay the foundation for Rula’s near real-time data capabilities. This role offers the opportunity to own a strategic transition from micro-batching to a Flink-based streaming architecture, driving innovation in how we harness data to support our mission. If you’re passionate about turning complex data into impactful insights that advance mental health care, this is your chance to make a meaningful difference. Required Qualifications • Data Pipeline Development (8+ yrs). Experience designing and maintaining scalable ETL/ELT pipelines for operational reporting using Kafka, Glue, dbt, Dagster, and Airflow. Leveraging Python and SQL for data transformation and quality checks, and working with Flink and Spark Streaming to build low-latency, near real-time pipelines. • Cloud Infrastructure & Data Warehousing (8+ yrs overall, 4+ yrs in AWS). Proficiency building and optimizing data pipelines using AWS services such as S3, Redshift, Glue, IAM, Kinesis, and EMR. Experience across GCP (BigQuery, Dataflow) and Azure (Synapse, Data Factory). Optimizing data warehouses (Redshift, Snowflake, BigQuery) and managing Data Lakes (S3, Delta Lake) for scalable, low-latency analytics. Ensuring cost efficiency, scalability, and compliance (CPRA, HIPAA) while supporting a migration toward Flink-based near real-time architecture. • Data Quality & Governance (8+ Years). Experience implementing scalable data validation, quality checks (e.g., deduplication, consistency), and error-handling mechanisms tailored for operational reporting pipelines, ensuring high-fidelity data for real-time dashboards and analytics. Proficiency in designing and enforcing data governance practices, including metadata management, lineage tracking for auditable reporting, and compliance with regulations like CPRA or HIPAA in Data Lake environments (e.g., AWS S3, Delta Lake). • Performance Optimization (3+ Years). Experience optimizing data pipelines, queries, and large-scale datasets for efficiency and scalability in operational reporting systems, with a focus on achieving low-latency delivery. Proficiency in tuning high-throughput streaming systems, including optimizing resource usage and implementing best practices for partitioning, caching, and indexing. • Security & Compliance (3+ Years). Experience implementing data security measures, including encryption, role-based access control (RBAC), and data masking, to protect sensitive data in operational reporting pipelines and Data Lakes (e.g., AWS S3, Delta Lake). Strong understanding of compliance standards such as HIPAA and CPRA, with hands-on expertise in applying these standards to streaming systems like Apache Kafka and Apache Flink. Demonstrated ability to ensure auditability and security in data workflows, supporting reliable and compliant near real-time analytics during the transition from micro-batching to a Flink-based architecture. • Collaboration & Communication (5+ Years). Strong ability to work cross-functionally with business analysts, product managers, leadership, and other stakeholders to define and deliver operational reporting requirements. Exceptional communication skills to translate complex technical concepts into clear, actionable insights for non-technical audiences. Proven adaptability to thrive in a fast-paced startup environment, collaborating effectively to support the rapid development and evolution of a near real-time data platform while aligning with Rula’s mission to improve mental health care outcomes. Preferred Qualifications While having the preferred qualifications enhances your candidacy, having all of them is not mandatory. We encourage all interested applicants to apply, even those who may not meet every preferred requirement. • Hands-on experience with AWS tools like S3, Glue, EMR, SageMaker, and Lambda for building scalable ETL/ELT pipelines optimized for ML/LLM training, including feature engineering, data versioning, and handling large-scale unstructured data • Demonstrated ability to maintain data integrity and accuracy in streaming systems like Apache Kafka and Apache Flink, supporting reliable operational insights during the transition from micro-batching to a near real-time architecture. • Familiarity with infrastructure as code (IaC) tools like Terraform or CloudFormation for managing cloud resources. • Experience implementing and maintaining CI/CD pipelines for data workflows. • Demonstrated ability to enhance pipeline performance to support near real-time analytics while maintaining cost efficiency and reliability during the transition from micro-batching to a streaming architecture. • Strong ability to partner with data scientists and ML engineers to design efficient pipelines, using orchestration tools (e.g., Airflow, Dagster) for incremental loading and cost optimization, while monitoring performance metrics like latency and resource utilization in AWS environments. We're serious about your well-being! As part of our team, full-time employees receive: • 100% remote work environment: Working hours to support a healthy work-life balance, ensuring you can meet both professional and personal commitments (must be based in United States, currently not hiring in Hawaii) • Attractive pay and benefits: Full transparency of pay ranges regardless of where you live in the United States • Comprehensive health benefits: Medical, dental, vision, life, disability, and FSA/HSA • 401(k) plan access: Start saving for your future • Generous time-off policies: Including 2 company-wide shutdown weeks each year for self-care (for most employees) • Paid parental leave: Available for all parents, including birthing, non-birthing, adopting, and fostering • Employee Assistance Program (EAP): Support for your mental and physical health • New hire home office stipend: Set up your workspace for success • Quarterly department stipend: Fund team-building activities or in-person gatherings • Wellness events and lunch & learns: Explore a variety of engaging topics • Community and employee resource groups: Participate in groups that celebrate employee identity and lived experiences, fostering a sense of community and belonging for all Our team We believe that diversity, equity, and inclusion are fundamental to our mission of making mental healthcare work for everyone. We are dedicated to having a culture of inclusion that will support our employees in feeling safe, seen, heard, and valued. Apply tot his job
Apply Now

Similar Jobs

Engineering Manager, Data Engineering - Remote-eligible (U.S. + select international)

Remote, USA Full-time

Data Engineer - Remote with meetings onsite in New York City

Remote, USA Full-time

Data Engineering Manager - Ads

Remote, USA Full-time

Manager Data Engineering 3

Remote, USA Full-time

online data entry clerk

Remote, USA Full-time

Data Entry Clerk - Typist / Full-time (Remote)

Remote, USA Full-time

Flexible Remote Data Entry Clerk - Work from Home

Remote, USA Full-time

Home-Based Data Entry & Typing Operations Associate

Remote, USA Full-time

[Remote] Data Entry Clerk

Remote, USA Full-time

Data Entry Clerk at Pacific Sun Electric Los Angeles, CA

Remote, USA Full-time

Experienced Content Marketing Specialist – Remote Opportunity to Drive Brand Awareness and Engagement through Strategic Content Creation

Remote, USA Full-time

Software Development Engineer in Test

Remote, USA Full-time

Remote Senior Internal Auditor- International Bank in Chicago, IL

Remote, USA Full-time

Marketing & Outreach Coordinator (Remote, Flexible Hours)

Remote, USA Full-time

Entry Level Data-Entry Clerk Work From Home / Remote-Part-Time

Remote, USA Full-time

SAP Retail Finance Technology, SENIOR MANAGER (NO 3rd Party) (GC/US Only)

Remote, USA Full-time

**National Customer Operations Manager – USA at blithequark**

Remote, USA Full-time

**Experienced Hybrid Data Entry Clerk – Onsite with Remote Opportunities**

Remote, USA Full-time

Experienced Data Entry Clerk – Remote Opportunities for Detail-Oriented Individuals with Strong Organizational Skills at arenaflex

Remote, USA Full-time

**Experienced Customer Service Representative – Remote Work Opportunity with blithequark**

Remote, USA Full-time
Back to Home