Software Engineer, Inference AI/ML

Remote, USA Full-time
CoreWeave is The Essential Cloud for AI™, providing a platform for innovators to build and scale AI. The role involves joining the Inference team to implement features that enhance model serving on the GPU platform, focusing on improving latency, reliability, and cost. Responsibilities Implement well-scoped features and fixes in Python/Go/C++ for model-serving services (e.g., Triton, vLLM, TensorRT-LLM, Ray Serve) Write tests, code comments, and short design docs; participate in code reviews Add basic metrics and dashboards; assist with alarms and runbooks Follow on-call runbooks and learn incident response in a guided rotation Contribute to performance experiments (e.g., request batching, concurrency, caching) with guidance Skills BS/MS in CS, EE, or related field, or equivalent practical experience Foundations in data structures, algorithms, and networked services Experience with Python or Go (C++ a plus) and Linux fundamentals; Git/CI basics Exposure to containers and Kubernetes (coursework or projects welcome) Curiosity about GPU inference concepts (micro-batching, KV cache, streaming) Internship or project that deployed a microservice or ML inference demo Coursework/research with PyTorch or TensorFlow; simple CUDA projects a plus Familiarity with Grafana/Prometheus/OpenTelemetry or similar tooling Benefits Medical, dental, and vision insurance - 100% paid for by CoreWeave Company-paid Life Insurance Voluntary supplemental life insurance Short and long-term disability insurance Flexible Spending Account Health Savings Account Tuition Reimbursement Ability to Participate in Employee Stock Purchase Program (ESPP) Mental Wellness Benefits through Spring Health Family-Forming support provided by Carrot Paid Parental Leave Flexible, full-service childcare support with Kinside 401(k) with a generous employer match Flexible PTO Catered lunch each day in our office and data center locations A casual work environment A work culture focused on innovative disruption Company Overview CoreWeave is a cloud-based AI infrastructure company offering GPU cloud services to simplify AI and machine learning workloads. It was founded in 2017, and is headquartered in Livingston, New Jersey, USA, with a workforce of 1001-5000 employees. Its website is
Apply Now

Similar Jobs

Accountant l

Remote, USA Full-time

Associate Product Manager

Remote, USA Full-time

[Remote] Laravel Full Stack Developer

Remote, USA Full-time

OPS Clinician SBS

Remote, USA Full-time

Account Manager / Outside Sales Representative - Virginia Beach, VA area

Remote, USA Full-time

Project Assistant

Remote, USA Full-time

Phoenix, AZ Account Executive - Bilingual Spanish

Remote, USA Full-time

Associate Equipment Specialist - Solar (Traveler) | Mortenson

Remote, USA Full-time

Project Coordinator

Remote, USA Full-time

Social Video Editor

Remote, USA Full-time

Part-Time arenaflex Customer Service Representative – Remote Work Opportunities for Motivated and Customer-Focused Individuals

Remote, USA Full-time

BOOZ ALLEN HAMILTON INTERNATIONAL (U.K.) LTD is hiring: Video & Animation Editor

Remote, USA Full-time

Risk & Control Specialist Senior

Remote, USA Full-time

[Remote] Structural Engineer (PE Required)

Remote, USA Full-time

Senior Accountant (Revenue & Accounting Operations)

Remote, USA Full-time

Big Room Planning & Support Coordinator

Remote, USA Full-time

Experienced Part-Time Remote Customer Service Representative for Leading Insurance Distribution Company

Remote, USA Full-time

**Experienced Customer Success Manager – Produce: Drive Strategic Growth and Customer Satisfaction in the Reusable Packaging Industry**

Remote, USA Full-time

**Experienced Customer Chat Specialist – Remote Work Opportunity with blithequark**

Remote, USA Full-time

Sr. Casualty Manager, Commercial Claims (Remote)

Remote, USA Full-time
Back to Home