Machine Learning Engineering Manager – LLM Serving, Infrastructure

Remote, USA Full-time
• Lead a high-performing engineering team to develop, build, and deploy a high-scale, low-latency LLM Serving Infrastructure. • Drive the implementation of a unified serving layer to support multiple LLM models and inference types (batch, offline eval flows and real-time/streaming). • Lead all aspects of the development of the Model Registry for deploying, versioning, and running LLMs across production environments. • Ensure successful integration with the core Personalization and Recommendation systems to deliver LLM-powered features. • Define and champion standardized technical interfaces and protocols for efficient model deployment and scaling. • Establish and monitor the serving infrastructure's performance, cost, and reliability, including load balancing, autoscaling, and failure recovery. • Collaborate closely with data science, machine learning research, and feature teams (Autoplay, Home, Search, etc.) to drive the active adoption of the serving infrastructure. • Scale up the serving architecture to handle hundreds of millions of users and high-volume inference requests for internal domain-specific LLMs. • Drive Latency and Cost Optimization: partner with SRE and ML teams to implement techniques like quantization, pruning, and efficient batching to minimize serving latency and cloud compute costs. • Develop Observability and Monitoring: build dashboards and alerting for service health, tracing, A/B test traffic, and latency trends to ensure consistency to defined SLAs. • Contribute to Core LPM Serving: focus on the technical strategy for deploying and maintaining the core Large Personalization Model (LPM). Apply tot his job
Apply Now

Similar Jobs

**Entry-Level Data Entry Specialist - Remote Work Opportunity at blithequark**

Remote, USA Full-time

**Job Title:** Remote Customer Service Representative – Thriving in a Dynamic Environment with Competitive Pay and Flexibility

Remote, USA Full-time

**Experienced Full Stack Software Engineer – Web & Cloud Application Development**

Remote, USA Full-time

**Experienced Data Analyst – Transportation Analytics at blithequark**

Remote, USA Full-time

**Experienced Full Stack Data Scientist – Web & Cloud Application Development**

Remote, USA Full-time

**Experienced Full Stack Data Entry Specialist – Web & Cloud Application Development with blithequark, Presented by blithequark**

Remote, USA Full-time

**Experienced Remote Data Entry & Online Order Support Specialist – Join blithequark's Global Retail Team**

Remote, USA Full-time

Kelly Services 10500 – Commercial Recruiter Virtual/ Must Live in Columbus, IN – Columbus, Indiana in Indianapolis, Indiana

Remote, USA Full-time

**Experienced Customer Order Picker – Online Grocery Fulfillment Specialist at blithequark**

Remote, USA Full-time

**Experienced Customer Service Advisor – Automotive Restoration and Repair Support**

Remote, USA Full-time

Meta Platforms is hiring: UX Researcher, Qualit...

Remote, USA Full-time

Customer Service– Hotel Reservations (Remote)

Remote, USA Full-time

**Experienced Direct Hire Legal Recruiter | Remote Work Opportunity with a Competitive Salary and Rapidly Growing Legal Services Company**

Remote, USA Full-time

Remote Medical Biller and Receptionist - Work-from-Home Opportunity with a Dynamic Therapy Practice

Remote, USA Full-time

Experienced Remote Customer Service Chat Representative – Part-Time Opportunity for Exceptional Communicators to Deliver Top-Tier Support from the Comfort of Their Own Homes

Remote, USA Full-time

**Experienced Full Stack Customer Service Representative – Work From Home**

Remote, USA Full-time

Regional Vice President (RVP) Provider Solutions

Remote, USA Full-time

VP, Regulatory Counsel (open to remote)

Remote, USA Full-time

**Experienced Customer Support Consultant – Deliver Exceptional Service and Drive Business Growth at blithequark**

Remote, USA Full-time

Senior Growth Strategist, Apps

Remote, USA Full-time
Back to Home