RemoteDevJobs
Browse JobsCompaniesFull StackFrontendBackendPro Newsletter
⚡ Boost ListingPost a Job
RemoteDevJobs

The #1 job board for remote developer positions. Updated daily with the best opportunities.

Categories

  • Full Stack
  • Frontend
  • Backend
  • DevOps / Cloud
  • Mobile
  • AI / ML

Resources

  • Browse All Jobs
  • Companies
  • Post a Job
  • Pricing

Job Alerts

Get the best remote dev jobs delivered to your inbox weekly.

© 2026 RemoteDevJobs. All rights reserved.

PrivacyTermsContact
Back to Jobs

Forward Deployed Engineer AI Inference

Red Hat, Inc.

Apply Now
Remote / Worldwide Full-time $120,000 - $160,000 Jan 10 59 views
KubernetesPythonGoAI InferenceBackend SystemsSREInfrastructure Engineering

Job Description

### About the Role The vLLM and LLM-D Engineering team at Red Hat is seeking a customer-focused developer to join as a **Forward Deployed Engineer**. In this role, you will bridge our cutting-edge inference platform (LLM-D and vLLM) with our customers' critical production environments. ### Responsibilities - **Orchestrate Distributed Inference**: Deploy and configure LLM-D and vLLM on Kubernetes clusters, setting up advanced deployments to maximize hardware utilization. - **Optimize for Production**: Run performance benchmarks, tune vLLM parameters, and configure intelligent inference routing policies to meet SLOs for latency and throughput. - **Code Side-by-Side**: Collaborate with customer engineers to write production-quality code (Python/Go/YAML) that integrates our inference engine into their Kubernetes ecosystem. - **Solve the "Unsolvable"**: Debug complex interactions between model architectures, hardware accelerators, and Kubernetes networking. - **Feedback Loop**: Act as the "Customer Zero" for our engineering teams, channeling field learnings back to product development. ### Requirements - **Experience**: 8+ years in Backend Systems, SRE, or Infrastructure Engineering. - **Skills**: Deep Kubernetes expertise, understanding of AI inference, and proficiency in coding. - **Customer Fluency**: Ability to communicate effectively between systems engineering and business value. - **Bias for Action**: Preference for rapid prototyping and iteration. ### Travel Travel only as needed to present, demo, or help execute proof-of-concepts.

Interested in this role?

Apply Now

Opens company application page

Actively hiring

Apply Now

Via company website

59 viewed Jan 10

About Red Hat, Inc.

Similar Jobs

Senior Manager, Site Reliability Engineering

Ping Identity

DevOps Technical Lead

3Pillar

Site Reliability Engineer

MinIO

Performance & Capacity Engineer – Capacity Planning Optimization

Meta

Forward Deployed Engineer AI Inference

Red Hat, Inc.

Apply Now