Sr. Software Engineer - K8s - GPU Orchestration - REMOTE Job at Living Talent, San Jose, CA

M2pqMjJ6U3lBRzhKUTFFSFk1OFpwUFlEdVE9PQ==
  • Living Talent
  • San Jose, CA

Job Description

GPU Orchestration
  • Startup
  • Company size: 30
  • Remote within North America
  • Compensation: Base Salary 250k + Equity

Key Responsibilities

  • Lead Design, Architecture & Development of K8s-based cloud infrastructure.
  • Use K8s Controllers, Operators & CRs to Implement scalable, high-availability solutions.
  • Integrate Karpenter, and/or other advanced tools for infrastructure optimization.
  • Architect MLOps Middleware integration (dynamic workload migration, resource disaggregation).
  • Build monitoring, logging & alerting systems.
  • Drive infrastructure cost optimization through FinOps best practices in K8s deployments.
  • Promote K8s best practices & mentor software engineers.
  • Collaborate across teams to drive K8s adoption in multi-cloud and hybrid environments.
  • Open-Source Contributions in the Kubernetes community.

Qualifications

Kubernetes Expertise

  • Designing, deploying, and managing K8s clusters (AKS, EKS, GKE, OpenStack, etc.).
  • Hands-on experience with K8s core components (Karpenter, cluster autoscaler, CNI, CSI, CRI, CRD, operators).
  • 5+ years in Kubernetes infrastructure.
  • Contributing to open-source Kubernetes projects.
  • 10+ years: software engineering experience.
  • Go, Python, Bash, etc. (one or more).
  • Excellent communication skills for both technical and non-technical stakeholders.
  • Bachelor’s or Master’s degree in Computer Science or related field (preferred).

Preferred Experience

  • GPU scheduling, container orchestration, HPC (high-performance computing) workloads.
  • Multi-cloud & hybrid cloud deployments familiarity.
  • MLOps platforms experience (Kubeflow, TFX, etc.).
  • FinOps practices & cloud cost management experience/knowledge

Job Tags

Remote job,

Similar Jobs

Maxion Research

Secret Shopper Hero - Shop, Share, and Earn - Flexible Scheduling with No Experience Required (Hiring Immediately) Job at Maxion Research

 ...experience is needed. Participants will have the option to choose particular studies based on their ability to participate either online, in person or over the telephone. Participants are needed on a wide range of topics such as: Health Issues (Research for... 

Excite Health Partners

Travel CT Tech - $2,079 per week in Houston, TX Job at Excite Health Partners

 ...Shift Details: Night Shift, 8:30pm - 7:00am (Wed-Sat), 5:30pm - 6:00am (Sun-Tue) Required Certifications: ARRT (CT), BLS (American Red Cross or American Heart Association) Scrub Attire: Hunter Green Floating Requirements: Not specified EMR Used: Meditech Relevant Skills:... 

GardaWorld

Surveillance Security Guard - Days Job at GardaWorld

 ...Job Description GardaWorld Security Services is Now Hiring a Surveillance Security Officer! Ready to suit up as a Surveillance Security...  ...- even better! Must be able to obtain/maintain an Iowa Private Security License (GardaWorld will help to obtain) In the United... 

Netflix

Software Engineer 5 Job at Netflix

Netflix is one of the worlds leading entertainment services with 278 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime... 

Russell Tobin

Localization Project Manager Job at Russell Tobin

 ...Russell Tobin & Associates is currently seeking a Localization Project Manager, Culver City, CA 90232 (Hybrid) Contract role for one of our Fortune 500 clients. Apply today for immediate consideration. Position: Localization Project Manager Location: Culver City...