AI Inference Engineer Job at Signify Technology, Alameda, CA

dlNuRFJ3MEpTeE1ZRGcxeVpwN0FQNW43SWc9PQ==
  • Signify Technology
  • Alameda, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

i2i Workforce

WordPress Web Developer Job at i2i Workforce

 ...WordPress Developer $30-$55/hr Remote| 1099 Contract or W-2 | Part-time or Full-time The Role (This Is a Doer Role) Were looking for a strong WordPress Web Developer who can build, fix, optimize, and deliver . This is a hands-on execution role for someone... 

The H&K Group

Experienced CDL Drivers Job at The H&K Group

 ...Inc., is currently seeking experienced and motivated Class B CDL drivers to become a part of our team. Our CDL drivers satisfy a critical...  ...telephone or radio contact with supervisor to receive delivery instructions Loads and unloads truck Inspects truck equipment... 

Tuscany Suites and Casino

Security Officer Job at Tuscany Suites and Casino

We are looking for a Security Officers who will be responsible for ensuring the safety and protection of the guests, employees and hotel/casino...  ....* Perform other duties assigned. EDUCATION & EXPERIENCEHigh school diploma or GED required. Must be at least 21 years... 

Barnard

Electrician Job at Barnard

 ...civil contractors. We specialize in dam construction and rehabilitation, power transmission and distribution, tunneling, inland marine, oil, gas, utility, and sewer and water pipeline projects. We offer competitive wages, 401(k)s, a generous health plan, and challenging... 

Department of Parks & Recreation

PARK MAINTENANCE ASSISTANT Job at Department of Parks & Recreation

 ...Job Description and Duties PARK MAINTENANCE ASSISTANT (PERMANENT INTERMITTENT)/ LAKE OROVILLE SECTOR/ NORTHERN BUTTESDISTRICT The reporting location for this position is the Lake Oroville Maintenance Shop located at 400 Glen Drive, Oroville, Ca 95966. This...