AI Inference Engineer Job at Signify Technology, Alameda, CA

dlNuRFJ3MEpTeE1ZRGcxeVpwN0FQNW43SWc9PQ==
  • Signify Technology
  • Alameda, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

Binding Minds Inc. (Certified Disability Owned Business Ente...

Graphic Designer Job at Binding Minds Inc. (Certified Disability Owned Business Ente...

 ...As the Analyst, Restaurant Training, you are a junior Instructional Designer, responsible for supporting the development of client's operational training materials used every day by 130,000+ team members in 3,800+ restaurants located across the United States and Canada... 

Primrose Schools

School Director Job at Primrose Schools

 ...POSITION SUMMARY The School Director at Primrose School is primarily responsible for driving enrollments and managing the overall operations...  ...back to your local community through Spring Fling and charity events. As the leader in early education and care, our research-... 

The Jewish Federation of Palm Beach County

Senior Communications Specialist Job at The Jewish Federation of Palm Beach County

 ...Utilize your creativity, passion, and strategic thinking to inspire others. As a Senior Communications Specialist at Jewish Federation of Palm Beach County, you will play a crucial role in advancing our philanthropic mission through impactful storytelling and meaningful... 

TELUS Digital AI Data Solutions

Online Task Contributor | Part-time, Remote in the US Job at TELUS Digital AI Data Solutions

 ...working from home while learning more about and contributing to the development of AI technologies, look no further. This flexible freelance role will help you to make your spare time pay off. A Day in the Life of an Online Task Contributor: In this role, you... 

Upward Health

Triage Nurse Job at Upward Health

 ...and will ensure interdisciplinary care is optimized toward targeted outcomes. By collaborating with the Clinical Operations team to assess, plan, implement, coordinate, monitor, and evaluate services and outcomes to maximize the patients health. The Triage will...