AI Inference Engineer Job at Signify Technology, Alameda, CA

dlNuRFJ3MEpTeE1ZRGcxeVpwN0FQNW43SWc9PQ==
  • Signify Technology
  • Alameda, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

Webull Financial LLC

Internship Job at Webull Financial LLC

Internship 2025 Summer Internship Program Saint Petersburg, FL 2025 Summer Internship - New York New York, NY

Tailor Made Compounding

Staff Pharmacist Job at Tailor Made Compounding

 ...Tired of working for a retail pharmacy? Look no further! We are hiring a Staff Pharmacist for our growing team! Established in 2016, Tailor Made Compounding has become one of the top compounding pharmacies in the nation, dedicated to providing quality medications... 

Love Pool Care

VP of Finance Job at Love Pool Care

 ...rapidly expanding organization. Position Summary: The VP of Finance is a senior financial and executive management support role...  ... Treasury Management: Serve as the key point of contact for banks and cash management. Payroll Processing: Approve and QA payroll... 

SCA Health

Director Practice Revenue Cycle Operations Job at SCA Health

 ...the need or challenge . Responsibilities Reporting to the Vice President of Practice Revenue Cycle Operations, the Director of Practice Revenue Cycle Operations will work directly with local leadership to support specialty practice partnerships within the... 

Krete

Chemist Job at Krete

 ...Krete Industries is looking for a Chemist with a strong formulation background and experience with chemicals commonly used in the concrete admixture industry such as stearates, surfactants/wetting agents, silanes, and air entrainers. The Chemist will be responsible for...