Stellon Labs Openings

Open Positions

Research Engineer

Location: San Francisco / Remote

Type: Full-time

Are you passionate about making AI models smaller, faster, and more efficient? We're looking for a Research Engineer to lead our edge AI optimization efforts and develop cutting-edge solutions for deploying models on resource-constrained devices.

What You'll Do:

Design and implement advanced quantization techniques to compress neural networks while maintaining accuracy
Develop novel training methodologies optimized for edge deployment scenarios
Research and prototype new approaches to model compression, pruning, and knowledge distillation
Collaborate with hardware teams to optimize models for specific chip architectures
Publish findings and contribute to the broader research community
Mentor junior engineers and drive technical excellence across the team

What We're Looking For:

Quantization Expertise: Deep understanding of post-training quantization, quantization-aware training, and mixed-precision techniques
PyTorch Mastery: Extensive experience with PyTorch for research and production, including custom operators and model optimization
Model Training: Proven track record in training large-scale models, distributed training, and hyperparameter optimization
Advanced degree in Computer Science, Machine Learning, or related field (or equivalent experience)
Publications in top-tier ML conferences/journals preferred

Bonus Points: