Job Description
We are looking for a visionary Senior AI Engineer to join our elite research team in San Francisco. At Nebula AI, we are building the next generation of Large Language Models and generative AI systems that will redefine human-computer interaction. You will work on cutting-edge architectures, optimize deep learning pipelines, and deploy scalable models that power millions of users.
Why Join Us?
β’ Work on state-of-the-art Generative AI and LLMs.
β’ Competitive compensation and equity packages.
β’ Collaborative, diverse, and inclusive culture.
β’ Flexible remote-first policy with a hub in SF.
Responsibilities
- Design, implement, and optimize large-scale machine learning models, specifically focusing on Transformers and Diffusion models.
- Collaborate with research scientists to prototype novel algorithms and improve model performance (accuracy, latency, and throughput).
- Manage the end-to-end machine learning lifecycle, including data preprocessing, model training, validation, and deployment.
- Conduct rigorous code reviews and mentor junior engineers to maintain high engineering standards.
- Stay abreast of the latest academic research and industry trends to integrate best practices into our production systems.
- Work closely with product teams to translate complex technical requirements into robust AI solutions.
Qualifications
- Masterβs or PhD degree in Computer Science, Mathematics, Statistics, or a related field.
- 5+ years of professional experience in software development and machine learning engineering.
- Deep expertise in Python, PyTorch, TensorFlow, or JAX.
- Strong understanding of deep learning fundamentals, natural language processing (NLP), and neural architecture design.
- Experience with distributed training systems (e.g., Ray, Kubernetes, Slurm) and MLOps tools (e.g., MLflow, DVC).
- Excellent problem-solving skills and the ability to communicate complex technical concepts clearly.