Job Description
We are seeking a visionary Senior AI/LLM Engineer to join our elite engineering team in San Francisco. As a leader in the AI space, we are building the next generation of generative AI solutions that will redefine how humans interact with technology. If you are passionate about pushing the boundaries of Large Language Models (LLMs), fine-tuning, and deploying scalable AI infrastructure, we want to hear from you.
In this role, you will collaborate with world-class researchers and product engineers to architect robust, efficient, and safe AI systems. You will be at the forefront of innovation, working on projects that have a tangible impact on millions of users worldwide.
Responsibilities
- Design, train, and deploy state-of-the-art Large Language Models (LLMs) and transformer architectures.
- Optimize model inference performance and reduce latency for real-time applications.
- Implement advanced fine-tuning strategies (e.g., LoRA, P-Tuning) to enhance model accuracy and domain specificity.
- Collaborate with data science teams to curate high-quality datasets and ensure ethical AI practices.
- Build and maintain MLOps pipelines for continuous integration and deployment of AI models.
- Research emerging trends in NLP and contribute to the technical roadmap for future AI capabilities.
Qualifications
- Masterβs or PhD in Computer Science, Artificial Intelligence, or a related technical field.
- 5+ years of professional experience in machine learning, deep learning, or NLP.
- Proficiency in Python, PyTorch, or TensorFlow.
- Strong experience with model serving frameworks (e.g., TensorFlow Serving, TorchServe, Ray Serve).
- Deep understanding of transformer architectures, attention mechanisms, and LLM training pipelines.
- Experience with cloud platforms (AWS, GCP, or Azure) and containerization (Docker, Kubernetes).