Job Description
At Zai Nexus, we are not just building software; we are architecting the digital reality for 2026 and beyond. We are seeking a visionary Senior AI Architect to lead our cutting-edge R&D division. In this role, you will define the roadmap for autonomous agents and next-generation generative models that will power the enterprise of tomorrow.
You will be at the forefront of technical innovation, bridging the gap between theoretical AI breakthroughs and scalable, production-grade systems. If you are passionate about pushing the boundaries of what is possible with artificial intelligence, we want to hear from you.
Why Join Us?
- Work on projects that directly influence the future of global industries.
- Competitive compensation package and equity options.
- Top-tier benefits and a collaborative, forward-thinking culture.
Responsibilities
- Architect Next-Gen Systems: Design, implement, and optimize state-of-the-art Large Language Models (LLMs) tailored for complex 2026 enterprise requirements.
- Research & Development: Lead internal research initiatives to explore emerging AI paradigms, including Reinforcement Learning and Transformer architectures.
- Model Optimization: Improve model inference latency, throughput, and cost-efficiency for real-time applications.
- Technical Leadership: Mentor a team of brilliant engineers and data scientists, fostering a culture of technical excellence and continuous learning.
- Strategic Planning: Define the long-term AI strategy and collaborate with product management to align technical roadmaps with business goals.
- Production Deployment: Manage the end-to-life cycle of AI models from training data preparation to model serving and monitoring.
Qualifications
- Education: Ph.D. or Masterβs degree in Computer Science, Mathematics, Statistics, or a related field.
- Experience: 7+ years of experience in Machine Learning, Deep Learning, or AI research, with at least 3 years in a senior leadership role.
- Technical Stack: Proficiency in Python, PyTorch, TensorFlow, or JAX. Experience with cloud platforms (AWS, GCP, or Azure) is required.
- Domain Expertise: Deep understanding of Natural Language Processing (NLP), Computer Vision, or Generative AI models.
- Problem Solving: Proven track record of solving complex, unstructured problems using data-driven approaches.
- Communication: Exceptional ability to communicate complex technical concepts to both technical and non-technical stakeholders.