Job Description
We are seeking a visionary Generative AI Architect to lead our next-generation AI infrastructure. As we look toward the horizon of 2026, your expertise will define the technical roadmap for scalable, secure, and high-performance models. If you are passionate about pushing the boundaries of Large Language Models (LLMs) and building the foundation for the future of intelligent systems, we want to hear from you.
In this role, you will bridge the gap between cutting-edge research and production-grade engineering, ensuring our AI solutions are robust, efficient, and ready for global deployment.
Responsibilities
- Architect and design scalable Generative AI pipelines and infrastructure to support 2026 enterprise standards.
- Optimize large language models for latency, throughput, and cost-efficiency in production environments.
- Lead the MLOps strategy, implementing CI/CD pipelines for machine learning models.
- Collaborate with product and engineering teams to translate business requirements into technical AI solutions.
- Ensure data privacy, security, and compliance across all AI workloads.
- Research and evaluate emerging AI frameworks and technologies to maintain a competitive edge.
Qualifications
- Ph.D. or Masterβs degree in Computer Science, Machine Learning, or a related field.
- Minimum of 5+ years of experience in software engineering with a focus on AI/ML.
- Deep expertise in Python, PyTorch, TensorFlow, and Hugging Face Transformers.
- Proven experience deploying and scaling LLMs on cloud platforms (AWS, GCP, or Azure).
- Strong understanding of MLOps, Kubernetes, and distributed systems.
- Excellent problem-solving skills and the ability to thrive in a fast-paced, innovative environment.