Home Job Details
A
Information Technology 🏒 Full Time ⭐️ Verified

Senior AI & LLM Architect

Apex Future Tech
San Francisco
Estimated Salary
USD 165.000 – USD 230.000
Live Update
12 Mei 2026
Deadline
12 Mei 2027

Job Description

About Apex Future Tech: We are a premier engineering firm dedicated to shaping the technological landscape of the next decade. We are currently seeking a visionary Senior AI & LLM Architect to join our elite R&D division. This role is pivotal in designing the infrastructure that will power our generative AI solutions through 2026 and beyond.

The Role: You will be at the forefront of artificial intelligence innovation, tasked with architecting robust, scalable, and secure Large Language Models (LLMs). You will bridge the gap between theoretical research and production-grade deployment, ensuring our systems are not only cutting-edge but also ethically sound and highly efficient.

Why Join Us? We offer a dynamic environment where your work directly impacts the future of human-computer interaction. Enjoy comprehensive benefits, stock options, and the opportunity to work with industry leaders in AI.

Responsibilities

  • Architect and deploy scalable Large Language Models (LLMs) designed for 2026 enterprise standards.
  • Optimize model inference speeds and reduce latency for real-time generative applications.
  • Lead the research and development of proprietary fine-tuning methodologies and RLHF pipelines.
  • Collaborate with cross-functional teams to integrate AI capabilities into consumer and B2B products.
  • Establish best practices for MLOps, ensuring reproducibility and governance of AI models.
  • Mentor junior engineers and data scientists, fostering a culture of technical excellence and continuous learning.

Qualifications

  • PhD or Master’s degree in Computer Science, Artificial Intelligence, or a related quantitative field.
  • 7+ years of professional experience in machine learning engineering and deep learning frameworks.
  • Expert proficiency in Python, PyTorch, and TensorFlow.
  • Proven track record of deploying LLMs (e.g., GPT-4, LLaMA) in high-traffic production environments.
  • Deep understanding of distributed systems, cloud infrastructure (AWS/GCP/Azure), and containerization (Docker/Kubernetes).
  • Strong knowledge of NLP concepts, transformer architectures, and prompt engineering.

Required Skills

Python PyTorch TensorFlow MLOps AWS GCP Kubernetes Docker LLM NLP Generative AI Machine Learning Engineering

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All