Posted on 2026/03/17
Account Executive / Territory Leader - AI EdTech (Dallas)
EQL Tech
Dallas, TX, United States
Job highlights Identified by Google from the original job post Qualifications • We are looking for someone who has built and shipped AI systems into production and understands the challenges of scalable inference and model serving • Strong Python development experience (3.10+) • Hands-on experience building production APIs with FastAPI • Experience with HuggingFace Transformers and PyTorch • Solid understanding of REST API design • Experience deploying containerized applications with Docker • 3 more items(s) Responsibilities • This role focuses on developing high-performance APIs for model inference, optimizing GPU workloads, and deploying AI services in cloud environments • Develop high-performance APIs using Python (3.10+) and FastAPI • Build and deploy LLM inference services using HuggingFace Transformers and PyTorch • Optimize GPU workloads and CUDA memory usage • Implement streaming inference APIs for real-time model responses • Containerize and deploy services using Docker and GPU-enabled infrastructure • Deploy AI workloads in Azure environments (AKS, ACI, or Container Apps) • 4 more items(s) More job highlights Job description We are looking for a Senior Python / AI API Engineer to build and deploy production-grade services powering Large Language Model (LLM) applications.
This role focuses on developing high-performance APIs for model inference, optimizing GPU workloads, and deploying AI services in cloud environments.
This is an engineering-focused role, not research. We are looking for someone who has built and ship...ped AI systems into production and understands the challenges of scalable inference and model serving.
Key Responsibilities
• Develop high-performance APIs using Python (3.10+) and FastAPI
• Build and deploy LLM inference services using HuggingFace Transformers and PyTorch
• Optimize GPU workloads and CUDA memory usage
• Implement streaming inference APIs for real-time model responses
• Containerize and deploy services using Docker and GPU-enabled infrastructure
• Deploy AI workloads in Azure environments (AKS, ACI, or Container Apps)
Required Skills
• Strong Python development experience (3.10+)
• Hands-on experience building production APIs with FastAPI
• Experience with HuggingFace Transformers and PyTorch
• Solid understanding of REST API design
• Experience deploying containerized applications with Docker Show full description Report this listing Loading...

Zero to AI Engineer
Skip the degree. Learn real-world AI skills used by AI researchers and engineers. Get certified in 8 weeks or less. No experience required.
Find AI, ML, Data Science Jobs By Location
Find Jobs By Position