< More Jobs

Posted on 2026/03/17

Account Executive / Territory Leader - AI EdTech (Dallas)

EQL Tech

Dallas, TX, United States

Full-time

Job highlights Identified by Google from the original job post Qualifications • We are looking for someone who has built and shipped AI systems into production and understands the challenges of scalable inference and model serving • Strong Python development experience (3.10+) • Hands-on experience building production APIs with FastAPI • Experience with HuggingFace Transformers and PyTorch • Solid understanding of REST API design • Experience deploying containerized applications with Docker • 3 more items(s) Responsibilities • This role focuses on developing high-performance APIs for model inference, optimizing GPU workloads, and deploying AI services in cloud environments • Develop high-performance APIs using Python (3.10+) and FastAPI • Build and deploy LLM inference services using HuggingFace Transformers and PyTorch • Optimize GPU workloads and CUDA memory usage • Implement streaming inference APIs for real-time model responses • Containerize and deploy services using Docker and GPU-enabled infrastructure • Deploy AI workloads in Azure environments (AKS, ACI, or Container Apps) • 4 more items(s) More job highlights Job description We are looking for a Senior Python / AI API Engineer to build and deploy production-grade services powering Large Language Model (LLM) applications.

This role focuses on developing high-performance APIs for model inference, optimizing GPU workloads, and deploying AI services in cloud environments.

This is an engineering-focused role, not research. We are looking for someone who has built and ship...ped AI systems into production and understands the challenges of scalable inference and model serving.

Key Responsibilities

• Develop high-performance APIs using Python (3.10+) and FastAPI

• Build and deploy LLM inference services using HuggingFace Transformers and PyTorch

• Optimize GPU workloads and CUDA memory usage

• Implement streaming inference APIs for real-time model responses

• Containerize and deploy services using Docker and GPU-enabled infrastructure

• Deploy AI workloads in Azure environments (AKS, ACI, or Container Apps)

Required Skills

• Strong Python development experience (3.10+)

• Hands-on experience building production APIs with FastAPI

• Experience with HuggingFace Transformers and PyTorch

• Solid understanding of REST API design

• Experience deploying containerized applications with Docker Show full description Report this listing Loading...

Zero to AI Engineer Program

Zero to AI Engineer

Skip the degree. Learn real-world AI skills used by AI researchers and engineers. Get certified in 8 weeks or less. No experience required.