Full Description
Responsibilities
Experience: 7+ years in AI development/architecture, with a focus on Generative AI and large-scale deep learning systems.
Technical skills:
• LLMs and Generative Models: Proven expertise in LLMs (customization, fine-tuning), Retrieval-Augmented Generation (RAG), and other generative models like GANs and Diffusion Models.
• Programming: Proficient in Python and deep learning frameworks like PyTorch or TensorFlow.
• NLP: Advanced understanding of modern NLP techniques, such as transformer models and tokenization.
• Cloud: Extensive experience with cloud platforms (AWS, Azure, GCP) and their respective AI/ML services (e.g., SageMaker, Vertex AI).
• MLOps & Infrastructure: Experience with MLOps practices and building high-availability, low-latency systems using containers (Docker, Kubernetes) and microservices.
• Monitor and increase the performance of RAG system.
• Understand the basics of RAG and how to validate its output.
• Determine how to increase the accuracy of RAG.
• Identify general tools/techniques that will be used on the back for retrieving/generating the data.
Find AI, ML, Data Science Jobs By Location
Find Jobs By Position