Posted on 3/19/2025
Product Development Manager - Large Language Models
Cohere
Toronto, ON
Full Description
Job Summary
We are seeking an experienced AI Research Engineer to join our team, focusing on designing, building, and scaling AI systems that underpin our suite of dev-centric enterprise products. As a key member of our AI research team, you will collaborate closely with our top researchers and engineers to develop novel AI-powered technologies that drive our business forward.
The ideal candidate will have strong software engineering skills, particularly in Python and related ML frameworks such as Tensorflow, TF-Serving, JAX, and XLA/MLIR. They will also have direct experience in building and deploying large-scale language models, a strong portfolio of successful product releases, and experience in creating and curating large-scale datasets.
Responsibilities
• Design, build, and scale AI systems: You will be responsible for designing, building, and scaling AI systems that underpin our suite of dev-centric enterprise products.
• Develop novel AI-powered technologies: You will collaborate closely with our top researchers and engineers to develop novel AI-powered technologies that drive our business forward.
• Work on North, Cohere's all-in-one secure AI workspace platform: You will work on North, driving agent development in RAG, tool use, and language agents embedded in North.
• Experiment with novel ideas: With access to our supercomputer and data infrastructure, you will be empowered to experiment with novel ideas and bring them to market quickly.
Requirements
• Expertise in software engineering: You should have strong programming skills, particularly in Python and related ML frameworks.
• Leadership experience in a product-centric organization: You should have a proven track record of managing teams and delivering high-quality products.
• Hands-on experience in building Large Language Models: You should have direct experience in developing and deploying large-scale language models.
• A strong portfolio of successful product releases: You should have released multiple features with several iterations.
• Experience in creating and curating large-scale datasets: You should have collected, processed, and managed large datasets.
• Familiarity with distributed training strategies: You should have knowledge of distributed training methods and their applications.
• Experience working with Transformer-based architectures: You should have experience working with autoregressive sequence models, such as Transformers.
• Collaboration skills: You should be able to work seamlessly with annotators and other stakeholders to deliver high-quality products.
• Publishing experience in top-tier venues: Having publications in top-tier conferences such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP would be an added advantage.
What We Offer
• Inclusive culture and work environment: We foster a culture of respect, open communication, and teamwork.
• Collaborative work environment: You will be part of a talented team of researchers and engineers pushing the boundaries of AI.
• Lunch stipend and in-office meals: We provide regular meals and snacks to keep your energy levels up.
• Comprehensive health and dental benefits: We offer medical and dental coverage for you and your family.
• Generous parental leave policy: We support new parents with top-up leave policies.
• Personal enrichment benefits: You can use these benefits to learn new skills, attend conferences, or pursue hobbies.
• Flexible work arrangements: We allow flexible work schedules to balance work and personal life.
• Ample vacation time: You will have plenty of time to relax and recharge.
Find AI, ML, Data Science Jobs By Location
Find Jobs By Position
Subscribe to the AI Search Newsletter
Get top updates in AI to your inbox every weekend. It's free!