About the job
At Cohere, we believe in the power of multimodal AI to revolutionise the way we interact with technology. Our engineering teams push the boundaries of what's possible, and we're looking for talented individuals to join us on this exciting journey. With an exceptional ratio of compute resources to engineers, we provide an ideal environment for you to explore, innovate and shape the future of AI.
Responsibilities
Design and develop cutting-edge multimodal AI systems, integrating various modalities such as text, speech, and vision.
Conduct research and experiments on our advanced compute infrastructure, exploring novel ideas in multimodal representation learning, transfer learning, and more.
Collaborate closely with our world-class teams, learning from and contributing to their expertise in the field.
Qualifications
Minimum
Possess exceptional software engineering skills, with a proven track record of building robust and scalable systems.
Have a strong command of Python and are well-versed in popular deep learning frameworks like JAX, PyTorch, and TensorFlow, with an understanding of their multimodal capabilities.
Knowledge of distributed training strategies, especially for large-scale multimodal models.
Familiarity with autoregressive models, particularly their application in multimodal tasks such as image or video captioning, speech-to-text generation.
Preferred
Publications in top-tier venues demonstrating your expertise in multimodal AI research.
Experience in writing efficient GPU kernels using CUDA, optimising performance for multimodal tasks.