Kalibre logo

Cohere

Latest active Cohere jobs

logo

Cohere

Product

•

Startup


$942.95M

Total funding

$500M · Jul 2024

Latest funding

201-500

Employees

2019

Founded

Cohere is a data security-focused AI company that develops safe, scalable NLP solutions using innovative large language models, enhancing business efficiency and privacy.


Cohere jobs

AI Developer

Security Engineer

DevOps Engineer


Visit Cohere

Cohere

Latest active Cohere jobs

AI Developer

Security Engineer

DevOps Engineer

Digital Marketing

Product Designer

Data Analyst

Finance

Machine Learning Developer

Business Development Specialist

Director

Fullstack Developer

Product Manager

Solutions Architect (Public Sector)


Washington

Office

5+ years in public sector customer-facing technical pre/post sales solutions architect (or similar) experience.2+ years in architecting or deploying NLP/AI/LLM solutions.Experience in Python.Experience in Jupyter Notebooks.

Member of Technical Staff, Modeling


New York

Remote

Extremely strong software engineering skills.Proficiency in Python and related ML frameworks such as TensorFlow, TF-serving, JAX, and XLA/MLIR.Experience writing kernels for GPUs using CUDA.Experience using large-scale distributed training strategies.Familiarity with autoregressive sequence models, such as transformers.

Infrastructure Software Engineer, Security


San Francisco

Remote

5+ years in software engineering.Experience in Kubernetes.Experience in GCP, Azure, AWS, OCI, multi-cloud on-prem / hybrid serving.Experience in OIDC specifications, including OAuth 2.0, JWT, and related protocols.Experience in secure coding practices.

Member of Technical Staff, Model Serving Infrastructure


New York

Hybrid

5+ years in engineering experience running production infrastructure at a large scale.Experience in designing large, highly available distributed systems with Kubernetes, and GPU workloads on those clusters.Experience with Kubernetes dev and production coding and support.Experience with GCP, Azure, AWS, OCI, multi-cloud on-prem / hybrid serving experience in designing, deploying, supporting, and troubleshooting in complex Linux-based computing environments.Experience in compute/storage/network resource and cost management.Experience in designing, deploying, supporting, and troubleshooting in complex Linux-based computing environments.Experience in Golang, C++ or other languages designed for high-performance scalable servers.

Senior Software Engineer, Secure Agents


San Francisco

Remote

5+ years in software engineering with a strong focus on developing user-facing security features.Experience in Python.Experience in secure coding practices.

Infrastructure Software Engineer, Security


New York

Hybrid

5+ years in software engineering with a strong focus on developing user-facing security featuresExperience in Kubernetes dev and production coding and supportExperience with GCP, Azure, AWS, OCI, multi-cloud on-prem/hybrid servingExperience with OIDC specifications, including OAuth 2.0, JWT, and related protocolsExperience in secure coding practicesExperience in session managementExperience in multi-factor authentication

Senior Software Engineer, Secure Agents


New York

Hybrid

5+ years in software engineering with a strong focus on developing user-facing security features.Experience in Python.Experience in session management.Experience in multi-factor authentication.Experience in secure coding practices.

Member of Technical Staff, Model Serving Infrastructure


San Francisco

Hybrid

5+ years of engineering experience running production infrastructure at a large scaleExperience designing large, highly available distributed systems with Kubernetes, and GPU workloads on those clustersExperience with Kubernetes dev and production coding and supportExperience with GCP, Azure, AWS, OCI, multi-cloud on-prem / hybrid serving experience in designing, deploying, supporting, and troubleshooting in complex Linux-based computing environmentsExperience in compute/storage/network resource and cost managementExperience in designing, deploying, supporting, and troubleshooting in complex Linux-based computing environmentsExperience with AIExperience with AI toolsExperience with AI agentsExperience with GolangExperience with C++Experience with Distributed SystemsExperience with GPUsExperience with TPUsExperience with Custom Accelerators

Member of Technical Staff, Model Serving Infrastructure


Seattle

Remote

5+ years of engineering experience running production infrastructure at a large scale.Experience designing large, highly available distributed systems with Kubernetes, and GPU workloads on those clusters.Experience with Kubernetes dev and production coding and support.Experience with GCP, Azure, AWS, OCI, multi-cloud on-prem / hybrid serving experience in designing, deploying, supporting, and troubleshooting in complex Linux-based computing environments.Experience in compute/storage/network resource and cost management.Experience in designing, deploying, supporting, and troubleshooting in complex Linux-based computing environments.Experience in Golang, C++ or other languages designed for high-performance scalable servers.

Senior Member of Technical Staff - Multimodal AI


New York

Remote

Experience in Python.Experience in deep learning frameworks like JAX, PyTorch, and TensorFlow.Knowledge of distributed training strategies.Familiarity with autoregressive models.Experience in writing efficient GPU kernels using CUDA.

Senior Member of Technical Staff - Multimodal AI


San Francisco

Remote

Experience in Python.Experience in deep learning frameworks like JAX, PyTorch, and TensorFlow.Knowledge of distributed training strategies.Familiarity with autoregressive models.Experience in writing efficient GPU kernels using CUDA.

Member of Technical Staff, MLE


New York

Remote

3+ years in model training, deployment, and maintenance in a production environment.Experience in NLP and deep learning.Experience scaling products at hyper-growth startup.Experience improving LLM performance for custom domains via fine tuning or RLHF.Experience in information retrieval systems for document question answering.Experience in day-to-day NLP for industry using Python and related toolchains (spaCy, HuggingFace, NLTK, etc.).Published research in areas of machine learning at major conferences and/or journals.

Senior Social Media Strategist


New York

Remote

7+ years in digital/social marketing.

Marketing Analytics Consultant


Boston

Remote

5+ years in marketing analyticsExperience in CRMExperience in marketing automation integrationsExperience in setting up relevant fields for measurementExperience in campaign performanceExperience in e2e lead to cashExperience in web traffic analyticsExperience in attribution modelingExperience in working across sales and finance teams

Member of Technical Staff, Search


San Francisco

Remote

Experience leveraging large language models as part of training data or evaluation pipelines.Experience working with a wide range of technologies.Experience building training and/or evaluation datasets for practical use cases.Experience using large-scale distributed training strategies with GPUs.

Senior Social Media Strategist


San Francisco

Remote

7+ years in digital/social marketing.

Senior Social Media Strategist


San Francisco

Hybrid

7+ years in digital/social marketing.

Product Manager, Search and Embeddings


New York

4+ years in product management.1+ years in search products and embedding/reranking models.Experience in working on search products and embedding / cross-encoder models.

Machine Learning Intern/Co-op (Fall 2025)


San Francisco

Hybrid

Experience in PythonExperience in TensorFlowExperience in TF-ServingExperience in JAXExperience in XLA/MLIRExperience in large-scale distributed training strategiesExperience in autoregressive sequence models, such as transformersExperience in continual and active learning strategies for streaming data

Senior Tech Lead Manager, Model Efficiency


San Francisco

Remote

3+ years managing engineering teams with demonstrable impact on system performance metrics and team growthExperience with transformer architecture optimizations, attention mechanism enhancements, and kv-cache optimization techniquesExperience implementing LLM inference optimizations such as continuous batching, speculative decoding, or decoder-specific memory optimizationsExperience in at least one ML framework's execution pipeline (PyTorch, JAX, TensorFlow) and corresponding compiler stack