Jobs for the people who keep systems observable.

Hand-curated observability roles — OpenTelemetry, Prometheus, Grafana, tracing, metrics, logs and SRE — pulled straight from the companies building the stack. No recruiters, no noise.

394 open roles across the observability ecosystem
Hiring observability talent? Post a job →
◆ Remote
Filter by stack & role
Latest roles
C
Senior Software Engineer - JVM Language Clients
Develop JVM language clients for ClickHouse database
ClickHouse Tel Aviv senior
ProfilingDevRelClickHouse
today
C
Senior Software Engineer - JVM Language Clients
Develop JVM language clients for ClickHouse database
ClickHouse Spain (remote) ◆ Remote senior
ProfilingRemoteDevRelClickHouse
today
N
Program Manager, Global Technical Services
New Relic Atlanta, Georgia, USA; Austin, Texas, USA; Portland, Oregon, USA
MetricsLogs
today
G
Senior AI Engineer | Canada | Remote
Grafana Labs Canada (Remote) ◆ Remote
MetricsLogsRemoteBackendGrafana
today
G
Senior AI Engineer | US | Remote
Grafana Labs United States (Remote) ◆ Remote
MetricsLogsRemoteBackendGrafana
today
G
Senior Field Engineer | Germany | Remote
Grafana Labs Germany (Remote) ◆ Remote
RemoteSREBackendPrometheusGrafana
today
E
Senior SRE - Platform (Managed Kubernetes Infrastructure)
Design, build, and scale multi-cloud platform for hosted and serverless services, ensuring reliability and automating system engineering efforts.
Elastic Canada senior
MetricsIncident ResponseSREBackendPrometheus
today
D
Staff AI Engineer - Data Visualization
Datadog Bordeaux, France; Grenoble, France; Lyon, France; Madrid, Spain; Montpellier, France; Nantes, France; Nice, France; Paris, France; Sophia Antipolis, France
today
C
Sr Software Engineer, Storage
Design and build scalable storage infrastructure for petabytes of telemetry data on AWS
Cribl Remote - United States ◆ Remote senior
MetricsIncident ResponseRemoteData
1d ago
D
Director, Security Channels (North America)
Datadog New York, New York, USA
Metrics
1d ago
D
Senior Software Engineer - Streaming Platform (NORAM)
Datadog New York, New York, USA
MetricsAPMIncident ResponseBackend
2d ago
P
Software Engineer - Core Product
Design and build highly available services, scalable databases, and reliable data streams for PagerDuty's scheduling platform.
PagerDuty Toronto senior
Incident ResponseBackendFrontend
3d ago
I
Platform Engineer
Design, maintain, and scale infrastructure for incident response platform
incident.io London ◆ Remote mid £110k – £200k
Incident ResponseRemoteSREPlatformDevRel
3d ago
E
Principal Data Scientist - Agent Builder
Elastic Sweden
MetricsBackend
3d ago
E
Principal Data Scientist - Agent Builder
Elastic Portugal
MetricsBackend
3d ago
E
Principal Data Scientist - Agent Builder
Elastic Greece
MetricsBackend
3d ago
E
Principal Data Scientist - Agent Builder
Elastic Spain
MetricsBackend
3d ago
S
Staff Technical Program Manager
Technical program manager for platform org, driving strategic execution and partnering with senior engineering leaders.
Sentry San Francisco, California ◆ Remote staff $200k – $240k
MetricsIncident ResponseRemoteSRE
3d ago
D
Staff Software Engineer - K9 Security
Datadog Portugal, Remote ◆ Remote
RemoteeBPF
3d ago
D
Staff Software Engineer - K9 Security
Datadog France, Remote; Germany, Remote; Ireland, Remote; Italy, Remote; Spain, Remote ◆ Remote
RemoteeBPF
3d ago
D
Staff Software Engineer - K9 Security
Datadog Paris, France
eBPF
3d ago
N
Manager, Software Engineering (Fullstack Team)
New Relic Bangalore, India
SREBackendFrontendEngineering Management
3d ago
H
Resident Architect- LATAM
Honeycomb Remote - Brazil ◆ Remote
RemoteSREOpenTelemetry
4d ago
S
Senior Software Engineer, Control Plane
Design and operate core platform primitives for distributed coordination, routing, replication, and lifecycle orchestration.
Sentry Toronto, Ontario, Canada ◆ Remote senior CA$200k – CA$295k
Incident ResponseRemoteBackendKubernetes
6d ago
C
Sr Information Systems Engineer, IT Engineering
Design and run core systems for a telemetry infrastructure company, focusing on identity, access, endpoints, cloud infrastructure, and automation.
Cribl Remote - United States ◆ Remote senior
MetricsLogsIncident ResponseRemote
6d ago
D
Senior Services Architect - New York
Datadog New York, New York, USA
Kubernetes
6d ago
D
Senior Software Engineer - Observability Visibility
Datadog New York, New York, USA
SRE
6d ago
E
Principal Software Engineer I - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Sweden senior
Backend
6d ago
E
Principal Software Engineer I - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Spain senior
Backend
6d ago
E
Principal Software Engineer I - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Romania senior
Backend
6d ago
E
Principal Software Engineer I - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Portugal senior
Backend
6d ago
E
Principal Software Engineer I - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Norway senior
Backend
6d ago
E
Principal Software Engineer I - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Ireland senior
Backend
6d ago
E
Principal Software Engineer I - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Hungary senior
Backend
6d ago
E
Principal Software Engineer I - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Greece senior
Backend
6d ago
E
Principal Software Engineer I - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic United States senior
Backend
6d ago
E
Principal Software Engineer I - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic United Kingdom senior
Backend
6d ago
E
Principal Software Developer I - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Canada senior
Backend
6d ago
N
Senior Software Engineer - Fullstack (Backend Focused)
New Relic Bangalore, India
BackendFrontendKubernetes
6d ago
D
Technical Escalations Engineer 2 (APM - Java/Proxy/C++) - Mexico City
Datadog Mexico City, Mexico
MetricsTracingAPMBackend
1w ago
E
Senior Java Engineer - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Sweden senior
Backend
1w ago
E
Senior Java Engineer - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Spain senior
Backend
1w ago
E
Senior Java Engineer - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Romania senior
Backend
1w ago
E
Senior Java Engineer - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Norway senior
Backend
1w ago
E
Senior Java Engineer - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Portugal senior
Backend
1w ago
E
Senior Java Engineer - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Ireland senior
Backend
1w ago
E
Senior Java Engineer - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Hungary senior
Backend
1w ago
E
Senior Java Engineer - Distributed Systems - Elasticsearch
Design and improve Elasticsearch's distributed systems for scale, performance, and resilience
Elastic Greece senior
Backend
1w ago
D
Senior Security Engineer - Cloud SIEM
Datadog Lisbon, Portugal
Incident ResponseSecurityKubernetes
1w ago
D
Senior Security Engineer - Cloud SIEM
Datadog Dublin, Ireland; Madrid, Spain; Paris, France
Incident ResponseSecurityKubernetes
1w ago
C
Sr Software Engineer, Edge
Develop back-end features for data collection across multiple platforms
Cribl Remote - United States ◆ Remote senior
Incident ResponseRemoteBackendKubernetes
1w ago
D
Senior Software Engineer - Code Gen
Datadog New York, New York, USA
Backend
1w ago
C
Staff Professional Services Consultant
Partner with IT and Security teams to implement telemetry infrastructure for large enterprises.
Cribl Remote - United States ◆ Remote staff
RemoteSREGrafanaKubernetes
1w ago
G
Staff Software Engineer - Platform, SysEng | Canada | Remote
Grafana Labs Canada (Remote) ◆ Remote
MetricsLogsTracingIncident ResponseRemote
1w ago
G
Staff Software Engineer - Platform, SysEng | USA | Remote
Grafana Labs United States (Remote) ◆ Remote
MetricsLogsTracingIncident ResponseRemote
1w ago
P
Software Engineer II - Lisbon
Design and implement integrations with various tools and technologies, leveraging APIs, webhooks, and SaaS platforms.
PagerDuty Lisbon mid
Incident ResponseBackendFrontend
1w ago
E
Principal Product Manager Infrastructure, Observability
Define and drive Observability product strategy for infrastructure systems and services
Elastic Canada senior
1w ago
E
Principal Product Manager Infrastructure, Observability
Define and drive vision and strategy for Infrastructure Observability
Elastic United States senior
1w ago
G
Senior Observability Architect | West Coast | PST | Remote
Grafana Labs United States (Remote) ◆ Remote
RemoteGrafana
1w ago
D
Staff Product Designer, APM
Datadog New York, New York, USA ◆ Remote
APMRemoteBackend
1w ago
C
Senior Software Engineer
Backend development with TypeScript, AWS, and cloud technologies for identity management and authentication.
Cribl Remote - United States ◆ Remote senior
Incident ResponseRemoteBackend
1w ago
N
Software Engineer - Fullstack
New Relic Hyderabad, India
SREBackendFrontendData
1w ago
N
Manager, Software Engineering
New Relic Hyderabad, India
BackendEngineering Management
1w ago
C
Staff Software Engineer, Cribl AI
Design and develop observability data search and analytics using Generative AI technologies
Cribl Remote - United States ◆ Remote staff
Incident ResponseRemoteBackendFrontend
1w ago
H
Senior Site Reliability Engineer
Honeycomb Remote - United Kingdom ◆ Remote
Incident ResponseRemoteSREBackendDevRel
1w ago
D
Senior Developer Advocate - Modern App Development
Datadog California, USA, Remote; Nevada, USA, Remote; Texas, USA, Remote; Washington, USA, Remote ◆ Remote
RemoteDevRel
1w ago
C
AI Product Engineer - ClickStack
Build AI capabilities for a petabyte-scale observability platform
ClickHouse Netherlands (remote) ◆ Remote senior
MetricsLogsTracingIncident ResponseRemote
1w ago
C
AI Product Engineer - ClickStack
Build AI-powered observability platform capabilities for developer experience
ClickHouse Canada (remote) ◆ Remote senior
MetricsLogsTracingIncident ResponseRemote
1w ago
C
AI Product Engineer - ClickStack
Build AI-powered observability platform capabilities for developer experience
ClickHouse United Kingdom (remote) ◆ Remote senior
MetricsLogsTracingIncident ResponseRemote
1w ago
C
AI Product Engineer - ClickStack
Build AI-powered observability platform capabilities for developer experience
ClickHouse Germany (remote) ◆ Remote senior
MetricsLogsTracingIncident ResponseRemote
1w ago
C
AI Product Engineer - ClickStack
Build AI capabilities for a petabyte-scale observability platform
ClickHouse United States (remote) ◆ Remote senior
MetricsLogsTracingIncident ResponseRemote
1w ago
E
Site Reliability Engineer (Hosted Infra) - Platform
Design and implement large-scale system automation, optimize host reliability, and strengthen observability across multiple cloud providers
Elastic United States senior
Incident ResponseSREPlatformPrometheusKubernetes
1w ago
D
Staff Applied Scientist - Dashboards
Datadog New York, New York, USA
MetricsLogsTracingSRE
1w ago
C
Software Engineer - Database Integrations
Design and implement database integrations for real-time data replication at petabyte scale
ClickHouse United Kingdom senior
MetricsLogsTracingIncident ResponseBackend
1w ago
S
Senior Software Engineer, Events Analytics Platform
Design and implement scalable data storage and query services for event data, expanding search capabilities and performance.
Sentry Toronto, Ontario, Canada ◆ Remote senior CA$200k – CA$295k
MetricsRemoteBackendDevRelClickHouse
2w ago
S
Senior Software Engineer, Events Analytics Platform
Design and implement scalable data storage and querying infrastructure for event data, pushing the boundaries of data visibility at Sentry.
Sentry San Francisco, California ◆ Remote senior $190k – $280k
MetricsRemoteBackendDevRelClickHouse
2w ago
S
Staff Software Engineer, AI Developer Tooling
Design and implement API access to internal systems for AI coding agents, improving AI-generated pull requests and automating engineering work.
Sentry San Francisco, California ◆ Remote staff $240k – $320k
LogsRemoteBackend
2w ago
D
Senior Software Engineer - Linux Kernel/eBPF
Datadog Alabama, USA, Remote; Arizona, USA, Remote; Arkansas, USA, Remote; California, USA, Remote; Colorado, USA, Remote; Connecticut, USA, Remote; Delaware, USA, Remote; District of Columbia, USA, Remote; Florida, USA, Remote; Georgia, USA, Remote; Idaho, USA, Remote; Illinois, USA, Remote; Indiana, USA, Remote; Iowa, USA, Remote; Kansas, USA, Remote; Kentucky, USA, Remote; Louisiana, USA, Remote; Maine, USA, Remote; Maryland, USA, Remote; Massachusetts, USA, Remote; Michigan, USA, Remote; Minnesota, USA, Remote; Missouri, USA, Remote; Montana, USA, Remote; Nebraska, USA, Remote; Nevada, USA, Remote; New Hampshire, USA, Remote; New Jersey, USA, Remote; New Mexico, USA, Remote; New York, USA, Remote; North Carolina, USA, Remote; Ohio, USA, Remote; Oklahoma, USA, Remote; Oregon, USA, Remote; Pennsylvania, USA, Remote; Rhode Island, USA, Remote; South Carolina, USA, Remote; South Dakota, USA, Remote; Tennessee, USA, Remote; Texas, USA, Remote; Utah, USA, Remote; Vermont, USA, Remote; Virginia, USA, Remote; Washington, USA, Remote; Wisconsin, USA, Remote ◆ Remote
RemoteeBPF
2w ago
D
Senior Product Manager, AAA/Enterprise Growth
Datadog New York, New York, USA
DevRel
2w ago
D
Security Engineer 2 - Cyber Threat Intelligence
Datadog New York, New York, USA
Security
2w ago
C
Senior Cloud Engineer
Design, develop, deploy, and secure a ClickHouse Cloud database platform for regulated and mission-critical environments.
ClickHouse United States (remote) ◆ Remote senior
RemoteBackendKubernetesClickHouse
2w ago
D
Product Manager II, AI & Data Security
Datadog Boston, Massachusetts, USA; New York, New York, USA
2w ago
H
Senior Site Reliability Engineer
Honeycomb Remote - Ireland ◆ Remote
Incident ResponseRemoteSREBackendDevRel
2w ago
G
Staff Software Engineer | Canada |Remote
Grafana Labs Canada (Remote) ◆ Remote
Incident ResponseRemoteGrafana
2w ago
H
Senior Software Engineer - AI Intelligence
Honeycomb Remote - United States ◆ Remote
Incident ResponseRemoteBackendFrontend
2w ago
D
Senior Product Manager - Search
Datadog New York, New York, USA
Backend
2w ago
D
Staff GenAI Engineer - Application Performance Monitoring (APM)
Datadog New York, New York, USA ◆ Remote
TracingProfilingAPMIncident ResponseRemote
2w ago
N
Product Manager - Log Management
New Relic Atlanta, Georgia, USA; Chicago, Illinois, USA; Dallas, Texas, USA; Houston, Texas, USA; Portland, Oregon, USA
Logs
2w ago
S
Engineering Manager, Developer Infrastructure
Lead team building developer infrastructure, CI/CD systems, and development environments for a large tech company.
Sentry San Francisco, California ◆ Remote manager $220k – $300k
RemotePlatformBackendEngineering Management
2w ago
S
Security Engineer, IAM
Security Engineer for IAM, responsible for access control and identity management practices, working with infrastructure and platform teams to enable secure self-service workflows.
Sentry San Francisco, California ◆ Remote senior $155k – $240k
LogsIncident ResponseRemoteSecurityKubernetes
2w ago
D
Senior Software Engineer - Infrastructure R&D
Datadog Denver, Colorado, USA; New York, New York, USA
BackendKubernetes
2w ago
D
Senior Software Engineer - Environments Accelerator
Datadog Denver, Colorado, USA; New York, New York, USA
BackendKubernetes
2w ago
G
Senior Backend Engineer - Alerting | Sweden | Remote
Grafana Labs Sweden (Remote) ◆ Remote
Incident ResponseRemoteBackendPrometheusGrafana
2w ago
G
Senior Backend Engineer - Alerting | UK | Remote
Grafana Labs United Kingdom (Remote) ◆ Remote
Incident ResponseRemoteBackendPrometheusGrafana
2w ago
G
Senior Backend Engineer - Alerting | Spain | Remote
Grafana Labs Spain (Remote) ◆ Remote
Incident ResponseRemoteBackendPrometheusGrafana
2w ago
G
Senior Backend Engineer - Alerting | Ireland | Remote
Grafana Labs Republic of Ireland (Remote) ◆ Remote
Incident ResponseRemoteBackendPrometheusGrafana
2w ago
G
Senior Backend Engineer - Alerting | Germany | Remote
Grafana Labs Germany (Remote) ◆ Remote
Incident ResponseRemoteBackendPrometheusGrafana
2w ago
D
Distinguished Architect, AI
Datadog New York, New York, USA; San Francisco, California, USA
Backend
2w ago
D
Staff Applied Scientist - Agentic Interfaces
Datadog New York, New York, USA
MetricsTracingSRE
2w ago
D
Engineering Manager I - AI Platform - Evaluation & Annotation
Datadog Paris, France
BackendEngineering ManagementData
2w ago