SRE as a Service

Site reliability engineering (SRE) is our passion and the very essence of what we do and specialise in. With our SRE-as-a-Service, you’ll get all your software running safe and sound 24/7 with prompt technical support whenever needed.

Get started
hero-image
OFFER

WHAT WE OFFER

24/7 service

Build the server infrastructure for your software and maintain it 24/7 with guaranteed SLA.

img

Holistic SRE

Ensure reliability, observability, performance, and security for your applications.

img
details

Technical details

We will elaborate, launch into operation, and thoroughly maintain all essential Site Reliability Engineering practices for your applications. They will cover the following:

An Infrastructure as Code (IaC) approach, ensuring all infrastructure components are defined in code, rendering all modifications transparent and easily recoverable.

Multi-layered observability based on Prometheus/Grafana stack and additional services to monitor all host-level components, Kubernetes and business apps, as well as web services’ external availability.

24/7 on-call duties, powered by our observability system and its business metrics, a unique incident management system, and strict SLA (Service Level Agreement) regulations.

Availability and performance troubleshooting, based on observability insights, software-specific metrics, and active communication between our site reliability engineers and your developers.

Scalable and highly available design implemented in networking and Kubernetes-based infrastructure by us and in your software under our guidance.

Prompt emergency response aided by backups and disaster recovery plans (DRPs) and followed by written post-mortems.

Reliable software release processes based on GitLab CI/CD or GitHub and best practices, such as automatic caching and reproducible builds.

Enforced security measures at various levels, from data centres to your code, involving proper configurations, automated image scanning, network and runtime policies, auditing and event logging.

outcome

outcome

Guaranteed availability for your applications and services. 10-minute reaction time.

Ongoing SRE assistance, involving tight cooperation with your development team.

Transparent and efficient handling of outages, overloads, and other emergencies.

picture

Business value

Business-critical services are operated in a reliable, resilient fashion, reducing financial and reputational risk. Furthermore, operating costs are reduced thanks to the outsourcing of non-core competencies.

origins

Frequent origins

Launch a new startup or new software/digital services in an existing organisation.

Eliminate longstanding SRE-related issues that affect product reliability, performance, or security.

Ensure 24/7/365 service availability and eligible MTTR (mean time to recovery).

Growing business with in-house engineers no longer able to handle SRE tasks.

Replace a quitting SRE engineer or your current SRE agency.

plans

Subscription plans

M*
9,100
per month
L*
11,700
per month
Dedicated technical lead
Dedicated project manager
Dedicated DevOps team (6-8 pax.)
Team In-Project Hours per month (not less than)
120
160
incl. Consultancy hours
12
16
Kubernetes clusters supported
1 dev + 1 prod**
1 dev + 1 prod**
Kubernetes clusters federation (Istio based)
24×7 On-call SRE
Monitoring and alerting solutions
SLA on reaction time (mins)
10
10
Number of incidents we process
unlimited
unlimited
Escalating incidents to your team
SLA on application uptime
TBA after 3 month
TBA after 3 month
Dedicated technical lead
Dedicated project manager
Dedicated DevOps team (6-8 pax.)
Team In-Project Hours per month (not less than)
120
incl. Consultancy hours
12
Kubernetes clusters supported
1 dev + 1 prod**
Kubernetes clusters federation (Istio based)
24×7 On-call SRE
Monitoring and alerting solutions
SLA on reaction time (mins)
10
Number of incidents we process
unlimited
Escalating incidents to your team
SLA on application uptime
TBA after 3 month
Dedicated technical lead
Dedicated project manager
Dedicated DevOps team (6-8 pax.)
Team In-Project Hours per month (not less than)
160
incl. Consultancy hours
16
Kubernetes clusters supported
1 dev + 1 prod**
Kubernetes clusters federation (Istio based)
24×7 On-call SRE
Monitoring and alerting solutions
SLA on reaction time (mins)
10
Number of incidents we process
unlimited
Escalating incidents to your team
SLA on application uptime
TBA after 3 month

* Looking for more affordable start? Check out our subscription plans for DevOps as a Service (with no 24×7 on-call SRE involved).

** Additional K8s clusters support: dev – €500, prod – €700

Contact us to figure it out for your case!

picture

How to get started?

Collaboration model:

  • Subscription, based on the amount of work to be performed.
  • Packages starting at 6200 EUR per month
Let's discuss your project!
services

Related services

DevOps as a Service
Infrastructure & CI/CD audit
All services