Site Reliability Engineering services

Site reliability engineering (SRE) based on Open Source tooling is our passion and the very essence of what we do and specialise in. With our SRE as a Service, you’ll get all your software running safe and sound 24/7 with prompt technical support whenever needed.

Get started

OFFER

WHAT WE OFFER

24/7 support service

Build the server and cloud infrastructure for your software and maintain it 24/7 with guaranteed SLA.

Holistic SRE

Ensure reliability, observability, performance, and security for your applications.

details

Technical details

We will elaborate, launch into operation, and thoroughly maintain all essential Site Reliability Engineering practices based on well-tested Open Source software for your applications. They will cover the following:

An Infrastructure as Code (IaC) approach using Terraform, Pulumi, Ansible or other tools. It ensures all infrastructure components are defined in code, rendering all modifications transparent and easily recoverable.

Multi-layered observability based on Prometheus/Grafana stack and additional services to monitor all host-level components, Kubernetes and business apps, as well as web services’ external availability.

24/7 on-call duties, powered by our observability system and its business metrics, a unique incident management system, thoughtful workflows, and strict SLA (Service Level Agreement) regulations.

Availability and performance troubleshooting, based on performing RCA (Root Cause Analysis), observability insights, software-specific metrics, and active communication between our site reliability engineers and your developers.

Scalable and highly available design implemented in networking and Kubernetes-based infrastructure by us and in your software under our guidance.

Prompt emergency response aided by backups and disaster recovery plans (DRPs) and followed by written post-mortems.

Reliable software release processes based on GitLab CI/CD or GitHub Actions and best practices. The latter includes automatic caching, reproducible builds, rebuilding and redeploying only modified components, and smart container registry cleanup.

Enforced security measures at various levels, from data centres to your code. They involve proper configuration, automated image scanning for known vulnerabilities, code quality and security scans, network and runtime policies, auditing, event logging, and implementing a SIEM.

outcome

Guaranteed availability for your applications and services. 10-minute reaction time.

Ongoing SRE assistance, involving tight cooperation with your development team.

Transparent and efficient handling of outages, overloads, and other emergencies.

Business value

Business-critical services are operated in a reliable, resilient fashion, reducing financial and reputational risk. Furthermore, operating costs are reduced thanks to the outsourcing of non-core competencies.

origins

Frequent origins

Launch a new startup or new software/digital services in an existing organisation.

Eliminate longstanding SRE-related issues that affect product reliability, performance, or security.

Ensure 24/7/365 service availability and eligible MTTR (mean time to recovery).

Growing business with in-house engineers no longer able to handle SRE tasks.

Replace a quitting SRE engineer or your current SRE agency.

plans

Subscription plans

The table below shows the pricing for our basic SRE as a Service monthly subscriptions, along with their deliverables and limitations.

€ EUR

$ USD

€

9,100

per month

€

11,700

per month

XL*

€

14,000

per month

Dedicated technical lead

Dedicated project manager

Dedicated DevOps team (6-8 pax.)

Team In-Project Hours per month (not less than)

120

160

200

incl. Consultancy hours

Kubernetes clusters supported

1 dev + 1 prod**

Kubernetes clusters federation (Istio based)

24×7 On-call SRE

Monitoring and alerting solutions

SLA on reaction time (mins)

Number of incidents we process

unlimited

Escalating incidents to your team

SLA on application uptime

TBA after 3 months

€ EUR

$ USD

Dedicated technical lead

Dedicated project manager

Dedicated DevOps team (6-8 pax.)

Team In-Project Hours per month (not less than)

120

incl. Consultancy hours

Kubernetes clusters supported

1 dev + 1 prod**

Kubernetes clusters federation (Istio based)

24×7 On-call SRE

Monitoring and alerting solutions

SLA on reaction time (mins)

Number of incidents we process

unlimited

Escalating incidents to your team

SLA on application uptime

TBA after 3 months

Dedicated technical lead

Dedicated project manager

Dedicated DevOps team (6-8 pax.)

Team In-Project Hours per month (not less than)

160

incl. Consultancy hours

Kubernetes clusters supported

1 dev + 1 prod**

Kubernetes clusters federation (Istio based)

24×7 On-call SRE

Monitoring and alerting solutions

SLA on reaction time (mins)

Number of incidents we process

unlimited

Escalating incidents to your team

SLA on application uptime

TBA after 3 months

Dedicated technical lead

Dedicated project manager

Dedicated DevOps team (6-8 pax.)

Team In-Project Hours per month (not less than)

200

incl. Consultancy hours

Kubernetes clusters supported

1 dev + 1 prod**

Kubernetes clusters federation (Istio based)

24×7 On-call SRE

Monitoring and alerting solutions

SLA on reaction time (mins)

Number of incidents we process

unlimited

Escalating incidents to your team

SLA on application uptime

TBA after 3 months

* Looking for more affordable start? Check out our subscription plans for DevOps as a Service (with no 24×7 on-call SRE involved).

** Additional K8s clusters support: dev – €500, prod – €700.

Need more hours? Contact us to get a price quote for your case!

How to get started?

Collaboration model:

Subscription, based on the amount of work to be performed.
Packages starting at €9,100 per month (see above).

Let’s discuss your project!

FAQ

Who will be working on my project?

What grade do your engineers have?

How will we communicate?

Will I be able to adjust the amount of work?

Can I cancel the subscription when I want?

Will I be able to use the infrastructure after we terminate our agreement?

services

Related services

DevOps as a Service

Infrastructure & CI/CD audit

All services

case study

ComplAi offers SaaS for end-to-end management of compliance. Its production environment could not handle a growing number  of users, and CI/CD was holding the business back from growing. Learn how Palark helped ComplAi to succeed!

Get the PDF