Talari Pradeep

DevOps Engineer |

I architect scalable cloud infrastructure and automate everything in between — from code commit to production deployment. Passionate about reliability, speed, and building systems that engineers love to work with.

0 Years Exp.
0 Projects
0 Certs
0 % Uptime
sre-shell@tp-core:~
LIVE
visitor@tp-shell:~$

The Engineer Behind the Stack

Talari Pradeep
India
Open to Work

I'm a Cloud & DevOps Engineer with a passion for building robust, scalable infrastructure that powers great products.

With hands-on expertise across AWS, Kubernetes, Docker, and Terraform, I specialize in designing CI/CD pipelines that cut deployment time, infrastructure automation that saves engineering hours, and observability stacks that catch problems before users do.

Working on the enterprise healthcare project at Accenture, I focus deeply on reliability, infrastructure security, and incident response — helping teams transition from manual operations to fully automated, version-controlled cloud infrastructure.

Enterprise AWS EKS Cluster Management
Jenkins CI/CD & Terraform Automation
Automated OS Patching via Ansible
Trivy & SonarQube Security Scanning

DevOps & SRE Interactive Sandbox

Experiment with live infrastructure reliability parameters, trace packets at kernel level, scale clusters, and audit IaC drift.

⏱️

SLA & Error Budget

Configure a Service Level Agreement (SLA) to calculate allowed downtime budgets, and test your reaction speed when chaos hits.

Target SLA: 99.9%
99.0% 99.9% 99.99% 99.999%
Weekly Budget 1.68 hrs
Monthly Budget 7.31 hrs
Yearly Budget 3.65 days
System: OPERATIONAL
Error Budget Remaining: 100%

ArgoCD GitOps Rollout

Simulate a GitOps continuous deployment pipeline. Push a commit and watch ArgoCD synchronize container pods in real time.

💻
Git Repo
v1.2.0
ArgoCD
Synced
Ingress Route: us-east-1 (Primary)

us-east-1 (Primary) Online

pod-0
pod-1
pod-2
pod-3

eu-central-1 (DR) Standby

pod-dr-0
pod-dr-1
pod-dr-2
pod-dr-3

Chaos & Auto-Healing

Inject live infrastructure faults into the metrics dashboard and observe the autonomous SRE control loop restore operations.

CPU Usage
28%
RAM Usage
45%
API Latency
120ms
Error Rate
0.0%
SRE Automation Log (Live Stream)
[HEALER] Health check active - all probes OK.
📖

SRE Runbook Simulator

Execute interactive checklists to triage outages, resolve state drifts, and roll back canary versions.

Select a runbook scenario to start the SRE checklist.

Let's Connect

Open to full-time roles, freelance contracts, and interesting collaborations. Drop me a message!