VOOZH
about
URL: https://dev.to/t/reliability
⇱ Reliability - DEV Community
Ten 95% Reliable Agents Chained Together Give You a 60% System. Microservices Solved This a Decade Ago.
👁 kavinkimcreator profile
Kavin Kim
👁 Image
Kavin Kim
Jun 18
Ten 95% Reliable Agents Chained Together Give You a 60% System. Microservices Solved This a Decade Ago.
#
ai
#
agents
#
distributed
#
reliability
👁 Image
👁 Image
2
reactions
Add Comment
4 min read
Your MCP Agent is Logging "Sucess: true" While the task never ran
👁 sasi_sundar profile
Sasi Sundar
👁 Image
Sasi Sundar
Jun 15
Your MCP Agent is Logging "Sucess: true" While the task never ran
#
mcp
#
aiagents
#
reliability
#
devtool
👁 Image
1
reaction
Add Comment
3 min read
Three AI providers went down on the same day. Here's the architecture that didn't care.
👁 rikuq profile
Ravi Patel
👁 Image
Ravi Patel
Jun 15
Three AI providers went down on the same day. Here's the architecture that didn't care.
#
ai
#
reliability
#
llm
#
failover
Add Comment
5 min read
Surviving the region you run in: failover on Aurora DSQL, and what the demo proves
👁 hocmemini profile
Jonathan
👁 Image
Jonathan
Jun 15
Surviving the region you run in: failover on Aurora DSQL, and what the demo proves
#
aws
#
database
#
reliability
#
sre
Add Comment
5 min read
Sliding-Window Spend Guard: the $47K Loop Per-Call Caps Miss
👁 alex_spinov profile
Alexey Spinov
👁 Image
Alexey Spinov
Jun 13
Sliding-Window Spend Guard: the $47K Loop Per-Call Caps Miss
#
finops
#
ai
#
python
#
reliability
Add Comment
11 min read
Graceful Degradation: Circuit Breakers for External API Dependencies
👁 helperx profile
HelperX
👁 Image
HelperX
Jun 12
Graceful Degradation: Circuit Breakers for External API Dependencies
#
architecture
#
node
#
reliability
#
webdev
Add Comment
5 min read
Building a Chaos Testing Harness for Multi-Region Video API Endpoints
👁 ahmet_gedik778845 profile
ahmet gedik
👁 Image
ahmet gedik
Jun 10
Building a Chaos Testing Harness for Multi-Region Video API Endpoints
#
testing
#
php
#
reliability
#
go
Add Comment
10 min read
Error budgets when downtime costs money: reliability engineering for payment-critical systems
👁 errorbudget profile
errorbudget
👁 Image
errorbudget
Jun 8
Error budgets when downtime costs money: reliability engineering for payment-critical systems
#
sre
#
devops
#
reliability
#
fintech
Add Comment
10 min read
Distributed Tracing 101: The Mental Model, the Standards, and Your First Pipeline
👁 devhelm profile
DevHelm
👁 Image
DevHelm
Jun 8
Distributed Tracing 101: The Mental Model, the Standards, and Your First Pipeline
#
guides
#
infrastructure
#
reliability
Add Comment
5 min read
Safe Operating Throughput (SOT) as a First-Class SRE Metric: Derivation and Operationalization
👁 npayyappilly profile
Nijo George Payyappilly
👁 Image
Nijo George Payyappilly
Jun 8
Safe Operating Throughput (SOT) as a First-Class SRE Metric: Derivation and Operationalization
#
sre
#
devops
#
kubernetes
#
reliability
Add Comment
17 min read
Monitoring and Logging: How They Work Together and When You Need Both
👁 devhelm profile
DevHelm
👁 Image
DevHelm
Jun 8
Monitoring and Logging: How They Work Together and When You Need Both
#
guides
#
infrastructure
#
reliability
Add Comment
8 min read
AI SRE: What an Autonomous Agent Doing On-Call Actually Looks Like
👁 devhelm profile
DevHelm
👁 Image
DevHelm
Jun 8
AI SRE: What an Autonomous Agent Doing On-Call Actually Looks Like
#
ai
#
engineering
#
reliability
Add Comment
6 min read
MCP Server Monitoring: How to Keep AI Agent Infrastructure Reliable
👁 devhelm profile
DevHelm
👁 Image
DevHelm
Jun 8
MCP Server Monitoring: How to Keep AI Agent Infrastructure Reliable
#
ai
#
guides
#
reliability
Add Comment
6 min read
Deploying Production Systems on Raspberry Pi: Lessons from the Field
👁 ranaweerasupun profile
Supun Sriyananda
👁 Image
Supun Sriyananda
Jun 7
Deploying Production Systems on Raspberry Pi: Lessons from the Field
#
raspberrypi
#
deployment
#
linux
#
reliability
Add Comment
7 min read
maskedcauses: Maximum Likelihood Estimation for Masked Series System Failures
👁 queelius profile
Alex Towell
👁 Image
Alex Towell
Jun 7
maskedcauses: Maximum Likelihood Estimation for Masked Series System Failures
#
r
#
statistics
#
reliability
#
seriessystems
Add Comment
5 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
👁 DEV Community
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account
👁 Image
👁 Image
👁 Image
👁 Image
👁 Image