CloudCloudWatch Metric Streams + IAM

AWS ElastiCache Integration

Monitor Redis and Memcached node metrics, eviction rates, and replication health. AI memory pressure analysis surfaces the root cause of cache evictions and hit rate degradations before applications are impacted.

Connect ElastiCache Book a Demo

Setup

How It Works

Deploy CloudFormation Stack

Launch the TigerOps CloudFormation template to configure Metric Streams for the AWS/ElastiCache namespace and create the IAM role with read permissions for cluster metadata.

Auto-Discover Clusters

TigerOps uses the ElastiCache DescribeCacheClusters API to enumerate all Redis and Memcached clusters. Each node in a cluster is monitored individually for granular fault isolation.

Configure Redis Replication Monitoring

For Redis replication groups, TigerOps tracks ReplicationLag and primary/replica role assignments. Failover events are captured from EventBridge and surfaced as incidents.

Set Memory & Eviction Alerts

Define memory utilisation and eviction rate thresholds. TigerOps alerts when evictions indicate memory pressure and predicts OOM conditions using trend analysis.

Capabilities

What You Get Out of the Box

Memory & Hit Rate Metrics

FreeableMemory, BytesUsedForCache, CacheHits, CacheMisses, and CacheHitRate per node and cluster. TigerOps alerts when hit rate degrades and correlates it with eviction events or cluster configuration changes.

Eviction Rate Monitoring

Evictions per second tracked per node. Rising eviction rates indicate memory pressure and potential cache thrashing. TigerOps alerts on sustained eviction spikes and recommends right-sizing or maxmemory-policy changes.

Redis Replication Health

ReplicationLag per replica node, MasterLinkHealthStatus, and SaveInProgress tracking. TigerOps alerts when replication lag exceeds your threshold and captures failover events with RTO measurements.

Connection & CPU Metrics

CurrConnections, NewConnections, and EngineCPUUtilization per node. TigerOps alerts on connection spike anomalies and correlates CPU saturation with high command latency.

Command Latency Tracking

For ElastiCache Serverless and Redis 7.x clusters, TigerOps ingests ElastiCache command latency metrics (EcCommandLatency) to surface slow-command patterns before they impact applications.

AI Memory Pressure Analysis

When evictions spike, TigerOps AI analyses the correlated signals — key count growth, large key sizes, TTL distribution — and generates a root cause summary with remediation options.

Configuration

ElastiCache Monitoring Setup

Enable slow log export and deploy the TigerOps ElastiCache monitoring stack.

elasticache-setup.sh

# Enable Redis slow log export to CloudWatch Logs
aws elasticache modify-replication-group \
  --replication-group-id my-redis-cluster \
  --log-delivery-configurations \
    "LogType=slow-log,DestinationType=cloudwatch-logs,DestinationDetails={CloudWatchLogsDetails={LogGroup=/elasticache/slow-logs}},LogFormat=json" \
  --apply-immediately

# Deploy TigerOps ElastiCache monitoring stack
aws cloudformation deploy \
  --template-url https://tigerops-cfn.s3.amazonaws.com/elasticache-integration.yaml \
  --stack-name tigerops-elasticache \
  --capabilities CAPABILITY_IAM \
  --parameter-overrides \
    TigerOpsApiKey=${TIGEROPS_API_KEY} \
    EnableSlowLogIngestion=true \
    SlowLogGroup=/elasticache/slow-logs \
    MemoryWarningPercent=80 \
    EvictionRateWarning=100

# Enable EventBridge for failover notifications
aws events put-rule \
  --name tigerops-elasticache-events \
  --event-pattern '{"source":["aws.elasticache"],"detail-type":["ElastiCache Replication Group Failover Complete"]}'

FAQ

Common Questions

Does TigerOps support both Redis and Memcached on ElastiCache?

Yes. TigerOps monitors both Redis (including Redis Cluster mode) and Memcached clusters. Redis provides richer metrics including replication lag and keyspace statistics, while Memcached monitoring focuses on cache hit rate, evictions, and connection metrics.

How does TigerOps detect a Redis primary failover in ElastiCache?

TigerOps subscribes to ElastiCache EventBridge events for replication group failover notifications (ElastiCache:ReplicationGroupFailoverComplete). The failover event is surfaced as an incident with the RTO, the new primary node, and the metric state at the time of failure.

Can TigerOps alert when ElastiCache memory is almost full?

Yes. TigerOps monitors the DatabaseMemoryUsagePercentage metric (Redis) and alerts when memory utilisation exceeds your configured threshold. It also tracks the eviction rate and computes how quickly the cluster is approaching its memory limit.

Does TigerOps support ElastiCache Serverless?

Yes. TigerOps ingests ElastiCache Serverless metrics including ECPUUtilization, StorageUtilization, and EcCommandLatency. Since Serverless scales automatically, TigerOps focuses on ECPU utilisation relative to your provisioned ECPU limit.

Can TigerOps help identify Redis hot keys causing latency?

TigerOps can ingest slow log data from Redis via a CloudWatch Logs subscription on your ElastiCache log export. Slow commands are indexed alongside metrics so you can correlate command latency spikes with specific key patterns and time windows.

Get Started

Keep Your Cache Healthy and Hit Rates High

Eviction alerts, replication health monitoring, and AI memory pressure analysis. Connect your ElastiCache clusters in minutes.

Start Free Talk to an Engineer

Explore More

Related Integrations

View all 275+ integrations