Obsfly

Redis Monitoring

找到真正关键 5 个字段的 Redis 监控。

Obsfly 提取 INFO、slowlog、latency monitor 与 keyspace 统计 — 把 200+ 字段变成预测每次故障的 8 条告警。

Why monitor Redis

Redis is famously fast and famously easy to mis-tune. Big keys, blocking commands, and unbounded sorted sets are the production killers — and they're invisible without a tool that reads slowlog and tracks keyspace size over time.

What we scrape

Obsfly reads Redis through the surfaces operators already know. No driver changes, no extensions installed by us, no agent on the database itself.

INFO sections

server / clients / memory / stats / replication / commandstats / keyspace.

SLOWLOG GET

Recent slow commands with arguments and execution time.

LATENCY HISTORY / LATENCY GRAPH

Latency monitor events with sub-millisecond resolution.

CLUSTER NODES / CLUSTER INFO

Cluster topology, slot ownership, node failures.

DEBUG OBJECT / MEMORY USAGE

Per-key inspection (sampled, not continuous).

CLIENT LIST

Connected clients with idle time, addr, name, sub state.

Key metrics tracked

Memory used / maxmemory
Eviction risk and OOM forecast.
Hit rate (keyspace_hits / (hits + misses))
Cache health.
Slowlog rate
Commands crossing slowlog-log-slower-than threshold per minute.
Blocking command count
BLPOP / BRPOP / WAIT — visibility into blocking workloads.
Replication lag (offset)
master_repl_offset minus replica's slave_repl_offset.
Evictions / expired keys per minute
Pressure indicators.
Big-key alerts
Keys above size threshold detected via DEBUG OBJECT sampling.

Common Redis pains, and how Obsfly surfaces each

Latency spikes under no obvious load change

Sign

LATENCY HISTORY shows fork or aof-write-then-fsync events; usually correlated with BGSAVE.

Fix

Tune save schedule; consider AOF-only persistence with everysec fsync. On RDB, save during low-traffic windows.

Memory growth despite consistent traffic

Sign

INFO memory shows used_memory growing; eviction policy not kicking in.

Fix

maxmemory not set, or maxmemory-policy is noeviction. Set both and re-deploy.

Slow commands with KEYS *

Sign

Slowlog dominated by KEYS / FLUSHDB / SMEMBERS on huge sets.

Fix

Replace with SCAN. Educate the team. Add a slowlog-based alert.

Replication breaks under burst writes

Sign

Replicas disconnect; master_repl_offset jumps; replicas full-resync.

Fix

Bump repl-backlog-size to absorb burst. Verify client-output-buffer-limit slave isn't terminating replicas.

vs Datadog DBM for Redis

Datadog Redis is INFO-based with limited slowlog parsing. Obsfly adds structured slowlog with command attribution, big-key sampling, and CLUSTER topology visualization with forecasted slot rebalances.
Full Datadog DBM comparison →

FAQ

Standalone, Sentinel, Cluster — all supported?+

Yes. The agent auto-detects topology and scrapes accordingly. Cluster mode collects from every node and reconstructs cluster-wide views.

Does it work with managed Redis (ElastiCache, MemoryDB, Cloud Memorystore, Upstash)?+

Yes. Standard Redis protocol; all commands we use are supported by managed providers.

Big-key detection — how does it not DoS my Redis?+

We sample at most 1 key per 200ms via SCAN + DEBUG OBJECT, with a configurable budget. No KEYS *, no MEMORY DOCTOR loops.

· · ·

See Obsfly on your Redis.

20-min demo. We connect to a sample Redis on the call and reproduce your slowest query in the tool.

Redis monitoring — slow log, keyspace, replication, anomalies · Obsfly