Problem Description
This article explains scenarios which would lead to an Aerospike cluster node rejecting write operations (insert or update).
Explanation
The Aerospike database server has mechanisms to protect against running out of memory or disk space.
The server is designed to stop writes on the disk (and the memory) if any of the following are breached:
- Memory utilization is above a certain threshold (
stop-writes-pct). - Available Percentage on the disk goes below a certain threshold (
min-avail-pct). Situations leading to such low available percent include:- Defragmentation not keeping up with the number of objects evicted.
- Eviction is not able to keep up.
- For strong-consistency enabled namespaces,
clock_skew_stop_writesis triggered off whencluster_clock_skew_msis above thecluster_clock_skew_stop_writes_secthreshold. - As of Aerospike Server 4.5.1, for each Available mode (AP) namespace where nsup is enabled (i.e.
nsup-periodis not zero) writes will be suspended if the cluster clock skew exceeds 40 seconds.