ScyllaDB Docs Developers Switching from Aerospike: A Technical Comparison

Switching from Aerospike: A Technical Comparison¶

This page covers the main differences between ScyllaDB and Aerospike. The motivation was to answer a paid promotional benchmark that Aerospike distributed.

Aerospike is a stellar K/V database/cache. Caching keys in RAM while the values are saved in fast NVMe storage is a novel approach for a simple K/V model. Using this model, Aerospike achieves high throughput and low latency.

ScyllaDB is designed as a more sophisticated database. It is part of the wide-column NoSQL family, a wide superset of K/V only. The model allows a full set of clustering keys inside each row, millions of entries can be saved in a single row. Independent cells can be accessed without rewriting the whole row. HA and DR are deep in the model with configurable replication, local and global indexes, best in-class elasticity and a battle-tested database as a service model (with connectivity, bring-your-own-cloud, encryption, bring-your-own-keys, security, and more features).

This page compares the performance of Aerospike vs ScyllaDB to show that flexibility doesn’t need to come at the expense of performance. ScyllaDB can beat Aerospike at its own game.

At a glance¶

Key-Key-Value, not just Key-Value. Clustering keys give every row millions of independently addressable cells. ScyllaDB updates each cell directly (vs. Aerospike’s read-modify-write).
Indexes don’t live in RAM. Aerospike always pins 64 bytes per key in memory. ScyllaDB serves comparable latency straight from NVMe. RAM is cache rather than a constraint.
Dense instances, fully used. Limited disk:RAM ratios lock Aerospike out of i8ge / i7ie / i3en-class hardware. ScyllaDB runs them at 90% utilization.

One write, or read-modify-write?¶

In a K/V model, the row is the unit of allocation. Updating a single element inside a large value bin means reading the whole object, merging the change, and writing it all back (so two I/O actions instead of one). ScyllaDB updates a single cell directly; the LSM tree merges cells in the background using idle-time I/O schedulers and compaction controllers, without any user involvement or performance impact.

Aerospike — Key/Value

namespace › set › record › bins (one allocation unit)

user:9137 name email prefs[…] last_seen cart[…]

→ UPDATE last_seen on record user:9137 1. READ entire record from disk (all bins)… 2. Merge new value into the object in memory 3. WRITE entire record back to disk ✗ 2 I/O actions + write amplification → defrag pressure

I/O ops: 2

ScyllaDB — Key-Key-Value

keyspace › table › partition key › clustering key › cell

user:9137 ck: name ck: email ck: prefs ck: last_seen ck: cart

→ UPDATE last_seen WHERE pk = user:9137 1. WRITE single cell — done. Background: LSM compaction merges cells on idle I/O ✓ 1 I/O action. No read. No user impact.

I/O ops: 1

The same clustering keys make ScyllaDB a natural time-series database: partition by sensor_id, cluster by timestamp, and range queries become sequential reads straight off disk. This is well-suited for event logs, messaging, IoT, AI feature stores. K/V can only fake this with secondary indexes or by overloading bins with ever-growing lists.

Access patterns¶

Clustering keys aren’t a one trick pony. Each pattern below is a different way to slice a partition.

01 · Time-series — Partition by entity, cluster by time

PRIMARY KEY ((sensor_id), reading_ts)
WITH CLUSTERING ORDER BY (reading_ts DESC)

Pairs with TTL, time-window compaction, and tiered storage. Range slice: WHERE sensor_id=? AND reading_ts >= ? AND < ?

02 · Time-bucketing — Bound the partition size

PRIMARY KEY ((sensor_id, day), reading_ts)
-- day = '2026-06-14'

Trade-off: cross-bucket queries fan out to multiple partitions. Pick a granularity (hour/day/month) to match your read window.

03 · Messaging / fan-out — The Discord shape

PRIMARY KEY ((channel_id, bucket), message_id)
WITH CLUSTERING ORDER BY (message_id DESC)
-- message_id = Snowflake / TimeUUID

Newest-first reads with zero sort at query time. A pure K/V store can’t express this pattern natively.

04 · Hierarchical — Compound clustering keys

PRIMARY KEY ((user_id), year, month, day, event_id)

Valid: filter year, then year+month… Invalid: filter day without year and month.

05 · Leaderboard — Cluster by score

PRIMARY KEY ((game_id, season), score, player_id)
WITH CLUSTERING ORDER BY (score DESC, player_id ASC)

LIMIT 100 returns the top 100 with no sort at query time.

06 · Key-Key-Value — Generic attribute store

PRIMARY KEY ((entity_id), attr_name)
-- rows of (attr_name, value)

Update one attribute without reading or rewriting the others. This is the core KKV-beats-RMW argument, in schema form.

07 · Inverted index — Query-table-per-access-pattern

-- base:   PRIMARY KEY ((user_id))
-- lookup: PRIMARY KEY ((email), user_id)

Direct partition read on email — no cluster-wide scan, unlike Aerospike’s scatter-gather secondary indexes.

08 · Graph edges — Adjacency list

PRIMARY KEY ((vertex_id), edge_type, target_id)

Slice by edge type within a vertex; range-scan neighbors in order.

09 · Event log — Append-only / event sourcing

PRIMARY KEY ((aggregate_id), event_seq)
WITH CLUSTERING ORDER BY (event_seq ASC)

Sequential write, sequential replay — LSM-friendly and contention-free.

10 · Hash-prefix — Split hot/wide partitions

PRIMARY KEY ((celebrity_id, shard), post_ts)
-- shard in 0..N

Trade-off: reads query all N shards and merge — a wider single partition becomes more partitions touched.

11 · Status + time — Slice by category, then time

PRIMARY KEY ((tenant_id), status, updated_ts)
WITH CLUSTERING ORDER BY (status ASC, updated_ts DESC)

One partition serves several status-scoped, time-ordered views — no extra index to maintain.

KKV is harder to build, but it pays off¶

Aerospike’s model is elegant precisely because it’s simple. Values are bounded atomic blobs, so they’re easy to cache and easy to locate. Just a single NVMe read (tens of microseconds) brings data to RAM. A wide-column store can’t take that shortcut, and we won’t pretend otherwise.

A single key can point to a giant partition. We have customers running a 1 TB partition (it’s not recommended, but the flexibility is there). Because partition size is unknown upfront, the engine has to do far more than fetch-a-blob:

MVCC — Multi-version concurrency control so parallel readers and writers see consistent results across a partition that may be mid-write.
Row-level cache — Caching has to work at row granularity, not whole-partition. You can’t evict or populate a terabyte as one unit.
Range tombstones — Deletes aren’t a single marker; whole ranges of clustering keys can be tombstoned and must be reconciled on read.
Multi-allocation parsing — Unknown partition sizes mean multiple memory allocations during parsing instead of one fixed-size slot.
Multi-I/O range reads — Fetching a slice of a partition may require several disk accesses, scheduled so they never stall user traffic.
Range scans — Users can scan arbitrary subsets of a partition — a query surface a pure K/V store simply doesn’t expose.

Altogether, this is a fundamentally harder problem. And ScyllaDB still matches Aerospike’s best benchmarks. The performance advantage of KV comes with a cost — read on for where each model wins.

64 bytes per key — in RAM, always¶

Aerospike keeps its primary index in RAM: a fixed 64 bytes per key, regardless of record size, scaling linearly with key count. Small partitions are where it hurts most — at 200 B records the key-to-data ratio approaches 1:3.5. ScyllaDB scales independently of memory: roughly 100 GB of data per 1 GB of RAM, with no hard ceiling on keys per node. The table below shows when RAM starts driving your infrastructure bill.

Memory footprint at scale · Interactive

Number of records

10M100M1B10B100B

Record size

Replication factor

ScyllaDB

~100 GB of data per 1 GB of RAM · indexes live on NVMe

RAM required (cache)—

Total storage (with replication)—

Hard ceiling on keys per nodeNone

Aerospike

64 B per key per replica — pinned in RAM

RAM required (primary index)—

Total storage (with replication)—

Key : data RAM ratio—

At this scale, Aerospike's index needs — more RAM than ScyllaDB — before a single row is cached.

Base-10 units (1 GB = 10⁹ B). Theoretical lower bounds — production adds overhead, secondary indexes, write amplification and headroom. Past a point this forces Aerospike operators to reshape the data model or fall back to all-flash mode, giving away the latency the architecture was built for.

Dense instances are wasted on Aerospike¶

Because the key cache must live in RAM, Aerospike has a limited disk:RAM ratio and can’t take advantage of storage-dense families like AWS i8ge, i7ie, i3en. Look at the hardware Aerospike chose for its own 2026 benchmark: 6 TB of raw data on 8× i4g.4xlarge, with 15 TB of storage per zone. That’s roughly 40% utilization — so you’re paying for disk that sits empty, on a deliberately less-dense instance family.

	Aerospike — as benchmarked	ScyllaDB — same 6 TB
Configuration	8 × i4g.4xlarge · ~15 TB per zone for 6 TB of data	2 × i8ge.3xlarge — one per zone · 12 vCPU + 7.5 TB NVMe each
Nodes	8	2 (one per zone)
Sized by	RAM ceiling, by design	Actual data, not RAM
Storage utilization	~40%	Up to 90%

This matters more over time. Most deployments become data-bound — the dataset grows until storage, not compute, dominates sizing. That’s exactly where Aerospike’s RAM-for-index architecture forces you onto more (and emptier) nodes.

Two more costs of keeping the index in RAM:

Slow boot — up to 30 min. The model assumes every key is cached in RAM, so a node rebuilds its entire index on boot. This takes as long as 30 minutes. Elasticity suffers (new nodes must build their index), and rolling restarts get heavier. Flushing the key cache to disk before reboot helps, but it’s still slower and more complex than a KKV model that has no such limitation.
Small partitions wreck RAM efficiency. Each key needs 64 bytes of RAM. With 128–256 B values, the key-to-data ratio gets ugly and you provision enormous amounts of RAM for a mediocre result. See the table above to compare RAM required and overhead for ScyllaDB KKV vs Aerospike KV at different record sizes.

Flexibility, without giving up the win¶

Despite managing all the complexity outlined above, ScyllaDB still beats Aerospike’s best published numbers in a head-to-head comparison. We keep up with their chosen workload, and we also do it while offering a data model their architecture can’t express.

7M+ ops/sec — 2021 benchmark, we repeated theirs at petabyte scale. Aerospike claimed 5M TPS read-only and 3.7M TPS 80/20 on 20 i3en.24xlarge. ScyllaDB hit 7M ops/sec and 7.5M inserts at 4ms P99 — 40% higher throughput on the same hardware class. See the petabyte benchmark blog and the presentation.

On par — 2026 McKnight Group benchmark, corrected config. The paid benchmark used incorrect configs: RF3 vs RF2, an old YCSB client. When ScyllaDB reran it with a corrected configuration, our wide-column KKV model matched throughput and the P99 was only 1–2ms higher at multi-terabyte scale. See the Aerospike whitepaper or prove it on your workload.

Note

About that benchmark. Aerospike’s recent paid whitepaper conveniently showed only one narrow scenario suited to its architecture: Key/Value only (the only schema it supports), large values (which masks the all-index-in-RAM problem), no node failures (masking slow recovery), and no scaling events (masking ScyllaDB’s elasticity). On that exact workload, ScyllaDB is on par. Change any one of those variables and the results shift in ScyllaDB’s favor. Let us run it on your data rather than ours.

Meet Logstor — our native K/V mode¶

With demand for pure K/V and the chance to solve the easier problem optimally, we built our own K/V-only format. It keeps the advantages of ScyllaDB — tablets, DBaaS, advanced consistency options — and adds a simple value type to push performance even further. We call it Logstor, after the LSA (log-structured allocator) that already manages ScyllaDB’s in-RAM memory. Logstor applies the same algorithm to managing space on disk.

Use KKV when you need the data model. Time-series, messaging, hierarchies, leaderboards, range scans, secondary indexes, multi-cell updates without RMW. The wide-column flexibility is the point.
Use Logstor KV when it’s genuinely key/value. Simple lookups by key with a bounded value — Aerospike’s home turf. Logstor delivers better performance than Aerospike on that workload, with ScyllaDB’s elasticity and DBaaS underneath.

One platform, two storage engines, your choice per workload. For some use cases KKV is the right tool; for others the new KV format is. Either way, you stay on the same elastic, fully-managed, multi-region database — and either way you beat the alternative on its own terms.

And there’s more coming beyond raw performance — including fresh benchmark results on the current i8g generation, not just the older hardware.

Feature by feature¶

How do Aerospike and ScyllaDB compare on flexibility, efficiency, scaling, resiliency, and DBaaS capabilities? Explore each category below.

Flexibility¶

Capability	Aerospike	ScyllaDB
Data model	Key-Value only. Namespaces › Sets › Records › Bins. Single-record lookups; complex data must be aggregated into one record, document-style.	Wide-column (KKV). Partition key routes data; clustering keys sort millions of independent cells per row. Strict superset of K/V — plus vector embeddings for AI retrieval.
Range queries	Not a design goal. Try secondary indexes (memory + performance overhead) or abuse Collection Data Types — bloated records, bottlenecks.	Clustering keys sort data on disk natively. “All readings for sensor A between Tuesday and Friday” is one efficient sequential read. Built for time-series, messaging, IoT.
Secondary access patterns	Local in-memory secondary indexes only. Queries scatter-gather across every node; strict-SLA users end up dual-writing two tables in the application.	Local and global secondary indexes, plus Materialized Views. Query by `email_address` with a direct partition read — no cluster-wide scan.
Schema & types	Schemaless beyond the primary key. Same bin can be a string in one record, an integer in the next — validation burden shifts to app code for years.	Schema enforced under CQL with online `ALTER TABLE`; collections plus reusable User-Defined Types. DynamoDB-style free attributes under Alternator.
Query APIs	Proprietary API and clients. The data layer is permanently attached to Aerospike.	CQL (Cassandra protocol) + DynamoDB-compatible API. Migrate from Cassandra, DataStax, Keyspaces, Cosmos DB or DynamoDB with little to no change — and inherit their Spark / Kafka / Trino ecosystems.

Efficiency¶

Capability	Aerospike	ScyllaDB
Memory	Primary index in RAM by default: 64 B per entry. At 90% memory utilization, a node enters `stop-writes` — writes halt until fixed. All-flash mode exists, at the cost of latency.	Memory is unified cache (indexes + rows, LRU). Nothing is force-pinned; billion-row workloads are dramatically cheaper. Comparable latency served straight from NVMe.
CPU	OS-native thread pools per task type; the OS owns scheduling, so context switching grows with scale. Vertical scaling means hand-tuning pools.	Shard-per-core, shared-nothing (Seastar). No locks, no contention; designed to run at 90%+ CPU with Workload Prioritization keeping user ops first — OLAP beside OLTP safely.
Storage	Proprietary, opaque raw-device format — backups and tooling must go through Aerospike products. Disk often tuned well below the 90% default for write-heavy loads.	Industry-standard XFS, open SSTable files. Incremental compaction lets you fill disks to 90% (ScyllaDB X Cloud) without risking availability.
Write-heavy workloads	Copy-on-write storage triggers aggressive defragmentation under heavy updates — write amplification, CPU overhead, latency spikes; admins throttle defrag by hand.	LSM-tree writes to immutable SSTables; compaction runs in background under I/O schedulers that guarantee user workloads aren’t impacted. Cell-level updates skip RMW entirely.

Scaling¶

Capability	Aerospike	ScyllaDB
Elasticity	Fixed 4096 partitions; scaling triggers Migrations whose disk/network I/O causes latency spikes — so admins throttle them and scaling stretches out.	Tablets move fine-grained data chunks between nodes. Scale from 500K to 2M OPS in ~10 minutes. No overprovisioning needed.
Vertical scale	RAM-for-indexes architecture caps useful node density; thread pools need retuning as cores grow.	Lockless shard-per-core design scales linearly to 256+ vCPU meganodes; streaming to a 60 TB node is as fast as to a small one.
Global scale	XDR for cross-DC; Multi-Site Clustering is a paid add-on, and active-active commits every region before a write succeeds — latency climbs.	Native multi-DC, eventually-consistent active-active replication. Reads and writes are served locally; replication never sits in the write path. No bolted-on CDC between DCs.

Resiliency¶

Capability	Aerospike	ScyllaDB
Node failure & maintenance	A reboot wipes the in-memory index — it’s rebuilt over the network (ASMT helps, with extra steps). Rack/AZ loss forces migrations that hit cluster performance. Rack awareness: Enterprise add-on.	Leaderless, flash-persisted design: nodes leave and rejoin gracefully, no availability or performance impact. Region and rack awareness are default — for replication and for AZ-local client routing.
Replication economics	RF=2 is common — it offsets the RAM cost. One node failure leaves a single copy of your data during the most dangerous window.	RF=3 is affordable because the architecture is hardware-efficient: quorum consistency, seamless rolling upgrades, true fault tolerance.
Consistency	Strong Consistency mode is a paid add-on, with higher write latency cluster-wide.	Per-query Consistency Level: `LOCAL_QUORUM` for AZ-spanning safety with local reads, global quorum where you need it, tunable along the request path.

DBaaS¶

Capability	Aerospike	ScyllaDB
Managed service	Newer offering — early-adopter territory.	ScyllaDB Cloud has run data-intensive production workloads since 2019: Tripadvisor, Freshworks, Supercell. Bring-your-own-cloud, BYO keys, encryption at rest, advanced networking.
Feature gating	Community edition lacks compression, encryption-at-rest, multi-region replication, rack awareness — gated behind Enterprise, some priced separately.	Free tier includes everything: advanced compression, enterprise encryption-at-rest, Workload Prioritization, cross-region replication, LDAP. Develop locally on your own workstation against the real thing.
Support	Standard commercial support tiers.	World-class 24×7 support with a 15-minute SLA. Consistently the top-rated NoSQL provider on G2.
Multi-DC	Cross-Datacenter Replication (XDR); true active-active multi-site clustering is a paid add-on with cross-region write latency.	Native multi-DC, eventually-consistent active-active — reads and writes served locally, replication off the write path. No bolted-on CDC between datacenters.
Elasticity & scale	Migrations throttled to protect traffic; scaling stretches out.	Tablets + X Cloud: scale up and out, 500K → 2M OPS in ~10 minutes, no overprovisioning.
What’s next	—	Logstor, our new KV storage mode, beats Aerospike on its home turf — and there’s more coming beyond raw performance. Watch for i8g-generation benchmark results.

You don’t have to compromise¶

Teams can enjoy the best DBaaS, elasticity and HA while having a feature-rich wide-column DB or even a K/V store with the best results in the industry. Reach out to our engineering team.