How does Akka Database sharding work?

AnonymousUser · August 26, 2024, 6:19am

This topic is based on a conversation between an anonymous user of Akka and our “Ask AI about Akka” bot in the documentation.

Ask-AI-Bot · August 26, 2024, 6:35am

Akka Persistence R2DBC is able spread the load of your application over more than one database. The event journal, snapshot store and durable state can be split up over multiple tables and physical backend databases.

Database Sharding: Using a single non-distributed database can become a bottleneck for applications that have high throughput requirements. To address this, the event journal, snapshot store, and durable state can be split up over multiple tables and physical backend databases. This is achieved through database partitioning and database sharding. The data is partitioned by the slices that are used for eventsBySlices and Projections.
Data Partitions: You can configure how many data partitions are needed. A data partition corresponds to a separate database table. The number of data partitions must be between 1 and 1024 and a whole number divisor of 1024 (number of slices), e.g. 2, 4, 8, 16. Each data partition corresponds to a database table. The tables will have the data partition as suffix, e.g. event_journal_0, event_journal_1.
Configuration: The data partitions are configured with the number of tables and databases that the data will be split into. The selection of data partition is made from the slice of the persistenceId. The number of Projection instances when using eventsBySlices must be greater than or equal to the number of data partitions.
Schema: Each data partition corresponds to a table. You can copy the DDL statements for the tables and indexes from Creating the schema but change the table and index names to include data partition suffix.
Example: If you configure 8 data partitions and 4 databases, an individual entity will map to a specific slice and the entity will read and write to the table that covers the corresponding slice range. Each database may host several of the data partition tables. Each database requires a separate connection factory and connection pool.

You can find more details on the source page.

This is a corrected answer to the reply of the “Ask AI about Akka” bot in the documentation. Originally the bot explained the concept of “Akka Cluster Sharding” which must not be confused with “Akka Database Sharding”.

Topic		Replies	Views
Akka Persistence Cassandra Plugin - initialising of multiple cassandra cluster instances Persistence / Event Sourcing	3	1448	April 19, 2018
Delete Akka remember entities from event source DB Akka Libraries	6	463	June 14, 2023
Akka 2.6.9 released with support for Replicated Event Sourcing Akka Libraries	0	621	September 10, 2020
Akka Use case discussion Akka Libraries akka	0	380	August 1, 2020
Custom SourceProvider for events from outside Akka (Custom SQL query \| TimescaleDB)(R2DBC) Persistence / Event Sourcing	2	86	September 5, 2024

How does Akka Database sharding work?

Related topics