Choosing a primary key for maximum performance

Techniques that let you evenly distribute load across table partitions
- Using a hash of key column values as a primary key
Row size limit

General recommendations for choosing a primary key:

Avoid situations where the main load falls on a single of a table. The more evenly the load is distributed across partitions, the higher the performance.
Reduce the number of partitions that can be affected in a single request. Moreover, if the request affects no more than one partition, it is performed using a special simplified protocol. This significantly increases the speed and saves the resources.

All YDB tables are sorted by primary key in ascending order. This means that writes to a data table with a monotonically increasing primary key will cause new data to be written to the end of the table. As YDB splits table data into partitions based on key ranges, inserts are always processed by the same server that is responsible for the “last” partition. Concentrating the load on a single server results in slow data uploading and inefficient use of a distributed system.
As an example, let’s take logging of user events to a table with the schema.

timestamp monotonically increases, and as a result, all records are written to the end of the table and the “last” partition, which is responsible for this range of keys, serves all records in the table. This will make it impossible to scale the write load, performance will be limited to a single process of servicing this partition and won’t increase with the addition of servers to the cluster.

Writing data to a table with the ( timestamp, userid, userevent, PRIMARY KEY (timestamp, userid) ) schema results in an uneven load on table partitions due to a monotonically increasing primary key. Changing the sequence of key components so that the monotonically increasing part isn’t the first component can help distribute the load more evenly. If you change the table schema to ( timestamp, userid, userevent, PRIMARY KEY (userid, timestamp) ), then, with a sufficient number of users that generate events, writing to the database is distributed across the partitions more evenly.

Let’s take a table with the schema. As the entire primary key or its first component, you can use a hash of the source key. For example:

Let’s assume that the main scenario for working with table data is to read all events by a specific userid. Then, when you use the table schema, each read affects all the partitions of the table. Moreover, each partition is fully scanned, since the rows related to a specific userid are located in an order that isn’t known in advance. Changing the sequence of ( timestamp, userid, userevent, PRIMARY KEY (userid, timestamp) ) key components causes all rows related to a specific userid to follow each other. This arrangement of rows has a positive effect on the speed of reading information on a particular .

In YDB, all columns, including key ones, may contain a NULL value. Using NULL as values in key columns isn’t recommended. According to the SQL standard (ISO/IEC 9075), you can’t compare NULL with other values. Therefore, the use of concise SQL statements with simple comparison operators may lead, for example, to skipping rows containing NULL during filtering.

To achieve high performance, we don’t recommend writing rows larger than 8 MB and key columns larger than 2 KB to the DB.