Configuring a Managed Service for YDB target endpoint
When creating or editing an endpoint, you can define:
- Yandex Managed Service for YDB DB connection settings. These are required parameters.
- Additional parameters.
Yandex Managed Service for YDB cluster
Connecting to the database with the cluster ID specified in Yandex Cloud. Available only for clusters deployed in Yandex Managed Service for YDB.
-
Database: Select a Managed Service for YDB database from list.
-
Service account ID: Select or create a service account with the
ydb.editor
role that Data Transfer will use to connect to the database.
Additional settings
-
Number of shards: Specify the required
N
number of shards.If the setting is specified, the
_shard_col
column is added to tables. The values in it are calculated as the remainder ofH/N
, whereH
is the result of the hash function at the current time andN
is the number of shards specified by the setting. -
Cleanup policy: Select a way to clean up data in the target database before the transfer:
-
Drop
: Fully delete tables included in the transfer (default).Use this option so that the latest version of the table schema is always transferred to the target database from the source whenever the transfer is activated.
-
Disabled
: Do not clean.Select this option if only replication without copying data is performed.
-
-
Table rotation:
-
Unit of measurement: Hour, day, or month.
-
Table size: In the selected units of measurement.
When the time interval equal to the selected unit of measurement ends, the oldest database table will be deleted and a new one will be created.
-
Number of tables: The required number of tables in the target database.
-
Partition by column: Split (partition) a table by the specified column's values. The column must be of the
time
type.For more information about table partitioning, see the Yandex Managed Service for YDB documentation.
If this setting is used, the specified number of tables for data for different time intervals is created in the target database. The name of each table is selected automatically by the date and time of the start of the interval. Depending on the values in the specified column of the source table, the original rows are distributed across the corresponding tables in the target database.
-
-
Renaming tables: Fill in if you need to rename tables in the source database when transferring data to the target database.
-
Sub directory for tables: Specify the subfolder to place tables in.
Final table placement path:
<Path in Yandex Database>/<subfolder>/<table>
.