Encrypted BACKUP
is an Enterprise-only feature. However, you can take full backups without an Enterprise license.
This doc provides information about how to take and restore encrypted backups in the following ways:
- Using AWS Key Management Service (KMS)
- Using Google Cloud Key Management Service (KMS)
- New in v23.1: Using Azure Key Vault
- Using a passphrase
Supported products
The feature described on this page is available in CockroachDB Dedicated, CockroachDB Serverless, and CockroachDB Self-Hosted clusters when you are running customer-owned backups. For a full list of features, see Backup and restore product support.
Use Key Management Service
You can encrypt full or incremental backups with AWS KMS, Google Cloud KMS, or Azure Key Vault by using the kms
option. Files written by the backup (BACKUP
manifest and data files) are encrypted using a 256-bit crypto-random generated data key. This data key is encrypted with the provided KMS URI(s) and stored alongside the BACKUP
data in an ENCRYPTION_INFO
file, which is used when restoring the backed-up data.
On RESTORE
, CockroachDB reads the ENCRYPTION_INFO
file and attempts to decrypt the encrypted data key using the KMS URI provided in the RESTORE
statement. Once CockroachDB successfully obtains the unencrypted data key, the BACKUP
manifest and data files will be decrypted and the restoration will proceed. Similarly, the same KMS URI is needed to decrypt the file to list the contents of the backup when using SHOW BACKUP
.
When used with incremental backups, the kms
option is applied to all the backup file URLs, which means each incremental must include at least one of the KMS URIs used to take the full backup. It can be any subset of the original URIs, but you cannot include any new KMS URIs. Similarly, when used with locality-aware backups, the KMS URI provided is applied to files in all localities.
For more information about AWS KMS, see the documentation. For more information about Google Cloud KMS, see the documentation.
Generate a KMS key
Before you can use a KMS to encrypt a CockroachDB backup, you must first generate a KMS key. This is the key generated by the cloud provider and it never leaves the KMS. It contains key-related metadata and key material to encrypt/decrypt other data. The key material can never be exported, deleted, or extracted. CockroachDB expects the key to be symmetric (256 bit).
CockroachDB also supports multi-region encryption for your backup. At the time of BACKUP
, you can provide multiple KMS URIs, each referencing a KMS key in a different region. This allows CockroachDB to save multiple versions of the encrypted data key used to encrypt the backup data, one per KMS URI. With these encrypted versions of the data key stored alongside the encrypted backup data, a user can RESTORE
the encrypted data using any one of the KMS URIs that were supplied during backup. In the case of a single KMS region outage, the data can be decrypted with any of the KMS keys from the other regions.
Add a new KMS key to an existing backup
To add a new KMS key to an existing backup, use the ALTER BACKUP
statement. ALTER BACKUP
allows for new KMS encryption keys to be applied to an existing chain of encrypted backups (full and incremental). Once completed, subsequent BACKUP
, RESTORE
, and SHOW BACKUP
statements can use any of the existing or new KMS URIs to decrypt the backup.
For examples on adding a new KMS key to an existing backup, see the ALTER BACKUP
examples.
URI formats
AWS KMS URI format
The AWS KMS URI must use the following format:
aws:///{key}?AUTH={auth_type>}®ION={region}
The AWS URI requires the following:
Component | Description |
---|---|
aws:/// |
The AWS scheme. Note the triple slash (/// ). |
{key} |
The key identifiers used to reference the KMS key that should be used to encrypt or decrypt. For information about the supported formats, see the AWS KMS docs. |
AUTH=<auth_type> |
The user-specified credentials. If you use AUTH=specified , you must provide access keys in the URI parameters (e.g., AWS_ACCESS_KEY_ID=<key_id>&AWS_SECRET_ACCESS_KEY=<secret_key> ). If you use AUTH=implicit , the access keys can be omitted and the credentials will be loaded from the environment. For details on setting up and using the different authentication types, see Authentication. |
REGION=<region> |
The region of the KMS key. |
See AWS's KMS keys documentation for guidance on creating an AWS KMS key.
Azure Key Vault URI format
New in v23.1: The Azure Key Vault URI must use one of two formats:
Explicit authentication using the
AUTH=specified
parameter (or omitting this, as it is the default option) with the tenant ID, client ID, client secret, and key vault name parameters:azure-kms:///{key}/{key version}?AZURE_TENANT_ID={tenant ID}&AZURE_CLIENT_ID={client ID}&AZURE_CLIENT_SECRET={client secret}&AZURE_VAULT_NAME={key vault name}
Implicit authentication with the
AUTH=implicit
and key vault name parameters:azure-kms:///{key}/{key version}?AUTH=implicit&AZURE_VAULT_NAME={key vault name}
See Cloud Storage Authentication for more detail on
implicit
authentication.
The Azure Key Vault URI uses the following parameters:
Component | Description |
---|---|
azure-kms:/// |
The Azure scheme. Note the triple slash (/// ). |
{key} |
Name of the key stored in your key vault. |
{key version} |
Current version of the key in your key vault. |
AZURE_TENANT_ID={tenant ID} |
Directory (tenant) ID for your App Registration. (This is not required for implicit authentication.) |
AZURE_CLIENT_ID={client ID} |
Application (client) ID for your App Registration. (This is not required for implicit authentication.) |
AZURE_CLIENT_SECRET={client secret} |
Client credentials secret generated for your App Registration. (This is not required for implicit authentication.) |
AZURE_VAULT_NAME={key vault name} |
Name of your key vault. |
To run an encrypted Azure backup, it is necessary to create the following:
- Azure Key Vault to generate and store your keys. See Microsoft's quickstart to Create a key vault using the Azure portal.
- Azure App Registration to manage role-based access control. See Microsoft's Register an application with the Microsoft identity platform to register an application.
Once you have created your App Registration you must assign it permissions to your key vault. To complete a successful encrypted backup, your App Registration needs the following permissions:
"dataActions": [
"Microsoft.KeyVault/vaults/keys/encrypt/action",
"Microsoft.KeyVault/vaults/keys/decrypt/action"
]
Follow Microsoft's Assign a Key Vault access policy tutorial for instructions.
Google Cloud KMS URI format
The Google Cloud KMS URI must use the following format:
gs:///projects/{project name}/locations/{location}/keyRings/{key ring name}/cryptoKeys/{key name}?AUTH={auth_type}
The Google Cloud URI requires the following:
Component | Description |
---|---|
gs:/// |
The Google Cloud scheme. Note the triple slash (/// ). |
projects/{project name} |
The name of the project that will hold the objects to encrypt. |
locations/{location} |
The location specified at key creation. |
keyRings/{key ring} |
The Google Cloud key ring created to group keys. |
cryptoKeys/{key name} |
The name of the key. |
AUTH=<auth_type> |
The user-specified credentials. If you use AUTH=specified , then you must include &CREDENTIALS= with your base-64 encoded key. To load credentials from your environment, use AUTH=implicit . For details on setting up and using the different authentication types, see Authentication. |
See Google Cloud's customer-managed encryption key documentation for guidance on creating a KMS key.
Examples
The following examples provide connection strings to Amazon S3 and Google Cloud Storage. For guidance using other authentication parameters, read Use Cloud Storage for Bulk Operations.
Take an encrypted Amazon S3 backup
To take an encrypted backup with AWS KMS, use the kms
option:
BACKUP INTO 's3://{BUCKET NAME}?AWS_ACCESS_KEY_ID={KEY ID}&AWS_SECRET_ACCESS_KEY={SECRET ACCESS KEY}'
WITH kms = 'aws:///{key}?AWS_ACCESS_KEY_ID={KEY ID}&AWS_SECRET_ACCESS_KEY={SECRET ACCESS KEY}®ION=us-east-1';
Take a backup with multi-region encryption
To take a backup with multi-region encryption, use the kms
option to specify a comma-separated list of KMS URIs:
BACKUP INTO 's3://{BUCKET NAME}?AWS_ACCESS_KEY_ID={KEY ID}&AWS_SECRET_ACCESS_KEY={SECRET ACCESS KEY}'
WITH KMS=(
'aws:///{key}?AUTH=implicit®ION=us-east-1',
'aws:///{key}?AUTH=implict®ION=us-west-1'
);
Restore from an encrypted Amazon S3 backup
To decrypt an encrypted backup, use the kms
option and any subset of the KMS URIs that were used to take the backup:
RESTORE FROM LATEST IN 's3://{BUCKET NAME}?AWS_ACCESS_KEY_ID={KEY ID}&AWS_SECRET_ACCESS_KEY={SECRET ACCESS KEY}'
WITH kms = 'aws:///{key}?AWS_ACCESS_KEY_ID={KEY ID}&AWS_SECRET_ACCESS_KEY={SECRET ACCESS KEY}®ION=us-east-1';
Take an encrypted Azure Blob Storage backup
New in v23.1:
To take an encrypted backup with Azure KMS, use the kms
option:
BACKUP INTO 'azure://{container name}?AUTH=specified&AZURE_ACCOUNT_NAME={account name}&AZURE_CLIENT_ID={client ID}&AZURE_CLIENT_SECRET={client secret}&AZURE_TENANT_ID={tenant ID}'
WITH kms = 'azure-kms:///{key}/{key version}?AZURE_TENANT_ID={tenant ID}&AZURE_CLIENT_ID={client ID}&AZURE_CLIENT_SECRET={client secret}&AZURE_VAULT_NAME={key vault name}';
Take a backup with multi-region encryption
New in v23.1:
To take a backup with multi-region encryption, use the kms
option to specify a comma-separated list of KMS URIs:
BACKUP INTO 'azure://{container name}?AUTH=specified&AZURE_ACCOUNT_NAME={account name}&AZURE_CLIENT_ID={client ID}&AZURE_CLIENT_SECRET={client secret}&AZURE_TENANT_ID={tenant ID}'
WITH KMS=(
'azure-kms:///{key}/{key version}?AZURE_TENANT_ID={tenant ID}&AZURE_CLIENT_ID={client ID}&AZURE_CLIENT_SECRET={client secret}&AZURE_VAULT_NAME={key vault name}',
'azure-kms:///{key}/{key version}?AZURE_TENANT_ID={tenant ID}&AZURE_CLIENT_ID={client ID}&AZURE_CLIENT_SECRET={client secret}&AZURE_VAULT_NAME={key vault name}'
);
Restore from an encrypted Azure Blob Storage backup
New in v23.1:
To decrypt an encrypted backup, use the kms
option and any subset of the KMS URIs that were used to take the backup:
RESTORE FROM LATEST IN 'azure://{container name}?AUTH=specified&AZURE_ACCOUNT_NAME={account name}&AZURE_CLIENT_ID={client ID}&AZURE_CLIENT_SECRET={client secret}&AZURE_TENANT_ID={tenant ID}'
WITH kms = 'azure-kms:///{key}/{key version}?AZURE_TENANT_ID={tenant ID}&AZURE_CLIENT_ID={client ID}&AZURE_CLIENT_SECRET={client secret}&AZURE_VAULT_NAME={key vault name}';
Take an encrypted Google Cloud Storage backup
To take an encrypted backup with Google Cloud KMS, use the kms
option:
BACKUP INTO 'gs://{BUCKET NAME}?AUTH=specified&CREDENTIALS={ENCODED KEY}'
WITH kms = 'gs:///projects/{project name}/locations/us-east1/keyRings/{key ring name}/cryptoKeys/{key name}?AUTH=specified&CREDENTIALS={encoded key}';
Take a backup with multi-region encryption
To take a backup with multi-region encryption, use the kms
option to specify a comma-separated list of KMS URIs:
BACKUP INTO 'gs://{BUCKET NAME}?AUTH=specified&CREDENTIALS={ENCODED KEY}'
WITH KMS=(
'gs:///projects/{project name}/locations/us-east1/keyRings/{key ring name}/cryptoKeys/{key name}?AUTH=specified&CREDENTIALS={encoded key}',
'gs:///projects/{project name}/locations/us-west1/keyRings/{key ring name}/cryptoKeys/{key name}?AUTH=specified&CREDENTIALS={encoded key}'
);
Restore from an encrypted Google Cloud Storage backup
To decrypt an encrypted backup, use the kms
option and any subset of the KMS URIs that were used to take the backup:
RESTORE FROM LATEST IN 'gs://{BUCKET NAME}?AUTH=specified&CREDENTIALS={ENCODED KEY}'
WITH kms = 'gs:///projects/{project name}/locations/us-east1/keyRings/{key ring name}/cryptoKeys/{key name}?AUTH=specified&CREDENTIALS={encoded key}';
Use a passphrase
You can encrypt full or incremental backups with a passphrase by using the encryption_passphrase
option. Files written by the backup (including BACKUP
manifests and data files) are encrypted using the specified passphrase to derive a key. To restore the encrypted backup, the same encryption_passphrase
option (with the same passphrase) must be included in the RESTORE
statement.
When used with incremental backups, the encryption_passphrase
option is applied to all the backup file URLs, which means the same passphrase must be used when appending another incremental backup to an existing backup. Similarly, when used with locality-aware backups, the passphrase provided is applied to files in all localities.
Encryption is done using AES-256-GCM, and GCM is used to both encrypt and authenticate the files. A random salt is used to derive a once-per-backup AES key from the specified passphrase, and then a random initialization vector is used per-file. CockroachDB uses PBKDF2 with 64,000 iterations for the key derivation.
BACKUP
and RESTORE
will use more memory when using encryption, as both the plain-text and cipher-text of a given file are held in memory during encryption and decryption.
For an example of an encrypted backup, see Create an encrypted backup.
The examples in this section use one of the following storage URIs:
- External connections, which allow you to represent an external storage or sink URI. You can then specify the external connection's name in statements rather than the provider-specific URI. For detail on using external connections, see the
CREATE EXTERNAL CONNECTION
page. - Amazon S3 connection strings with the default
AUTH=specified
parameter. For guidance on usingAUTH=implicit
authentication with Amazon S3 buckets instead, read Cloud Storage Authentication.
For guidance on connecting to other storage options or using other authentication parameters instead, read Use Cloud Storage.
Take an encrypted backup using a passphrase
To take an encrypted backup, use the encryption_passphrase
option:
BACKUP INTO 's3://{BUCKET NAME}?AWS_ACCESS_KEY_ID={KEY ID}&AWS_SECRET_ACCESS_KEY={SECRET ACCESS KEY}' WITH encryption_passphrase = 'password123';
To restore, use the same encryption_passphrase
. See the example below for more details.
Restore from an encrypted backup using a passphrase
To decrypt an encrypted backup, use the encryption_passphrase
option option and the same passphrase that was used to create the backup.
For example, the encrypted backup created in the previous example can be restored with:
RESTORE FROM LATEST IN 's3://{BUCKET NAME}?AWS_ACCESS_KEY_ID={KEY ID}&AWS_SECRET_ACCESS_KEY={SECRET ACCESS KEY}' WITH encryption_passphrase = 'password123';
To restore from a specific backup, use RESTORE FROM {subdirectory} IN ...
.