Using the DD Boost Storage Plugin with gpbackup and gprestore

    Dell EMC Data Domain Boost (DD Boost) is Dell EMC software that can be used with the gpbackup and utilities to perform faster backups to the Dell EMC Data Domain storage appliance. You can also replicate a backup on a separate, remote Data Domain system for disaster recovery with gpbackup or gpbackup_manager. For information about replication, see .

    To use the DD Boost storage plugin application, you first create a configuration file to specify the location of the plugin, the DD Boost login, and the backup location. When you run or gprestore, you specify the configuration file with the option --plugin-config. For information about the configuration file, see DD Boost Storage Plugin Configuration File Format.

    If you perform a backup operation with the gpbackup option --plugin-config, you must also specify the --plugin-config option when you restore the backup with gprestore.

    The configuration file specifies the absolute path to the Greenplum Database DD Boost storage plugin executable, DD Boost connection credentials, and Data Domain location. The configuration file is required only on the master host. The DD Boost storage plugin application must be in the same location on every Greenplum Database host.

    The DD Boost storage plugin configuration file uses the YAML 1.1 document format and implements its own schema for specifying the DD Boost information.

    The configuration file must be a valid YAML document. The gpbackup and gprestore utilities process the configuration file document in order and use indentation (spaces) to determine the document hierarchy and the relationships of the sections to one another. The use of white space is significant. White space should not be used simply for formatting purposes, and tabs should not be used at all.

    This is the structure of a DD Boost storage plugin configuration file.

    executablepath

    Required. Absolute path to the plugin executable. For example, the Tanzu Greenplum installation location is $GPHOME/bin/gpbackup_ddboost_plugin. The plugin must be in the same location on every Greenplum Database host.

    options

    Required. Begins the DD Boost storage plugin options section.

    • hostname

      Required. The IP address or hostname of the host. There is a 30-character limit.

      username

      Required. The Data Domain Boost user name. There is a 30-character limit.

      Required. The passcode for the DD Boost user to access the Data Domain storage unit.

      storage-unit

      Required. A valid storage unit name for the Data Domain system that is used for backup and restore operations.

      directory

      Required. The location for the backup files, configuration files, and global objects on the Data Domain system. The location on the system is /<data-domain-dir> in the storage unit of the system.

    : During a backup operation, the plugin creates the directory location if it does not exist in the storage unit and stores the backup in this directory /<data-domain-dir>/YYYYMMDD/YYYYMMDDHHMMSS/.

    • replication

      Optional. Activates or deactivates backup replication with DD Boost managed file replication when gpbackup performs a backup operation. Value is either on or off. Default value is off, backup replication is deactivated. When the value is on, the DD Boost plugin replicates the backup on the Data Domain system that you specify with the remote_* options.

    : The replication option and remote_* options are ignored when performing a restore operation with . The remote_* options are ignored if replication is off.

    : This option is ignored when you perform replication with the gpbackup_manager replicate-backup command. For information about replication,see .

    • replication-streams

      Optional. Used with the gpbackup_manager replicate-backup command, ignored otherwise. Specifies the maximum number of Data Domain I/O streams that can be used when replicating a backup set on a remote Data Domain server from the Data Domain server that contains the backup. Default value is 1.

    : This option is ignored when you perform replication with gpbackup. The default value is used.

    : During a backup operation, the plugin creates the directory location if it does not exist in the storage unit of the remote Data Domain system and stores the replicated backup in this directory /<remote-dd-dir>/YYYYMMDD/YYYYMMDDHHMMSS/.

    This is an example DD Boost storage plugin configuration file that is used in the next gpbackup example command. The name of the file is .

    1. executablepath: $GPHOME/bin/gpbackup_ddboost_plugin
    2. options:
    3. hostname: "192.0.2.230"
    4. username: "test-ddb-user"
    5. password: "asdf1234asdf"
    6. storage_unit: "gpdb-backup"

    This gpbackup example backs up the database demo using the DD Boost storage plugin. The absolute path to the DD Boost storage plugin configuration file is /home/gpadmin/ddboost-test-config.yml.

    The DD Boost storage plugin writes the backup files to this directory of the Data Domain storage unit gpdb-backup.

    1. /test/backup/<YYYYMMDD>/<YYYYMMDDHHMMSS>/

    This is an example DD Boost storage plugin configuration file that enables replication.

    To restore from the replicated backup in the previous example, you can run gprestore with the DD Boost storage plugin and specify a configuration file with this information.

    1. executablepath: $GPHOME/bin/gpbackup_ddboost_plugin
    2. options:
    3. hostname: "192.0.3.20"
    4. remote_username: "test-dd-remote"
    5. remote_password: "qwer2345erty"
    6. directory: "test/replication"

    Dell EMC DD Boost is integrated with Tanzu Greenplum and requires a DD Boost license. Open source Greenplum Database cannot use the DD Boost software, but can back up to a Dell EMC Data Domain system mounted as an NFS share on the Greenplum master and segment hosts.

    When you perform a backup with the DD Boost storage plugin, the plugin stores the backup files in this location in the Data Domain storage unit.