Installing the Greenplum Load Tools
See the Greenplum Database Release Notes for the list of currently supported platforms for the Load Tools.
Parent topic:
The Greenplum Database load tools installer installs the following data loading tools:
- Greenplum parallel file distribution program ()
- Greenplum data loading utility (gpload)
Download the appropriate Greenplum Loaders installer package for your platform from VMware Tanzu Network.
Follow the instructions in to verify the integrity of the Greenplum Loaders software.
Unzip the installer:
The installer will prompt you to accept the license agreement and to provide an installation path. For the installation path, be sure to enter an absolute path if you choose not to accept the default location (for example, ). The default install location for the load tools is
/usr/local/greenplum-loaders-<version\>
Your Greenplum Database load tools installation contains the following files and directories:
- bin — data loading command-line tools
- ext — python dependencies
- lib — load tools library files
- greenplum_loaders_path.sh — environment variables
As a convenience, a greenplum_loaders_path.sh
file is provided in your load tools installation directory following installation. It has the following environment variable settings:
PATH
— Adds the path to the data loading command-line utilities.- — The path to additional Python library files needed for
gpload
.
You can source this file in your user’s startup shell profile (such as .bashrc
or .bash_profile
).
For example, you could add a line similar to the following to your chosen profile files (making sure the right install path is used):
After editing the chosen profile file, source it as the correct user to make the changes active. For example:
- — The name of the default Greenplum database to connect to.
PGHOST
— The Greenplum master host name or IP address.PGPORT
— The port number that the Greenplum master instance is running on.PGUSER
— The default database role name to use for login.
In order for Greenplum Database to be able to accept remote client connections, you must configure your Greenplum Database master so that connections are allowed from the client hosts and database users that will be connecting to Greenplum Database.
Make sure that the
pg_hba.conf
file of the Greenplum Database master is correctly configured to allow connections from the users to the database(s) using the authentication method you want. For details, see “Editing the pg_hba.conf File” in the Greenplum Database Administration Guide, and also see the Greenplum Database Security Configuration Guide.Make sure the authentication method you choose is supported by the client tool you are using.
If you edited file, the change requires a server reload (using the
gpstop -u
command) to take effect.Make sure that the databases and roles you are using to connect exist in the system and that the roles have the correct priviliges to the database objects.
Refer to the load tool reference documentation for further instructions:
- Greenplum Load Tools — See Load Tools Reference.
- Greenplum Database SQL Syntax — See .