Connecting to Hadoop

When launched, Aginity Workbench automatically displays the database connection dialog box . You can also establish a new connection by clicking Connect on the application’s toolbar. You can also simultaneously connect to several database servers.

Establish a Database Connection

To establish a connection:

Depending on the Hadoop distribution that is used in your organization, you can connect to the Hadoop cluster using one of the following methods:

To connect to a Hadoop cluster using IBM Big SQL:

  1. In the drivers list in the lower left of the dialog box, select IBM Big SQL.
  2. In the Server box, provide the URL or IP address of the target server. In the Port field, either leave the default port or provide an alternative one.
  3. In the User ID and Password boxes, provide the user’s name for the connection and the corresponding password.
  4. In the Database list, provide the name of the Big SQL database to connect to.
  5. If desired, click the Advanced button to open a dialog box to enable a Secure Socket Layer (SSL) connection to the database. Select Enable SSL and specify the Keystore Database and Stash File file names.
  6. To save the provided connection information as a connection entry, click Save. Once saved, a connection entry can be selected from the Saved list.
  7. Click OK. When the connection is established, the main window of the Aginity Workbench application, Query Analyzer, opens.

To connect to a Hadoop cluster using Hive:

  1. In the drivers list in the lower left of the dialog box, select Aginity Hive Native.
  2. In the HiveServer2 field, provide the domain name or IP address for the server that enables remote clients to execute queries against Hive and retrieve the results. In the Port field, either leave the default port or provide an alternative one.
  3. In the Metastore field, provide the domain name or IP address for the service that stores the metadata for Hive tables and partitions and provides clients access to this information. In the Port field, either leave the default port or provide an alternative one. Metastore information is optional, although omitting it may cause missing metadata in table properties and statistics, and can impact reverse-engineered DDL. Whenever possible, you should provide this information.
  4. In the User ID field and Password field, provide the user’s name for the connection and the corresponding password. If you are going to be connecting using Kerberos using your current computer login, you can leave these blank. If you are going to be connecting using Kerberos using a user name and password other than the credentials used for your current computer login, the User ID must be in the form username@realm. More information about the realm is discussed in advanced connection properties (described below).
  5. In the Database list, provide the name of the Hive database to connect to.
  6. In the Timeout field, either leave the default connection timeout or adjust it accordingly.
  7. If desired, click the Advanced button to open the advanced connection properties. You can also set Kerberos options from here.
  8. To save the provided connection information as a connection entry, click Save. Once saved, a connection entry can be selected from the Saved list.
  9. Click OK. When the connection is established, the main window of the Aginity Workbench application, Query Analyzer, opens.

Connection entries are stored in an encrypted form and can be decrypted by the currently logged-on Windows user only. For security reasons, you can choose not to save your password with your connection string. In this case you need to enter your password every time a connection to the database needs to be established.

Connect to Multiple Database Servers Simultaneously

If you frequently work with objects in databases that reside on different servers, you may find the ability to simultaneously connect to multiple servers useful.

To connect to multiple database servers:

  1. In the Database Connection dialog box, specify the connection information for the first server that you want to connect to. For an explanation of the connection parameters, see the section Establishing a Database Connection.
  2. Open the Database Connection dialog box again and specify the connection information for each additional server to connect to.

    A separate Query Analyzer window opens for each server. You can switch between the windows by clicking their buttons in the side bar of the main Workbench window.


Aginity Workbench for Hadoop * Topic last updated December 27, 2018