Linux Prerequisites
Java Runtime Environment installed and on the path - only Open JDK 11 is supported and tested with Azkaban and RED
PuTTY or equivalent tools for Windows (for remote installation and interactive task execution)
Linux Installation
Currently, there is no interactive install wizard for WhereScape Azkaban on Linux but the azkaban-installer.jar
file in the RED install directory under <install_dir>\Azkaban\ can be transferred to Linux and run directly using Java. This process requires a pre-prepared response file to be supplied with the parameters for the installation.
Azkaban Installation
Install the Azkaban Metadata
Install the Azkaban Web Server
Install one or more Azkaban Executor Servers
- Add the Scheduler configuration to RED
1. Install the Azkaban Metadata
Perform the Azkaban metadata install by executing the below commands in your command shell in Linux or Windows.
- First, set the environment variables for your Azkaban metadata user and pwd:
AZ_DB_USER=myuser
AZ_DB_PWD=mypwd
- Then, run the
azkaban-installer.jar
with theupgrade-schema
command:java -jar "azkaban-installer.jar" upgrade-schema --database=AZ_REPO --host=PG_HOST --port=5432 --schema=white --username-var=AZ_DB_USER --password-var=AZ_DB_PWD
2. Install the Azkaban Web Server
Copy the scheduler .jar file from <RED_install_dir>\Azkaban\azkaban-installer.jar to the Linux machine that will host the main Web Server and run the following commands (see example response files later in this section):
java -jar azkaban-installer.jar --log-file=./install.log install-server --response-file=web_server_response_file.xml
3. Install one or more Azkaban Executor Servers
Copy the scheduler .jar file from <RED_install_dir>\Azkaban\azkaban-installer.jar to the Linux machine that will host each Executor Server and run the following commands (see example response files later in this section):
java -jar azkaban-installer.jar --log-file=./install.log install-server --response-file=executor_server_response_file.xml
Azkaban-installer command line help
The same installer can be run from Windows or Linux using the following command to output the installer command line help:
java -jar azkaban-installer.jar --help
Which outputs:
Usage: azkaban-installer [-hV] [--log-file=FILE] [COMMAND] --log-file=FILE File name (including path) for the log file (default: C: \Users\Red1\AppData\Local\Temp\WsSchedulerInstaller_2023050 2_201952_803.log) -h, --help Show this help message and exit. -V, --version Print version information and exit. Commands: example-response-file Generates an example well-formed XML response file to standard output install-server Installs or upgrades an Azkaban server as described in the response file mandatory-executor-tags Outputs the executor tags that should always be added to executors running on the current machine. response-file-schema Writes the XML schema for the response file to standard output test-connection Tests that a JDBC connection to the specified database can be established. upgrade-schema Upgrades or installs the Scheduler metadata schema.
4. Add the scheduler configuration to RED
To start using this scheduler in RED first, the Scheduler Configuration needs to be added to RED via the Scheduler tab.
Once in RED, navigate to the Scheduler tab and select the menu item Scheduler→Configuration, this will open the configuration for the attached scheduler.
- Enter your URLs for the Azkaban Web Server you setup earlier into both the Dashboard and Rest API URL fields.
- Provide the credentials for Dashboard User into the Dashboard section
- Provide the credentials for Publish User in the Rest API section
- Add your default scheduler tags which you setup on your Executor servers (these are case sensitive so ensure you have the same case as in your response file)
- Set the default scripts as shown in the example below
Example response files:
Note
You must install the Azkaban Web Server first before installing any Executor Servers. The Web Server and Executor Servers can be installed on Windows or Linux, or some combination of both. It is important to ensure that connectivity between your Web and Executor Servers is open on the ports you assign to them.
RED has implemented two password values for the azkabanPasswordEncryption
property for Linux: WALLET and NONE
- Use
WALLET
and set a value forazkaban.walletCmd
which is used as a system command to retrieve the password from the wallet. For more information on Wallet configuration using a standard Linux password manager, visit https://www.passwordstore.org/ - Or, use
NONE
For more information about wherescapeJobWorkdir
property, refer to RED Scheduler Configuration.
Example Install Command:
Typical installation command assuming the azkaban-installer.jar
has been copied to the home directory for the running user on Linux:
java -jar ~/azkaban-installer.jar --log-file=~/install.log install-server --response-file=~/response.xml
Starting and Stopping the Azkaban Servervs
On Linux, the Azkaban Servers are not started automatically after installation to run the startup script.
<azkaban_install_dir>/<web_server_name>/azkaban-web-server/bin/start-web.sh
<azkaban_install_dir>/<exec_server_name>/azkaban-exec-server/bin/start-exec.sh
There are various ways to have these servers start when Linux starts. The simplest is to add it via crontab with the @reboot keyword.
For example:
crontab -e
Opens crontab editor for the current user
Add a line for each server, save, and close the editor:
@reboot <azkaban_install_dir>/<web_server_name>/azkaban-web-server/bin/start-web.sh
@reboot <azkaban_install_dir>/<exec_server_name>/azkaban-exec-server/bin/start-exec.sh
crontab -l
Lists the current users' cron jobs.