Azkaban is a batch workflow job scheduler created at LinkedIn to run Hadoop jobs. Azkaban resolves the ordering through job dependencies and provides an easy to use web user interface to maintain and track your workflows.
It is the hadoop job scheduling coordinator, so many components are linked in order
The more popular ones are azkaban and oozie azkaban is more lightweight
azkaban needs mysql, because the task execution is not ok after only one execution, so the task needs to be saved in mysql
Need to install azkaban-excutor-web-sql
Just unzip it, there are many plug-ins, only these three are needed at present,
Only need to install to a certain machine in the cluster
There is a sql script createall in sql and import it into mysql
mysql>create database azkaban
source /opt/azkaban-2.5.0/create-all-sql-2.5.0.sql
azkaban is encrypted https, so the server needs an encrypted certificate
keytool -keystore keystore -alias jetty -genkey -keyalg RSA
Password requires six digits
Some of the following do not need to be filled in, just fill in a country code to CN
The jetty password is the same and press enter
Copy the generated keystorecp to the azakban-web directory
Then unify the time, since task scheduling may need to be done on time
cp /usr/share/zoneinfo/Asia/Shanghai /etc/localtime
If there is no Shanghai, it can be generated with tzselect
Configure the properties in the conf in the web
Configure the mysql password and ssl password accordingly
Add a user in user. xml
<user username="admin" password="admin" roles="admin,metrics"/>
Configure excutor conf properties similar to the above
start up
Start the web first
bin/azkaban.start.sh
excutor
bin/azkaban
Visit https://mini01:8443 and it is recommended to use chrome ff to reject the link
Upload file task must be zip
zip xx.zip xx
~ ❯❯❯ cat command.job
#command.job
type=command
command=echo 'hello'