org.apache.atlas.hbase.bridge.HBaseBridge imports the HBase metadata into Atlas using the model defined above. import-hbase.sh command can be used to facilitate this.
<verbatim>
Usage 1: <atlas package>/hook-bin/import-hbase.sh
Usage 2: <atlas package>/hook-bin/import-hbase.sh [-n <namespace regex> OR --namespace <namespace regex >] [-t <table regex > OR --table <table regex>]
The logs are in <atlas package>/logs/import-hbase.log
---++ HBase Hook
Atlas HBase hook registers with HBase to listen for create/update/delete operations and updates the metadata in Atlas, via Kafka notifications, for the changes in HBase.
Follow the instructions below to setup Atlas hook in HBase:
* Set-up Atlas hook in hbase-site.xml by adding the following:
* Copy <atlas package>/hook/hbase/<All files and folder> to hbase class path. HBase hook binary files are present in apache-atlas-<release-vesion>-SNAPSHOT-hbase-hook.tar.gz
* Copy <atlas-conf>/atlas-application.properties to the hbase conf directory.
The following properties in <atlas-conf>/atlas-application.properties control the thread pool and notification details:
* atlas.hook.hbase.synchronous - boolean, true to run the hook synchronously. default false. Recommended to be set to false to avoid delays in Hbase operation.
* atlas.hook.hbase.numRetries - number of retries for notification failure. default 3
* atlas.hook.hbase.minThreads - core number of threads. default 1
* atlas.hook.hbase.maxThreads - maximum number of threads. default 5
* atlas.hook.hbase.keepAliveTime - keep alive time in msecs. default 10
* atlas.hook.hbase.queueSize - queue size for the threadpool. default 10000
Refer [[Configuration][Configuration]] for notification related configurations
---++ NOTES
* Only the namespace, table and columnfamily create / update / delete operations are caputured by the hook. Columns changes wont be captured and propagated.