---+ Configuring Apache Atlas ---++ Introduction All configuration in Atlas uses java properties style configuration. ---++ Application Properties The main configuration file is application.properties which is in the *conf* dir at the deployed location. It consists of the following sections: ---+++ Graph Database Configs ---++++ Graph persistence engine This section sets up the graph db - titan - to use a persistence engine. Please refer to <a href="http://s3.thinkaurelius.com/docs/titan/0.5.4/titan-config-ref.html">link</a> for more details. The example below uses BerkeleyDBJE. <verbatim> atlas.graph.storage.backend=berkeleyje atlas.graph.storage.directory=data/berkley </verbatim> ---+++++ Graph persistence engine - Hbase Basic configuration <verbatim> atlas.graph.storage.backend=hbase #For standalone mode , specify localhost #for distributed mode, specify zookeeper quorum here - For more information refer http://s3.thinkaurelius.com/docs/titan/current/hbase.html#_remote_server_mode_2 atlas.graph.storage.hostname=<ZooKeeper Quorum> </verbatim> Advanced configuration Refer http://s3.thinkaurelius.com/docs/titan/0.5.4/titan-config-ref.html#_storage_hbase ---++++ Graph Search Index This section sets up the graph db - titan - to use an search indexing system. The example configuration below setsup to use an embedded Elastic search indexing system. <verbatim> atlas.graph.index.search.backend=elasticsearch atlas.graph.index.search.directory=data/es atlas.graph.index.search.elasticsearch.client-only=false atlas.graph.index.search.elasticsearch.local-mode=true atlas.graph.index.search.elasticsearch.create.sleep=2000 </verbatim> ---++++ Graph Search Index - Solr <verbatim> atlas.graph.index.search.backend=solr5 atlas.graph.index.search.solr.mode=cloud atlas.graph.index.search.solr.zookeeper-url=<the ZK quorum setup for solr as comma separated value> eg: 10.1.6.4:2181,10.1.6.5:2181 </verbatim> ---+++ Hive Lineage Configs The higher layer services like hive lineage, schema, etc. are driven by the type system and this section encodes the specific types for the hive data model. # This models reflects the base super types for Data and Process <verbatim> atlas.lineage.hive.table.type.name=DataSet atlas.lineage.hive.process.type.name=Process atlas.lineage.hive.process.inputs.name=inputs atlas.lineage.hive.process.outputs.name=outputs ## Schema atlas.lineage.hive.table.schema.query=hive_table where name=?, columns </verbatim> ---+++ Notification Configs Refer http://kafka.apache.org/documentation.html#configuration for kafka configuration. All kafka configs should be prefixed with 'atlas.kafka.' <verbatim> atlas.notification.embedded=true atlas.kafka.data=${sys:atlas.home}/data/kafka atlas.kafka.zookeeper.connect=localhost:9026 atlas.kafka.bootstrap.servers=localhost:9027 atlas.kafka.zookeeper.session.timeout.ms=400 atlas.kafka.zookeeper.sync.time.ms=20 atlas.kafka.auto.commit.interval.ms=1000 </verbatim> ---+++ Client Configs <verbatim> atlas.client.readTimeoutMSecs=60000 atlas.client.connectTimeoutMSecs=60000 </verbatim> ---+++ Security Properties ---++++ SSL config The following property is used to toggle the SSL feature. <verbatim> atlas.enableTLS=false </verbatim>