These Ansible playbooks will build a Hadoop cluster.
You can pre-build a Rackspace cloud environment or run the playbooks against an existing environment.
See INSTALL-ENV.md for installation and build instructions.
Ansible == 22.214.171.124 (2.2 is not supported at the moment)
Expects RHEL/CentOS 6/7 or Ubuntu 14 hosts.
Building the Rackspace Cloud environment requires the
pyraxPython module: pyrax link.
It installs Hortonworks Data Platform using Ambari Blueprints.
It supports static inventory if the environment is pre-built (
The data drives can be customized and can be put on top of Cloud Block Storage (partitioning is automatic).
If there are 2 or 3 masternodes, it will also enable HA NameNode.
Memory settings are scaled with the hardware configuration of the nodes.
The cloud environment requires the standard
pyraxcredentials file that looks like this:
[rackspace_cloud] username = my_username api_key = 01234567890abcdef
This file will be referenced in
By default, the file is expected to be:
When provisioning HDP on existing infrastructure edit
inventory/staticand add the nodes.
[Configuration files] (id:configuration)
To customize, change the variables under
playbooks/group_vars/all: contains global cluster and cloud settings
playbooks/group_vars/master-nodes: master-nodes configuration
playbooks/group_vars/slave-nodes: slave-nodes configuration
playbooks/group_vars/edge-nodes: edge-nodes configuration
For a one-node cluster, set
cloud_nodes_count in master-nodes to 1 and
cloud_nodes_count in slave-nodes to 0.
To provision a cloud environment, run the
provision_rax.sh script after you've customized the variables under
Continue with the HDP deployment steps here : HDP Install