State Backend
Motivation
Section titled “Motivation”OpenTofu/Terraform supports remote state storage via various backends that you normally configure in your .tf files as follows:
terraform { backend "s3" { bucket = "my-tofu-state" key = "frontend-app/tofu.tfstate" region = "us-east-1" encrypt = true dynamodb_table = "my-lock-table" }}Unfortunately, the backend configuration does not currently support expressions, variables, or functions. This makes it hard to keep your code DRY if you have multiple OpenTofu/Terraform modules. For example, consider the following folder structure, which uses different OpenTofu/Terraform modules to deploy a backend app, frontend app, MySQL database, and a VPC:
Directorybackend-app
- main.tf
Directoryfrontend-app
- main.tf
Directorymysql
- main.tf
Directoryvpc
- main.tf
To use remote state with each of these modules, you would have to copy/paste the identical backend configuration into each of the main.tf files. The only thing that would differ between the configurations would be the key parameter: e.g., the key for mysql/main.tf might be mysql/terraform.tfstate and the key for frontend-app/main.tf might be frontend-app/terraform.tfstate.
In addition, the resources used for remote state will be provisioned somewhere else, and that somewhere else needs to be managed. Most users end up using “click-ops” to provision the S3 bucket and DynamoDB table used for AWS remote state (clicking around in the AWS console until they have what they need). This is error-prone, difficult to reproduce, and makes it hard to do the right thing consistently (e.g., enabling versioning, encryption, and access logging).
Luckily, Terragrunt has built-in tooling to make it easy to manage remote state.
Generating remote state settings with Terragrunt
Section titled “Generating remote state settings with Terragrunt”To fill in the settings via Terragrunt, create a root.hcl file in the root folder, plus one terragrunt.hcl file in each of the OpenTofu/Terraform modules:
- root.hcl
Directorybackend-app
- main.tf
- terragrunt.hcl
Directoryfrontend-app
- main.tf
- terragrunt.hcl
Directorymysql
- main.tf
- terragrunt.hcl
Directoryvpc
- main.tf
- terragrunt.hcl
In your root.hcl file, you can define your entire remote state configuration just once in a generate block, to generate a backend.tf file that includes the backend configuration:
generate "backend" { path = "backend.tf" if_exists = "overwrite_terragrunt" contents = <<EOFterraform { backend "s3" { bucket = "my-tofu-state" key = "${path_relative_to_include()}/tofu.tfstate" region = "us-east-1" encrypt = true dynamodb_table = "my-lock-table" }}EOF}This instructs Terragrunt to create the file backend.tf in the working directory (where Terragrunt calls tofu/terraform)
before it runs any OpenTofu/Terraform commands, including init. This allows you to inject this backend configuration
in all the units that include the root file and have terragrunt properly initialize the backend configuration with
interpolated values.
To inherit this configuration in each unit, such as mysql/terragrunt.hcl, you can
tell Terragrunt to automatically include all the settings from the root root.hcl file as follows:
include "root" { path = find_in_parent_folders("root.hcl")}The include block tells Terragrunt to use an identical Terragrunt configuration from the root.hcl file specified via the path parameter. It behaves exactly as if you had copy/pasted the OpenTofu/Terraform configuration from the included file generate configuration into mysql/terragrunt.hcl, but this approach is much easier to maintain!
The next time you run terragrunt, it will automatically configure all the settings for the backend, if they aren’t configured already, by calling tofu/terraform init.
The terragrunt.hcl files above use two Terragrunt built-in functions:
-
find_in_parent_folders(): This function returns the absolute path to the first file it finds in the parent folders above the current unit named something. In the example above, the call tofind_in_parent_folders("root.hcl")inmysql/terragrunt.hclwill return/your-root-folder/root.hcl. This way, you don’t have to hard code thepathparameter in every unit. -
path_relative_to_include(): This function returns the relative path between the unit and the path specified in itsincludeblock. We typically use this in a rootroot.hclfile so that each unit stores its OpenTofu/Terraform state at a differentkey. For example, themysqlunit will have itskeyparameter resolve tomysql/tofu.tfstateand thefrontend-appmodule will have itskeyparameter resolve tofrontend-app/tofu.tfstate.
Read Functions docs for more info.
Create remote state resources automatically
Section titled “Create remote state resources automatically”The generate block is useful for allowing you to set up the remote state backend configuration automatically, but
this introduces a bootstrapping problem: how do you create and manage the underlying storage resources for the remote
state? For example, when using the s3 backend, OpenTofu/Terraform
expects the S3 bucket to already exist for it to upload the state objects.
Ideally, you can manage the S3 bucket using OpenTofu/Terraform, but what about the state object for the module managing the S3
bucket? How do you create the S3 bucket, before you run tofu/terraform, if you need to run tofu/terraform to create the
bucket?
To handle this, Terragrunt supports a different block for managing the backend configuration: the remote_state block.
NOTE
remote_stateis an alternative way of managing the OpenTofu/Terraform backend compared togenerate. You cannot use both methods at the same time to manage the remote state configuration. When implementingremote_state, be sure to remove the correspondinggenerateblock for managing the backend.
The following backends are currently supported by remote_state:
For all other backends, the remote_state block operates in the same manner as generate. However, we may add
support for additional backends to remote_state blocks, which may disrupt your environment. If you do not want support
for automated management of remote state resources, we recommend sticking to generate blocks to configure the backend.
When you run terragrunt with a remote_state configuration, it will automatically create the following resources if they don’t already exist:
-
S3 bucket: If you are using the S3 backend for remote state storage and the
bucketyou specify inremote_state.configdoesn’t already exist, Terragrunt will create it automatically, with versioning, server-side encryption, and access logging enabled.In addition, you can let terragrunt tag the bucket with custom tags that you specify in
remote_state.config.s3_bucket_tags. -
DynamoDB table: If you are using the S3 backend for remote state storage and/or you specify a
dynamodb_table(a DynamoDB table used for locking) inremote_state.config, Terragrunt will create them automatically if they don’t already exist. They will be created with server-side encryption enabled, and the DynamoDB table will use the primary keyLockID.You may configure a custom endpoint for the AWS DynamoDB API using
remote_state.config.dynamodb_endpoint.In addition, you can let terragrunt tag the DynamoDB table with custom tags that you specify in
remote_state.config.dynamodb_table_tags. -
GCS bucket: If you are using the GCS backend for remote state storage and the
bucketyou specify inremote_state.configdoesn’t already exist, Terragrunt will create it automatically, with versioning enabled. For this to work correctly you must also specifyprojectandlocationkeys inremote_state.config, so Terragrunt knows where to create the bucket. You will also need to supply valid credentials using eitherremote_state.config.credentialsor by setting theGOOGLE_APPLICATION_CREDENTIALSenvironment variable. If you want to skip creating the bucket entirely, simply setskip_bucket_creationtotrueand Terragrunt will assume the bucket has already been created. If you don’t specifybucketinremote_statethen terragrunt will assume that you will passbucketthrough-backend-configinextra_arguments.We also strongly recommend you enable Cloud Audit Logs to audit and track API operations performed against the state bucket.
In addition, you can let Terragrunt label the bucket with custom labels that you specify in
remote_state.config.gcs_bucket_labels.
Note: If you specify a profile key in remote_state.config, Terragrunt will automatically use this AWS profile when creating the S3 bucket or DynamoDB table.
Note: You can disable automatic remote state initialization by setting remote_state.disable_init, this will skip the automatic creation of remote state resources and will execute terraform init passing the backend=false option. This can be handy when running commands such as run --all validate as part of a CI process where you do not want to initialize remote state.
The following example demonstrates using an environment variable to configure this option:
remote_state { # ...
disable_init = tobool(get_env("TG_DISABLE_INIT", "false"))}Here is an example of using the remote_state block to configure the S3 backend:
remote_state { backend = "s3" generate = { path = "backend.tf" if_exists = "overwrite" } config = { bucket = "my-terraform-state" key = "${path_relative_to_include()}/terraform.tfstate" region = "us-east-1" encrypt = true dynamodb_table = "my-lock-table" }}Like the approach with generate blocks, this will generate a backend.tf file that contains the remote state
configuration. However, in addition to that, terragrunt will also now manage the S3 bucket and DynamoDB table for you.
This means that if the S3 bucket my-terraform-state and DynamoDB table my-lock-table does not exist in your account,
Terragrunt will automatically create these resources before calling terraform and configure them based on the
specified configuration parameters.
Additionally, for the S3 backend only, Terragrunt will automatically update the S3 resource to match the
configuration specified in the remote_state bucket. For example, if you require versioning in the remote_state
block, but the underlying state bucket doesn’t have versioning enabled, Terragrunt will automatically turn on versioning
on the bucket to match the configuration.
If you do not want terragrunt to automatically apply changes, you can configure the following:
remote_state { # ... other args omitted for brevity ... config = { # ... other config omitted for brevity ... disable_bucket_update = true }}Check out the terragrunt-infrastructure-modules-example and terragrunt-infrastructure-live-example repos for fully-working sample code that demonstrates how to use Terragrunt to manage remote state.
S3-specific remote state settings
Section titled “S3-specific remote state settings”For the s3 backend, the following config options can be used for S3-compatible object stores, as necessary:
Note: The skip_bucket_accesslogging is now DEPRECATED. It is replaced by accesslogging_bucket_name. Please read below for more details on when to use the new config option.
remote_state { # ...
config = { skip_bucket_versioning = true # use only if the object store does not support versioning skip_bucket_ssencryption = true # use only if non-encrypted OpenTofu/Terraform State is required and/or the object store does not support server-side encryption skip_bucket_root_access = true # use only if the AWS account root user should not have access to the remote state bucket for some reason skip_bucket_enforced_tls = true # use only if you need to access the S3 bucket without TLS being enforced skip_credentials_validation = true # skip validation of AWS credentials, useful when is used S3 compatible object store different from AWS enable_lock_table_ssencryption = true # use only if non-encrypted DynamoDB Lock Table for the OpenTofu/Terraform State is required and/or the NoSQL database service does not support server-side encryption accesslogging_bucket_name = <string> # use only if you need server access logging to be enabled for your terraform state S3 bucket. Provide a <string> value representing the name of the target bucket to be used for logs output. accesslogging_target_prefix = <string> # use only if you want to set a specific prefix for your terraform state S3 bucket access logs when Server Access Logging is enabled. Provide a <string> value representing the TargetPrefix to be used for the logs output objects. If set to empty <string>, then TargetPrefix will be set to empty <string>. If attribute is not provided at all, then TargetPrefix will be set to default value `TFStateLogs/`.
shared_credentials_file = "/path/to/credentials/file" skip_metadata_api_check = true force_path_style = true }}If you experience an error for any of these configurations, confirm you are using OpenTofu or Terraform v0.12.2 or greater.
Further, the config options s3_bucket_tags, dynamodb_table_tags, accesslogging_bucket_tags, skip_bucket_versioning, skip_bucket_ssencryption, skip_bucket_root_access, skip_bucket_enforced_tls, skip_bucket_public_access_blocking, accesslogging_bucket_name, accesslogging_target_prefix, and enable_lock_table_ssencryption are only valid for backend s3. They are used by terragrunt and are not passed on to OpenTofu/Terraform. See section Create remote state resources automatically
GCS-specific remote state settings
Section titled “GCS-specific remote state settings”For the gcs backend, the following config options can be used for GCS-compatible object stores, as necessary:
remote_state { # ...
skip_bucket_versioning = true # use only if the object store does not support versioning
enable_bucket_policy_only = false # use only if uniform bucket-level access is needed (https://cloud.google.com/storage/docs/uniform-bucket-level-access)
encryption_key = "GOOGLE_ENCRYPTION_KEY"}If you experience an error for any of these configurations, confirm you are using Terraform v0.12.0 or greater.
Further, the config options gcs_bucket_labels, skip_bucket_versioning and enable_bucket_policy_only are only valid for the backend gcs. They are used by Terragrunt and are not passed on to OpenTofu/Terraform. See “Create remote state resources automatically” for more details.
Further reading
Section titled “Further reading”Managing your remote state like this is really valuable when you organize your units into a stack.
Reading about those concepts will help you understand how to organize your infrastructure such that different units stored in isolated state can interact with each other.