7 Terra and Google Cloud

7.1 Introduction to Terra and Google Cloud

7.1.1 General overview

7.1.2 Your account

7.1.3 The billing project

7.1.4 SQL queries

7.2 Using the Terra platform

7.2.1 General structure

7.2.1.1 Workspaces

7.2.1.2 Dashboard

7.2.1.3 Data

7.2.1.4 Notebooks

7.2.1.5 Workflows

7.2.1.5.1 Why use workflows

In comparison to Terra Notebooks in which downstream analysis/processing takes place, the Terra workflow is for doing the heavy lifting aspects of your work. For example, if you need to use samtools to filter 1000 .bam files for properly paired reads, you would use a workflow to do this. Workflows are flexible and as well as the 1000s that are available via the Broad Institute’s ‘Broad Methods Repository’, you can formulate your own pipelines using the intuitive Workflow Description Language (WDL). Implementing your chosen/custom workflow is made relatively simple by integration with the Terra data table structure, making it easy to define sample sets and other inputs, as well as being able to assign computing resources and run processes in parallel.

In this section, I am not going to try to re-write the excellent Broad Institute -provided tutorials, but instead outline how you might approach using particular elements of Terra workflows, pointing to relevant tutorials along the way.

7.2.1.5.2 The workflow view
The workflow tab

Figure 7.1: The workflow tab

Here you can see what the workflow tab looks like. This particular workspace has a collection of workflows, some of which are custom, others are imported from a workflow repository.

To find or create a new workflow, click on ‘Find a Workflow’. A pop-up will suggest some popular workflows, or there is the option to ‘Find additional workflows’. To search for a workflow, select the relevant repository and import the workflow into your workspace. To generate your own, click on ‘Broad Methods Repository’. You may need to sign in with your Google account here. Once you have done this, you will see the option to ‘Create New Method’ in the top right-hand corner. For more detail on how to create your own workflows, see this Terra tutorial. Here, the process of workflow generation is covered, but building the actual WDL script that underlies it is not. I will link to some resources for this below.

Returning to the workflow view, to use a workflow, simply select it’s tile and you will be taken to the input view.

7.2.2 Get help

7.3 Introduction to the AMP-PD data