Welcome to Viash!

Viash is your go-to script wrapper for building data pipelines from modular software components. All you need is your trusty script and a metadata file to embark on this journey.

Check out some of Viash’s key features:

Requirements

This guide assumes you’ve already installed Viash, Docker and Nextflow.

Quickstart example project

To get up and running fast, we provide a template project for you to use. It contains three components from the same package as well as 2 custom operators from vsh-pipeline-operators, which are combined into a Nextflow pipeline as follows:

graph TD
   A(file?.tsv) --> X[vsh_flatten] 
   X --file1.tsv--> B1[/remove_comments/] --> C1[/take_column/] --> Y
   X --file2.tsv--> B2[/remove_comments/] --> C2[/take_column/] --> Y
   Y[vsh_toList] --> D[/combine_columns/]
   D --> E(output)

This pipeline takes one or more TSV files as input and stores its output in an output folder.

Step 1: Get the template

First create a new repository by clicking the “Use this template” button. If you can’t see the “Use this template” button, log into GitHub first.

Next, clone the repository using the following command.

git clone https://github.com/youruser/my_first_pipeline.git && cd my_first_pipeline

Your new repository should contain the following files:

tree my_first_pipeline
.
├── CHANGELOG.md
├── LICENSE.md
├── README.md
├── README.qmd
├── _viash.yaml
├── main.nf
├── resources_test
│   ├── file1.tsv
│   └── file2.tsv
└── src
    └── template
        ├── combine_columns
        │   ├── config.vsh.yaml
        │   └── script.R
        ├── remove_comments
        │   ├── config.vsh.yaml
        │   ├── script.sh
        │   └── test.sh
        ├── take_column
        │   ├── config.vsh.yaml
        │   └── script.py
        └── workflow
            ├── config.vsh.yaml
            ├── main.nf
            └── nextflow.config

Step 2: Build the Viash components

With Viash you can turn the components in src/ into Dockerized Nextflow modules by running:

viash ns build --setup cachedbuild --parallel
Output
Exporting combine_columns (template) =nextflow=> target/nextflow/template/combine_columns
Exporting combine_columns (template) =executable=> target/executable/template/combine_columns
Exporting take_column (template) =executable=> target/executable/template/take_column
Exporting remove_comments (template) =nextflow=> target/nextflow/template/remove_comments
[notice] Building container 'ghcr.io/viash-io/project_template/template/take_column:0.3.0' with Dockerfile
[notice] Building container 'ghcr.io/viash-io/project_template/template/combine_columns:0.3.0' with Dockerfile
Exporting workflow (template) =nextflow=> target/nextflow/template/workflow
Exporting remove_comments (template) =executable=> target/executable/template/remove_comments
[notice] Building container 'ghcr.io/viash-io/project_template/template/remove_comments:0.3.0' with Dockerfile
Exporting take_column (template) =nextflow=> target/nextflow/template/take_column
All 7 configs built successfully

This command not only transforms the Viash components in src/ to Nextflow modules but it also builds the containers when appropriate (starting from the Docker cache when available using the cachedbuild argument). Once everything is built, a new target directory has been created containing the executables and modules grouped per platform:

ls -l
Output
total 80
-rw-r--r-- 1 runner docker  1482 Jan 16 10:45 CHANGELOG.md
-rw-r--r-- 1 runner docker 32219 Jan 16 10:45 LICENSE.md
-rw-r--r-- 1 runner docker 12142 Jan 16 10:45 README.md
-rw-r--r-- 1 runner docker  7840 Jan 16 10:45 README.qmd
-rw-r--r-- 1 runner docker   512 Jan 16 10:46 _viash.yaml
-rw-r--r-- 1 runner docker   245 Jan 16 10:45 main.nf
-rw-r--r-- 1 runner docker   222 Jan 16 10:45 nextflow.config
drwxr-xr-x 2 runner docker  4096 Jan 16 10:45 resources_test
drwxr-xr-x 3 runner docker  4096 Jan 16 10:45 src
drwxr-xr-x 4 runner docker  4096 Jan 16 10:45 target

Step 3: Run the pipeline

Now run the pipeline with Nextflow:

nextflow run . \
  -main-script target/nextflow/template/workflow/main.nf \
  -with-docker \
  --input resources_test/file*.tsv \
  --publish_dir output
Output
N E X T F L O W  ~  version 24.10.3
Launching `target/nextflow/template/workflow/main.nf` [dreamy_gutenberg] DSL2 - revision: 45d6e681b4
[56/e9030b] Submitted process > workflow:run_wf:remove_comments:processWf:remove_comments_process (run)
[f3/c4ef6e] Submitted process > workflow:run_wf:take_column:processWf:take_column_process (run)
[79/a0d629] Submitted process > workflow:run_wf:combine_columns:processWf:combine_columns_process (combined)
[c4/655c6c] Submitted process > workflow:publishStatesSimpleWf:publishStatesProc (combined)

This will run the different stages of the workflow , with the final result result being stored in a file named combined.workflow.output.tsv in the output directory output:

cat output/combined.workflow.output.tsv
Output
"1" 0.11
"2" 0.23
"3" 0.35
"4" 0.47

What’s next?

Congratulations, you’ve reached the end of this quickstart tutorial, and we’re excited for you to delve deeper into the world of Viash! Our comprehensive guide and reference documentation is here to help you explore various topics, such as:

So, get ready to enhance your skills and create outstanding solutions with Viash!