`task` destroy, need of being able to specify output folder #307

DavidGOrtega · 2021-11-25T15:21:09Z

If I launch a cluster of tasks in the same workdir but different env vars or script I would like to be able to recover all the execution in different folders. The problem right now is that directory is used in the initial sync and in the final output.
We need to be able to specify directoryOutput aligned with directory by default to avoid this outputs to be overwritten

resource "iterative_task" "task1" {
  cloud     = "aws"
  machine   = "t2.micro"
  spot      = 0
  name      = "task1"
  directory = "."
  region    = "us-west"

  script = <<-END
    #!/bin/bash
    echo 'task1' >> report.md
  END
}

resource "iterative_task" "task2" {
  cloud     = "aws"
  machine   = "t2.micro"
  spot      = 0
  name      = "task2"
  directory = "."
  region    = "us-west"

  script = <<-END
    #!/bin/bash
    echo 'task2' >> report.md
  END
}

If I destroy the example above I have just only a mess in current folder we might want

resource "iterative_task" "task1" {
  cloud     = "aws"
  machine   = "t2.micro"
  spot      = 0
  name      = "task1"
  directory = "."
  output = "./task1"
  region    = "us-west"

  script = <<-END
    #!/bin/bash
    echo 'task1' >> report.md
  END
}

resource "iterative_task" "task2" {
  cloud     = "aws"
  machine   = "t2.micro"
  spot      = 0
  name      = "task2"
  directory = "."
  output = "./task2"
  region    = "us-west"

  script = <<-END
    #!/bin/bash
    echo 'task2' >> report.md
  END
}

note the output property

The text was updated successfully, but these errors were encountered:

0x2b3bfa0 · 2021-11-25T15:28:57Z

This probably belongs to an epic about parallel training. With the current behavior, users can solve that by writing the result of each machine to a different path:

resource "iterative_task" "task" {
  name        = "example"
  cloud       = "aws"
  parallelism = 4

  script = <<-END
    #!/bin/bash
    date >> result-$(uuidgen)
  END
}

0x2b3bfa0 · 2021-11-25T15:31:02Z

Machines have no means of knowing if they're resuming an interrupted tast or starting a new one. Without implementing some sort of leader election and task splitting mechanism, there won't be an 1:1 mapping between parallelism and the number of outputs.

DavidGOrtega · 2021-11-25T15:33:03Z

Im not speaking about training in parallel, just launch the same task with different parameters within the same terraform file.
Nothing to be with parallelism

DavidGOrtega · 2021-11-25T15:34:43Z

Im updating with a terraform example

0x2b3bfa0 · 2021-11-26T04:04:53Z

Thanks for the clarification! 🙏🏼 Still, it looks like my first reply is relevant: you can avoid overwriting artifacts by naming them differently on each task.

Nevertheless, there is a deeper problem with the current approach: running terraform destroy will overwrite all the files in the given directory and destroying several tasks simultaneously will produce the errors described in #306.

0x2b3bfa0 · 2021-11-26T04:12:01Z

We should probably consider having separate directories for input and output:

Schema

Attributes

resource "iterative_task" "task" {
  name        = "example"
  cloud       = "aws"

  input_directory  = "."
  output_directory = "./output"

  script = <<-END
    #!/bin/bash
    date >> result
  END
}

Block

resource "iterative_task" "task" {
  name        = "example"
  cloud       = "aws"

  script = <<-END
    #!/bin/bash
    date >> result
  END

  directories {
    input  = "."
    output = "./output"
  }
}

Behavior

Droste effect prevention

If the input directory contains the output directory, the latter should be excluded from the upload.

Input exclusion

The output directory should only contain the produced artifacts and, perhaps, the task/machine logs.
The output directory should not contain the files on the input directory.

Existence

The output directory will be automatically created if it doesn't exist.
The input directory should exist; otherwise, an error will be displayed.

Example

The output directory should have subdirectories with the Long task name, each of them containing the output for a given task.

  output
    tpi-first-a1b2-c3d4.log
    tpi-first-a1b2-c3d4
      result.model
    tpi-second-1a2b-3c4d.log
    tpi-second-1a2b-3c4d
      result.model
  main.tf
  train.py
./
├── output/
│   ├── tpi-first-a1b2-c3d4.log
│   ├── tpi-first-a1b2-c3d4/
│   │   └── result.model
│   ├── tpi-second-1a2b-3c4d.log
│   └── tpi-second-1a2b-3c4d/
│       └── result.model
├── main.tf
└── train.py

DavidGOrtega · 2021-11-26T14:33:40Z

directories {
    input  = "."
    output = "./output"
  }

Love this idea.
As I said In other issues I can recover a bunch of tasks and change the directory do apply and then the output folder will be the one I gave. However to make it work I have to first remove the tfstate and then the apply and then destroy... not very user friendly

0x2b3bfa0 · 2021-11-29T18:26:09Z

Regarding user experience, we may want to set ForceNew: false for the directory attribute in the schema, so it can be updated locally.

terraform-provider-iterative/iterative/resource_task.go

Lines 105 to 110 in fa9c7f8

    
           "directory": { 
        
           	Type:     schema.TypeString, 
        
           	ForceNew: true, 
        
           	Optional: true, 
        
           	Default:  "", 
        
           },

DavidGOrtega · 2021-12-27T19:28:19Z

I have picked this @0x2b3bfa0

DavidGOrtega changed the title ~~task destroy, need of being able to autogenerate subfolders to sync data~~ task destroy, need of being able to specify output folder Nov 25, 2021

DavidGOrtega added p1-important High priority resource-task iterative_task TF resource labels Nov 25, 2021

DavidGOrtega mentioned this issue Nov 25, 2021

task destroy multiple tasks may end up in transfer errors that makes impossible the task destruction #306

Closed

0x2b3bfa0 mentioned this issue Nov 26, 2021

task: optionally skip downloading artefacts on destroy #303

Closed

DavidGOrtega mentioned this issue Nov 26, 2021

task recovering two tasks specifying a different folder ends in error #300

Closed

casperdcl added the enhancement New feature or request label Nov 26, 2021

casperdcl mentioned this issue Nov 26, 2021

task make script field optional #305

Closed

DavidGOrtega mentioned this issue Dec 3, 2021

EPIC - task release #326

Closed

12 tasks

0x2b3bfa0 mentioned this issue Dec 7, 2021

task bucket usage vs "directory" within a bucket #299

Closed

DavidGOrtega self-assigned this Dec 27, 2021

DavidGOrtega mentioned this issue Dec 29, 2021

Task directory_out #340

Merged

DavidGOrtega closed this as completed in #340 Jan 10, 2022

casperdcl mentioned this issue Mar 8, 2022

task: make workdir.output more intuitive #414

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`task` destroy, need of being able to specify output folder #307

`task` destroy, need of being able to specify output folder #307

DavidGOrtega commented Nov 25, 2021 •

edited

Loading

0x2b3bfa0 commented Nov 25, 2021

0x2b3bfa0 commented Nov 25, 2021 •

edited

Loading

DavidGOrtega commented Nov 25, 2021 •

edited

Loading

DavidGOrtega commented Nov 25, 2021

0x2b3bfa0 commented Nov 26, 2021 •

edited

Loading

0x2b3bfa0 commented Nov 26, 2021 •

edited

Loading

DavidGOrtega commented Nov 26, 2021

0x2b3bfa0 commented Nov 29, 2021 •

edited

Loading

DavidGOrtega commented Dec 27, 2021

task destroy, need of being able to specify output folder #307

task destroy, need of being able to specify output folder #307

Comments

DavidGOrtega commented Nov 25, 2021 • edited Loading

0x2b3bfa0 commented Nov 25, 2021

0x2b3bfa0 commented Nov 25, 2021 • edited Loading

DavidGOrtega commented Nov 25, 2021 • edited Loading

DavidGOrtega commented Nov 25, 2021

0x2b3bfa0 commented Nov 26, 2021 • edited Loading

0x2b3bfa0 commented Nov 26, 2021 • edited Loading

Schema

Attributes

Block

Behavior

Droste effect prevention

Input exclusion

Existence

Example

DavidGOrtega commented Nov 26, 2021

0x2b3bfa0 commented Nov 29, 2021 • edited Loading

DavidGOrtega commented Dec 27, 2021

`task` destroy, need of being able to specify output folder #307

`task` destroy, need of being able to specify output folder #307

DavidGOrtega commented Nov 25, 2021 •

edited

Loading

0x2b3bfa0 commented Nov 25, 2021 •

edited

Loading

DavidGOrtega commented Nov 25, 2021 •

edited

Loading

0x2b3bfa0 commented Nov 26, 2021 •

edited

Loading

0x2b3bfa0 commented Nov 26, 2021 •

edited

Loading

0x2b3bfa0 commented Nov 29, 2021 •

edited

Loading