Commit Graph

12 Commits

Author SHA1 Message Date
Lennart Kats d68d054160 Merge remote-tracking branch 'databricks/main' into extend-deployment-modes 2023-07-15 16:08:44 +02:00
Lennart Kats (databricks) 57e75d3e22
Add development runs (#522)
This implements the "development run" functionality that we desire for DABs in the workspace / IDE.

## bundle.yml changes

In bundle.yml, there should be a "dev" environment that is marked as
`mode: debug`:
```
environments:
  dev:
    default: true
    mode: development # future accepted values might include pull_request, production
```

Setting `mode` to `development` indicates that this environment is used
just for running things for development. This results in several changes
to deployed assets:
* All assets will get '[dev]' in their name and will get a 'dev' tag
* All assets will be hidden from the list of assets (future work; e.g.
for jobs we would have a special job_type that hides it from the list)
* All deployed assets will be ephemeral (future work, we need some form
of garbage collection)
* Pipelines will be marked as 'development: true'
* Jobs can run on development compute through the `--compute` parameter
in the CLI
* Jobs get their schedule / triggers paused
* Jobs get concurrent runs (it's really annoying if your runs get
skipped because the last run was still in progress)

Other accepted values for `mode` are `default` (which does nothing) and
`pull-request` (which is reserved for future use).

## CLI changes

To run a single job called "shark_sighting" on existing compute, use the
following commands:
```
$ databricks bundle deploy --compute 0617-201942-9yd9g8ix
$ databricks bundle run shark_sighting
```

which would deploy and run a job called "[dev] shark_sightings" on the
compute provided. Note that `--compute` is not accepted in production
environments, so we show an error if `mode: development` is not used.

The `run --deploy` command offers a convenient shorthand for the common
combination of deploying & running:
```
$ export DATABRICKS_COMPUTE=0617-201942-9yd9g8ix
$ bundle run --deploy shark_sightings
```
The `--deploy` addition isn't really essential and I welcome feedback 🤔
I played with the idea of a "debug" or "dev" command but that seemed to
only make the option space even broader for users. The above could work
well with an IDE or workspace that automatically sets the target
compute.

One more thing I added is`run --no-wait` can now be used to run
something without waiting for it to be completed (useful for IDE-like
environments that can display progress themselves).
```
$ bundle run --deploy shark_sightings --no-wait
```
2023-07-12 08:51:54 +02:00
Lennart Kats f59e73a026 WIP 2023-07-10 09:21:14 +02:00
Lennart Kats 368796ba12 WIP 2023-07-10 09:12:50 +02:00
Lennart Kats 54a66c0120 Fix paths on Windows 2023-07-07 18:04:28 +02:00
Lennart Kats cbd306825b Support experiment names that use full paths 2023-07-07 11:49:02 +02:00
Lennart Kats 6b221c02e5 Cleanup 2023-07-07 11:12:14 +02:00
Lennart Kats 50b76bb41e Cleanup 2023-07-07 11:09:09 +02:00
Lennart Kats 4762716f73 Rename "debug" to "development" 2023-07-03 16:30:42 +02:00
Lennart Kats 7c654ec417 Pause schedule for debug jobs 2023-06-20 11:21:33 +02:00
Lennart Kats 823c868d39 Increase max concurrent runs for debug runs 2023-06-18 17:40:30 +02:00
Lennart Kats 48d6df3dfa Add support for "debug runs"
- Add "mode: debug" property for environments
- Add "--deploy", "--compute", "--no-wait" CLI flags
2023-06-18 16:50:20 +02:00