databricks-cli

Commit Graph

Author	SHA1	Message	Date
Pieter Noordhuis	e13fae7a1d	Update comment	2024-10-18 16:21:52 +02:00
Pieter Noordhuis	3201821a62	Don't double-index	2024-10-18 16:21:39 +02:00
Pieter Noordhuis	1b51c0cf0e	Update run_as tests for dashboards	2024-10-18 15:48:27 +02:00
Pieter Noordhuis	5fb35e358f	Add dashboards to summary output	2024-10-18 15:48:10 +02:00
Pieter Noordhuis	43f9155de5	Merge remote-tracking branch 'origin/main' into dashboards	2024-10-18 15:35:08 +02:00
Lennart Kats (databricks)	c5043c3d9d	Add `bundle summary` to display URLs for deployed resources (#1731 ) ## Changes Adds a textual output to the `databricks bundle summary` command, which includes URLs of deployed resources. Example usage: ``` $ databricks bundle summary Name: my_pipeline Target: dev Workspace: Host: https://domain.databricks.com User: user@databricks.com Path: /Users/user@databricks.com/.bundle/my_pipeline/dev Resources: Jobs: my_project_job: Name: [dev lennart] my_project_job URL: https://domain.databricks.com/jobs/206899209187287?o=6051921418418893 Pipelines: my_project_pipeline: Name: [dev lennart] my_project_pipeline URL: https://domain.databricks.com/pipelines/3f849fd5-ba7d-47fa-a34c-c6bf034b4f58?o=6051921418418893 ``` Notes: * The top headers of the output are the same as those from the existing `bundle validate` command * URLs are colored light blue in the output * For resources that haven't been deployed yet, we show `(not deployed)` in place of the URL --------- Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com> Co-authored-by: Pieter Noordhuis <pcnoordhuis@gmail.com>	2024-10-18 06:45:47 +00:00
Pieter Noordhuis	3270afaff4	Move utility functions dealing with IAM to libs/iamutil (#1820 ) ## Changes The two functions `GetShortUserName` and `IsServicePrincipal` are unrelated to auth or the purpose of the auth package. This change moves them into their own package and updates `IsServicePrincipal` to take an `*iam.User` argument instead of a string username. ## Tests Tests pass.	2024-10-10 13:02:25 +00:00
Lennart Kats (databricks)	e885794722	Show actionable errors for collaborative deployment scenarios (#1386 ) ## Changes This adds diagnostics for collaborative (production) deployment scenarios, including: - Bob deploys a bundle that is normally deployed by Alice, but this fails because Bob can't write to `/Users/Alice/.bundle`. - Charlie deploys a bundle that is normally deployed by Alice, but this fails because he can't create a new pipeline where Alice would be the owner. - Alice deploys a bundle where she didn't list herself as one of the CAN_MANAGE users in permissions. That can work, but is probably a mistake. ## Tests Unit tests, manual testing.	2024-10-10 11:18:23 +00:00
shreyas-goenka	bca9c2eda4	Add validation for files with a `.(resource-name).yml` extension (#1780 ) ## Changes We want to encourage a pattern of specifying only a single resource in a YAML file when the `.(resource-type).yml` extension is used (for example, `.job.yml`). This convention could allow us to bijectively map a resource YAML file to its corresponding resource in the Databricks workspace. This PR: 1. Emits a recommendation diagnostic when we detect this convention is being violated. We can promote this to a warning when we want to encourage this pattern more strongly. 2. Visualises the recommendation diagnostics in the `bundle validate` command. NOTE: While this PR also shows the recommendation for `.yaml` files, we do not encourage users to use this extension. We only support it here since it's part of the YAML standard and some existing users might already be using `.yaml`. ## Tests Unit tests and manually. Here's what an example output looks like: ``` Recommendation: define a single job in a file with the .job.yml extension. at resources.jobs.bar resources.jobs.foo in foo.job.yml:13:7 foo.job.yml:5:7 The following resources are defined or configured in this file: - bar (job) - foo (job) ``` --------- Co-authored-by: Lennart Kats (databricks) <lennart.kats@databricks.com>	2024-10-07 09:16:20 +00:00
Andrew Nester	a8cff48c0b	Always prepend bundle remote paths with /Workspace (#1724 ) ## Changes Due to platform changes, all libraries, notebooks and etc. paths used in Databricks must be started with either /Workspace or /Volumes prefix. This PR makes sure that all bundle paths are correctly prefixed. Note: this change is a breaking change if user previously configured and used `/Workspace/Workspace` folder in their workspace file system or having `/Workspace/${workspace.root_path}...` pattern configured anywhere in their bundle config Fixes: #1751 AI: - [x] Scan DABs config and error out on `/Workspace/${workspace.root_path}...` pattern usage ## Tests Added unit tests --------- Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2024-10-02 15:34:00 +00:00
Pieter Noordhuis	80d55f4540	Add resource path field to bundle workspace configuration (#1800 ) ## Changes Default workspace path for resources with a presence in the workspace tree. Note: this path is not created automatically (yet). We need this only for dashboards (so far), so can take care of creation if one or more dashboards are part of a deployment. This saves an API call for deployments where this is not necessary. ## Tests Expanded existing tests.	2024-10-02 13:55:40 +00:00
Pieter Noordhuis	b63e94366a	Rename mutator	2024-10-01 11:19:43 -07:00
Pieter Noordhuis	0301854a43	Comment	2024-10-01 10:47:20 -07:00
Pieter Noordhuis	8186da8e67	Simplify	2024-10-01 10:47:08 -07:00
Pieter Noordhuis	a2a794e5fa	Configure the default parent path for dashboards	2024-10-01 10:46:53 -07:00
Pieter Noordhuis	0f22ec6116	Remove generate changes from this branch	2024-10-01 04:51:50 -07:00
Pieter Noordhuis	c123cca275	Merge branch 'workspace-resource-path' into dashboards	2024-10-01 04:49:53 -07:00
Pieter Noordhuis	08f7f3b6b7	Add resource path field to bundle workspace configuration	2024-09-30 14:29:16 +02:00
Pieter Noordhuis	802be90687	Coverage for conversion logic	2024-09-30 11:54:08 +02:00
Lennart Kats (databricks)	da3b4f7c72	Fix panic in `apply_presets.go` (#1796 ) ## Changes This fixes the user-reported panic in `apply_presets.go`. I'm still unsure how to reproduce this, since the CLI just reports `ob broken_job is not defined` when I try to use `bundle deploy` with an empty job. That said — we may as well be defensive here and I see we have lots of checks for empty job/cluster/etc. settings scattered throughout our code base so at least we're somewhat consistent.	2024-09-29 14:08:10 +00:00
Pieter Noordhuis	3a1d92c75c	Comments	2024-09-27 16:39:51 +02:00
Pieter Noordhuis	ff15a046fc	Merge remote-tracking branch 'origin/main' into dashboards	2024-09-27 14:58:53 +02:00
Pieter Noordhuis	7a9355c02c	Merge remote-tracking branch 'origin/main' into dashboards	2024-09-27 14:58:37 +02:00
Pieter Noordhuis	1d1aa0a416	Rename `RootPath` -> `BundleRootPath` (#1792 ) ## Changes After introducing the `SyncRootPath` field on the bundle (#1694), the previous `RootPath` became ambiguous. Does it mean the bundle root path or the sync root path? This PR renames to field to `BundleRootPath` to remove the ambiguity. ## Tests n/a --------- Co-authored-by: shreyas-goenka <88374338+shreyas-goenka@users.noreply.github.com>	2024-09-27 10:03:05 +00:00
Pieter Noordhuis	56cd96cb93	Move trampoline code into trampoline package (#1793 ) ## Changes Doing this to make room for PyDABs under `bundle/python`. ## Tests n/a	2024-09-27 09:32:54 +00:00
Pieter Noordhuis	a1dca56abf	Trim trailing whitespace (#1794 ) ## Changes Trailing whitespace is trimmed per the VS Code settings for this repository. ## Tests n/a	2024-09-27 09:30:39 +00:00
Andrew Nester	66f2ba64a8	Simplified isFullVariableOverrideDef implementation (#1791 ) ## Changes Simplified isFullVariableOverrideDef implementation Follow up on https://github.com/databricks/cli/pull/1787	2024-09-26 12:55:07 +00:00
shreyas-goenka	495040e4cd	Modify SetLocation test utility to take full locations as argument (#1788 ) I plan to use this in https://github.com/databricks/cli/pull/1780, to set the line and column numbers as well for the locations. gopatch file used: ``` @@ var x expression var y expression var z expression @@ -bundletest.SetLocation(x, y, z) +bundletest.SetLocation(x, y, []dyn.Location{{File: z}}) ```	2024-09-25 16:13:48 +00:00
Andrew Nester	b3a3071086	Fixed full variable override detection (#1787 ) ## Changes Fixes #1786 ## Tests All valid override combinations are added as test cases	2024-09-25 12:35:16 +00:00
Gleb Kanterov	3d9decdda9	Add JobTaskClusterSpec validate mutator (#1784 ) ## Changes Add JobTaskClusterSpec validate mutator. It catches the case when tasks don't which cluster to use. For example, we can get this error with minor modifications to `default-python` template: ```yaml tasks: - task_key: python_file_task spark_python_task: python_file: ../src/my_project_10/main.py ``` ``` % databricks bundle validate Error: Missing required cluster or environment settings at resources.jobs.my_project_10_job.tasks[0] in resources/my_project_10_job.yml:17:11 Task "print_github_stars" requires a cluster or an environment to run. Specify one of the following fields: job_cluster_key, environment_key, existing_cluster_id, new_cluster. ``` We implicitly rely on "one of" validation, which does not exist. Many bundle fields can't co-exist, for instance, specifying: `JobTask.{existing_cluster_id,job_cluster_key}`, `Library.{whl,pypi}`, `JobTask.{notebook_task,python_wheel_task}`, etc. ## Tests Unit tests --------- Co-authored-by: Pieter Noordhuis <pcnoordhuis@gmail.com>	2024-09-25 11:30:14 +00:00
Gleb Kanterov	490259a14a	Refactor jobs path translation (#1782 ) ## Changes Extract package for other modules to transform different kinds of paths in job resources. ## Tests Unit tests	2024-09-24 13:51:54 +00:00
Andrew Nester	56ed9bebf3	Added support for creating all-purpose clusters (#1698 ) ## Changes Added support for creating all-purpose clusters Example of configuration ``` bundle: name: clusters resources: clusters: test_cluster: cluster_name: "Test Cluster" num_workers: 2 node_type_id: "i3.xlarge" autoscale: min_workers: 2 max_workers: 7 spark_version: "13.3.x-scala2.12" spark_conf: "spark.executor.memory": "2g" jobs: test_job: name: "Test Job" tasks: - task_key: test_task existing_cluster_id: ${resources.clusters.test_cluster.id} notebook_task: notebook_path: "./src/test.py" targets: development: mode: development compute_id: ${resources.clusters.test_cluster.id} ``` ## Tests Added unit, config and E2E tests	2024-09-23 10:42:34 +00:00
Andrew Nester	bcab6ca37b	Fixed detecting full syntax variable override which includes type field (#1775 ) ## Changes Fixes #1773 ## Tests Confirmed manually	2024-09-18 10:23:07 +00:00
Lennart Kats (databricks)	e220f9ddd6	Use the friendly name of service principals when shortening their name (#1770 ) ## Summary Use the friendly name of service principals when shortening their name. This change is helpful for the prefix in development mode. Instead of adding a prefix like `[dev 1706906c-c0a2-4c25-9f57-3a7aa3cb8123]`, we'll prefix like `[dev my_principal]`.	2024-09-16 18:35:07 +00:00
Pieter Noordhuis	6dadf667b5	Merge remote-tracking branch 'origin/main' into dashboards	2024-09-12 16:03:58 +02:00
Andrew Nester	66307134c1	Fixed generated YAML missing 'default' for empty values (#1765 ) ## Changes Fixed generated YAML missing 'default' for empty values ## Tests Added unit test	2024-09-11 09:49:58 +00:00
shreyas-goenka	5d2c0e3885	Alias variables block in the `Target` struct (#1748 ) ## Changes This PR aliases and overrides the schema associated with the variables block in `target` to allow for directly specifying a variable value in the JSON schema (without an levels of nesting). This is needed because this direct value is resolved by dynamically parsing the configuration tree. `ca6332a5a4/bundle/config/root.go (L424)` ## Tests Existing unit tests.	2024-09-10 14:49:34 +00:00
Pieter Noordhuis	7403101d59	Merge remote-tracking branch 'origin/main' into dashboards	2024-09-09 16:48:23 +02:00
Pieter Noordhuis	b7a952d22e	wip	2024-09-06 14:12:59 +02:00
Andrew Nester	02e83877f4	Added listing cluster filtering for cluster lookups (#1754 ) ## Changes We added a custom resolver for the cluster to add filtering for the cluster source when we list all clusters. Without the filtering listing could take a very long time (5-10 mins) which leads to lookup timeouts. ## Tests Existing unit tests passing	2024-09-06 11:34:57 +00:00
Pieter Noordhuis	ceefa80d72	Pass copy of `dyn.Path` to callback function (#1747 ) ## Changes Some call sites hold on to the `dyn.Path` provided to them by the callback. It must therefore never be mutated after the callback returns, or these mutations leak out into unknown scope. This change means it is no longer possible for this failure mode to happen. ## Tests Unit test.	2024-09-05 11:05:16 +00:00
Andrew Nester	72030844c5	Fixed variable override in target with full variable syntax (#1749 ) ## Changes This PR makes sure that both of this override syntax for variables work correctly ``` targets: dev: variables: cluster1: spark_version: "14.2.x-scala2.11" node_type_id: "Standard_DS3_v2" num_workers: 4 spark_conf: spark.speculation: false spark.databricks.delta.retentionDurationCheck.enabled: false cluster2: default: spark_version: "14.2.x-scala2.11" node_type_id: "Standard_DS3_v2" num_workers: 4 spark_conf: spark.speculation: false spark.databricks.delta.retentionDurationCheck.enabled: false ``` ## Tests Added regression test --------- Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2024-09-04 17:16:40 +00:00
Andrew Nester	ca6332a5a4	Fixed complex variables are not being correctly merged from include files (#1746 ) ## Changes Fixes an `Error: no value assigned to required variable <variable>.` when the main complex variable definition is defined in one file but target override is defined in separate file which is included in the main one. ## Tests Added regression test	2024-09-04 11:24:55 +00:00
Pieter Noordhuis	3461c66dc9	Add DABs support for AI/BI dashboards	2024-09-03 10:35:41 +02:00
Gleb Kanterov	ed448815b4	PythonMutator: explain missing package error (#1736 ) ## Changes Explain the error when the `databricks-pydabs` package is not installed or the Python environment isn't correctly activated. Example output: ``` Error: python mutator process failed: ".venv/bin/python3 -m databricks.bundles.build --phase load --input .../input.json --output .../output.json --diagnostics .../diagnostics.json: exit status 1", use --debug to enable logging .../.venv/bin/python3: Error while finding module specification for 'databricks.bundles.build' (ModuleNotFoundError: No module named 'databricks') Explanation: 'databricks-pydabs' library is not installed in the Python environment. If using Python wheels, ensure that 'databricks-pydabs' is included in the dependencies, and that the wheel is installed in the Python environment: $ .venv/bin/pip install -e . If using a virtual environment, ensure it is specified as the venv_path property in databricks.yml, or activate the environment before running CLI commands: experimental: pydabs: venv_path: .venv ``` ## Tests Unit tests	2024-09-02 09:49:30 +00:00
Andrew Nester	582558cac2	Do not suppress normalisation diagnostics for resolving variables (#1740 ) ## Changes Tested on the following bundle configuration ``` bundle: name: clusters mode: development variables: webhook_notifications: description: Webhook URL for notifications type: complex default: on_failure: id: 6a6c04c1-389c-4534-95af-b68b62a9dbe6 resources: jobs: test_job: name: "Andrew Nester Test Job" tasks: - task_key: test_task notebook_task: notebook_path: "./src/test.py" new_cluster: num_workers: 2 node_type_id: "i3.xlarge" autoscale: min_workers: 2 max_workers: 7 spark_version: "12.2.x-scala2.12" spark_conf: "spark.executor.memory": "2g" webhook_notifications: ${var.webhook_notifications} ``` bundle validate output is below ``` andrew.nester@HFW9Y94129 wheel % databricks bundle validate Warning: expected sequence, found map at resources.jobs.test_job.webhook_notifications.on_failure in bundle.yml:11:9 Name: clusters Target: default Workspace: User: andrew.nester@databricks.com Path: /Users/andrew.nester@databricks.com/.bundle/clusters/default ``` Note that error correctly points to the variable	2024-09-02 09:17:18 +00:00
shreyas-goenka	5d9910c8e0	Make lock optional in the JSON schema (#1738 ) Fixes https://github.com/databricks/cli/issues/1561	2024-09-02 08:39:08 +00:00
Gleb Kanterov	70ce802518	PythonMutator: preserve normalize diagnostics (#1735 ) ## Changes Preserve diagnostics if there are any errors or warnings when PythonMutator normalizes output. If anything goes wrong during conversion, diagnostics contain the relevant location and path. ## Tests Unit tests	2024-08-30 13:29:00 +00:00
Lennart Kats (databricks)	85459c1963	Improve error handling for /Volumes paths in mode: development (#1716 ) ## Changes * Provide a more helpful error when using an artifact_path based on /Volumes * Allow the use of short_names in /Volumes paths ## Example cases Example of a valid /Volumes artifact_path: * `artifact_path: /Volumes/catalog/schema/${workspace.current_user.short_name}/libs` Example of an invalid /Volumes path (when using `mode: development`): * `artifact_path: /Volumes/catalog/schema/libs` * Resulting error: `artifact_path should contain the current username or ${workspace.current_user.short_name} to ensure uniqueness when using 'mode: development'`	2024-08-28 12:14:19 +00:00
Lennart Kats (databricks)	84b47745e4	Ignore CLI version check on development builds of the CLI (#1714 ) ## Changes This changes makes sure we ignore CLI version check on development builds of the CLI. Before: ``` $ cat databricks.yml \| grep cli_version databricks_cli_version: ">= 0.223.1" $ cli bundle deploy Error: Databricks CLI version constraint not satisfied. Required: >= 0.223.1, current: 0.0.0-dev+06b169284737 ``` after ``` ... $ cli bundle deploy ... Warning: Ignoring Databricks CLI version constraint for development build. Required: >= 0.223.1, current: 0.0.0-dev+d52d6f08fcd5 ``` ## Tests <!-- How is this tested? -->	2024-08-23 10:13:21 +00:00

1 2 3 4 5 ...

266 Commits