databricks-cli

Commit Graph

Author	SHA1	Message	Date
Denis Bilenko	38efedcd73	Remove bundle.git.inferred (#2258 ) The only use case for it was to emit a warning and based on the discussion here https://github.com/databricks/cli/pull/2213/files#r1933558087 the warning it not useful and logging that with reduced severity is also not useful.	2025-01-29 14:15:52 +00:00
Gleb Kanterov	13596eb605	PythonMutator: Fix relative path error (#2253 ) ## Changes Fix relative path errors in the Python mutator that was failing during deployment since v0.239.1. Before that: ``` % databricks bundle deploy Deploying resources... Updating deployment state... Error: failed to compute relative path for job jobs_as_code_project_job: Rel: can't make resources/jobs_as_code_project_job.py relative to /Users/$USER/jobs_as_code_project ``` As a result, the bundle was deployed, but the deployment state wasn't updated. ## Tests Unit tests, adding acceptance tests in https://github.com/databricks/cli/pull/2254	2025-01-29 13:56:57 +00:00
shreyas-goenka	884b5f26ed	Set bundle auth configuration in command context (#2195 ) ## Changes This change is required to enable tracking execution time telemetry for bundle commands. In order to track execution time for the command generally, we need to have the databricks auth configuration available at this section of the code: `41bbd89257/cmd/root/root.go (L99)` In order to do this we can rely on the `configUsed` context key. Most commands rely on the `root.MustWorkspaceClient` function which automatically sets the client config in the `configUsed` context key. Bundle commands, however, do not do so. They instead store their workspace clients in the `&bundle.Bundle{}` object. With this PR, the `configUsed` context key will be set for all `bundle` commands. Functionally nothing changes. ## Tests Existing tests. Also manually verified that either `root.MustConfigureBundle` or `utils.ConfigureBundleWithVariables` is called for all bundle commands (except `bundle init`) thus ensuring this context key would be set for all bundle commands. refs for the functions: 1. `root.MustConfigureBundle`: `41bbd89257/cmd/root/bundle.go (L88)` 2. `utils.ConfigureBundleWithVariables`: `41bbd89257/cmd/bundle/utils/utils.go (L19)` --------- Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2025-01-29 11:02:08 +00:00
Ilya Kuznetsov	0487e816cc	Reading variables from file (#2171 ) ## Changes New source of default values for variables - variable file `.databricks/bundle/<target>/variable-overrides.json` CLI tries to stat and read that file every time during variable initialisation phase <!-- Summary of your changes that are easy to understand --> ## Tests Acceptance tests	2025-01-23 14:35:33 +00:00
Andrew Nester	8af9efaa62	Show an error when non-yaml files used in include section (#2201 ) ## Changes `include` section is used only to include other bundle configuration YAML files. If any other file type is used, raise an error and guide users to use `sync.include` instead ## Tests Added acceptance test --------- Co-authored-by: Julia Crawford (Databricks) <julia.crawford@databricks.com>	2025-01-23 13:58:18 +00:00
Gleb Kanterov	3d91691f25	PythonMutator: propagate source locations (#1783 ) ## Changes Add a mechanism to load Python source locations in the Python mutator. Previously, locations pointed to generated YAML. Now, they point to Python sources instead. Python process outputs "locations.json" containing locations of bundle paths, examples: ```json {"path": "resources.jobs.job_0", "file": "resources/job_0.py", "line": 3, "column": 5} {"path": "resources.jobs.job_0.tasks[0].task_key", "file": "resources/job_0.py", "line": 10, "column": 5} {"path": "resources.jobs.job_1", "file": "resources/job_1.py", "line": 5, "column": 7} ``` Such locations form a tree, and we assign locations of the closest ancestor to each `dyn.Value` based on its path. For example, `resources.jobs.job_0.tasks[0].task_key` is located at `job_0.py:10:5` and `resources.jobs.job_0.tasks[0].email_notifications` is located at `job_0.py:3:5`, because we use the location of the job as the most precise approximation. This feature is only enabled if `experimental/python` is used. Note: for now, we don't update locations with relative paths, because it has a side effect in changing how these paths are resolved ## Example ``` % databricks bundle validate Warning: job_cluster_key abc is not defined at resources.jobs.examples.tasks[0].job_cluster_key in resources/example.py:10:1 ``` ## Tests Unit tests and manually	2025-01-22 15:37:37 +00:00
Denis Bilenko	e9902036b8	Set WorktreeRoot to sync root outside git repo (#2197 ) ## Changes If git is not detected, set default worktree root to sync root. Otherwise NewFileSet/View raise an error about worktree root being outside view root in acceptance/bundle/sync-paths-dotdot. This behavior is introduced in https://github.com/databricks/cli/pull/1945 Stacked on https://github.com/databricks/cli/pull/2202 ## Tests Existing tests.	2025-01-22 10:50:13 +00:00
Denis Bilenko	41bbd89257	Clean up unnecessary cleanup of inferred flag (#2193 ) ## Changes The SelectTarget mutator (part of Load phase) clears bundle.git.inferred flag but it is not set until later - Initialize phase / LoadGitDetails mutator. ## Tests Existing tests.	2025-01-20 17:21:34 +00:00
Ilya Kuznetsov	84a73052d2	fix: Detailed message for using source-linked deployment with file_path specified (#2119 ) ## Changes Resolves remaining comments from here https://github.com/databricks/cli/pull/2046 [This](https://github.com/databricks/cli/pull/2046#discussion_r1907121844) and [this](https://github.com/databricks/cli/pull/2046#discussion_r1908928239) are on hold until Pieter's response ## Tests <!-- How is this tested? -->	2025-01-20 16:16:51 +00:00
Pieter Noordhuis	9061635789	Default to forward slash-separated paths for path translation (#2145 ) ## Changes This came up in #2122 where relative library paths showed up with backslashes on Windows. It's hard to run acceptance tests where paths may be in either form. This change updates path translation logic to always use forward slash-separated paths, including for absolute paths. ## Tests * Unit tests pass. * Confirmed that code where library paths are used uses the `filepath` package for path manipulation. The functions in this package always normalize their inputs to be platform-native paths. * Confirmed that code that uses absolute paths works with forward slash-separated paths on Windows.	2025-01-17 09:38:01 +00:00
Denis Bilenko	2e70558dc1	Resolve variables in a loop (#2164 ) ## Changes - Instead of doing 2 passes on variable resolution, do a loop until there are no more updates (or we reach count 100). - Stacked on top of #2163 which is a regression test for this: acceptance/bundle/variables/complex-transitive-deep ## Tests Existing tests, new regression tests. These tests already passed before, added for completeness: - acceptance/bundle/variables/cycle - acceptance/bundle/variables/complex-cross-ref	2025-01-16 14:39:54 +00:00
shreyas-goenka	f2bba632cb	Patch references to UC schemas to capture dependencies automatically (#1989 ) ## Changes Fixes https://github.com/databricks/cli/issues/1977. This PR modifies the bundle configuration to capture the dependency that a UC Volume or a DLT pipeline might have on a UC schema at deployment time. It does so by replacing the schema name with a reference of the form `${resources.schemas.foo.name}`. For example: The following UC Volume definition depends on the UC schema with the name `schema_name`. This mutator converts this configuration from: ``` resources: volumes: bar: catalog_name: catalog_name name: volume_name schema_name: schema_name schemas: foo: catalog_name: catalog_name name: schema_name ``` to: ``` resources: volumes: bar: catalog_name: catalog_name name: volume_name schema_name: ${resources.schemas.foo.name}` schemas: foo: catalog_name: catalog_name name: schema_name ``` ## Tests Unit tests and manually.	2025-01-16 13:27:00 +00:00
Denis Bilenko	b273dc5942	Enable linter 'copyloopvar' and fix the issues (#2160 ) ## Changes - Remove all unnecessary copies of the loop variable, it is not necessary since Go 1.22 https://go.dev/blog/loopvar-preview - Enable the linter that catches this issue https://github.com/karamaru-alpha/copyloopvar ## Tests Existing tests.	2025-01-16 11:20:50 +00:00
Denis Bilenko	30dec59781	Improve resolution of complex variables within complex variables (#2157 ) ## Changes - Remove ResolveVariableReferencesInComplexVariables - it blocked complex-within-complex for no good reason. - Repeat regular resolution twice, it helps with a couple test cases we have. There may be a case for running it 3 times or more in a loop, but there is no test case for that, so this PR is simple incremental improvement. ## Tests Existing acceptance tests. Previously all unit tests for complex variables were converted to acceptance tests, to capture this change and ensure nothing breaks.	2025-01-15 18:03:43 +01:00
Denis Bilenko	39b03592d7	Migrate TestResolveComplexVariableWithVarReference (#2156 ) This is the last test referencing ResolveVariableReferencesInComplexVariables, allowing removal of that mutator.	2025-01-15 17:52:17 +01:00
Denis Bilenko	581565a1c4	Migrate more variable tests to acceptance (#2154 )	2025-01-15 15:59:42 +01:00
Denis Bilenko	b76eee0e8c	Migrate resolution tests to acceptance tests (#2143 )	2025-01-15 11:22:23 +01:00
Pieter Noordhuis	5d9bc3b553	Allow artifact path to be located outside the sync root (#2128 ) ## Changes We perform a check during path translation that the path being referenced is contained in the bundle's sync root. If it isn't, it's not a valid remote reference. However, this doesn't apply to paths that are _always_ local, such as the artifact path. An artifact's build command is executed in its path. Files created by the artifact build (e.g. wheels or JARs) don't need to be in the sync root because they have a dedicated and different upload path into `${workspace.artifact_path}`. Therefore, this check that a path is contained in the bundle's sync root doesn't apply to artifact paths. This change modifies the structure of path translation to allow opting out of this check. Fixes #1927. ## Tests * Existing and new tests pass. * Manually confirmed that building and using a wheel built outside the sync root path works as expected. * No acceptance tests because we don't run build as part of validate.	2025-01-14 08:34:55 +00:00
Andrew Nester	913e10a037	Added support for Databricks Apps in DABs (#1928 ) ## Changes Now it's possible to configure new `app` resource in bundle and point it to the custom `source_code_path` location where Databricks App code is defined. On `databricks bundle deploy` DABs will create an app. All consecutive `databricks bundle deploy` execution will update an existing app if there are any updated On `databricks bundle run <my_app>` DABs will execute app deployment. If the app is not started yet, it will start the app first. ### Bundle configuration ``` bundle: name: apps variables: my_job_id: description: "ID of job to run app" lookup: job: "My Job" databricks_name: description: "Name for app user" additional_flags: description: "Additional flags to run command app" default: "" my_app_config: type: complex description: "Configuration for my Databricks App" default: command: - flask - --app - hello - run - ${var.additional_flags} env: - name: DATABRICKS_NAME value: ${var.databricks_name} resources: apps: my_app: name: "anester-app" # required and has to be unique description: "My App" source_code_path: ./app # required and points to location of app code config: ${var.my_app_config} resources: - name: "my-job" description: "A job for app to be able to run" job: id: ${var.my_job_id} permission: "CAN_MANAGE_RUN" permissions: - user_name: "foo@bar.com" level: "CAN_VIEW" - service_principal_name: "my_sp" level: "CAN_MANAGE" targets: dev: variables: databricks_name: "Andrew (from dev)" additional_flags: --debug prod: variables: databricks_name: "Andrew (from prod)" ``` ### Execution 1. `databricks bundle deploy -t dev` 2. `databricks bundle run my_app -t dev` If app is started ``` ✓ Getting the status of the app my-app ✓ App is in RUNNING state ✓ Preparing source code for new app deployment. ✓ Deployment is pending ✓ Starting app with command: flask --app hello run --debug ✓ App started successfully You can access the app at <app-url> ``` If app is not started ``` ✓ Getting the status of the app my-app ✓ App is in UNAVAILABLE state ✓ Starting the app my-app ✓ App is starting... .... ✓ App is starting... ✓ App is started! ✓ Preparing source code for new app deployment. ✓ Downloading source code from /Workspace/Users/... ✓ Starting app with command: flask --app hello run --debug ✓ App started successfully You can access the app at <app-url> ``` ## Tests Added unit and config tests + manual test. ``` --- PASS: TestAccDeployBundleWithApp (404.59s) PASS coverage: 36.8% of statements in ./... ok github.com/databricks/cli/internal/bundle 405.035s coverage: 36.8% of statements in ./... ```	2025-01-13 16:43:48 +00:00
Lennart Kats (databricks)	3e40a0c2f1	Encourage the use of root_path in production to ensure single deployment (#1712 ) ## Changes This updates `mode: production` to allow `root_path` to indicate uniqueness. Historically, we required `run_as` for this, which isn't actually very effective for that purpose. `run_as` also had the problem that it doesn't work for pipelines. This is a cherry-pick from https://github.com/databricks/cli/pull/1387 --------- Co-authored-by: Pieter Noordhuis <pcnoordhuis@gmail.com>	2025-01-13 12:19:12 +00:00
Denis Bilenko	df17e4b4ea	Convert some resolve variables tests to acceptance test (#2100 )	2025-01-08 17:44:52 +00:00
Ilya Kuznetsov	0289becea8	Handle `${workspace.file_path}` references in source-linked deployments (#2046 ) ## Changes 1. Updates `workspace.file_path` during source-linked deployment to address cases like this https://github.com/databricks/bundle-examples/blob/main/default_python/resources/default_python_pipeline.yml#L13 2. Updates `workspace.file_path` in `metadata.json` 3. Prints warning for users when `workspace.file_path` is explicitly set but deploy is running in source-linked mode ## Tests Unit test	2025-01-08 12:43:56 +00:00
Gleb Kanterov	02c7df39f6	Add 'experimental/python' support (#2052 ) ## Changes Add `experimental/python` section replacing `experimental/pydabs`. Add 2 new mutators into existing pipeline: - `ApplyPythonMutator(load_resources)` - loads resources from Python code - `ApplyPythonMutator(apply_mutators)` - transforms existing resources defined in Python/YAML Example: ```yaml experimental: python: resources: - "resources:load_resources" mutators: - "mutators:add_email_notifications" ``` ## Tests Unit tests and manually --------- Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2025-01-08 09:29:45 +00:00
Denis Bilenko	e2cd8c2f34	Enable perfsprint linter and apply autofix (#2071 ) https://github.com/catenacyber/perfsprint	2025-01-07 10:49:23 +00:00
Denis Bilenko	39d1e8093f	Enable intrange linter and apply autofix (#2069 ) New construct in Go1.22+ for integer iteration: https://github.com/ckaznocha/intrange?tab=readme-ov-file#intrange	2025-01-03 09:25:07 +00:00
shreyas-goenka	7beb0fb8b5	Add validation mutator for volume `artifact_path` (#2050 ) ## Changes This PR: 1. Incrementally improves the error messages shown to the user when the volume they are referring to in `workspace.artifact_path` does not exist. 2. Performs this validation in both `bundle validate` and `bundle deploy` compared to before on just deployments. 3. It runs "fast" validations on `bundle deploy`, which earlier were only run on `bundle validate`. ## Tests Unit tests and manually. Also, existing integration tests provide coverage (`TestUploadArtifactToVolumeNotYetDeployed`, `TestUploadArtifactFileToVolumeThatDoesNotExist`) Examples: ``` .venv➜ bundle-playground git:(master) ✗ cli bundle validate Error: cannot access volume capital.whatever.my_volume: User does not have READ VOLUME on Volume 'capital.whatever.my_volume'. at workspace.artifact_path in databricks.yml:7:18 ``` and ``` .venv➜ bundle-playground git:(master) ✗ cli bundle validate Error: volume capital.whatever.foobar does not exist at workspace.artifact_path resources.volumes.foo in databricks.yml:7:18 databricks.yml:12:7 You are using a volume in your artifact_path that is managed by this bundle but which has not been deployed yet. Please first deploy the volume using 'bundle deploy' and then switch over to using it in the artifact_path. ```	2025-01-02 17:23:15 +05:30
Denis Bilenko	0b80784df7	Enable testifylint and fix the issues (#2065 ) ## Changes - Enable new linter: testifylint. - Apply fixes with --fix. - Fix remaining issues (mostly with aider). There were 2 cases we --fix did the wrong thing - this seems to a be a bug in linter: https://github.com/Antonboom/testifylint/issues/210 Nonetheless, I kept that check enabled, it seems useful, just need to be fixed manually after autofix. ## Tests Existing tests	2025-01-02 12:03:41 +01:00
Denis Bilenko	3f523b45cc	Fix lost diags across different mutators (#2057 ) ## Changes Fix cases where accumulated diagnostics are lost instead of being propagated further. In some cases it's not possible, add a comment there. ## Tests Existing tests	2024-12-31 14:01:45 +00:00
Denis Bilenko	2fee243586	Fix finding Python within virtualenv on Windows (#2034 ) ## Changes Simplify logic for selecting Python to run when calculating default whl build command: "python" on Windows and "python3" everywhere. Python installers from python.org do not install python3.exe. In virtualenv there is no python3.exe. ## Tests Added new unit tests to create real venv with uv and simulate activation by prepending venv/bin to PATH.	2024-12-20 07:45:32 +00:00
Pieter Noordhuis	241fcfffb0	Consolidate helper functions to `internal/testutil` package (#2002 ) ## Changes This is one step (of many) toward moving the integration tests around. This change consolidates the following functions: * `ReadFile` / `WriteFile` * `GetEnvOrSkipTest` * `RandomName` ## Tests n/a	2024-12-12 12:35:38 +00:00
Denis Bilenko	2e018cfaec	Enable gofumpt and goimports in golangci-lint (#1999 ) ## Changes Enable gofumpt and goimports in golangci-lint and apply autofix. This makes 'make fmt' redundant, will be cleaned up in follow up diff. ## Tests Existing tests.	2024-12-12 10:28:42 +01:00
Lennart Kats (databricks)	2ee7d56ae6	Show an error when using a cluster override with 'mode: production' (#1994 ) ## Changes We should show a warning when using a cluster override with 'mode: production'. Right now, we inadvertently show an error for this state. This is a followup based on https://github.com/databricks/cli/pull/1899#discussion_r1877765148.	2024-12-11 14:57:31 +00:00
Denis Bilenko	8d5351c1c3	Enable errcheck everywhere and fix or silent remaining issues (#1987 ) ## Changes Enable errcheck linter for the whole codebase. Fix remaining complaints: - If we can propagate error to caller, do that - If we writing to stdout, continue ignoring errors (to avoid crashing in "cli \| head" case) - Add exception for cobra non-critical API such as MarkHidden/MarkDeprecated/RegisterFlagCompletionFunc. This keeps current code and behaviour, to be decided later if we want to change this. - Continue ignoring errors where that is desired behaviour (e.g. git.loadConfig). - Continue ignoring errors where panicking seems riskier than ignoring the error. - Annotate cases in libs/dyn with //nolint:errcheck - to be addressed later. Note, this PR is not meant to come up with the best strategy for each case, but to be a relative safe change to enable errcheck linter. ## Tests Existing tests.	2024-12-11 13:26:00 +01:00
Denis Bilenko	4236e7122f	Switch to `folders.FindDirWithLeaf` (#1963 ) ## Changes Remove two duplicate implementations of the same logic, switch everywhere to folders.FindDirWithLeaf. Add Abs() call to FindDirWithLeaf, it cannot really work on relative paths. ## Tests Existing tests.	2024-12-11 09:44:22 +01:00
Denis Bilenko	67f08ba924	Avoid panic if Config.Workspace.CurrentUser.User is not set (#1993 ) ## Changes Extra check to avoid panic if /api/2.0/preview/scim/v2/Me returns `{}` ## Tests Existing tests.	2024-12-11 09:40:14 +01:00
Lennart Kats (databricks)	f3c628e537	Allow overriding compute for non-development mode targets (#1899 ) ## Changes Allow overriding compute for non-development targets. We previously had a restriction in place where `--cluster-id` was only allowed for targets that use `mode: development`. The intention was to prevent mistakes, but this was overly restrictive. ## Tests Updated unit tests.	2024-12-10 10:02:44 +00:00
Denis Bilenko	1b2be1b2cb	Add error checking in tests and enable errcheck there (#1980 ) ## Changes Fix all errcheck-found issues in tests and test helpers. Mostly this done by adding require.NoError(t, err), sometimes panic() where t object is not available). Initial change is obtained with aider+claude, then manually reviewed and cleaned up. ## Tests Existing tests.	2024-12-09 13:56:41 +01:00
Pieter Noordhuis	6e754d4f34	Rewrite 'interface{} -> any' (#1959 ) ## Changes The `any` alias for `interface{}` has been around since Go 1.18. Now that we're using golangci-lint (#1953), we can lint on it. Existing commits can be updated with: ``` gofmt -w -r 'interface{} -> any' . ``` ## Tests n/a	2024-12-05 15:37:24 +00:00
Denis Bilenko	0ad790e468	Properly read Git metadata when running inside workspace (#1945 ) ## Changes Since there is no .git directory in Workspace file system, we need to make an API call to api/2.0/workspace/get-status?return_git_info=true to fetch git the root of the repo, current branch, commit and origin. Added new function FetchRepositoryInfo that either looks up and parses .git or calls remote API depending on env. Refactor Repository/View/FileSet to accept repository root rather than calculate it. This helps because: - Repository is currently created in multiple places and finding the repository root is becoming relatively expensive (API call needed). - Repository/FileSet/View do not have access to current Bundle which is where WorkplaceClient is stored. ## Tests - Tested manually by running "bundle validate --json" inside web terminal within Databricks env. - Added integration tests for the new API. --------- Co-authored-by: Andrew Nester <andrew.nester@databricks.com> Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2024-12-05 10:13:13 +00:00
shreyas-goenka	0da17f6ec6	Add default value for `volume_type` for DABs (#1952 ) ## Changes The Unity Catalog volumes API requires a `volume_type` argument when creating volumes. In the context of DABs, it's unnecessary to require users to specify the volume type every time. We can default to "MANAGED" instead. This PR is similar to https://github.com/databricks/cli/pull/1743 which does the same for dashboards. ## Tests Unit test	2024-12-04 11:05:54 +00:00
Denis Bilenko	0e088eb9f8	Simplify load_git_details.go; remove unnecessary Abs() call (#1950 ) Suggested here https://github.com/databricks/cli/pull/1945#discussion_r1866088579	2024-12-02 22:41:38 +00:00
shreyas-goenka	2847533e1e	Add DABs support for Unity Catalog volumes (#1762 ) ## Changes This PR adds support for UC volumes to DABs. ### Can I use a UC volume managed by DABs in `artifact_path`? Yes, but we require the volume to exist before being referenced in `artifact_path`. Otherwise you'll see an error that the volume does not exist. For this case, this PR also adds a warning if we detect that the UC volume is defined in the DAB itself, which informs the user to deploy the UC volume in a separate deployment first before using it in `artifact_path`. We cannot create the UC volume and then upload the artifacts to it in the same `bundle deploy` because `bundle deploy` always uploads the artifacts to `artifact_path` before materializing any resources defined in the bundle. Supporting this in a single deployment requires us to migrate away from our dependency on the Databricks Terraform provider to manage the CRUD lifecycle of DABs resources. ### Why do we not support `preset.name_prefix` for UC volumes? UC volumes will not have a `dev_shreyas_goenka` prefix added in `mode: development`. Configuring `presets.name_prefix` will be a no-op for UC volumes. We have decided not to support prefixing for UC resources. This is because: 1. UC provides its own namespace hierarchy that is independent of DABs. 2. Users can always manually use `${workspace.current_user.short_name}` to configure the prefixes manually. Customers often manually set up a UC hierarchy for dev and prod, including a schema or catalog per developer. Thus, it's often unnecessary for us to add prefixing in `mode: development` by default for UC resources. In retrospect, supporting prefixing for UC schemas and registered models was a mistake and will be removed in a future release of DABs. ## Tests Unit, integration test, and manually. ### Manual Testing cases: 1. UC volume does not exist: ``` ➜ bundle-playground git:(master) ✗ cli bundle deploy Error: failed to fetch metadata for the UC volume /Volumes/main/caps/my_volume that is configured in the artifact_path: Not Found ``` 2. UC Volume does not exist, but is defined in the DAB ``` ➜ bundle-playground git:(master) ✗ cli bundle deploy Error: failed to fetch metadata for the UC volume /Volumes/main/caps/managed_by_dab that is configured in the artifact_path: Not Found Warning: You might be using a UC volume in your artifact_path that is managed by this bundle but which has not been deployed yet. Please deploy the UC volume in a separate bundle deploy before using it in the artifact_path. at resources.volumes.bar in databricks.yml:24:7 ``` --------- Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2024-12-02 21:18:07 +00:00
shreyas-goenka	e86a949d99	Add the `bundle_uuid` helper function for templates (#1947 ) ## Changes This PR adds the `bundle_uuid` helper function that'll return a stable identifier for the bundle for the duration of the `bundle init` command. This is also the UUID that'll be set in the telemetry event sent during `databricks bundle init` and would be used to correlate revenue from bundle init with resource deployments. Template authors should add the uuid field to their `databricks.yml` file they generate: ``` bundle: # A stable identified for your DAB project. We use this UUID in the Databricks backend # to correlate and identify multiple deployments of the same DAB project. uuid: {{ bundle_uuid }} ``` ## Tests Unit test	2024-12-02 10:29:29 +00:00
Denis Bilenko	00bd98f898	Move loadGitDetails mutator to Initialize phase (#1944 ) This will require API call when run inside a workspace, which will require workspace client (we don't have one at the current point). We want to keep Load phase quick, since it's common across all commands.	2024-12-02 09:49:32 +00:00
Andrew Nester	8053e9c4e4	Fix segfault in bundle summary command (#1937 ) ## Changes This PR introduces use of new `isNil` method. It allows to ensure we filter out all improperly defined resources in `bundle summary` command. This includes deleted resources or resources with incorrect configuration such as only defining key of the resource and nothing else. Fixes #1919, #1913 ## Tests Added regression unit test case	2024-11-28 12:27:24 +00:00
shreyas-goenka	b323703c1b	Add validation for single node clusters (#1909 ) ## Changes This PR adds a warning validating that the configuration for a single node cluster is valid for interactive, job, job-task, and pipeline clusters. Note: We skip the validation if a cluster policy is configured because the policy is likely to configure `spark_conf` / `custom_tags` itself. Note: Terrform originally only had validation for interactive, job, and job-task clusters. This PR adding the validation for pipeline clusters as well is new. This PR follows the same logic as we used to have in Terraform. The validation was removed from Terraform because we had no way to demote the error to a warning: https://github.com/databricks/terraform-provider-databricks/pull/4222 ### Background Single-node clusters require `spark_conf` and `custom_tags` to be correctly set in the cluster definition for them to function optimally. The cluster will be created even if incorrectly configured, but its performance will not be great. For example, if both `spark_conf` and `custom_tags` are not set and `num_workers` is 0, then only the driver process will be launched on the cluster compute instance thus leading to sub-optimal utilization of available compute resources and no parallelization across worker processes when processing a spark query. ### Issue This PR addresses some issues reported in https://github.com/databricks/cli/issues/1546 ## Tests Unit tests and manually. Example output of the warning: ``` ➜ bundle-playground git:(master) ✗ cli bundle validate Warning: Single node cluster is not correctly configured at resources.pipelines.bar.clusters[0] in databricks.yml:29:11 num_workers should be 0 only for single-node clusters. To create a valid single node cluster please ensure that the following properties are correctly set in the cluster specification: spark_conf: spark.databricks.cluster.profile: singleNode spark.master: local[*] custom_tags: ResourceClass: SingleNode Name: foobar Target: default Workspace: User: shreyas.goenka@databricks.com Path: /Workspace/Users/shreyas.goenka@databricks.com/.bundle/foobar/default Found 1 warning ```	2024-11-22 15:48:09 +00:00
Ilya Kuznetsov	490dd058aa	Extended message for warning when source-linked mode is used outside of the workspace (#1929 ) ## Changes Added path and locations to the warning which displayed when source-linked mode is used outside of the workspace	2024-11-22 14:44:33 +00:00
Pieter Noordhuis	abfd1713e0	Skip sync warning if no sync paths are defined (#1926 ) ## Changes Users can configure the bundle to not synchronize any files with: ```yaml sync: paths: [] ``` If it is explicitly configured as an empty list, the validate command must not warn about not having any files to synchronize. The warning exists to alert users who are unintentionally not synchronizing any files (they might have a `.gitignore` pattern that matches everything). Closes #1663. ## Tests * New unit test.	2024-11-21 15:03:13 +00:00
Pieter Noordhuis	a3cea07c9e	Support lookup by name of notification destinations (#1922 ) ## Changes Add support for notification destinations in variable lookups. More information: https://docs.databricks.com/en/admin/workspace-settings/notification-destinations.html Depends on #1921. ## Tests * New unit test * Manually confirmed that the lookup works	2024-11-21 15:52:14 +01:00
shreyas-goenka	c2e2abcc35	Extend "notebook not found" error to warn about missing extension (#1920 ) ## Changes The full workspace path for a notebook does not contain the notebook's extension. If a user converts that file path to a relative path (like `/Workspace/bundle_root/bar/nb` -> `./bar/nb`), they can be confused as to why the new file path does not work. The changes in this PR nudge them to add the appropriate file extension (e.g., `./bar/nb.py` or `./bar/nb.ipynb`). One common way users can end up in this scenario is by using the view job as YAML functionality in the Databricks UI. ## Tests Unit test and manually. ``` (.venv) ➜ bundle-playground git:(master) ✗ cli bundle validate Error: notebook ./foo not found. Local notebook references are expected to contain one of the following file extensions: [.py, .r, .scala, .sql, .ipynb] ```	2024-11-21 16:21:21 +05:30

1 2 3 4 5 ...

309 Commits