databricks-cli

Commit Graph

Author	SHA1	Message	Date
Pieter Noordhuis	14abcb3ad7	Add `dynvar` package for variable resolution with a `dyn.Value` tree (#1143 ) ## Changes This is the `dyn` counterpart to the `bundle/config/interpolation` package. It relies on the paths in `${foo.bar}` being valid `dyn.Path` instances. It leverages `dyn.Walk` to get a complete picture of all variable references and uses `dyn.Get` to retrieve values pointed to by variable references. Depends on #1142. ## Tests Unit test coverage. I tried to mirror the tests from `bundle/config/interpolation` and added new ones where applicable (for example to test type retention of referenced values).	2024-01-24 18:49:06 +00:00
Pieter Noordhuis	ff6e0354b9	Add functionality to visit values in `dyn.Value` tree (#1142 ) ## Changes This change adds the following functions: * `dyn.Get(value, "foo.bar") -> (dyn.Value, error)` * `dyn.Set(value, "foo.bar", newValue) -> (dyn.Value, error)` * `dyn.Map(value, "foo.bar", func) -> (dyn.Value, error)` And equivalent functions that take a previously constructed `dyn.Path`: * `dyn.GetByPath(value, dyn.Path) -> (dyn.Value, error)` * `dyn.SetByPath(value, dyn.Path, newValue) -> (dyn.Value, error)` * `dyn.MapByPath(value, dyn.Path, func) -> (dyn.Value, error)` Changes made by the "set" and "map" functions are never reflected in the input argument; they return new `dyn.Value` instances for all nodes in the path leading up to the changed value. ## Tests New unit tests cover all critical paths.	2024-01-24 18:38:46 +00:00
Andrew Nester	70fe0e36ef	Added `databricks bundle generate job` command (#1043 ) ## Changes Now it's possible to generate bundle configuration for existing job. For now it only supports jobs with notebook tasks. It will download notebooks referenced in the job tasks and generate bundle YAML config for this job which can be included in larger bundle. ## Tests Running command manually Example of generated config ``` resources: jobs: job_128737545467921: name: Notebook job format: MULTI_TASK tasks: - task_key: as_notebook existing_cluster_id: 0704-xxxxxx-yyyyyyy notebook_task: base_parameters: bundle_root: /Users/andrew.nester@databricks.com/.bundle/job_with_module_imports/development/files notebook_path: ./entry_notebook.py source: WORKSPACE run_if: ALL_SUCCESS max_concurrent_runs: 1 ``` ## Tests Manual (on our last 100 jobs) + added end-to-end test ``` --- PASS: TestAccGenerateFromExistingJobAndDeploy (50.91s) PASS coverage: 61.5% of statements in ./... ok github.com/databricks/cli/internal/bundle 51.209s coverage: 61.5% of statements in ./... ```	2024-01-17 14:26:33 +00:00
shreyas-goenka	7dcdadde79	Add debug log line for when bundle init is run from non-TTY interface (#1117 )	2024-01-11 15:41:13 +00:00
Pieter Noordhuis	94112eaedb	Make libs/exec fallback to `sh` if `bash` cannot be found (#1114 ) ## Changes Falling back to `sh` is also what GitHub Actions do if `bash` is not found in the path. It is possible `bash` is not available when running from minimal Docker containers and we must not error out in this case. See: https://docs.github.com/en/actions/using-workflows/workflow-syntax-for-github-actions#jobsjob_idstepsshell. This change renames `interpreter` -> `shell`. ## Tests Unit tests pass.	2024-01-11 12:26:31 +00:00
Pieter Noordhuis	d8a64e6617	Define constant for the invalid `dyn.Value` (#1101 ) ## Changes The nil value is a real valid value that we need to represent. To accommodate this we introduced `dyn.KindInvalid` as the zero-value for `dyn.Kind` (see #904), but did not yet update the comments on `dyn.NilValue` or add tests for `kind.go`. This also moves `KindNil` to be last in the definition order (least likely to care about it). ## Tests Tests pass.	2024-01-05 13:02:04 +00:00
Pieter Noordhuis	bae220d1bc	Consolidate functions to convert `dyn.Value` to native types (#1100 ) ## Changes The file `value.go` had a couple `AsZZZ` and `MustZZZ` functions. This change backfills missing versions and moves all of them to a separate file. ## Tests Tests pass; full coverage.	2024-01-05 12:06:12 +00:00
Lennart Kats (databricks)	167deec8c3	Change recommended production deployment path from /Shared to /Users (#1091 ) ## Changes This PR changes the default and `mode: production` recommendation to target `/Users` for deployment. Previously, we used `/Shared`, but because of a lack of POSIX-like permissions in WorkspaceFS this meant that files inside would be readable and writable by other users in the workspace. Detailed change: * `default-python` no longer uses a path that starts with `/Shared` * `mode: production` no longer requires a path that starts with `/Shared` ## Related PRs Docs: https://github.com/databricks/docs/pull/14585 Examples: https://github.com/databricks/bundle-examples/pull/17 ## Tests * Manual tests * Template unit tests (with an extra check to avoid /Shared)	2024-01-02 19:58:24 +00:00
Lennart Kats (databricks)	10a8ce4562	Improve experience for multiple builtin templates (#1052 ) ## Changes This enhances the template selection experience a bit as we add more and more built-in templates (like https://github.com/databricks/cli/pull/1051 and https://github.com/databricks/cli/pull/1059): ### New experience: <img width="661" alt="image" src="https://github.com/databricks/cli/assets/58432911/afe3b84d-8a77-47f3-b9c2-f827f7893cd7"> ### Current experience: <img width="265" alt="image" src="https://github.com/databricks/cli/assets/58432911/36f8d568-819f-4920-83b1-fb76109ea3d1"> --------- Co-authored-by: shreyas-goenka <88374338+shreyas-goenka@users.noreply.github.com>	2023-12-27 12:03:08 +00:00
shreyas-goenka	f2408eda62	Add support for reprompts if user input does not match template schema (#946 ) ## Changes This PR adds retry logic to user input prompts, prompting users again if the value does not match the requirements specified in the bundle template schema. ## Tests Manually. Here's an example UX. The first prompt expects an integer and the second one a string made only from the letters "defg" ``` shreyas.goenka@THW32HFW6T cli % cli bundle init ~/mlops-stack Please enter an integer [123]: abc Validation failed: "abc" is not a integer Please enter an integer [123]: 123 Please enter a string [dddd]: apple Validation failed: invalid value for input_root_dir: "apple". Only characters the 'd', 'e', 'f', 'g' are allowed ```	2023-12-22 15:43:08 +00:00
Pieter Noordhuis	938eb1600c	Rename libs/config -> libs/dyn (#1086 ) ## Changes The name "dynamic value", or "dyn" for short, is more descriptive than the opaque "config". Also, it conveniently does not alias with other packages in the repository, or (popular ones) elsewhere. (discussed with @andrewnester) ## Tests n/a	2023-12-22 13:20:45 +00:00
Pieter Noordhuis	a1297d71fd	Functionality to walk a `config.Value` tree (#1081 ) ## Changes This change adds: * A `config.Walk` function to walk a configuration tree * A `config.Path` type to represent a value's path inside a tree * Functions to create a `config.Path` from a string, or convert one to a string ## Tests Additional unit tests with full coverage.	2023-12-22 10:38:09 +00:00
Andrew Nester	ac37a592f1	Added exec.NewCommandExecutor to execute commands with correct interpreter (#1075 ) ## Changes Instead of handling command chaining ourselves, we execute passed commands as-is by storing them, in temp file and passing to correct interpreter (bash or cmd) based on OS. Fixes #1065 ## Tests Added unit tests	2023-12-21 15:45:23 +00:00
Andrew Nester	42f21d82fe	Do not prompt for template values in Git Bash (#1082 ) ## Changes Follow up on #1077	2023-12-20 12:01:53 +00:00
Andrew Nester	f18094d943	Revert using IsPromptSupported from promptOrAssignDefaultValues (#1077 ) ## Changes Fixes nightly test `TestAccBundleInitErrorOnUnknownFields`. `TestAccBundleInitErrorOnUnknownFields` has an interactive shell by default so the test fails on waiting for prompt. This was introduced in #1069. ## Tests Nightly test succeed.	2023-12-19 09:58:46 +00:00
Andrew Nester	6dd6899b52	Do not allow input prompts in Git Bash terminal (#1069 ) ## Changes Likely due to fact that Git Bash does not correctly support ANSI escape sequences, we cannot use `promptui` package there. See known issues: - https://github.com/manifoldco/promptui/issues/208 - https://github.com/chzyer/readline/issues/191	2023-12-18 15:01:59 +00:00
Pieter Noordhuis	b17e845d44	Skip profile resolution if `DATABRICKS_AUTH_TYPE` is set (#1068 ) ## Changes If a user configures a workspace host in a bundle and wants to use the "azure-cli" authentication type, we would still run profile resolution. If the databrickscfg has a matching profile, we still load it, even though it should be a fallback. ## Tests * Unit test. * Manually confirmed that setting `DATABRICKS_AUTH_TYPE=azure-cli` now works as expected.	2023-12-18 09:57:07 +00:00
Lennart Kats (databricks)	8b9930a49a	Improve default template (#1046 ) ## Changes - Tweak strings, documentation in template - Extend requirements-dev.txt with setuptools/wheel for building whl files - Clarify what the "_job.yml" file is for for users who are only interested in DLT pipelines (answering a question that came up recently) ## Tests Existing tests exercise this template	2023-12-11 19:13:14 +00:00
Serge Smertin	42c06267eb	Stub out Python virtual environment installation for `labs` commands (#1057 ) This PR removes 15 seconds from `make test` runtime	2023-12-11 16:30:19 +00:00
Andrew Nester	cdf29da27b	Change default_python template to auto-update version on each wheel build (#1034 ) ## Changes Change default_python template to auto-update version on each wheel build	2023-12-01 13:24:55 +00:00
Pieter Noordhuis	60a8abdcd7	Rewrite the friendly log handler (#1038 ) ## Changes It wasn't working because it deferred to the regular `slog.TextHandler` for the `WithAttr` and `WithGroup` functions. Both of these functions don't mutate the handler but return a new one. When the top-level logger called one of these, log records in that context used the standard handler instead of ours. To implement tracking of attributes and groups, I followed the guide at https://github.com/golang/example/blob/master/slog-handler-guide/README.md for writing custom handlers. ## Tests The new tests demonstrate formatting through `t.Log` and look good.	2023-12-01 12:17:04 +00:00
shreyas-goenka	bdef0f7b23	Add support for conditional prompting in bundle init (#971 ) ## Changes This PR introduces the `skip_prompt_if` extension to the jsonschema library. If the inputs provided by the user match the JSON schema then the prompt for that property is skipped. Right now only constant checks are supported, but if in the future more complicated conditionals are required, this can be extended to support `allOf`, `oneOf`, `anyOf` etc allowing template authors to specify conditionals of arbitary complexity. ## Tests Unit tests and manually.	2023-11-30 16:07:45 +00:00
shreyas-goenka	1f1ed6db53	Add versioning for bundle templates (#972 ) ## Changes This PR adds versioning for bundle templates. Right now there's only logic for the maximum version of templates supported. At some point in the future if we make a breaking template change we can also include a minimum version of template supported by the CLI. ## Tests Unit tests.	2023-11-30 14:28:51 +00:00
Pieter Noordhuis	10c9eca06f	Filter out system clusters for `--configure-cluster` (#1031 ) ## Changes Only clusters with their source attribute equal to `UI` or `API` should be presented in the dropdown. ## Tests Unit test and manual confirmation.	2023-11-30 09:59:11 +00:00
Serge Smertin	65458cbde6	Fix `panic: $HOME is not set` (#1027 ) This PR adds error to `env.UserHomeDir(ctx)` Fixes https://github.com/databricks/setup-cli/issues/73 --------- Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2023-11-29 19:08:27 +00:00
Andrew Nester	b5f34a1181	Removed unused `ToHttpsUrl` method and corresponding library (#1017 ) ## Changes Removed unused ToHttpsUrl method and corresponding library	2023-11-28 16:08:27 +00:00
Pieter Noordhuis	f5f57b6bf9	Populate struct field with `config.Value` instance if possible (#1010 ) ## Changes If a struct has a field of type `config.Value`, then we set it to the source value while converting a `config.Value` instance to a struct as part of a call to `convert.ToTyped`. This is convenient when dealing with deeply nested structs where functions on inner structs need access to the metadata provided by their corresponding `config.Value` (e.g. where they were defined). ## Tests Added unit tests pass.	2023-11-27 10:06:29 +00:00
Pieter Noordhuis	ef97e249ec	Add function to check if `config.Value` is valid (#1009 ) ## Changes Small function broken out from other work in progress.	2023-11-24 13:21:47 +00:00
Pieter Noordhuis	6187803007	Correctly overwrite local state if remote state is newer (#1008 ) ## Changes A bug in the code that pulls the remote state could cause the local state to be empty instead of a copy of the remote state. This happened only if the local state was present and stale when compared to the remote version. We correctly checked for the state serial to see if the local state had to be replaced but didn't seek back on the remote state before writing it out. Because the staleness check would read the remote state in full, copying from the same reader would immediately yield an EOF. ## Tests * Unit tests for state pull and push mutators that rely on a mocked filer. * An integration test that deploys the same bundle from multiple paths, triggering the staleness logic. Both failed prior to the fix and now pass.	2023-11-24 11:15:46 +00:00
shreyas-goenka	d9fe2ab43d	Improve error message when path is not a bundle template (#985 ) Adds better error message when input path is not a bundle template before: ``` shreyas.goenka@THW32HFW6T bricks % cli bundle init ~/bricks Error: open /Users/shreyas.goenka/bricks/databricks_template_schema.json: no such file or directory ``` after: ``` shreyas.goenka@THW32HFW6T bricks % cli bundle init ~/bricks Error: expected to find a template schema file at /Users/shreyas.goenka/bricks/databricks_template_schema.json ```	2023-11-22 12:25:16 +00:00
Lennart Kats (databricks)	92539d4b9b	Work around DLT issue with `$PYTHONPATH` not being set correctly (#999 ) ## Changes DLT currently doesn't always set `$PYTHONPATH` correctly (ES-947370). This restores the original workaround to make new pipelines work while that issue is being addressed. The workaround was removed in #832. Manually tested.	2023-11-20 19:25:43 +00:00
Pieter Noordhuis	2c908f8fea	Function to convert Go struct back to `config.Value` (#935 ) ## Changes This PR is the counterpart to #904. With this change, we are able to convert a `config.Value` into a Go struct, make modifications to the Go struct, and reflect those changes in a new `config.Value`. This functionality allows us to incrementally introduce this configuration representation to existing bundle mutators. Bundle mutators expect a `*bundle.Bundle` argument and mutate its configuration directly. These mutations are not reflected in the corresponding `config.Value` (once introduced), which means we cannot use the `config.Value` as source of truth until we update _all_ mutators. To address this, we can run `convert.ToTyped` and `convert.FromTyped` at the mutator boundary (from `bundle.Apply`) and capture changes made to the Go struct. Then we can incrementally make mutators aware of the `config.Value` configuration and have them mutate that structure directly. ## Tests New unit tests pass. Manual spot checks against the bundle configuration type.	2023-11-15 09:19:51 +00:00
shreyas-goenka	b397501880	Fix template initialization from current working directory (#976 ) ## Changes If args[0] == "." was provided to bundle init command, it would try to resolve it as a built in template and error out. ## Tests Manually before: ``` shreyas.goenka@THW32HFW6T mlops-stack % cli bundle init . Error: open /var/folders/lg/njll3hjx7pjcgxs6n7b290bw0000gp/T/templates3934264356/templates/databricks_template_schema.json: no such file or directory ``` after: ``` shreyas.goenka@THW32HFW6T mlops-stack % cli bundle init . Welcome to MLOps Stacks. For detailed information on project generation, see the README at https://github.com/databricks/mlops-stacks/blob/main/README.md. Project Name [my-mlops-project]: ^C ```	2023-11-14 22:09:18 +00:00
Andrew Nester	ea4153e323	Fixed flaky TestBackgroundCombinedOutputFailure (#978 ) ## Changes `TestBackgroundCombinedOutputFailure` was occasionally failing because combined output could have been in different order, see https://github.com/databricks/cli/actions/runs/6823883271/job/18558675165?pr=928	2023-11-10 14:09:02 +00:00
Serge Smertin	3284a8c56c	Improved usability of `databricks auth login ... --configure-cluster` flow by displaying cluster type and runtime version (#956 ) This PR adds selectors for Databricks-connect compatible clusters and SQL warehouses Tested in https://github.com/databricks/cli/pull/914	2023-11-09 16:38:45 +00:00
Serge Smertin	f111b0846e	Added process stubbing for easier testing of launched subprocesses (#963 ) ## Changes This PR makes unit testing with subprocesses fast. ``` ctx := context.Background() ctx, stub := process.WithStub(ctx) stub.WithDefaultOutput("meeee") ctx = env.Set(ctx, "FOO", "bar") out, err := process.Background(ctx, []string{"/usr/local/bin/meeecho", "1", "--foo", "bar"}) require.NoError(t, err) require.Equal(t, "meeee", out) require.Equal(t, 1, stub.Len()) require.Equal(t, []string{"meeecho 1 --foo bar"}, stub.Commands()) allEnv := stub.CombinedEnvironment() require.Equal(t, "bar", allEnv["FOO"]) require.Equal(t, "bar", stub.LookupEnv("FOO")) ``` This should make further iterations of https://github.com/databricks/cli/pull/914 easier ## Tests `make test`	2023-11-09 14:24:05 +00:00
shreyas-goenka	d4d4b7480f	Do not allow empty descriptions for bundle template inputs (#967 ) ## Changes We rely on the descriptions to render the prompts to a user. Thus we should not allow empty descriptions here. Note, both mlops stacks and the default-python template have descriptions for all their properties so this should not be an issue. ## Tests Unit test	2023-11-08 16:48:37 +00:00
Serge Smertin	e68a88e14d	Added `env.UserHomeDir(ctx)` for parallel-friendly tests (#955 ) ## Changes `os.Getenv(..)` is not friendly with `libs/env`. This PR makes the relevant changes to places where we need to read user home directory. ## Tests Mainly done in https://github.com/databricks/cli/pull/914	2023-11-08 14:50:20 +00:00
shreyas-goenka	283f24179d	Remove validation for default value against pattern (#959 ) ## Changes This PR removes validation for default value against the regex pattern specified in a JSON schema at schema load time. This is required because https://github.com/databricks/cli/pull/795 introduces parameterising the default value as a Go text template impling that the default value now does not necessarily have to match the pattern at schema load time. This will also unblock: https://github.com/databricks/mlops-stacks/pull/108 Note, this does not remove runtime validation for input parameters right before template initialization, which happens here: `fb32e78c9b/libs/template/materialize.go (L76)` ## Tests Changes to existing test.	2023-11-07 12:35:59 +00:00
shreyas-goenka	fb32e78c9b	Make to/from string methods private to the jsonschema package (#942 ) ## Changes This PR makes a few methods private, exposing cleaner interfaces to get the string representations for enums and default values of a JSON Schema. ## Tests Manually, template initialization for the `default-python` template still works as expected.	2023-11-06 15:05:17 +00:00
Pieter Noordhuis	8e1156edbd	Function to merge two instances of `config.Value` (#938 ) ## Changes Semantics for merging two instances of `config.Value`: * Merging x with nil or nil with x always yields x * Merging maps a and b means entries from map b take precedence * Merging sequences a and b means concatenating them These are the same semantics that we use today when calling into mergo in `bundle/config`. ## Tests Unit tests pass.	2023-11-03 19:15:47 +00:00
shreyas-goenka	d70d7445c4	Remove resolution of repo names against the Databricks Github account (#940 ) ## Changes This functionality is not exercised (and will not be anytime soon). Instead we use a map to have first party aliases for supported templates. `1e46b9f88a/cmd/bundle/init.go (L21)` ## Tests Existing tests and manually, bundle init still works.	2023-11-01 13:02:06 +00:00
Taiga Matsumoto	e408b701ac	Add override to support YAML inputs for apps (#921 ) ## Changes <!-- Summary of your changes that are easy to understand --> Take @andrefurlan-db 's original [commit](https://github.com/databricks/cli/compare/databricks:6e21ced...andrefurlan-db:12ed10c) to add `apps` support to the CLI and add the yaml file-support as an override (the apps routes are already apart of the Go SDK and are available for use in the CLI) NOTE: this feature is still private preview. CLI usage will be internal only ## Tests <!-- How is this tested? -->	2023-10-27 18:57:26 +00:00
shreyas-goenka	5a8cd0c5bc	Persist deployment metadata in WSFS (#845 ) ## Changes This PR introduces a metadata struct that stores a subset of bundle configuration that we wish to expose to other Databricks services that wish to integrate with bundles. This metadata file is uploaded to a file `${bundle.workspace.state_path}/metadata.json` in the WSFS destination of the bundle deployment. Documentation for emitted metadata fields: * `version`: Version for the metadata file schema * `config.bundle.git.branch`: Name of the git branch the bundle was deployed from. * `config.bundle.git.origin_url`: URL for git remote "origin" * `config.bundle.git.bundle_root_path`: Relative path of the bundle root from the root of the git repository. Is set to "." if they are the same. * `config.bundle.git.commit`: SHA-1 commit hash of the exact commit this bundle was deployed from. Note, the deployment might not exactly match this commit version if there are changes that have not been committed to git at deploy time, * `file_path`: Path in workspace where we sync bundle files to. * `resources.jobs.[job-ref].id`: Id of the job * `resources.jobs.[job-ref].relative_path`: Relative path of the yaml config file from the bundle root where this job was defined. Example metadata object when bundle root and git root are the same: ```json { "version": 1, "config": { "bundle": { "lock": {}, "git": { "branch": "master", "origin_url": "www.host.com", "commit": "7af8e5d3f5dceffff9295d42d21606ccf056dce0", "bundle_root_path": "." } }, "workspace": { "file_path": "/Users/shreyas.goenka@databricks.com/.bundle/pipeline-progress/default/files" }, "resources": { "jobs": { "bar": { "id": "245921165354846", "relative_path": "databricks.yml" } } }, "sync": {} } } ``` Example metadata when the git root is one level above the bundle repo: ```json { "version": 1, "config": { "bundle": { "lock": {}, "git": { "branch": "dev-branch", "origin_url": "www.my-repo.com", "commit": "3db46ef750998952b00a2b3e7991e31787e4b98b", "bundle_root_path": "pipeline-progress" } }, "workspace": { "file_path": "/Users/shreyas.goenka@databricks.com/.bundle/pipeline-progress/default/files" }, "resources": { "jobs": { "bar": { "id": "245921165354846", "relative_path": "databricks.yml" } } }, "sync": {} } } ``` This unblocks integration to the jobs break glass UI for bundles. ## Tests Unit tests and integration tests.	2023-10-27 12:55:43 +00:00
shreyas-goenka	a5815a0b47	Add welcome message to bundle templates (#907 ) ## Changes Adds a welcome_message field to templates and the default python template. ## Tests Manually. Here's the output logs during template init now: ``` shreyas.goenka@THW32HFW6T bricks % cli bundle init Template to use [default-python]: Welcome to the sample Databricks Asset Bundle template! Please enter the following information to initialize your sample DAB. Unique name for this project [my_project]: abcde Include a stub (sample) notebook in 'abcde/src': no Include a stub (sample) Delta Live Tables pipeline in 'abcde/src': yes Include a stub (sample) Python package in 'abcde/src': no ✨ Your new project has been created in the 'abcde' directory! Please refer to the README.md of your project for further instructions on getting started. Or read the documentation on Databricks Asset Bundles at https://docs.databricks.com/dev-tools/bundles/index.html. ```	2023-10-25 12:27:25 +00:00
Pieter Noordhuis	a60c40e71e	Add configuration normalization code (#915 ) ## Changes This is similar to #904 but instead of converting the dynamic configuration to Go structs, this normalizes a `config.Value` according to the type of a Go struct and returns the new, normalized `config.Value`. This will be used to ensure that two `config.Value` trees are type-compatible before we can merge them (i.e. instances from different files). Warnings and errors during normalization are accumulated and returned as a `diag.Diagnostics` structure. We can use this to surface warnings about unknown fields, or errors about invalid types, in aggregate instead of one-by-one. This approach is inspired by the pattern to accumulate diagnostics in Terraform provider code. ## Tests New unit tests.	2023-10-25 11:56:42 +00:00
shreyas-goenka	4a09ffc1ec	Add support for multiline descriptions when using template enums (#916 ) ## Changes This PR splits the question prompt at the last new line character to make multiline selection prompts work with `promptui` ## Tests Tested manually https://github.com/databricks/cli/assets/88374338/027e5210-f7f4-479d-98df-744d15b7a8fb	2023-10-25 09:37:25 +00:00
shreyas-goenka	f8d7e31118	Fix pattern validation for input properties (#912 ) ## Changes Fixes bug where input validation would only be done on the first input parameter in the template schema. ## Tests Unit test.	2023-10-24 15:56:54 +00:00
Pieter Noordhuis	3411b8aa37	Loading an empty file yields a nil (#906 ) ## Changes Empty YAML files are valid and should return a nil-equivalent when loaded. ## Tests Tests pass.	2023-10-24 11:24:43 +00:00
Pieter Noordhuis	5018059444	Library to convert config.Value to Go struct (#904 ) ## Changes Now that we have a new YAML loader (see #828), we need code to turn this into our Go structs. ## Tests New unit tests pass. Confirmed that we can replace our existing loader/converter with this one and that existing unit tests for bundle loading still pass.	2023-10-24 11:12:36 +00:00
Pieter Noordhuis	d4be40520c	Resolve configuration before performing verification (#890 ) ## Changes If a bundle configuration specifies a workspace host, and the user specifies a profile to use, we perform a check to confirm that the workspace host in the bundle configuration and the workspace host from the profile are identical. If they are not, we return an error. The check was introduced in #571. Previously, the code included an assumption that the client configuration was already loaded from the environment prior to performing the check. This was not the case, and as such if the user intended to use a non-default path to `.databrickscfg`, this path was not used when performing the check. The fix does the following: * Resolve the configuration prior to performing the check. * Don't treat the configuration file not existing as an error. * Add unit tests. Fixes #884. ## Tests Unit tests and manual confirmation.	2023-10-20 13:10:31 +00:00
Pieter Noordhuis	ab05f8e6e7	New YAML loader to support configuration location (#828 ) ## Changes In order to support variable interpolation on fields that aren't a string in the resource types, we need a separate representation of the bundle configuration tree with the type equivalent of Go's `any`. But instead of using `any` directly, we can do better and use a custom type equivalent to `any` that captures additional metadata. In this PR, the additional metadata is limited to the origin of the configuration value (file, line number, and column). The YAML in this commit uses the upstream YAML parser's `yaml.Node` type to get access to location information. It reimplements the loader that takes the `yaml.Node` structure and turns it into the configuration tree we need. Next steps after this PR: * Implement configuration tree type checking (against a Go type) * Implement configuration tree merging (to replace the current merge functionality) * Implement conversion to and from the bundle configuration struct * Perform variable interpolation against this configuration tree (to support variable interpolation for ints) * (later) Implement a `jsonloader` that produces the same tree and includes location information ## Tests The tests in `yamlloader` perform an equality check on the untyped output of loading a YAML file between the upstream YAML loader and this loader. The YAML examples were generated by prompting ChatGPT for examples that showcase anchors, primitive values, edge cases, etc.	2023-10-20 12:56:59 +00:00
shreyas-goenka	3700785dfa	Add support for validating CLI version when loading a jsonschema object (#883 ) ## Changes Updates to bundle templates can require updated versions of the CLI. This PR extends the JSON schema representation to allow template authors to set a min CLI version they require for their templates. This is required to make improvements/additions to the mlops-stacks repo ## Tests Tested using unit tests and manually. For manualy testing, I created a custom build of the CLI using go releaser and then tested it against a local instance of mlops-stack When mlops-stack schema has: ``` "min_databricks_cli_version": "v5000.1.1", ``` output (error as expected) ``` shreyas.goenka@THW32HFW6T bricks % ./dist/cli_darwin_arm64/databricks bundle init ~/mlops-stack Error: minimum CLI version "v5000.1.1" is greater than current CLI version "v0.207.2-dev+1b992c0". Please upgrade your current Databricks CLI ``` When the mlops-stack schema has: ``` "min_databricks_cli_version": "v0.1.1", ``` output (validation passes) ``` shreyas.goenka@THW32HFW6T bricks % ./dist/cli_darwin_arm64/databricks bundle init ~/mlops-stack Welcome to MLOps Stack. For detailed information on project generation, see the README at https://github.com/databricks/mlops-stack/blob/main/README.md. Project Name [my-mlops-project]: ^C ```	2023-10-19 14:01:48 +00:00
Lennart Kats (databricks)	a2ee8bb45b	Improve the output of the `databricks bundle init` command (#795 ) Improve the output of help, prompts, and so on for `databricks bundle init` and the default template. Among other things, this PR adds support for a new `welcome_message` property that lets a template print a custom message on success: ``` $ databricks bundle init Template to use [default-python]: Unique name for this project [my_project]: lennart_project Include a stub (sample) notebook in 'lennart_project/src': yes Include a stub (sample) Delta Live Tables pipeline in 'lennart_project/src': yes Include a stub (sample) Python package in 'lennart_project/src': yes ✨ Your new project has been created in the 'lennart_project' directory! Please refer to the README.md of your project for further instructions on getting started. Or read the documentation on Databricks Asset Bundles at https://docs.databricks.com/dev-tools/bundles/index.html. ``` --------- Co-authored-by: shreyas-goenka <88374338+shreyas-goenka@users.noreply.github.com>	2023-10-19 07:08:36 +00:00
Andrew Nester	8c1441ff71	Support .gitignore syntax in sync section and make sure it works recursively (#854 ) Fixes #815	2023-10-10 08:45:15 +00:00
shreyas-goenka	043e54950d	Add hint to delete sync snapshot if parsing fails (#853 ) ## Changes Example error: ``` Error: error parsing existing sync state. Please delete your existing sync snapshot file (.databricks/sync-snapshots/f3c00bc127903f9b.json) and retry: invalid sync state representation. Remote file footxt is missing the corresponding local file ``` ## Tests Manually	2023-10-09 15:26:46 +00:00
Lennart Kats (databricks)	0894584132	Minor template tweaks (#832 ) ## Changes Minor tweaks to the template.	2023-10-04 15:27:09 +00:00
shreyas-goenka	40ae23bb33	Refactor change computation for sync (#785 ) ## Changes This PR pays some tech debt by refactoring sync diff computation into interfaces that are more robust. Specifically: 1. Refactor the single diff computation function into a `SnapshotState` class that computes the target state only based on the current local files making it more robust and not carrying over state from previous iterations. 2. Adds new validations for the sync state which make sure that the invariants that downstream code expects are actually held true. This prevents a class of issues where these invariants break and the synchroniser behaves unexpectedly. Note, this does not change the existing schema for the snapshot, only the way the diff is computed, and thus is backwards compatible (ie does not require a schema version bump). ## Tests <!-- How is this tested? -->	2023-10-03 13:47:46 +00:00
Serge Smertin	7d0f170eee	Added `python.DetectInterpreters` and other utils (#805 ) This PR adds a few utilities related to Python interpreter detection: - `python.DetectInterpreters` to detect all Python versions available in `$PATH` by executing every matched binary name with `--version` flag. - `python.DetectVirtualEnvPath` to detect if there's any child virtual environment in `src` directory - `python.DetectExecutable` to detect if there's python3 installed either by `which python3` command or by calling `python.DetectInterpreters().AtLeast("v3.8")` To be merged after https://github.com/databricks/cli/pull/804, as one of the steps to get https://github.com/databricks/cli/pull/637 in, as previously discussed.	2023-10-03 10:47:09 +00:00
Pieter Noordhuis	f1b068cefe	Use normalized short name for tag value in development mode (#821 ) ## Changes The jobs backend propagates job tags to the underlying cloud provider's resources. As such, they need to match the constraints a cloud provider places on tag values. The display name can contain anything. With this change, we modify the tag value to equal the short name as used in the name prefix. Additionally, we leverage tag normalization as introduced in #819 to make sure characters that aren't accepted are removed before using the value as a tag value. This is a new stab at #810 and should completely eliminate this class of problems. ## Tests Tests pass.	2023-10-02 06:58:51 +00:00
Pieter Noordhuis	4226c88e98	Library to validate and normalize cloud specific tags (#819 ) ## Changes Prompted by the proposed fix for a tagging-related problem in #810, I investigated how tag validation works. This turned out to be quite a bit more complex than anticipated. Tags at the job level (or cluster level) are passed through to the underlying compute infrastructure and as such are tested against cloud-specific validation rules. GCP appears to be the most restrictive. It would be disappointing to always restrict to `\w+`, so this package implements validation and normalization rules for each cloud. It can pick the right cloud to use using a Go SDK configuration. ## Tests Exhaustive unit tests. The regular expressions were pulled by #814.	2023-09-29 08:49:08 +00:00
Serge Smertin	7171874db0	Added `process.Background()` and `process.Forwarded()` (#804 ) ## Changes This PR adds higher-level wrappers for calling subprocesses. One of the steps to get https://github.com/databricks/cli/pull/637 in, as previously discussed. The reason to add `process.Forwarded()` is to proxy Python's `input()` calls from a child process seamlessly. Another use-case is plugging in `less` as a pager for the list results. ## Tests `make test`	2023-09-27 09:04:44 +00:00
Lennart Kats (databricks)	0c1516c4ba	Make the default `databricks bundle init` template more self-explanatory (#796 ) This makes the default-python template more self-explanatory and adds a few other tweaks for a better out-of-the-box experience.	2023-09-26 09:12:34 +00:00
shreyas-goenka	757d5efe8d	Add support for regex patterns in template schema (#768 ) ## Changes This PR introduces support for regex pattern validation in our custom jsonschema validator. This allows us to fail early if a user enters an invalid value for a field. For example, now this is what initializing the default template looks like with an invalid project name: ``` shreyas.goenka@THW32HFW6T bricks % cli bundle init Template to use [default-python]: Unique name for this project [my_project]: (__) Error: invalid value for project_name: (__). Must consist of letter and underscores only. ``` ## Tests New unit tests and manually.	2023-09-25 09:53:38 +00:00
shreyas-goenka	2c58deb2c5	Fall back to full Git clone if shallow clone is not supported (#775 ) ## Changes Git repos hosted over HTTP do not support shallow cloning. This PR adds retry logic if we detect shallow cloning is not supported. Note I saw the match string `dumb http transport does not support shallow capabilities` being reported in for different hosts on the internet, so this should work accross a large class of git servers. Howerver, it's not strictly necessary to have the `--depth` flag so we can remove it if this issue is reported again. ## Tests Tested manually. `bundle init` successfully downloads the private HTTP repo reported during by internal user.	2023-09-15 09:14:51 +00:00
shreyas-goenka	327ab0e598	Error when unknown keys are encounters during template execution (#766 ) ## Tests New unit test and manually	2023-09-14 15:53:20 +00:00
shreyas-goenka	be55310cc9	Use enums for default python template (#765 ) ## Changes This PR changes schema to use the enum type for the default template yes/no questions. ## Tests Manually	2023-09-13 17:57:31 +00:00
Pieter Noordhuis	0cb05d1ded	Prompt once for a client profile (#727 ) ## Changes The previous implementation ran the risk of infinite looping for the account client due to a mismatch in determining what constitutes an account client between the CLI and SDK (see [here](`83443bae8d/libs/databrickscfg/profiles.go (L61)`) and [here](`0fdc5165e5/config/config.go (L160)`)). Ultimately, this code must never infinite loop. If a user is prompted and selects a profile that cannot be used, they should receive that feedback immediately and try again, instead of being prompted again. Related to #726. ## Tests <!-- How is this tested? -->	2023-09-11 15:32:24 +00:00
Lennart Kats (databricks)	a4e94e1b36	Fix author in setup.py (#761 ) Fix author in setup.py showing <no value>	2023-09-11 08:59:48 +00:00
Pieter Noordhuis	4ccc70aeac	Consolidate environment variable interaction (#747 ) ## Changes There are a couple places throughout the code base where interaction with environment variables takes place. Moreover, more than one of these would try to read a value from more than one environment variable as fallback (for backwards compatibility). This change consolidates those accesses. The majority of diffs in this change are mechanical (i.e. add an argument or replace a call). This change: * Moves common environment variable lookups for bundles to `bundles/env`. * Adds a `libs/env` package that wraps `os.LookupEnv` and `os.Getenv` and allows for overrides to take place in a `context.Context`. By scoping overrides to a `context.Context` we can avoid `t.Setenv` in testing and unlock parallel test execution for integration tests. * Updates call sites to pass through a `context.Context` where needed. * For bundles, introduces `DATABRICKS_BUNDLE_ROOT` as new primary variable instead of `BUNDLE_ROOT`. This was the last environment variable that did not use the `DATABRICKS_` prefix. ## Tests Unit tests pass.	2023-09-11 08:18:43 +00:00
Lennart Kats (databricks)	9e56bed593	Minor default template tweaks (#758 ) Minor template tweaks, mostly making the imports section for DLT notebooks a bit more elegant. Tested with DAB deployment + in-workspace UI.	2023-09-11 07:36:44 +00:00
shreyas-goenka	d9a276b17d	Fix minor typos in default-python template (#754 ) Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2023-09-09 21:55:43 +00:00
shreyas-goenka	7c96270db8	Add enum support for bundle templates (#668 ) ## Changes This PR includes: 1. Adding enum field to the json schema struct 2. Adding prompting logic for enum values. See demo for how it looks 3. Validation rules, validating the default value and config values when an enum list is specified This will now enable template authors to use enums for input parameters. ## Tests Manually and new unit tests	2023-09-08 12:07:22 +00:00
shreyas-goenka	1a7bf4e4f1	Add schema and config validation to jsonschema package (#740 ) ## Changes At a high level this PR adds new schema validation and moves functionality that should be present in the jsonschema package, but resides in the template package today, to the jsonschema package. This includes for example schema validation, schema instance validation, to / from string conversion methods etc. The list below outlines all the pieces that have been moved over, and the new validation bits added. This PR: 1. Adds casting default value of schema properties to integers to the jsonschema.Load method. 2. Adds validation for default value types for schema properties, checking they are consistant with the type defined. 3. Introduces the LoadInstance and ValidateInstance methods to the json schema package. These methods can be used to read and validate JSON documents against the schema. 4. Replaces validation done for template inputs to use the newly defined JSON schema validation functions. 5. Moves to/from string and isInteger utility methods to the json schema package. ## Tests Existing and new unit tests.	2023-09-07 14:36:06 +00:00
Lennart Kats (databricks)	50b2c0b83b	Fix notebook showing up in template when not selected (#743 ) ## Changes This fixes a typo that caused the notebook.ipynb file to show up even if the user answered "no" to the question about including a notebook. ## Tests We have matrix validation tests for all the yes/no combinations and whether the build + validate. There is no current test for the absence of files.	2023-09-07 08:26:43 +00:00
Lennart Kats (databricks)	3c79181148	Remove unused file (#742 ) defaults.json was originally used in tests. It's no longer used and should be removed.	2023-09-06 18:18:15 +00:00
Lennart Kats (databricks)	f9e521b43e	databricks bundle init template v2: optional stubs, DLT support (#700 ) ## Changes This follows up on https://github.com/databricks/cli/pull/686. This PR makes our stubs optional + it adds DLT stubs: ``` $ databricks bundle init Template to use [default-python]: default-python Unique name for this project [my_project]: my_project Include a stub (sample) notebook in 'my_project/src' [yes]: yes Include a stub (sample) DLT pipeline in 'my_project/src' [yes]: yes Include a stub (sample) Python package 'my_project/src' [yes]: yes ✨ Successfully initialized template ``` ## Tests Manual testing, matrix tests. --------- Co-authored-by: Andrew Nester <andrew.nester@databricks.com> Co-authored-by: PaulCornellDB <paul.cornell@databricks.com> Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2023-09-06 09:52:31 +00:00
Lennart Kats (databricks)	8c2cc07f7b	databricks bundle init template v1 (#686 ) ## Changes This adds a built-in "default-python" template to the CLI. This is based on the new default-template support of https://github.com/databricks/cli/pull/685. The goal here is to offer an experience where customers can simply type `databricks bundle init` to get a default template: ``` $ databricks bundle init Template to use [default-python]: default-python Unique name for this project [my_project]: my_project ✨ Successfully initialized template ``` The present template: - [x] Works well with VS Code - [x] Works well with the workspace - [x] Works well with DB Connect - [x] Uses minimal stubs rather than boiler-plate-heavy examples I'll have a followup with tests + DLT support. --------- Co-authored-by: Andrew Nester <andrew.nester@databricks.com> Co-authored-by: PaulCornellDB <paul.cornell@databricks.com> Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2023-09-05 11:58:34 +00:00
Lennart Kats (databricks)	947d5b1e5c	Fix IsServicePrincipal() only working for workspace admins (#732 ) ## Changes The latest rendition of isServicePrincipal no longer worked for non-admin users as it used the "principals get" API. This new version relies on the property that service principals always have a UUID as their userName. This was tested with the eng-jaws principal (8b948b2e-d2b5-4b9e-8274-11b596f3b652).	2023-09-05 11:20:55 +00:00
shreyas-goenka	bbbeabf98c	Add support for ordering of input prompts (#662 ) ## Changes JSON schema properties are a map and thus unordered. This PR introduces a JSON schema extension field called `order` to allow template authors to define the order in which template variables should be resolved/prompted. ## Tests Unit tests. --------- Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2023-09-05 11:08:25 +00:00
Pieter Noordhuis	1752e29885	Update Go SDK to v0.19.0 (#729 ) ## Changes * Update Go SDK to v0.19.0 * Update commands per OpenAPI spec from Go SDK * Incorporate `client.Do()` signature change to include a (nil) header map * Update `workspace.WorkspaceService` mock with permissions methods * Skip `files` service in codegen; already implemented under the `fs` command ## Tests Unit and integration tests pass.	2023-09-05 09:43:57 +00:00
Lennart Kats (databricks)	707fd6f617	Cleanup after "Add a foundation for built-in templates" (#707 ) ## Changes Add some cleanup based on @pietern's comments on https://github.com/databricks/cli/pull/685	2023-08-30 14:01:08 +00:00
Lennart Kats (databricks)	a5b86093ec	Add a foundation for built-in templates (#685 ) ## Changes This pull request extends the templating support in preparation of a new, default template (WIP, https://github.com/databricks/cli/pull/686): * builtin templates that can be initialized using e.g. `databricks bundle init default-python` * builtin templates are embedded into the executable using go's `embed` functionality, making sure they're co-versioned with the CLI * new helpers to get the workspace name, current user name, etc. help craft a complete template * (not enabled yet) when the user types `databricks bundle init` they can interactively select the `default-python` template And makes two tangentially related changes: * IsServicePrincipal now uses the "users" API rather than the "principals" API, since the latter is too slow for our purposes. * mode: prod no longer requires the 'target.prod.git' setting. It's hard to set that from a template. (Pieter is planning an overhaul of warnings support; this would be one of the first warnings we show.) The actual `default-python` template is maintained in a separate PR: https://github.com/databricks/cli/pull/686 ## Tests Unit tests, manual testing	2023-08-25 09:03:42 +00:00
Serge Smertin	5ed635a240	Added `databricks account o-auth-enrollment enable` command (#687 ) This command takes the user through the interactive flow to set up OAuth for a fresh account, where only Basic authentication works. --------- Co-authored-by: Andrew Nester <andrew.nester@databricks.com>	2023-08-21 16:17:02 +00:00
Pieter Noordhuis	c25bc041b1	Never ignore root directory when enumerating files in a repository (#683 ) ## Changes The pattern `.*` in a `.gitignore` file can match `.` when walking all files in a repository. If it does, then the walker immediately aborts and no files are returned. The root directory (an unnamed directory) must never be ignored. Reported in https://github.com/databricks/databricks-vscode/issues/837. ## Tests New tests pass.	2023-08-21 07:35:02 +00:00
Andrew Nester	e3e9bc6def	Added support for sync.include and sync.exclude sections (#671 ) ## Changes Added support for `sync.include` and `sync.exclude` sections ## Tests Added `sample-java` folder to gitignore ``` bundle: name: wheel-task sync: include: - "./sample-java/*.kts" ``` Kotlin files were correctly synced. ``` [DEBUG] Test execution command: /opt/homebrew/opt/go@1.21/bin/go test ./... -json -timeout 1h -coverpkg=./... -coverprofile=coverage.txt -run ^TestAcc [DEBUG] Test execution directory: /Users/andrew.nester/cli 2023/08/17 17:12:10 [INFO] ✅ TestAccAlertsCreateErrWhenNoArguments (2.320s) 2023/08/17 17:12:10 [INFO] ✅ TestAccApiGet (0.650s) 2023/08/17 17:12:12 [INFO] ✅ TestAccClustersList (1.060s) 2023/08/17 17:12:12 [INFO] ✅ TestAccClustersGet (0.760s) 2023/08/17 17:12:26 [INFO] ✅ TestAccFilerWorkspaceFilesReadWrite (13.270s) 2023/08/17 17:12:32 [INFO] ✅ TestAccFilerWorkspaceFilesReadDir (6.860s) 2023/08/17 17:12:46 [INFO] ✅ TestAccFilerDbfsReadWrite (13.380s) 2023/08/17 17:12:53 [INFO] ✅ TestAccFilerDbfsReadDir (7.460s) 2023/08/17 17:13:01 [INFO] ✅ TestAccFilerWorkspaceNotebookConflict (7.920s) 2023/08/17 17:13:10 [INFO] ✅ TestAccFilerWorkspaceNotebookWithOverwriteFlag (9.290s) 2023/08/17 17:13:10 [INFO] ✅ TestAccFilerLocalReadWrite (0.010s) 2023/08/17 17:13:11 [INFO] ✅ TestAccFilerLocalReadDir (0.010s) 2023/08/17 17:13:14 [INFO] ✅ TestAccFsCatForDbfs (3.180s) 2023/08/17 17:13:15 [INFO] ✅ TestAccFsCatForDbfsOnNonExistentFile (0.940s) 2023/08/17 17:13:15 [INFO] ✅ TestAccFsCatForDbfsInvalidScheme (0.560s) 2023/08/17 17:13:18 [INFO] ✅ TestAccFsCatDoesNotSupportOutputModeJson (2.910s) 2023/08/17 17:13:51 [INFO] ✅ TestAccFsCpDir (32.730s) 2023/08/17 17:14:06 [INFO] ✅ TestAccFsCpFileToFile (14.740s) 2023/08/17 17:14:20 [INFO] ✅ TestAccFsCpFileToDir (14.340s) 2023/08/17 17:14:53 [INFO] ✅ TestAccFsCpDirToDirFileNotOverwritten (32.710s) 2023/08/17 17:15:12 [INFO] ✅ TestAccFsCpFileToDirFileNotOverwritten (19.590s) 2023/08/17 17:15:32 [INFO] ✅ TestAccFsCpFileToFileFileNotOverwritten (19.950s) 2023/08/17 17:16:11 [INFO] ✅ TestAccFsCpDirToDirWithOverwriteFlag (38.970s) 2023/08/17 17:16:32 [INFO] ✅ TestAccFsCpFileToFileWithOverwriteFlag (21.040s) 2023/08/17 17:16:52 [INFO] ✅ TestAccFsCpFileToDirWithOverwriteFlag (19.670s) 2023/08/17 17:16:54 [INFO] ✅ TestAccFsCpErrorsWhenSourceIsDirWithoutRecursiveFlag (1.890s) 2023/08/17 17:16:54 [INFO] ✅ TestAccFsCpErrorsOnInvalidScheme (0.690s) 2023/08/17 17:17:10 [INFO] ✅ TestAccFsCpSourceIsDirectoryButTargetIsFile (15.810s) 2023/08/17 17:17:14 [INFO] ✅ TestAccFsLsForDbfs (4.000s) 2023/08/17 17:17:18 [INFO] ✅ TestAccFsLsForDbfsWithAbsolutePaths (4.000s) 2023/08/17 17:17:21 [INFO] ✅ TestAccFsLsForDbfsOnFile (3.140s) 2023/08/17 17:17:23 [INFO] ✅ TestAccFsLsForDbfsOnEmptyDir (2.030s) 2023/08/17 17:17:24 [INFO] ✅ TestAccFsLsForDbfsForNonexistingDir (0.840s) 2023/08/17 17:17:25 [INFO] ✅ TestAccFsLsWithoutScheme (0.590s) 2023/08/17 17:17:27 [INFO] ✅ TestAccFsMkdirCreatesDirectory (2.310s) 2023/08/17 17:17:30 [INFO] ✅ TestAccFsMkdirCreatesMultipleDirectories (2.800s) 2023/08/17 17:17:33 [INFO] ✅ TestAccFsMkdirWhenDirectoryAlreadyExists (2.700s) 2023/08/17 17:17:35 [INFO] ✅ TestAccFsMkdirWhenFileExistsAtPath (2.870s) 2023/08/17 17:17:40 [INFO] ✅ TestAccFsRmForFile (4.030s) 2023/08/17 17:17:43 [INFO] ✅ TestAccFsRmForEmptyDirectory (3.470s) 2023/08/17 17:17:46 [INFO] ✅ TestAccFsRmForNonEmptyDirectory (3.350s) 2023/08/17 17:17:47 [INFO] ✅ TestAccFsRmForNonExistentFile (0.940s) 2023/08/17 17:17:51 [INFO] ✅ TestAccFsRmForNonEmptyDirectoryWithRecursiveFlag (3.570s) 2023/08/17 17:17:52 [INFO] ✅ TestAccGitClone (0.890s) 2023/08/17 17:17:52 [INFO] ✅ TestAccGitCloneWithOnlyRepoNameOnAlternateBranch (0.730s) 2023/08/17 17:17:53 [INFO] ✅ TestAccGitCloneErrorsWhenRepositoryDoesNotExist (0.540s) 2023/08/17 17:18:02 [INFO] ✅ TestAccLock (8.800s) 2023/08/17 17:18:06 [INFO] ✅ TestAccLockUnlockWithoutAllowsLockFileNotExist (3.930s) 2023/08/17 17:18:09 [INFO] ✅ TestAccLockUnlockWithAllowsLockFileNotExist (3.320s) 2023/08/17 17:18:20 [INFO] ✅ TestAccSyncFullFileSync (10.570s) 2023/08/17 17:18:31 [INFO] ✅ TestAccSyncIncrementalFileSync (11.460s) 2023/08/17 17:18:42 [INFO] ✅ TestAccSyncNestedFolderSync (10.850s) 2023/08/17 17:18:53 [INFO] ✅ TestAccSyncNestedFolderDoesntFailOnNonEmptyDirectory (10.650s) 2023/08/17 17:19:04 [INFO] ✅ TestAccSyncNestedSpacePlusAndHashAreEscapedSync (10.930s) 2023/08/17 17:19:11 [INFO] ✅ TestAccSyncIncrementalFileOverwritesFolder (7.010s) 2023/08/17 17:19:18 [INFO] ✅ TestAccSyncIncrementalSyncPythonNotebookToFile (7.380s) 2023/08/17 17:19:24 [INFO] ✅ TestAccSyncIncrementalSyncFileToPythonNotebook (6.220s) 2023/08/17 17:19:30 [INFO] ✅ TestAccSyncIncrementalSyncPythonNotebookDelete (5.530s) 2023/08/17 17:19:32 [INFO] ✅ TestAccSyncEnsureRemotePathIsUsableIfRepoDoesntExist (2.620s) 2023/08/17 17:19:38 [INFO] ✅ TestAccSyncEnsureRemotePathIsUsableIfRepoExists (5.460s) 2023/08/17 17:19:40 [INFO] ✅ TestAccSyncEnsureRemotePathIsUsableInWorkspace (1.850s) 2023/08/17 17:19:40 [INFO] ✅ TestAccWorkspaceList (0.780s) 2023/08/17 17:19:51 [INFO] ✅ TestAccExportDir (10.350s) 2023/08/17 17:19:54 [INFO] ✅ TestAccExportDirDoesNotOverwrite (3.330s) 2023/08/17 17:19:58 [INFO] ✅ TestAccExportDirWithOverwriteFlag (3.770s) 2023/08/17 17:20:07 [INFO] ✅ TestAccImportDir (9.320s) 2023/08/17 17:20:24 [INFO] ✅ TestAccImportDirDoesNotOverwrite (16.950s) 2023/08/17 17:20:35 [INFO] ✅ TestAccImportDirWithOverwriteFlag (10.620s) 2023/08/17 17:20:35 [INFO] ✅ 68/68 passed, 0 failed, 3 skipped ```	2023-08-18 08:07:25 +00:00
shreyas-goenka	042fbaa614	Rename init project-dir flag to output-dir (#676 ) ## Changes This PR: 1. Renames the project-dir flag to output-dir 2. Makes the project dir flag optional. When unspecified we default to the current working directory. ## Tests Manually --------- Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2023-08-17 20:32:30 +00:00
shreyas-goenka	6a843f28ef	Correct name for force acquire deploy flag (#656 ) ## Changes As discussed here, the name for this flag should be `force-lock`: https://github.com/databricks/cli/pull/578#discussion_r1276233445 ## Tests Manually and existing tests	2023-08-15 19:03:43 +00:00
shreyas-goenka	6c644e159c	Add map and pair helper functions for bundle templates (#665 ) ## Changes Go text templates allows only specifying one input argument for invocations of associated templates (ie `{{template ...}}`). This PR introduces the map and pair functions which allow template authors to work around this limitation by passing multiple arguments as key value pairs in a map. This PR is based on feedback from the mlops stacks migration where otherwise a bunch of duplicate code is required for computed values and fixtures. ## Tests Unit test	2023-08-15 16:07:22 +00:00
shreyas-goenka	61b103318f	Use custom prompter for bundle template inputs (#663 ) ## Changes Prompt UI glitches often. We are switching to a custom implementation of a simple prompter which is much more stable. This also allows new lines in prompts which has been an ask by the mlflow team. ## Tests Tested manually	2023-08-15 14:50:20 +00:00
shreyas-goenka	878bb6deae	Return better error messages for invalid JSON schema types in templates (#661 ) ## Changes Adds a function to validate json schema types added by the author. The default json unmarshaller does not validate that the parsed type matches the enum defined in `jsonschema.Type` Includes some other improvements to provide better error messages. This PR was prompted by usability difficulties reported by @mingyu89 during mlops stack migration. ## Tests Unit tests	2023-08-15 14:28:04 +00:00
Andrew Nester	6e708da6fc	Upgraded Go version to 1.21 (#664 ) ## Changes Upgraded Go version to 1.21 Upgraded to use `slices` and `slog` from core instead of experimental. Still use `exp/maps` as our code relies on `maps.Keys` which is not part of core package and therefore refactoring required. ### Tests Integration tests passed ``` [DEBUG] Test execution command: /opt/homebrew/opt/go@1.21/bin/go test ./... -json -timeout 1h -run ^TestAcc [DEBUG] Test execution directory: /Users/andrew.nester/cli 2023/08/15 13:20:51 [INFO] ✅ TestAccAlertsCreateErrWhenNoArguments (2.150s) 2023/08/15 13:20:52 [INFO] ✅ TestAccApiGet (0.580s) 2023/08/15 13:20:53 [INFO] ✅ TestAccClustersList (0.900s) 2023/08/15 13:20:54 [INFO] ✅ TestAccClustersGet (0.870s) 2023/08/15 13:21:06 [INFO] ✅ TestAccFilerWorkspaceFilesReadWrite (11.980s) 2023/08/15 13:21:13 [INFO] ✅ TestAccFilerWorkspaceFilesReadDir (7.060s) 2023/08/15 13:21:25 [INFO] ✅ TestAccFilerDbfsReadWrite (12.810s) 2023/08/15 13:21:33 [INFO] ✅ TestAccFilerDbfsReadDir (7.380s) 2023/08/15 13:21:41 [INFO] ✅ TestAccFilerWorkspaceNotebookConflict (7.760s) 2023/08/15 13:21:49 [INFO] ✅ TestAccFilerWorkspaceNotebookWithOverwriteFlag (8.660s) 2023/08/15 13:21:49 [INFO] ✅ TestAccFilerLocalReadWrite (0.020s) 2023/08/15 13:21:49 [INFO] ✅ TestAccFilerLocalReadDir (0.010s) 2023/08/15 13:21:52 [INFO] ✅ TestAccFsCatForDbfs (3.190s) 2023/08/15 13:21:53 [INFO] ✅ TestAccFsCatForDbfsOnNonExistentFile (0.890s) 2023/08/15 13:21:54 [INFO] ✅ TestAccFsCatForDbfsInvalidScheme (0.600s) 2023/08/15 13:21:57 [INFO] ✅ TestAccFsCatDoesNotSupportOutputModeJson (2.960s) 2023/08/15 13:22:28 [INFO] ✅ TestAccFsCpDir (31.480s) 2023/08/15 13:22:43 [INFO] ✅ TestAccFsCpFileToFile (14.530s) 2023/08/15 13:22:58 [INFO] ✅ TestAccFsCpFileToDir (14.610s) 2023/08/15 13:23:29 [INFO] ✅ TestAccFsCpDirToDirFileNotOverwritten (31.810s) 2023/08/15 13:23:47 [INFO] ✅ TestAccFsCpFileToDirFileNotOverwritten (17.500s) 2023/08/15 13:24:04 [INFO] ✅ TestAccFsCpFileToFileFileNotOverwritten (17.260s) 2023/08/15 13:24:37 [INFO] ✅ TestAccFsCpDirToDirWithOverwriteFlag (32.690s) 2023/08/15 13:24:56 [INFO] ✅ TestAccFsCpFileToFileWithOverwriteFlag (19.290s) 2023/08/15 13:25:15 [INFO] ✅ TestAccFsCpFileToDirWithOverwriteFlag (19.230s) 2023/08/15 13:25:17 [INFO] ✅ TestAccFsCpErrorsWhenSourceIsDirWithoutRecursiveFlag (2.010s) 2023/08/15 13:25:18 [INFO] ✅ TestAccFsCpErrorsOnInvalidScheme (0.610s) 2023/08/15 13:25:33 [INFO] ✅ TestAccFsCpSourceIsDirectoryButTargetIsFile (14.900s) 2023/08/15 13:25:37 [INFO] ✅ TestAccFsLsForDbfs (3.770s) 2023/08/15 13:25:41 [INFO] ✅ TestAccFsLsForDbfsWithAbsolutePaths (4.160s) 2023/08/15 13:25:44 [INFO] ✅ TestAccFsLsForDbfsOnFile (2.990s) 2023/08/15 13:25:46 [INFO] ✅ TestAccFsLsForDbfsOnEmptyDir (1.870s) 2023/08/15 13:25:46 [INFO] ✅ TestAccFsLsForDbfsForNonexistingDir (0.850s) 2023/08/15 13:25:47 [INFO] ✅ TestAccFsLsWithoutScheme (0.560s) 2023/08/15 13:25:49 [INFO] ✅ TestAccFsMkdirCreatesDirectory (2.310s) 2023/08/15 13:25:52 [INFO] ✅ TestAccFsMkdirCreatesMultipleDirectories (2.920s) 2023/08/15 13:25:55 [INFO] ✅ TestAccFsMkdirWhenDirectoryAlreadyExists (2.320s) 2023/08/15 13:25:57 [INFO] ✅ TestAccFsMkdirWhenFileExistsAtPath (2.820s) 2023/08/15 13:26:01 [INFO] ✅ TestAccFsRmForFile (4.030s) 2023/08/15 13:26:05 [INFO] ✅ TestAccFsRmForEmptyDirectory (3.530s) 2023/08/15 13:26:08 [INFO] ✅ TestAccFsRmForNonEmptyDirectory (3.190s) 2023/08/15 13:26:09 [INFO] ✅ TestAccFsRmForNonExistentFile (0.830s) 2023/08/15 13:26:13 [INFO] ✅ TestAccFsRmForNonEmptyDirectoryWithRecursiveFlag (3.580s) 2023/08/15 13:26:13 [INFO] ✅ TestAccGitClone (0.800s) 2023/08/15 13:26:14 [INFO] ✅ TestAccGitCloneWithOnlyRepoNameOnAlternateBranch (0.790s) 2023/08/15 13:26:15 [INFO] ✅ TestAccGitCloneErrorsWhenRepositoryDoesNotExist (0.540s) 2023/08/15 13:26:23 [INFO] ✅ TestAccLock (8.630s) 2023/08/15 13:26:27 [INFO] ✅ TestAccLockUnlockWithoutAllowsLockFileNotExist (3.490s) 2023/08/15 13:26:30 [INFO] ✅ TestAccLockUnlockWithAllowsLockFileNotExist (3.130s) 2023/08/15 13:26:39 [INFO] ✅ TestAccSyncFullFileSync (9.370s) 2023/08/15 13:26:50 [INFO] ✅ TestAccSyncIncrementalFileSync (10.390s) 2023/08/15 13:27:00 [INFO] ✅ TestAccSyncNestedFolderSync (10.680s) 2023/08/15 13:27:11 [INFO] ✅ TestAccSyncNestedFolderDoesntFailOnNonEmptyDirectory (10.970s) 2023/08/15 13:27:22 [INFO] ✅ TestAccSyncNestedSpacePlusAndHashAreEscapedSync (10.930s) 2023/08/15 13:27:29 [INFO] ✅ TestAccSyncIncrementalFileOverwritesFolder (7.020s) 2023/08/15 13:27:37 [INFO] ✅ TestAccSyncIncrementalSyncPythonNotebookToFile (7.380s) 2023/08/15 13:27:43 [INFO] ✅ TestAccSyncIncrementalSyncFileToPythonNotebook (6.050s) 2023/08/15 13:27:48 [INFO] ✅ TestAccSyncIncrementalSyncPythonNotebookDelete (5.390s) 2023/08/15 13:27:51 [INFO] ✅ TestAccSyncEnsureRemotePathIsUsableIfRepoDoesntExist (2.570s) 2023/08/15 13:27:56 [INFO] ✅ TestAccSyncEnsureRemotePathIsUsableIfRepoExists (5.540s) 2023/08/15 13:27:58 [INFO] ✅ TestAccSyncEnsureRemotePathIsUsableInWorkspace (1.840s) 2023/08/15 13:27:59 [INFO] ✅ TestAccWorkspaceList (0.790s) 2023/08/15 13:28:08 [INFO] ✅ TestAccExportDir (8.860s) 2023/08/15 13:28:11 [INFO] ✅ TestAccExportDirDoesNotOverwrite (3.090s) 2023/08/15 13:28:14 [INFO] ✅ TestAccExportDirWithOverwriteFlag (3.500s) 2023/08/15 13:28:23 [INFO] ✅ TestAccImportDir (8.330s) 2023/08/15 13:28:34 [INFO] ✅ TestAccImportDirDoesNotOverwrite (10.970s) 2023/08/15 13:28:44 [INFO] ✅ TestAccImportDirWithOverwriteFlag (10.130s) 2023/08/15 13:28:44 [INFO] ✅ 68/68 passed, 0 failed, 3 skipped ```	2023-08-15 13:50:40 +00:00
Miles Yucht	5b819cd982	Always resolve .databrickscfg file (#659 ) ## Changes #629 introduced a change to autopopulate the host from .databrickscfg if the user is logging back into a host they were previously using. This did not respect the DATABRICKS_CONFIG_FILE env variable, causing the flow to stop working for users with no .databrickscfg file in their home directory. This PR refactors all config file loading to go through one interface, `databrickscfg.GetDatabricksCfg()`, and an auxiliary `databrickscfg.GetDatabricksCfgPath()` to get the configured file path. Closes #655. ## Tests ``` $ databricks auth login --profile abc Error: open /Users/miles/.databrickscfg: no such file or directory $ ./cli auth login --profile abc Error: cannot load Databricks config file: open /Users/miles/.databrickscfg: no such file or directory $ DATABRICKS_CONFIG_FILE=~/.databrickscfg.bak ./cli auth login --profile abc Databricks Host: https://asdf ```	2023-08-14 12:45:08 +00:00
shreyas-goenka	6ea70c82a9	Execute paths without the .tmpl extension as templates (#654 ) ## Changes The `.tmpl` extension is only meant as a qualifier for whether the file content is executed as a template. All file paths in the `template` directory should be treated as valid go text templates. Before only paths with the `.tmpl` extensions would be resolved as templates, after this change, all file paths are interpreted as templates. ## Tests Unit test. The newly added unit tests also asserts that the file path is correct, even when the `.tmpl` extension is missing.	2023-08-11 13:48:32 +00:00
Pieter Noordhuis	2a58253d20	Consolidate functions in libs/git (#652 ) ## Changes The functions in `libs/git/git.go` assumed global state (e.g. working directory) and were no longer used. This change consolidates the functionality to turn an origin URL into an HTTPS URL. Closes #187. ## Tests Expanded existing unit test.	2023-08-10 09:36:42 +00:00
shreyas-goenka	6430d23453	Print y/n options when displaying prompts using cmdio.Ask (#650 ) ## Changes Adds `[y/n]` in `cmdio.Ask` to make the options obvious in all question prompts ## Tests Test manually. Works.	2023-08-09 09:22:42 +00:00
shreyas-goenka	d6f626912f	Fix bundle git branch validation (#645 ) ## Changes This PR: 1. Fixes the computation logic for `ActualBranch`. An error in the earlier logic caused the validation mutator to be a no-op. 2. Makes the `.git` string a global var. This is useful to configure in tests. 3. Adds e2e test for the validation mutator. ## Tests Unit test	2023-08-07 17:29:02 +00:00
shreyas-goenka	81ee031a04	Add bundle init command and support for prompting user for input values (#631 ) ## Changes This PR adds two features: 1. The bundle init command 2. Support for prompting for input values In order to do this, this PR also introduces a new `config` struct which handles reading config files, prompting users and all validation steps before we materialize the template With this PR users can start authoring custom templates, based on go text templates, for their projects / orgs. ## Tests Unit tests, both existing and new	2023-08-07 13:14:25 +00:00
shreyas-goenka	55e62366fa	Add unit test for file name execution during rendering (#640 ) ## Changes Adds a Unit test that directories and files in the file tree are executed as templates	2023-08-07 12:44:01 +00:00
shreyas-goenka	5df8935de4	Add JSON schema validation for input template parameters (#598 ) ## Changes This PR: 1. Adds code for reading template configs and validating them against a JSON schema. 2. Moves the json schema struct in `bundle/schema` to a separate library package. This struct is now reused for validating template configs. ## Tests Unit tests	2023-08-01 14:09:27 +00:00
shreyas-goenka	fc8729d162	Only treat files with .tmpl extension as templates (#594 ) ## Changes In a world before this PR, all files would be treated as `go text templates`, making the content in these files quake in fear since they would be executed (as a template). This PR makes it so that only files with the `.tmpl` extension are understood to be templates. This is useful for avoiding ambiguity in cases like where a binary file could be interpreted as a go text template otherwise. In order to do so, we introduce the `copyFile` struct which does a copy of the source file from the template without loading it into memory. ## Tests Unit tests	2023-08-01 13:43:27 +00:00
Lennart Kats (databricks)	d55652be07	Extend deployment mode support (#577 ) ## Changes This adds `mode: production` option. This mode doesn't do any transformations but verifies that an environment is configured correctly for production: ``` environments: prod: mode: production # paths should not be scoped to a user (unless a service principal is used) root_path: /Shared/non_user_path/... # run_as and permissions should be set at the resource level (or at the top level when that is implemented) run_as: user_name: Alice permissions: - level: CAN_MANAGE user_name: Alice ``` Additionally, this extends the existing `mode: development` option, * now prefixing deployed assets with `[dev your.user]` instead of just `[dev`] * validating that development deployments _are_ scoped to a user ## Related https://github.com/databricks/cli/pull/578/files (in draft) ## Tests Manual testing to validate the experience, error messages, and functionality with all resource types. Automated unit tests. --------- Co-authored-by: Fabian Jakobs <fabian.jakobs@databricks.com>	2023-07-30 07:19:49 +00:00
shreyas-goenka	2f4bf844fc	Fix git clone integration test for non-existing repo (#610 ) ## Changes This PR changes the integration test to just check an error is returned rather than asserting specific text is present in the error. This is required because the error returned can be different based on whether git ssh keys have been setup.	2023-07-27 13:51:57 +00:00
shreyas-goenka	ed972f7ae0	Add url parse helper function for templates (#600 ) ## Tests unit test	2023-07-27 09:51:31 +00:00
shreyas-goenka	34f196bb4e	Add unit test that raw strings are printed as is (#599 ) ## Changes Add unit test that raw strings are printed as is. This method is useful to print text that would otherwise be interpreted a go text template.	2023-07-25 17:18:43 +02:00
shreyas-goenka	47640b8b94	Add regexp compile helper function for templates (#601 ) ## Tests unit test	2023-07-25 16:42:53 +02:00
shreyas-goenka	8fdc0fec81	Add support for cloning repositories (#544 ) ## Changes Adds support for cloning public and private github repositories for databricks templates ## Tests Integration tests	2023-07-25 15:36:20 +02:00
shreyas-goenka	13731e144c	Fix formatting in renderer.go (#593 ) ## Changes Due to a bug in Github UI, https://github.com/databricks/cli/pull/589 got merged without passing the go/fmt formatting checks This PR fixes the formatting which breaks the PR checks	2023-07-21 11:23:47 +02:00
shreyas-goenka	02dbac7b8a	Add template renderer for Databricks templates (#589 ) ## Changes This PR adds the renderer struct, which is a walker that traverses templates and generates projects from them ## Tests Unit tests	2023-07-21 10:59:02 +02:00
shreyas-goenka	a7a109a5d8	Remove base path checks during sync (#576 ) ## Changes Earlier we removed recursive deletion from sync. This makes it safe enough for us to not restrict sync to just the namespace of the user. This PR removes that base path validation. Note: If the sync destination is under `/Repos` we still only create missing directories required if the path is under my namespace ie matches `/Repos/@me/` ## Tests Manually Before: ``` shreyas.goenka@THW32HFW6T hello-bundle % cli bundle deploy Starting upload of bundle files Error: path must be nested under /Users/shreyas.goenka@databricks.com or /Repos/shreyas.goenka@databricks.com ``` After: ``` shreyas.goenka@THW32HFW6T hello-bundle % cli bundle deploy Starting upload of bundle files Uploaded bundle files at /Shared/common-test/hello-bundle/files! Starting resource deployment Resource deployment completed! ```	2023-07-14 11:43:20 +02:00
Andrew Nester	650fb0e8b6	Correctly use --profile flag passed for all bundle commands (#571 ) ## Changes Correctly use --profile flag passed for all bundle commands. Also adds a validation that if bundle configured host mismatches provided profile, it throws an error. Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2023-07-12 14:09:25 +02:00
Miles Yucht	f203731fe6	Support tab completion for profiles (#572 ) ## Changes Currently, `databricks --profile <TAB>` autocompletes with the shell default behavior, listing files in the local directory. This is not a great experience. Especially given that the suggested profile names for accounts are so long, it can be cumbersome to type them out by hand. This PR configures autocompletion for `--profile` to inspect the profiles of ~/.databrickscfg. One potential improvement is to filter the response based on whether the command is known to be account-level or workspace-level. ## Tests Manual test. <img width="579" alt="Screenshot_11_07_2023__18_31" src="https://github.com/databricks/cli/assets/1850319/d7a3acd0-2511-45ac-bd82-95567775c10a">	2023-07-12 12:05:51 +02:00
Pieter Noordhuis	db6313e99c	Fix secrets put-secret command (#545 ) ## Changes Two issues with this command: * The command line arguments for the secret value were ignored * If the secret value was piped through stdin, it would still prompt The second issue prevented users from using multi-line strings because the prompt reads until end-of-line. This change adds testing infrastructure for: * Setting up a workspace focused test (common between many tests) * Running a snippet of Python through the command execution API Porting more integration tests to use this infrastructure will be done in later commits. ## Tests New integration test passes. The interactive path cannot be integration tested just yet.	2023-07-05 17:30:54 +02:00
Pieter Noordhuis	f64c44285f	Decode contents by default in workspace export command (#531 ) ## Changes Also see #525. The direct download flag has been removed in newer versions because of the content type issue. Instead, we can make the command decode the base64 output when the output mode is text. ``` $ databricks workspace export /some/path/script.sh #!/bin/bash echo "this is a script" ``` ## Tests New integration test.	2023-06-27 20:42:29 +02:00
shreyas-goenka	30efe91c6d	Make local files default for fs commands (#506 ) ## Changes <!-- Summary of your changes that are easy to understand --> ## Tests <!-- How is this tested? -->	2023-06-23 16:07:09 +02:00
shreyas-goenka	d0e9953ad9	Use direct download for workspace filer read (#514 ) ## Changes Use the download method from the SDK in the read method for the WSFS implementation of the filer interface. Closes #452. ## Tests Tested by existing integration tests	2023-06-23 15:17:39 +02:00
shreyas-goenka	f9260125aa	Remove extra call to filer.Stat in dbfs filer.Read (#515 ) ## Changes This PR removes the stat call and instead relies on errors returned by the go SDK to return the appropriate errors ## Tests Tested using existing filer integration tests	2023-06-23 15:08:22 +02:00
Pieter Noordhuis	ae13135fc6	Follow up to #513 (#521 ) This reverts changes from #513 which ended up not being necessary.	2023-06-23 12:51:31 +02:00
Andrew Nester	d23d4aef3a	Fixed error on multiple profiles and failure to create a new profile with configured cluster (#513 )	2023-06-22 17:20:10 +02:00
shreyas-goenka	f2a2d058d1	Remove \r from new line print statments (#509 ) ## Changes Removes carriage character from new line prints for json output mode and sync events ## Tests Manually	2023-06-22 13:47:52 +02:00
Andrew Nester	7db1990c56	Added prompts for Databricks profile for auth login command (#502 ) ## Changes Added prompts for Databricks profile for auth login command ## Tests ``` andrew.nester@HFW9Y94129 cli % ./cli auth login https://xxxx-databricks.com ✔ Databricks Profile Name: my-profile█ ```	2023-06-21 12:58:28 +02:00
Pieter Noordhuis	e19eaca4d1	Add filer.Filer implementation backed by the Files API (#474 ) ## Tests New integration test for the read/write parts of the other filers. The integration test cannot be shared just yet because the Files API doesn't include support for creating/listing/removing directories yet.	2023-06-19 18:29:13 +00:00
shreyas-goenka	5d036ab6b8	Fix locker unlock for destroy (#492 ) ## Changes Adds ability for allowing unlock to succeed even if the deploy file is missing. ## Tests Using integration tests and manually	2023-06-19 15:57:25 +02:00
shreyas-goenka	bb32067a80	Add fs cp command (#463 ) ## Tests Tested using integration tests	2023-06-16 17:09:08 +02:00
shreyas-goenka	de47cf19f1	Use better error assertions and clean up locker API (#490 ) ## Changes Some cleanup work ## Tests Locker integration test passes	2023-06-16 16:29:04 +02:00
Pieter Noordhuis	b9406efd27	Update configure command (#482 ) ## Changes This now uses: * libs/cmdio to determine interactivity and perform prompting * libs/databrickscfg to persist the profile It loads a config.Config structure from the environment just like we do for unified authentication. It is therefore possible to specify both the host and token with environment variables. ## Tests ``` pieter.noordhuis@L4GHXDT29P /tmp % export DATABRICKS_CONFIG_FILE=.databrickscfg pieter.noordhuis@L4GHXDT29P /tmp % databricks configure Databricks Host: https://foo.bar Personal Access Token: *** pieter.noordhuis@L4GHXDT29P /tmp % cat .databrickscfg [DEFAULT] host = https://foo.bar token = token pieter.noordhuis@L4GHXDT29P /tmp % echo token \| databricks configure Error: host must be set in non-interactive mode pieter.noordhuis@L4GHXDT29P /tmp % echo token \| databricks configure --host foo Error: must start with https:// pieter.noordhuis@L4GHXDT29P /tmp % echo token \| databricks configure --host https://foo pieter.noordhuis@L4GHXDT29P /tmp % cat .databrickscfg [DEFAULT] host = https://foo token = token pieter.noordhuis@L4GHXDT29P /tmp % cat .databrickscfg pieter.noordhuis@L4GHXDT29P /tmp % databricks configure --host https://foo Personal Access Token: **** pieter.noordhuis@L4GHXDT29P /tmp % cat .databrickscfg [DEFAULT] host = https://foo token = token2 ```	2023-06-15 12:50:19 +00:00
shreyas-goenka	3d03df8844	Add mode default as a valid set value for progress-format flag (#472 ) Manually tested `progress-format` works fine for all three modes	2023-06-14 13:26:56 +02:00
Pieter Noordhuis	a17876480a	Include [DEFAULT] section header when writing ~/.databrickscfg (#464 ) ## Changes The ini library omits the default section header and in doing so breaks compatibility with Python's config parser. It raises: ``` Error: MissingSectionHeaderError: File contains no section headers. ``` This commit makes sure the DEFAULT section header is included. If the config file doesn't include a DEFAULT section itself, we include a comment describing its purpose. ## Tests New tests pass. Manually confirmed the DEFAULT section header is included. --------- Co-authored-by: PaulCornellDB <paul.cornell@databricks.com>	2023-06-13 16:41:56 +00:00
shreyas-goenka	d38649088c	Add workspace import-dir command (#456 ) ## Tests Testing using integration tests and manually	2023-06-12 21:03:46 +02:00
Pieter Noordhuis	960ce2e18e	Add implementation of filer.Filer for local filesystem (#460 ) ## Changes Local file reads on Windows require the file handle to be closed after using it. This commit includes an interface change to return an `io.ReadCloser` from `Read` to accommodate this. ## Tests The existing integration tests for the filer interface all pass.	2023-06-12 15:53:58 +02:00
Pieter Noordhuis	16bb224108	Add directory tracking to sync (#425 ) ## Changes This change replaces usage of the `repofiles` package with the `filer` package to consolidate WSFS code paths. The `repofiles` package implemented the following behavior. If a file at `foo/bar.txt` was created and removed, the directory `foo` was kept around because we do not perform directory tracking. If subsequently, a file at `foo` was created, it resulted in an `fs.ErrExist` because it is impossible to overwrite a directory. It would then perform a recursive delete of the path if this happened and retry the file write. To make this use case work without resorting to a recursive delete on conflict, we need to implement directory tracking as part of sync. The approach in this commit is as follows: 1. Maintain set of directories needed for current set of files. Compare to previous set of files. This results in mkdir of added directories and rmdir of removed directories. 2. Creation of new directories should happen prior to writing files. Otherwise, many file writes may race to create the same parent directories, resulting in additional API calls. Removal of existing directories should happen after removing files. 3. Making new directories can be deduped across common prefixes where only the longest prefix is created recursively. 4. Removing existing directories must happen sequentially, starting with the longest prefix. 5. Removal of directories is a best effort. It fails only if the directory is not empty, and if this happens we know something placed a file or directory manually, outside of sync. ## Tests * Existing integration tests pass (modified where it used to assert directories weren't cleaned up) * New integration test to confirm the inability to remove a directory doesn't fail the sync run	2023-06-12 11:44:00 +00:00
Pieter Noordhuis	e4415bfbcf	Tweak profile prompt (#454 ) ## Changes This includes the following changes: * Move profile loading code to libs/databrickscfg and add tests * Update prompt label to reflect workspace/account profiles * Start prompt in search mode by default * Custom error if `~/.databrickscfg` doesn't exist * Custom error if `~/.databrickscfg` doesn't contain profiles * Use stderr for prompt so that stdout redirection works (e.g. with `jq` or `jless`) ## Tests * New unit tests pass * Manual tests for both workspace and account commands * Search-by-default is really nice if you have many profiles	2023-06-09 13:56:35 +02:00
shreyas-goenka	4818541062	Add workspace export-dir command (#449 ) ## Changes This PR: 1. Adds the export-dir command 2. Changes filer.Read to return an error if a user tries to read a directory 3. Adds returning internal file structures from filer.Stat().Sys() ## Tests Integration tests and manually	2023-06-08 18:15:12 +02:00
shreyas-goenka	53164ae880	Add new line to cmdio JSON rendering (#443 ) ## Changes This PR adds a new line break to JSON rendering using cmdio. This is useful when we call `cmdio.Render` multiple times ## Tests Manually Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2023-06-08 15:48:51 +02:00
Pieter Noordhuis	be10ff9a75	Include recursive deletion in filer interface (#442 ) ## Changes This captures the recursive deletion of a directory tree in the filer interface. Prompted by #433. ## Tests Integration tests pass (ran the filer ones on AWS and Azure).	2023-06-06 06:27:47 +00:00
shreyas-goenka	ae10419eb8	Add fs cat command for dbfs files (#430 ) ## Changes TSIA ## Tests Manually and integration tests	2023-06-06 01:16:23 +02:00
shreyas-goenka	6ff00122ad	Add fs ls command for dbfs (#429 ) ## Changes 1. Adds fs ls command 2. Adds ability to define multiple templates ## Tests Manually and integration tests	2023-06-05 17:41:30 +02:00
Andrew Nester	1f130f3722	Do not use FgWhite and FgBlack for terminal output (#435 ) ## Changes Using white / black color for terminal output will lead to poorly displayed content in either light or dark terminal backgrounds. Some other CLIs experienced same issues (https://github.com/qri-io/qri/pull/774) Instead, let's just use color to highlight some of the output so it's more compatible with different background styles ## Tests <img width="772" alt="Screenshot 2023-06-05 at 16 05 09" src="https://github.com/databricks/cli/assets/2969996/01790239-6a33-4059-86a8-d5117ea0b75f"> --- <img width="757" alt="Screenshot 2023-06-05 at 16 05 20" src="https://github.com/databricks/cli/assets/2969996/ea3b9fdc-3782-4f4f-a9df-19e66af0c04f">	2023-06-05 17:30:40 +02:00
Pieter Noordhuis	1c0d67f66c	Add fs.FS adapter for the filer interface (#422 ) ## Changes This enables the use of `io/fs` functions `fs.Glob` and `fs.WalkDir` with filers. We can't use `fs.FS` as the standard interface instead of `filer.Filer` because: 1. It was made for reading from filesystems only, not writing 2. It doesn't take a context for the core functions Therefore a wrapper will do. ## Tests * Added unit tests to cover the adapter through a fake filer. * Manually ran `fs.WalkDir` against both WSFS and DBFS filers.	2023-06-02 12:49:59 +00:00
Serge Smertin	a6c9533c1c	Add profile on `databricks auth login` (#423 ) ## Changes - added saving profile to `~/.databrickscfg` whenever we do `databricks auth login`. - we either match profile by account id / canonical host or introduce the new one from deployment name. - fail on multiple profiles with matching accounts or workspace hosts. - overriding `~/.databrickscfg` keeps the (valid) comments, but reformats the file. ## Tests <!-- How is this tested? --> - `make test` - `go run main.go auth login --account-id XXX --host https://accounts.cloud.databricks.com/` - `go run main.go auth token --account-id XXX --host https://accounts.cloud.databricks.com/` - `go run main.go auth login --host https://XXX.cloud.databricks.com/`	2023-06-02 13:49:39 +02:00
shreyas-goenka	91097856b5	Add check for path is a directory in filer.ReadDir (#426 ) ## Tests Integration tests	2023-06-02 12:28:35 +02:00
Pieter Noordhuis	2b56af6016	Add Stat function to filer.Filer interface (#421 ) ## Changes TSIA ## Tests New integration test passes.	2023-06-01 20:23:22 +02:00
Serge Smertin	24ebfdf31e	Add readable console logger (#370 ) ## Changes Add a readable colored console logger that is active only for TTYs: <img width="764" alt="image" src="https://user-images.githubusercontent.com/259697/235221427-ca482b32-9f88-4adb-ada3-8c4f35f50f06.png"> ## Tests Run `go run main.go clusters list --log-level debug --profile demo`	2023-06-01 11:37:33 +02:00
Pieter Noordhuis	349e2aff40	Allow equivalence checking of filer errors to fs errors (#416 ) ## Changes The pattern `errors.Is(err, fs.ErrNotExist)` is common to check for an error type. Errors can implement `Is(error) bool` with a custom equivalence checker. ## Tests New asserts all pass in the integration test.	2023-05-31 20:47:00 +02:00
Pieter Noordhuis	42cd8daee0	Make filer.Filer return fs.DirEntry from ReadDir (#415 ) ## Changes This allows for compatibility with the stdlib functions in io/fs. ## Tests Integration tests pass.	2023-05-31 14:22:26 +02:00
Pieter Noordhuis	27df4e765c	Implement DBFS filer (#139 ) Adds a DBFS implementation of the `filer.Filer` interface. The integration tests are reused between the workspace filesystem and DBFS implementations to ensure identical behavior.	2023-05-31 13:24:20 +02:00
Pieter Noordhuis	92cb52041d	Add Mkdir and ReadDir functions to filer.Filer interface (#414 ) ## Changes This cherry-picks the filer changes from #408. ## Tests Manually ran integration tests.	2023-05-31 11:11:17 +02:00
Andrew Nester	05eaf7ff50	Added secrets input prompt for secrets put-secret command (#413 ) ## Changes Added secrets input prompt for secrets put-secrets command ## Tests <img width="623" alt="Screenshot 2023-05-30 at 12 06 24" src="https://github.com/databricks/cli/assets/2969996/9338e6ba-c504-48cc-ac97-cac97dde7a3a">	2023-05-31 10:18:29 +02:00
Fabian Jakobs	aa85638070	Sync: Gracefully handle broken notebook files (#398 ) ## Changes Ignore broken notebook files during sync. Fixes https://github.com/databricks/databricks-vscode/issues/712	2023-05-23 12:10:15 +02:00
Pieter Noordhuis	8979ed1394	Fix tests for new repository name (#390 )	2023-05-16 19:02:07 +02:00
Pieter Noordhuis	98ebb78c9b	Rename bricks -> databricks (#389 ) ## Changes Rename all instances of "bricks" to "databricks". ## Tests * Confirmed the goreleaser build works, uses the correct new binary name, and produces the right archives. * Help output is confirmed to be correct. * Output of `git grep -w bricks` is minimal with a couple changes remaining for after the repository rename.	2023-05-16 18:35:39 +02:00
Andrew Nester	180dfc9a40	Added ability for deferred mutator execution (#380 ) ## Changes Added `DeferredMutator` and `bundle.Defer` function which allows to always execute some mutators either in the end of execution chain or after error occurs in the middle of execution chain. Usage as follows: ``` deferredMutator := bundle.Defer([]bundle.Mutator{ lock.Acquire() transform.DoSomething(), //... }, []bundle.Mutator{ lock.Release(), }) ``` In such case `lock.Release()` will always be executed: either when all operations above succeed or when any of them fails ## Tests Before the change ``` andrew.nester@HFW9Y94129 multiples-tasks % bricks bundle deploy Starting upload of bundle files Uploaded bundle files at /Users/andrew.nester@databricks.com/.bundle/simple-task/development/files! Error: terraform not initialized andrew.nester@HFW9Y94129 multiples-tasks % bricks bundle deploy Error: deploy lock acquired by andrew.nester@databricks.com at 2023-05-10 16:41:22.902659 +0200 CEST. Use --force to override ``` After the change ``` andrew.nester@HFW9Y94129 multiples-tasks % bricks bundle deploy Starting upload of bundle files Uploaded bundle files at /Users/andrew.nester@databricks.com/.bundle/simple-task/development/files! Error: terraform not initialized andrew.nester@HFW9Y94129 multiples-tasks % bricks bundle deploy Starting upload of bundle files Uploaded bundle files at /Users/andrew.nester@databricks.com/.bundle/simple-task/development/files! Error: terraform not initialized ```	2023-05-16 18:01:50 +02:00
shreyas-goenka	9e16140b6e	Add git config block to bundle config (#356 ) ## Changes This config block contains commit, branch and remote_url which will be automatically loaded if specified in the repo, and can also be specified by the user ## Tests Unit and black-box tests	2023-04-26 16:54:36 +02:00
Serge Smertin	4c4a293015	Added OpenAPI command coverage (#357 ) This PR adds the following command groups: ## Workspace-level command groups * `bricks alerts` - The alerts API can be used to perform CRUD operations on alerts. * `bricks catalogs` - A catalog is the first layer of Unity Catalog’s three-level namespace. * `bricks cluster-policies` - Cluster policy limits the ability to configure clusters based on a set of rules. * `bricks clusters` - The Clusters API allows you to create, start, edit, list, terminate, and delete clusters. * `bricks current-user` - This API allows retrieving information about currently authenticated user or service principal. * `bricks dashboards` - In general, there is little need to modify dashboards using the API. * `bricks data-sources` - This API is provided to assist you in making new query objects. * `bricks experiments` - MLflow Experiment tracking. * `bricks external-locations` - An external location is an object that combines a cloud storage path with a storage credential that authorizes access to the cloud storage path. * `bricks functions` - Functions implement User-Defined Functions (UDFs) in Unity Catalog. * `bricks git-credentials` - Registers personal access token for Databricks to do operations on behalf of the user. * `bricks global-init-scripts` - The Global Init Scripts API enables Workspace administrators to configure global initialization scripts for their workspace. * `bricks grants` - In Unity Catalog, data is secure by default. * `bricks groups` - Groups simplify identity management, making it easier to assign access to Databricks Workspace, data, and other securable objects. * `bricks instance-pools` - Instance Pools API are used to create, edit, delete and list instance pools by using ready-to-use cloud instances which reduces a cluster start and auto-scaling times. * `bricks instance-profiles` - The Instance Profiles API allows admins to add, list, and remove instance profiles that users can launch clusters with. * `bricks ip-access-lists` - IP Access List enables admins to configure IP access lists. * `bricks jobs` - The Jobs API allows you to create, edit, and delete jobs. * `bricks libraries` - The Libraries API allows you to install and uninstall libraries and get the status of libraries on a cluster. * `bricks metastores` - A metastore is the top-level container of objects in Unity Catalog. * `bricks model-registry` - MLflow Model Registry commands. * `bricks permissions` - Permissions API are used to create read, write, edit, update and manage access for various users on different objects and endpoints. * `bricks pipelines` - The Delta Live Tables API allows you to create, edit, delete, start, and view details about pipelines. * `bricks policy-families` - View available policy families. * `bricks providers` - Databricks Providers REST API. * `bricks queries` - These endpoints are used for CRUD operations on query definitions. * `bricks query-history` - Access the history of queries through SQL warehouses. * `bricks recipient-activation` - Databricks Recipient Activation REST API. * `bricks recipients` - Databricks Recipients REST API. * `bricks repos` - The Repos API allows users to manage their git repos. * `bricks schemas` - A schema (also called a database) is the second layer of Unity Catalog’s three-level namespace. * `bricks secrets` - The Secrets API allows you to manage secrets, secret scopes, and access permissions. * `bricks service-principals` - Identities for use with jobs, automated tools, and systems such as scripts, apps, and CI/CD platforms. * `bricks serving-endpoints` - The Serving Endpoints API allows you to create, update, and delete model serving endpoints. * `bricks shares` - Databricks Shares REST API. * `bricks storage-credentials` - A storage credential represents an authentication and authorization mechanism for accessing data stored on your cloud tenant. * `bricks table-constraints` - Primary key and foreign key constraints encode relationships between fields in tables. * `bricks tables` - A table resides in the third layer of Unity Catalog’s three-level namespace. * `bricks token-management` - Enables administrators to get all tokens and delete tokens for other users. * `bricks tokens` - The Token API allows you to create, list, and revoke tokens that can be used to authenticate and access Databricks REST APIs. * `bricks users` - User identities recognized by Databricks and represented by email addresses. * `bricks volumes` - Volumes are a Unity Catalog (UC) capability for accessing, storing, governing, organizing and processing files. * `bricks warehouses` - A SQL warehouse is a compute resource that lets you run SQL commands on data objects within Databricks SQL. * `bricks workspace` - The Workspace API allows you to list, import, export, and delete notebooks and folders. * `bricks workspace-conf` - This API allows updating known workspace settings for advanced users. ## Account-level command groups * `bricks account billable-usage` - This API allows you to download billable usage logs for the specified account and date range. * `bricks account budgets` - These APIs manage budget configuration including notifications for exceeding a budget for a period. * `bricks account credentials` - These APIs manage credential configurations for this workspace. * `bricks account custom-app-integration` - These APIs enable administrators to manage custom oauth app integrations, which is required for adding/using Custom OAuth App Integration like Tableau Cloud for Databricks in AWS cloud. * `bricks account encryption-keys` - These APIs manage encryption key configurations for this workspace (optional). * `bricks account groups` - Groups simplify identity management, making it easier to assign access to Databricks Account, data, and other securable objects. * `bricks account ip-access-lists` - The Accounts IP Access List API enables account admins to configure IP access lists for access to the account console. * `bricks account log-delivery` - These APIs manage log delivery configurations for this account. * `bricks account metastore-assignments` - These APIs manage metastore assignments to a workspace. * `bricks account metastores` - These APIs manage Unity Catalog metastores for an account. * `bricks account networks` - These APIs manage network configurations for customer-managed VPCs (optional). * `bricks account o-auth-enrollment` - These APIs enable administrators to enroll OAuth for their accounts, which is required for adding/using any OAuth published/custom application integration. * `bricks account private-access` - These APIs manage private access settings for this account. * `bricks account published-app-integration` - These APIs enable administrators to manage published oauth app integrations, which is required for adding/using Published OAuth App Integration like Tableau Cloud for Databricks in AWS cloud. * `bricks account service-principals` - Identities for use with jobs, automated tools, and systems such as scripts, apps, and CI/CD platforms. * `bricks account storage` - These APIs manage storage configurations for this workspace. * `bricks account storage-credentials` - These APIs manage storage credentials for a particular metastore. * `bricks account users` - User identities recognized by Databricks and represented by email addresses. * `bricks account vpc-endpoints` - These APIs manage VPC endpoint configurations for this account. * `bricks account workspace-assignment` - The Workspace Permission Assignment API allows you to manage workspace permissions for principals in your account. * `bricks account workspaces` - These APIs manage workspaces for this account.	2023-04-26 13:06:16 +02:00
shreyas-goenka	43bc9a0d9d	Use cmdio logger to log bricks cmd execution errors (#348 ) ## Changes Uses the cmdio logger to log the execution error ## Tests Manually by making the root command return fake errors. Here is the output: ``` shreyas.goenka@THW32HFW6T bricks % bricks bundle validate Error: my foo error ``` ``` shreyas.goenka@THW32HFW6T bricks % bricks bundle validate --progress-format=json { "error": "my foo error" } ``` --------- Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2023-04-24 12:11:52 +02:00
Serge Smertin	9581187c9e	Update to Go SDK v0.8.0 (#351 ) ## Changes - Update to Go SDK v0.8.0 - Fix all breaking changes ## Tests - make test	2023-04-21 10:30:20 +02:00
shreyas-goenka	ddc0237468	Error out if question prompts are used in json mode (#340 ) ## Changes This PR disallows questions in json mode ## Tests Manually and unit test ``` shreyas.goenka@THW32HFW6T job-output % bricks bundle destroy --progress-format=json The following resources will be removed: { "resource_type": "databricks_job", "action": "delete", "resource_name": "foo" } Error: question prompts are not supported in json mode ```	2023-04-18 17:13:49 +02:00
shreyas-goenka	598ad62688	Log mutator messages using progress logger (#312 ) This PR uses progress logger to log messages inside mutators	2023-04-18 16:55:06 +02:00
shreyas-goenka	85889dffb1	Move state to event for whether they support inplace progress logging (#339 ) ## Changes Adds a IsInplaceSupported() function to the event interface. Any event that now uses the progress logger has to declare whether they support in place logging ## Tests Manually	2023-04-18 14:20:35 +02:00
shreyas-goenka	b9c68b4bd5	Fix wrap around issues with inplace logging (#334 ) ## Changes We deal with wraparounds for long lines of text in a bad way. This PR fixes that by saving the cursor position ## Tests Manually	2023-04-14 13:06:04 +02:00
shreyas-goenka	bd11da88eb	Do not fail snapshot destroy if snapshot does not exist (#328 ) ## Changes `bricks bundle destroy` would fail if the sync snapshot did not exist ## Tests Manually After: ``` shreyas.goenka@THW32HFW6T bundle-destroy % bricks bundle destroy --auto-approve No resources to destroy! Remote directory /Users/shreyas.goenka@databricks.com/.bundle/destroy/default will be deleted Successfully deleted files! ``` Before: ``` shreyas.goenka@THW32HFW6T bundle-destroy % bricks bundle destroy --auto-approve No resources to destroy! Remote directory /Users/shreyas.goenka@databricks.com/.bundle/destroy/default will be deleted Error: failed to destroy sync snapshot file: remove /Users/shreyas.goenka/projects/bundle-destroy/.databricks/bundle/default/sync-snapshots/a5bd1966cb8980a9.json: no such file or directory ```	2023-04-12 21:37:01 +02:00
shreyas-goenka	42cd405eba	Add tests for fileSet adding `databricks` to .gitignore (#325 ) ## Changes <!-- Summary of your changes that are easy to understand --> These are flows that were earlier only being tested in package `project`. Since package `project` has been deleted in https://github.com/databricks/bricks/pull/321, we needed to add coverage as done here ## Tests <!-- How is this tested? -->	2023-04-12 12:04:10 +02:00
Miles Yucht	946906221d	Delete sync snapshots file when destroying a bundle (#323 ) ## Changes This PR changes the files.Delete() mutator to delete the sync snapshots file on destroy. This ensures that files will be uploaded when the bundle is uploaded again. ## Tests - [x] Manual test: Ran `bricks bundle destroy`, observed that the sync snapshots file was deleted.	2023-04-11 16:57:01 +02:00
shreyas-goenka	4871f7bc8a	Add bundle destroy command (#300 ) Adds bundle destroy capability to bricks	2023-04-06 12:54:58 +02:00
Pieter Noordhuis	33645ae6ef	Revert "Configure log level to info by default (#267 )" (#307 ) ## Changes This reverts commit `e7a7e5b95a`. Job and pipeline runs print progress information now. No need to continue to rely on logging for this. ## Tests	2023-04-05 15:37:09 +02:00
Serge Smertin	02d9f877b5	Make `bricks auth` use `all-apis` scope (#304 ) ## Changes Use `all-apis` scope, so that we can use the issued token for SCIM APIs. The production environment has to be tuned in order to enable `all-apis` scope for a specific account. ## Tests Manual	2023-04-05 10:18:13 +02:00
shreyas-goenka	902813a490	Hardcode `.databricks` ignore pattern to ensure we never sync the cache directory (#295 ) ## Changes <!-- Summary of your changes that are easy to understand --> 1. Add pattern to always ignore .databricks 2. Best effort creation of .gitignore with .databricks if it's needed ## Tests <!-- How is this tested? -->	2023-04-04 15:44:57 +02:00
dependabot[bot]	57cf66d3a8	Bump github.com/databricks/databricks-sdk-go from 0.5.0 to 0.6.0 (#299 )	2023-04-03 21:33:21 +02:00
Pieter Noordhuis	cfd32c9602	Try to resolve a profile if only the host is specified (#287 ) ## Changes This improves out of the box usability where a user who already configured a `.databrickscfg` file will be able to reference the workspace host in their `bundle.yml` and it will automatically pick up the right profile. ## Tests * Newly added tests pass. * Manual testing confirms intended behavior. --------- Co-authored-by: shreyas-goenka <88374338+shreyas-goenka@users.noreply.github.com>	2023-03-29 20:44:19 +02:00
Pieter Noordhuis	8af934bbbb	Function to find the Git repository containing a bundle (#289 ) ## Changes Useful functions from #277. ## Tests Tests pass.	2023-03-29 16:36:35 +02:00
shreyas-goenka	8fd3dccca9	Add progress logs for job runs (#276 )	2023-03-29 14:58:09 +02:00
Pieter Noordhuis	1b47dd3af7	Trim log source field to basename of file (#273 ) This makes logs more readable and avoids leaking paths. Before: ``` time=2023-03-22T16:38:30.238+01:00 level=INFO source=/Users/pieter.noordhuis/dev/bricks/bundle/phases/phase.go:30 msg="Phase: initialize" time=2023-03-22T16:38:31.303+01:00 level=INFO source=/Users/pieter.noordhuis/dev/bricks/bundle/phases/phase.go:30 msg="Phase: build" time=2023-03-22T16:38:31.303+01:00 level=INFO source=/Users/pieter.noordhuis/dev/bricks/bundle/phases/phase.go:30 msg="Phase: deploy" ``` After: ``` time=2023-03-22T17:02:47.290+01:00 level=INFO source=phase.go:30 msg="Phase: initialize" time=2023-03-22T17:02:48.171+01:00 level=INFO source=phase.go:30 msg="Phase: build" time=2023-03-22T17:02:48.171+01:00 level=INFO source=phase.go:30 msg="Phase: deploy" ```	2023-03-23 08:56:39 +01:00
Pieter Noordhuis	123a5e15e9	Acquire lock prior to deploy (#270 ) Add configuration: ``` bundle: lock: enabled: true force: false ``` The force field can be set by passing the `--force` argument to `bricks bundle deploy`. Doing so means the deployment lock is acquired even if it is currently held. This should only be used in exceptional cases (e.g. a previous deployment has failed to release the lock).	2023-03-22 16:37:26 +01:00
shreyas-goenka	75d516939b	Error out if notebook file does not exist locally (#261 ) Adds check for whether file exists locally case 1: local (relative) file does not exist ``` foo: name: "[job-output] test-job by shreyas" tasks: - task_key: my_notebook_task existing_cluster_id: * notebook_task: notebook_path: "./doesnotexist" ``` output: ``` shreyas.goenka@THW32HFW6T job-output % bricks bundle deploy Error: notebook ./doesnotexist not found. Error: open /Users/shreyas.goenka/projects/job-output/doesnotexist: no such file or directory ``` case 2: remote (absolute) file does not exist ``` foo: name: "[job-output] test-job by shreyas" tasks: - task_key: my_notebook_task existing_cluster_id: * notebook_task: notebook_path: "/Users/shreyas.goenka@databricks.com/doesnotexist" ``` output: ``` shreyas.goenka@THW32HFW6T job-output % bricks bundle deploy shreyas.goenka@THW32HFW6T job-output % bricks bundle run foo Error: failed to reach TERMINATED or SKIPPED, got INTERNAL_ERROR: Task my_notebook_task failed with message: Notebook not found: /Users/shreyas.goenka@databricks.com/doesnotexist. This caused all downstream tasks to get skipped. ``` case 3: remote exists Successful deploy and run	2023-03-21 18:13:16 +01:00
Pieter Noordhuis	7dcc0d4b41	Fix test (#268 ) Follow up to #267.	2023-03-21 16:34:16 +01:00
Pieter Noordhuis	e7a7e5b95a	Configure log level to info by default (#267 ) Note: we log at INFO level by default until we implement progress reporting to stdout/stderr.	2023-03-21 16:14:20 +01:00
shreyas-goenka	ae09eb02d5	Path escape file path in filer interface (#254 )	2023-03-17 17:42:35 +01:00
Pieter Noordhuis	ad666ff796	Use new logger throughout codebase (#256 )	2023-03-17 15:17:31 +01:00
Pieter Noordhuis	c9340d6317	Drain sync event channel before returning (#253 ) Not waiting means the last few events may or may not be printed. This is relevant in the mode where sync runs once and then terminates.	2023-03-16 17:48:17 +01:00
Pieter Noordhuis	32a29c6af4	Add structured logging infrastructure (#246 ) New global flags: * `--log-file FILE`: can be literal `stdout`, `stderr`, or a file name (default `stderr`) * `--log-level LEVEL`: can be `error`, `warn`, `info`, `debug`, `trace`, or `disabled` (default `disabled`) * `--log-format TYPE`: can be `text` or `json` (default `text`) New functions in the `log` package take a `context.Context` and retrieve the logger from said context. Because we carry the logger in a context, adding [attributes](https://pkg.go.dev/golang.org/x/exp/slog#hdr-Attrs_and_Values) to the logger can be done as follows: ```go ctx = log.NewContext(ctx, log.GetLogger(ctx).With("foo", "bar")) ```	2023-03-16 14:46:53 +01:00
shreyas-goenka	715a4dfb21	Path escape filepaths in the URL (#250 ) Before we were using url query escaping to escape the file path. This is wrong since the file path is a part of the URL path rather than URL query. These encoding schemes are similar but do not have identical encodings which was why we got these weird edge cases Fixed, and added nightly test for assert for this ``` 2023/03/15 16:07:50 [INFO] Action: PUT: .gitignore, a b/bar.py, c+d/uno.py, foo.py 2023/03/15 16:07:51 [INFO] Uploaded foo.py 2023/03/15 16:07:51 [INFO] Uploaded a b/bar.py 2023/03/15 16:07:51 [INFO] Uploaded .gitignore 2023/03/15 16:07:51 [INFO] Uploaded c+d/uno.py 2023/03/15 16:07:51 [INFO] Initial Sync Complete ``` ``` [VSCODE] bricks cli path: /Users/shreyas.goenka/.vscode/extensions/databricks.databricks-0.3.4-darwin-arm64/bin/bricks [VSCODE] sync command args: sync,.,/Repos/shreyas.goenka@databricks.com/sync-fail.ide,--watch,--output,json -------------------------------------------------------- Starting synchronization (4 files) Uploaded .gitignore Uploaded foo.py Uploaded c+d/uno.py Uploaded a b/bar.py Completed synchronization ```	2023-03-15 17:25:57 +01:00
shreyas-goenka	316a006125	Add check for file exists incase of conflicting remote names (#244 ) Before: ``` shreyas.goenka@THW32HFW6T deco-538-pipeline-error % bricks bundle deploy Error: both myNb.py and myNb.sql point to the same remote file location myNb. Please remove one of them from your local project ``` Even though myNb.sql was created by renaming myNb.py Now deployments are successful	2023-03-10 11:52:45 +01:00
Pieter Noordhuis	fe738ede6a	Let sync return early if an error occurs (#235 ) The previous approach would proceed to execute all requests prior to returning the first error. This is solved with `errgroup.WithContext` that cancels the context if a routine returns an error.	2023-03-09 13:29:05 +01:00
Fabian Jakobs	f0c35a2b27	Initialize BRICKS_CLI_PATH and increase default OAuth timeout (#237 ) related to https://github.com/databricks/databricks-sdk-go/pull/330	2023-03-08 16:14:24 +01:00
Pieter Noordhuis	65b3f998ba	Escape URL in filer (#236 ) Also see #228.	2023-03-08 14:27:05 +01:00
Fabian Jakobs	da4b58a897	Fix link to workspace after AWS OAuth login (#234 ) `Host` is already normalized and always has the `https://` prefix.	2023-03-08 11:56:46 +01:00
Pieter Noordhuis	e872b587cc	Add optional JSON output for sync command (#230 ) JSON output makes it easy to process synchronization progress information in downstream tools (e.g. the vscode extension). This changes introduces a `sync.Event` interface type for progress events as well as an `sync.EventNotifier` that lets the sync code pass along progress events to calling code. Example output in text mode (default, this uses the existing logger calls): ```text 2023/03/03 14:07:17 [INFO] Remote file sync location: /Repos/pieter.noordhuis@databricks.com/... 2023/03/03 14:07:18 [INFO] Initial Sync Complete 2023/03/03 14:07:22 [INFO] Action: PUT: foo 2023/03/03 14:07:23 [INFO] Uploaded foo 2023/03/03 14:07:23 [INFO] Complete 2023/03/03 14:07:25 [INFO] Action: DELETE: foo 2023/03/03 14:07:25 [INFO] Deleted foo 2023/03/03 14:07:25 [INFO] Complete ``` Example output in JSON mode: ```json {"timestamp":"2023-03-03T14:08:15.459439+01:00","seq":0,"type":"start"} {"timestamp":"2023-03-03T14:08:15.459461+01:00","seq":0,"type":"complete"} {"timestamp":"2023-03-03T14:08:18.459821+01:00","seq":1,"type":"start","put":["foo"]} {"timestamp":"2023-03-03T14:08:18.459867+01:00","seq":1,"type":"progress","action":"put","path":"foo","progress":0} {"timestamp":"2023-03-03T14:08:19.418696+01:00","seq":1,"type":"progress","action":"put","path":"foo","progress":1} {"timestamp":"2023-03-03T14:08:19.421397+01:00","seq":1,"type":"complete","put":["foo"]} {"timestamp":"2023-03-03T14:08:22.459238+01:00","seq":2,"type":"start","delete":["foo"]} {"timestamp":"2023-03-03T14:08:22.459268+01:00","seq":2,"type":"progress","action":"delete","path":"foo","progress":0} {"timestamp":"2023-03-03T14:08:22.686413+01:00","seq":2,"type":"progress","action":"delete","path":"foo","progress":1} {"timestamp":"2023-03-03T14:08:22.688989+01:00","seq":2,"type":"complete","delete":["foo"]} ``` --------- Co-authored-by: shreyas-goenka <88374338+shreyas-goenka@users.noreply.github.com>	2023-03-08 10:27:19 +01:00
shreyas-goenka	5166055efb	[DECO-553] Escape file path strings in URL (#228 ) Tested manually Before: ``` shreyas.goenka@THW32HFW6T test-dbx % bricks sync --full . /Repos/shreyas.goenka@databricks.com/test-dbx 2023/02/27 19:51:17 [INFO] Remote file sync location: /Repos/shreyas.goenka@databricks.com/test-dbx 2023/02/27 19:51:17 [INFO] Action: PUT: #foo.py, .gitignore 2023/02/27 19:51:19 [INFO] Uploaded .gitignore Error: Creating file failed. An item with path /Repos/shreyas.goenka@databricks.com/test-dbx already exists ``` After: ``` shreyas.goenka@THW32HFW6T test-dbx % bricks sync --full . /Repos/shreyas.goenka@databricks.com/test-dbx 2023/02/27 19:51:46 [INFO] Remote file sync location: /Repos/shreyas.goenka@databricks.com/test-dbx 2023/02/27 19:51:46 [INFO] Action: PUT: #foo.py, .gitignore 2023/02/27 19:51:47 [INFO] Uploaded .gitignore 2023/02/27 19:51:47 [INFO] Uploaded #foo.py ```	2023-02-28 03:17:13 +01:00
shreyas-goenka	2615d66945	[DECO-531] Increase timeout for file import api calls (#223 ) This PR increases the client side timeout for upload API calls to 10 minutes to give sync enough time to import larger files	2023-02-22 16:01:58 +01:00
Pieter Noordhuis	9d3a0da073	Detect Jupyter notebook files (#219 ) Files with extension `.ipynb` are imported are Jupyter notebooks. This code detects 1) if the file is a valid Jupyter notebook and 2) the Databricks specific language it contains.	2023-02-21 13:49:01 +01:00
Pieter Noordhuis	7398a6d1e4	Add sample ipynb files (#218 ) Co-authored-by: pietern <pietern>	2023-02-20 20:03:20 +01:00
Pieter Noordhuis	414ea4f891	Bump databricks-sdk-go to 0.3.2 (#215 )	2023-02-20 16:00:20 +01:00
Pieter Noordhuis	584c8d1b0b	Allow synchronization to a directory inside a repo (#213 ) Before this commit this would error saying that the repo doesn't exist yet. With this commit it creates the directory, but only after checking that the repo exists.	2023-02-20 14:34:48 +01:00
Pieter Noordhuis	1715a987cf	Make sync command work in bundle context; reorder args (#207 ) Invoke with `bricks sync SRC DST`. In bundle context `SRC` and `DST` arguments are taken from bundle configuration. This PR adds `bricks bundle sync` to disambiguate between the two. Once the VS Code extension is bundle aware they can again be consolidated. Consolidating them today would regress the VS Code experience if a `bundle.yml` file is present in the file tree.	2023-02-20 11:33:30 +01:00
Pieter Noordhuis	58950ce507	Move notebook detection logic to package (#206 )	2023-02-15 17:14:59 +01:00
Fabian Jakobs	8c1b620b17	Don't sync symlink folders (#205 ) Fixes https://github.com/databricks/databricks-vscode/issues/452	2023-02-15 17:02:54 +01:00
Pieter Noordhuis	abb1de99ba	Locate and use global excludes file (#191 ) This implements rudimentary gitconfig loading as specified at https://git-scm.com/docs/git-config.	2023-02-02 12:25:53 +01:00
Pieter Noordhuis	241562e2b1	Move git package to libs/git (#189 ) Fixes #185.	2023-01-31 19:19:16 +01:00
Pieter Noordhuis	a7bf7ba6c5	Reload .gitignore files if they have changed (#190 ) This commit changes the code in repository.go to lazily load gitignore files as opposed to the previous eager approach. This means that the signature of the `Ignore` function family has changed to return `(bool, error)`. This lazy approach fits better when other code is responsible for recursively walking the file tree, because we never know up front which gitignore files need to be loaded to compute the ignores. It also means we no longer have to "prime" the `Repository` instance with a particular directory we're interested in and rather let calls to `Ignore` load whatever is needed. The fileset wrapper under `git/` internally taints all gitignore objects to force a call to [os.Stat] followed by a reload if they have changed, before calling into the [fileset.FileSet] functions for recursively listing files.	2023-01-31 18:34:36 +01:00
Pieter Noordhuis	eb76e5d3e8	Move git.FileSet to libs/fileset and make it aware of gitignores (#184 ) This moves `git.FileSet` to `libs/fileset` and decouples it from the Git package. It is made aware of gitignore rules in parent directories up to the repository root as well as gitignore files in underlying directories through the `fileset.Ignorer` interface. The recursive directory walker is reimplemented with [filepath.WalkDir]. Follow up to #182.	2023-01-27 16:04:58 +01:00

... 2 3 4 5 6 ...

361 Commits