databricks-cli

Commit Graph

Author	SHA1	Message	Date
Pieter Noordhuis	6ae353d84b	Use schema field for pipeline in builtin template (#2347 ) ## Changes The `schema` field implies the lifecycle of tables is no longer tied to the lifecycle of the pipeline, as was the case with the `target` field. More information about using the "catalog" and "schema" properties can be found here: https://docs.databricks.com/en/delta-live-tables/target-schema.html ## Tests n/a --------- Co-authored-by: Lennart Kats (databricks) <lennart.kats@databricks.com>	2025-03-05 14:19:33 +00:00
Denis Bilenko	b6bf035e7f	Skip serverless prompt in default-python (default is no) (#2388 ) ## Tests Manually running 'bundle init default-python' - no question about serverless.	2025-02-26 15:58:53 +00:00
Denis Bilenko	ab3d82e32e	default-python: Swap order of yes/no in serverless template (#2386 ) ## Changes Since at this moment we set default to 'no', interactively it should also default to 'no'. However, it just uses the first option. ## Tests Manually running `cli bundle init default-python`	2025-02-26 14:45:32 +00:00
Denis Bilenko	03f2ff5a39	Support serverless mode in default-python template (explicit prompt) (#2377 ) ## Changes - Add 'serverless' prompt to default-python template (default is currently set to "no"). - This is a simplified version of https://github.com/databricks/cli/pull/2348 with 'auto' functionality removed. ## Tests - Split default-python into default-python/classic, default-python/serverless, default-python/serverless-customcatalog. - Manually check that "bundle init default-python" with serverless=yes can be deployed and run on dogfood and test env.	2025-02-26 14:07:30 +01:00
Anton Nekipelov	428e730c9e	Set default data_security_mode to "SINGLE_USER" in bundle templates (#2372 ) ## Changes 1. Change the default-python bundle template to set `data_security_mode` of a cluster to SINGLE_USER 2. Change the experimental-jobs-as-code bundle template to set `data_security_mode` of a cluster to SINGLE_USER ## Why Explicitly adding this field saves experienced users from confusion onto what security mode is applied to the cluster ## Tests Changed existing unit and integration tests to pass with this change	2025-02-26 13:19:38 +01:00
Ilya Kuznetsov	25a701be92	Add missing `.gitignore` to dbt-sql and default-sql templates (#2356 ) ## Changes Added missing .gitignore files to templates ## Tests There were some incorrect snapshots of gitignore files in acceptance tests, probably generated by testing infra. Updated them to new files --------- Co-authored-by: Lennart Kats (databricks) <lennart.kats@databricks.com>	2025-02-25 09:42:02 +00:00
Lennart Kats (databricks)	f99716b0a5	Remove `run_as` from the built-in templates (#2044 ) ## Changes This removes the `run-as` property from the default templates. It's a useful property but it still only works for jobs and it makes the default databricks.yml a bit longer. It seems like users can just learn about it from the docs and/or vary their deployment identity. Depends on https://github.com/databricks/cli/pull/1712.	2025-02-24 08:31:46 +00:00
Lennart Kats (databricks)	bc30d44097	Provide instructions for testing in the default-python template (#2355 ) ## Changes Adds instructions for testing to the default-python template. ## Tests - Unit & acceptance tests.	2025-02-17 12:38:03 +00:00
Gleb Kanterov	31c10c1b82	Add experimental-jobs-as-code template (#2177 ) ## Changes Add experimental-jobs-as-code template allowing defining jobs using Python instead of YAML through the `databricks-bundles` PyPI package. ## Tests Manually and acceptance tests.	2025-01-20 10:15:11 +00:00
Gleb Kanterov	25f8ee8d66	Format default-python template (#2110 ) ## Changes Format code in default-python template, so it's already pre-formatted. ## Tests ``` $ databricks bundle init libs/template/templates/default-python $ ruff format --diff my_project 6 files already formatted ```	2025-01-15 10:40:29 +01:00
shreyas-goenka	b0c1c23630	Add `uuid` to builtin templates (#2088 ) ## Changes This is useful to track telemetry associated with the templates and can later be useful for functional usecases as well. Mlops stacks does the same here: https://github.com/databricks/mlops-stacks/pull/185 ## Tests Existing tests.	2025-01-09 18:19:34 +00:00
Ilia Babanov	daf0f48143	Remove unused vscode settings in the templates (#2013 ) ## Changes VSCode extension no longer uses `databricks.python.envFile ` setting. And older extension versions will use the same default value anyway. ## Tests None	2024-12-13 16:13:21 +00:00
Pieter Noordhuis	fae1b6742d	Update target references to use `${bundle.target}` (#1935 ) ## Changes The built-in template contains a reference to `${bundle.environment}`. This property has been deprecated in favor of `${bundle.target}` a long time ago (#670), so we should no longer emit it. The environment field will continue to be usable until we cut a new major version in some far away future. ## Tests * Unit tests * The test `TestInterpolationWithTarget` still covers correct interpolation of `${bundle.environment}`	2024-11-27 11:51:08 +00:00
Lennart Kats (databricks)	60c153c0e7	Fix pipeline in default-python template not working for certain workspaces (#1854 ) Change the default-python template to not set the `catalog` field for the pipeline for workspaces that set `hive_metastore` as the default catalog. The Pipelines service currently returns an error when that value is used for the `catalog` field. This is the most simple fix for this issue, which was reported by a customer. As a followup, we should look at whether we want to prompt for a catalog instead, possibly just for this specific scenario.	2024-10-22 15:52:46 +00:00
Andrew Nester	a8cff48c0b	Always prepend bundle remote paths with /Workspace (#1724 ) ## Changes Due to platform changes, all libraries, notebooks and etc. paths used in Databricks must be started with either /Workspace or /Volumes prefix. This PR makes sure that all bundle paths are correctly prefixed. Note: this change is a breaking change if user previously configured and used `/Workspace/Workspace` folder in their workspace file system or having `/Workspace/${workspace.root_path}...` pattern configured anywhere in their bundle config Fixes: #1751 AI: - [x] Scan DABs config and error out on `/Workspace/${workspace.root_path}...` pattern usage ## Tests Added unit tests --------- Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2024-10-02 15:34:00 +00:00
shreyas-goenka	a4ba0bbe9f	Add sub-extension to resource files in built-in templates (#1777 ) ## Changes We want to encourage a pattern of only specifying a single resource in a YAML file when an `.<resource-type>.yml` (like `.job.yml`) is used. This convention could allow us to bijectively map a resource YAML file to it's corresponding resource in the Databricks workspace. This PR simply makes the built-in templates compliant to this format. ## Tests Existing tests.	2024-09-25 12:58:14 +00:00
Lennart Kats (databricks)	7665c639bd	Use Unity Catalog for pipelines in the default-python template (#1766 ) ## Summary Enables Unity Catalog for pipelines in the default template. Pipelines will default to non-Unity Catalog pipelines if a catalog is not specified. Small caveat: there are cases where admins lock down the default catalog of a workspace and don't allow the creation of a new schema there. If that happens, the pipeline would fail at runtime with a clear error indicating what happened. ("PERMISSION_DENIED: User does not have CREATE SCHEMA on Catalog 'main'."). I've seen this with an internal Databricks workspace, where creating new non-UC schemas wasn't locked down, but creation in the `main` was. ## Testing - Validated on a non-UC + UC workspace. The catalog selection logic here is the same as applied for the SQL templates.	2024-09-23 09:52:04 +00:00
Lennart Kats (databricks)	f2dee890b8	Use periodic triggers in all templates (#1739 ) ## Summary Simplifies template by using the periodic trigger syntax instead of the cron schedule syntax. Periodic triggers are simpler to configure, simpler to read, and make sure that workloads are spread out through the day. We only recommend cron syntax for advanced cases or when more control is needed. ## Testing * Templates validation via unit tests * Manual validation that the new triggers work as expected in dev/prod	2024-09-12 08:33:00 +00:00
Lennart Kats (databricks)	072fa812e2	Include a permissions section in all templates (#1713 ) ## Changes This updates the templates to include a `permissions` section. Having a permissions section is a best practice, is helpful to understand the notion of permissions, and helps diagnose permission errors (https://github.com/databricks/cli/pull/1386). This is a cherry-pick from https://github.com/databricks/cli/pull/1387. This change was verified to work both in dev and prod. Existing unit tests validate the validity of the templates in these modes.	2024-09-03 07:51:54 +00:00
Lennart Kats (databricks)	ed4a4585c0	Update templates to latest LTS DBR (#1715 ) ## Changes This updates the templates to use the latest DBR 15 LTS version. - [x] DB Connect 15.4 must be released + validated before this can go out	2024-08-30 07:32:10 +00:00
Lennart Kats (databricks)	2e000f1ebd	Use materialized views in the default-sql template (#1709 ) ## Changes Materialized views now support `CREATE OR REPLACE` ([docs](https://docs.databricks.com/en/sql/language-manual/sql-ref-syntax-ddl-create-materialized-view.html))! This makes it possible to use them with Workflows in DABs.This PR updates the template to use a materialized view rather than a regular view. ## Tests Manually validated in production.	2024-08-29 19:07:21 +00:00
Lennart Kats (databricks)	8238a6ad0a	Remove reference to "dbt" in the default-sql template (#1696 ) ## Changes The `default-sql` template inadvertently mentioned dbt in one of the prompts. This PR removes that reference.	2024-08-19 15:47:18 +00:00
kijewskimateusz	c7a36921b4	Fix non-default project names not working in dbt-sql template (#1500 ) ## Changes Hello Team, While tinkering with your solution, I've noticed that profiles provided in dbt_project.yml and profiles.yml for generated dbt asset bundles. do not align. This led to the following error, when deploying DAB: ``` + dbt deps --target=dev 11:24:02 Running with dbt=1.8.2 11:24:02 Warning: No packages were found in packages.yml 11:24:02 Warning: No packages were found in packages.yml + dbt seed --target=dev --vars '{ dev_schema: mateusz_kijewski }' 11:24:05 Running with dbt=1.8.2 11:24:05 Encountered an error: Runtime Error Could not find profile named 'dbt_sql' ``` I have corrected profile name in profiles.yml.tmpl to the name used in dbt_project.yml.tmpl. Using the opportunity of forking your repo, I've also updated tests configuration in model config as starting of dbt v1.8 it's been raising warnings of configuration change from tests to data_tests ``` 11:31:34 [WARNING]: Deprecated functionality The `tests` config has been renamed to `data_tests`. Please see https://docs.getdbt.com/docs/build/data-tests#new-data_tests-syntax for more information. ``` ## Tests <!-- How is this tested? -->	2024-07-01 07:52:22 +00:00
Pieter Noordhuis	533d357a71	Fix typo in DBT template (#1498 ) ## Changes Found in https://github.com/databricks/bundle-examples/pull/26. ## Tests n/a	2024-06-17 15:56:49 +00:00
Lennart Kats (databricks)	99c7d136d6	Fix conditional in query in `default-sql` template (#1479 ) ## Changes This corrects a mistake in the sample SQL identified by @pietern	2024-06-06 07:40:15 +00:00
Lennart Kats (databricks)	41678fa695	Copy-editing for SQL templates (#1474 ) ## Changes This applies changes suggested by @juliacrawf-db	2024-06-05 11:13:32 +00:00
Lennart Kats (databricks)	4bc0ea0af3	Fix SQL schema selection in default-sql template (#1471 ) ## Changes This fixes a last-minute regression that snuck into https://github.com/databricks/cli/pull/1463: unfortunately we need to use `USE IDENTIFIER('schema')` to select a schema for now. In the future we expect we can just use `USE SCHEMA 'schema'`.	2024-06-04 15:40:40 +00:00
Lennart Kats (databricks)	aa36aee159	Make dbt-sql and default-sql templates public (#1463 ) ## Changes This makes the dbt-sql and default-sql templates public. These templates were previously not listed and marked "experimental" since structured streaming tables were still in gated preview and would result in weird error messages when a workspace wasn't enabled for the preview. This PR also incorporates some of the feedback and learnings for these templates so far.	2024-06-04 08:57:13 +00:00
Fabian Jakobs	e61f0e1eb9	Fix DBConnect support in VS Code (#1253 ) ## Changes With the current template, we can't execute the Python file and the jobs notebook using DBConnect from VSCode because we import `from pyspark.sql import SparkSession`, which doesn't support Databricks unified auth. This PR fixes this by passing spark into the library code and by explicitly instantiating a spark session where the spark global is not available. Other changes: * add auto-reload to notebooks * add DLT typings for code completion	2024-03-05 14:31:27 +00:00
Lennart Kats (databricks)	162b115e19	Add an experimental default-sql template (#1051 ) ## Changes This adds a `default-sql` template! In this latest revision, I've hidden the new template from the list so we can merge it, iterate over it, and properly release the template at the right time. - [x] WorkspaceFS support for .sql files is in prod - [x] SQL extension is preconfigured based on extension settings (if possible) - [ ] Streaming tables support is either ungated or the template provides instructions about signup - _Mitigation for now: this template is hidden from the list of templates._ - [x] Support non-UC workspaces ## Tests - [x] Unit tests - [x] Manual testing - [x] More manual testing - [x] Reviewer testing --------- Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com> Co-authored-by: PaulCornellDB <paul.cornell@databricks.com>	2024-02-19 12:01:11 +00:00
Lennart Kats (databricks)	1c680121c8	Add an experimental dbt-sql template (#1059 ) ## Changes This adds a new dbt-sql template. This work requires the new WorkspaceFS support for dbt tasks. In this latest revision, I've hidden the new template from the list so we can merge it, iterate over it, and propertly release the template at the right time. Blockers: - [x] WorkspaceFS support for dbt projects is in prod - [x] Move dbt files into a subdirectory - [ ] Wait until the next (>1.7.4) release of the dbt plugin which will have major improvements! - _Rather than wait, this template is hidden from the list of templates._ - [x] SQL extension is preconfigured based on extension settings (if possible) - MV / streaming tables: - [x] Add to template - [x] Fix https://github.com/databricks/dbt-databricks/issues/535 (to be released with in 1.7.4) - [x] Merge https://github.com/databricks/dbt-databricks/pull/338 (to be released with in 1.7.4) - [ ] Fix "too many 503 errors" issue (https://github.com/databricks/dbt-databricks/issues/570, internal tracker: ES-1009215, ES-1014138) - [x] Support ANSI mode in the template - [ ] Streaming tables support is either ungated or the template provides instructions about signup - _Mitigation for now: this template is hidden from the list of templates._ - [x] Support non-workspace-admin deployment - [x] Make sure `data_security_mode: SINGLE_USER` works on non-UC workspaces (it's required to be explicitly specified on UC workspaces with single-node clusters) - [x] Support non-UC workspaces ## Tests - [x] Unit tests - [x] Manual testing - [x] More manual testing - [ ] Reviewer manual testing - _I'd like to do a small bug bash post-merging._ - [x] Unit tests	2024-02-19 09:15:17 +00:00
Lennart Kats (databricks)	167deec8c3	Change recommended production deployment path from /Shared to /Users (#1091 ) ## Changes This PR changes the default and `mode: production` recommendation to target `/Users` for deployment. Previously, we used `/Shared`, but because of a lack of POSIX-like permissions in WorkspaceFS this meant that files inside would be readable and writable by other users in the workspace. Detailed change: * `default-python` no longer uses a path that starts with `/Shared` * `mode: production` no longer requires a path that starts with `/Shared` ## Related PRs Docs: https://github.com/databricks/docs/pull/14585 Examples: https://github.com/databricks/bundle-examples/pull/17 ## Tests * Manual tests * Template unit tests (with an extra check to avoid /Shared)	2024-01-02 19:58:24 +00:00
Lennart Kats (databricks)	8b9930a49a	Improve default template (#1046 ) ## Changes - Tweak strings, documentation in template - Extend requirements-dev.txt with setuptools/wheel for building whl files - Clarify what the "_job.yml" file is for for users who are only interested in DLT pipelines (answering a question that came up recently) ## Tests Existing tests exercise this template	2023-12-11 19:13:14 +00:00
Andrew Nester	cdf29da27b	Change default_python template to auto-update version on each wheel build (#1034 ) ## Changes Change default_python template to auto-update version on each wheel build	2023-12-01 13:24:55 +00:00
Lennart Kats (databricks)	92539d4b9b	Work around DLT issue with `$PYTHONPATH` not being set correctly (#999 ) ## Changes DLT currently doesn't always set `$PYTHONPATH` correctly (ES-947370). This restores the original workaround to make new pipelines work while that issue is being addressed. The workaround was removed in #832. Manually tested.	2023-11-20 19:25:43 +00:00
shreyas-goenka	a5815a0b47	Add welcome message to bundle templates (#907 ) ## Changes Adds a welcome_message field to templates and the default python template. ## Tests Manually. Here's the output logs during template init now: ``` shreyas.goenka@THW32HFW6T bricks % cli bundle init Template to use [default-python]: Welcome to the sample Databricks Asset Bundle template! Please enter the following information to initialize your sample DAB. Unique name for this project [my_project]: abcde Include a stub (sample) notebook in 'abcde/src': no Include a stub (sample) Delta Live Tables pipeline in 'abcde/src': yes Include a stub (sample) Python package in 'abcde/src': no ✨ Your new project has been created in the 'abcde' directory! Please refer to the README.md of your project for further instructions on getting started. Or read the documentation on Databricks Asset Bundles at https://docs.databricks.com/dev-tools/bundles/index.html. ```	2023-10-25 12:27:25 +00:00
Lennart Kats (databricks)	a2ee8bb45b	Improve the output of the `databricks bundle init` command (#795 ) Improve the output of help, prompts, and so on for `databricks bundle init` and the default template. Among other things, this PR adds support for a new `welcome_message` property that lets a template print a custom message on success: ``` $ databricks bundle init Template to use [default-python]: Unique name for this project [my_project]: lennart_project Include a stub (sample) notebook in 'lennart_project/src': yes Include a stub (sample) Delta Live Tables pipeline in 'lennart_project/src': yes Include a stub (sample) Python package in 'lennart_project/src': yes ✨ Your new project has been created in the 'lennart_project' directory! Please refer to the README.md of your project for further instructions on getting started. Or read the documentation on Databricks Asset Bundles at https://docs.databricks.com/dev-tools/bundles/index.html. ``` --------- Co-authored-by: shreyas-goenka <88374338+shreyas-goenka@users.noreply.github.com>	2023-10-19 07:08:36 +00:00
Lennart Kats (databricks)	0894584132	Minor template tweaks (#832 ) ## Changes Minor tweaks to the template.	2023-10-04 15:27:09 +00:00
Lennart Kats (databricks)	0c1516c4ba	Make the default `databricks bundle init` template more self-explanatory (#796 ) This makes the default-python template more self-explanatory and adds a few other tweaks for a better out-of-the-box experience.	2023-09-26 09:12:34 +00:00
shreyas-goenka	757d5efe8d	Add support for regex patterns in template schema (#768 ) ## Changes This PR introduces support for regex pattern validation in our custom jsonschema validator. This allows us to fail early if a user enters an invalid value for a field. For example, now this is what initializing the default template looks like with an invalid project name: ``` shreyas.goenka@THW32HFW6T bricks % cli bundle init Template to use [default-python]: Unique name for this project [my_project]: (__) Error: invalid value for project_name: (__). Must consist of letter and underscores only. ``` ## Tests New unit tests and manually.	2023-09-25 09:53:38 +00:00
shreyas-goenka	be55310cc9	Use enums for default python template (#765 ) ## Changes This PR changes schema to use the enum type for the default template yes/no questions. ## Tests Manually	2023-09-13 17:57:31 +00:00
Lennart Kats (databricks)	a4e94e1b36	Fix author in setup.py (#761 ) Fix author in setup.py showing <no value>	2023-09-11 08:59:48 +00:00
Lennart Kats (databricks)	9e56bed593	Minor default template tweaks (#758 ) Minor template tweaks, mostly making the imports section for DLT notebooks a bit more elegant. Tested with DAB deployment + in-workspace UI.	2023-09-11 07:36:44 +00:00
shreyas-goenka	d9a276b17d	Fix minor typos in default-python template (#754 ) Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2023-09-09 21:55:43 +00:00
Lennart Kats (databricks)	50b2c0b83b	Fix notebook showing up in template when not selected (#743 ) ## Changes This fixes a typo that caused the notebook.ipynb file to show up even if the user answered "no" to the question about including a notebook. ## Tests We have matrix validation tests for all the yes/no combinations and whether the build + validate. There is no current test for the absence of files.	2023-09-07 08:26:43 +00:00
Lennart Kats (databricks)	3c79181148	Remove unused file (#742 ) defaults.json was originally used in tests. It's no longer used and should be removed.	2023-09-06 18:18:15 +00:00
Lennart Kats (databricks)	f9e521b43e	databricks bundle init template v2: optional stubs, DLT support (#700 ) ## Changes This follows up on https://github.com/databricks/cli/pull/686. This PR makes our stubs optional + it adds DLT stubs: ``` $ databricks bundle init Template to use [default-python]: default-python Unique name for this project [my_project]: my_project Include a stub (sample) notebook in 'my_project/src' [yes]: yes Include a stub (sample) DLT pipeline in 'my_project/src' [yes]: yes Include a stub (sample) Python package 'my_project/src' [yes]: yes ✨ Successfully initialized template ``` ## Tests Manual testing, matrix tests. --------- Co-authored-by: Andrew Nester <andrew.nester@databricks.com> Co-authored-by: PaulCornellDB <paul.cornell@databricks.com> Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2023-09-06 09:52:31 +00:00
Lennart Kats (databricks)	8c2cc07f7b	databricks bundle init template v1 (#686 ) ## Changes This adds a built-in "default-python" template to the CLI. This is based on the new default-template support of https://github.com/databricks/cli/pull/685. The goal here is to offer an experience where customers can simply type `databricks bundle init` to get a default template: ``` $ databricks bundle init Template to use [default-python]: default-python Unique name for this project [my_project]: my_project ✨ Successfully initialized template ``` The present template: - [x] Works well with VS Code - [x] Works well with the workspace - [x] Works well with DB Connect - [x] Uses minimal stubs rather than boiler-plate-heavy examples I'll have a followup with tests + DLT support. --------- Co-authored-by: Andrew Nester <andrew.nester@databricks.com> Co-authored-by: PaulCornellDB <paul.cornell@databricks.com> Co-authored-by: Pieter Noordhuis <pieter.noordhuis@databricks.com>	2023-09-05 11:58:34 +00:00
Lennart Kats (databricks)	a5b86093ec	Add a foundation for built-in templates (#685 ) ## Changes This pull request extends the templating support in preparation of a new, default template (WIP, https://github.com/databricks/cli/pull/686): * builtin templates that can be initialized using e.g. `databricks bundle init default-python` * builtin templates are embedded into the executable using go's `embed` functionality, making sure they're co-versioned with the CLI * new helpers to get the workspace name, current user name, etc. help craft a complete template * (not enabled yet) when the user types `databricks bundle init` they can interactively select the `default-python` template And makes two tangentially related changes: * IsServicePrincipal now uses the "users" API rather than the "principals" API, since the latter is too slow for our purposes. * mode: prod no longer requires the 'target.prod.git' setting. It's hard to set that from a template. (Pieter is planning an overhaul of warnings support; this would be one of the first warnings we show.) The actual `default-python` template is maintained in a separate PR: https://github.com/databricks/cli/pull/686 ## Tests Unit tests, manual testing	2023-08-25 09:03:42 +00:00

49 Commits