Explore OSS #4

Closed
opened 2024-10-03 20:28:10 +00:00 by steve · 3 comments
Owner

Products

  • Apache Drill (Comes with Zookeeper)
  • Apache Druid
  • ClickHouse
  • PostgreSQL, MySQL
  • MsSQL - Microsoft
  • Apache Iceberg
  • pyiceberg (python plugin, will be needed for ETL)

Installation and Configuration

  • Work in a cohesive unit
**Products** - [x] Apache Drill (Comes with Zookeeper) - [ ] Apache Druid - [ ] ClickHouse - [x] PostgreSQL, MySQL - [ ] MsSQL - Microsoft - [x] Apache Iceberg - [ ] ~~pyiceberg (python plugin, will be needed for ETL)~~ Installation and Configuration - [x] Work in a cohesive unit
steve self-assigned this 2024-10-03 20:28:10 +00:00
steve added this to the MZ-HEALTH project 2024-10-03 20:28:10 +00:00
Author
Owner
  • Apache Drill requires both zookeeper and some jdbc drivers (to store in 3rdparty)
  • Clickhouse may not be very practical because it requires structural specifications (schema)
  • Iceberg, build with spark (have the code hosted on hiplab later on)
- **Apache Drill** requires both zookeeper and some **jdbc drivers** (to store in 3rdparty) - **Clickhouse** may not be very practical because it requires structural specifications (schema) - **Iceberg**, build with spark (have the code hosted on hiplab later on)
Author
Owner

data-transport 2.4 has support for iceberg (no streaming yet)

data-transport 2.4 has support for iceberg (no streaming yet)
Author
Owner
  • Drill allows us to have visibility over all the databases (jdbc compatibility) including Iceberg

  • Instead of using pyIceberg, I opted for pyspark because pyIceberg is not compatible with python 3.9. I also have built support for iceberg to data-transport.

- **Drill** allows us to have visibility over all the databases (jdbc compatibility) including Iceberg - Instead of using pyIceberg, I opted for pyspark because pyIceberg is not compatible with python 3.9. I also have built support for iceberg to data-transport.
steve added this to the version 0.1 milestone 2024-10-18 17:15:24 +00:00
steve closed this issue 2024-11-07 18:57:40 +00:00
Sign in to join this conversation.
No Label
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: mz-health/mz-pop-health#4
No description provided.