Building Blocks¶
Below is a list of building blocks currently available to projects and developers within the CLOE toolbox. Each block is designed to streamline development, enhance functionality, and provide robust solutions to common challenges.
Architecture Overview¶
Our toolbox follows a three-tier architecture designed for maximum reusability and maintainability:
graph TD
subgraph "Type 3: Project-Specific"
A[Data Masking Tools]
B[Access Control]
C[Custom Validators]
end
subgraph "Type 2: Intermediary"
D[Database Crawlers]
E[API Clients]
F[Data Processors]
end
subgraph "Type 1: Foundational"
G[Database Connectors]
H[Git Clients]
I[Authentication]
end
A --> D
B --> D
C --> E
D --> G
D --> H
E --> G
E --> I
F --> G
Building Block Types¶
The following blocks are available (More info here):
- Type 1: Foundational Building Blocks - Core infrastructure components used by other packages
- Type 2: Intermediary Building Blocks - Versatile components for both projects and other packages
- Type 3: Project-Specific Building Blocks - Specialized solutions for specific project challenges
Type 3¶
-
DBT Project Template
Kickstarts and sets up a new dbt environment with devcontainer and VSCode extensions.
Tags: dbt, build tools, sql
-
Azure Data Factory Auto Config
Automatically configures Azure Data Factory using metadata.
Tags: adf, azure data factory, etl, elt, metadata-driven
-
Snowflake Policy Creator
Creates Snowflake policies like data masking using metadata.
Tags: policies, snowflake
-
Nessy
Builds a lakehouse in Databricks.
Tags: spark, Databricks, Fabric
-
Python Package Template
Quickly sets up a Python environment with devcontainers and pre-commit hooks.
Tags: python, template, package
-
Snowflake Clone Creator
Easily and automatically creates customizable Snowflake database clones for feature-driven development.
Tags: clone, cicd, ci/cd, snowflake
-
Snowflake RBAC Tool
Automates the lifecycle of role-based access control in Snowflake.
Tags: snowflake, security, rbac, metadata-driven
-
SQL Composer
Simplifies the creation of SQL for Spark or Snowflake using a simplified metadata format.
Tags: transform, metadata-driven, simplified
-
SQL Transformations
Provides various transformations using the standard metadata format, including automatic lookups and complex table mappings.
Tags: transform, metadata-driven, sql
-
CLOE GUI
A GUI with many features to simplify and automate writing and reading of metadata.
Tags: GUI, metadata, api, utility
-
Snowflake Firewall Updater
A tool that automatically updates the Snowflake firewall with the IPs from the Azure Service Tags.
Tags: snowflake, network, terraform, utility, data-platform
-
Synthetic Data Generator
A tool that automatically generates synthetic data.
Tags: data generation, synthetic, test, utility, databricks
Type 2¶
-
ADF Job Template
Provides Azure Data Factory pipeline templates for data extraction and transfer, deployable with Terraform.
Tags: adf, template, templates
-
Logging
Provides a standardized logging interface for Python projects.
Tags: python, logging, utility
-
DDL Generator
Generates DDLs for creating tables and procedures in Snowflake from metadata.
Tags: ddl, metadata to code, code, sql
-
Snowflake Crawler
Crawls a Snowflake instance and saves the information in the common metadata format.
Tags: snowflake, crawler, utility
-
SAP Crawler
Scans a SAP system and saves the information in the common metadata format.
Tags: crawler, sap, utility
Type 1¶
-
API
Adds functionality to the GUI, like auto mapping between columns for use in the GUI.
Tags: GUI, UI
-
Metadata Definition
Holds definitions for the metadata format in Pydantic classes for secure metadata handling.
Tags: metadata, objects
-
Git Client Tool
Offers basic Git features like retrieving files from a specific commit or tag.
Tags: git, utility
-
Snowflake Connector
Provides a standardized interface to connect with Snowflake, including CI/CD scenarios and various authentication methods.
Tags: snowflake, connector, utility
Legacy Tools¶
-
Airflow Components
Adds additional functionality to Airflow. Like new tasks
Tags: Tag1, Tag2
-
Airflow Metadata Configurator
Tool to create DAG from metadata in Snowflake. Needs the airflow components.
Tags: Tag1, Tag2
-
MSSQL Crawler
Tool to crawl MSSQL instances and save information into metadata.
Tags: Tag1, Tag2
-
MSSQL Connector
Provides a standardized interface to connect with MSSQL, including CI/CD scenarios and various authentication methods.
Tags: Tag1, Tag2
-
SQL Orchestrator
Metadata-driven standalone SQL Orchestrator for executing stored procedures on a SQL server.
Tags: Tag1, Tag2
-
Information Schema Converter
Converts a information Schema CSV export to metadata.
Tags: Tag1, Tag2