Last updated: July 22, 2025
dqo dqo command-line command
The reference of the dqo command in DQOps. DQOps Data Quality Operations Center Interactive Shell Hit @|yellow <TAB>|@ to see available commands. Hit @|yellow ALT-S|@ to toggle tailtips.
dqo
DQOps command-line entry point script
Description
dqo is an executable script installed in the Python scripts local folder when DQOps is installed locally by installing the dqops package from PyPi. When the python environment Scripts folder is in the path, running dqo from the command line (bash, etc.) will start a DQOps local instance.
Command line sample
Example
Options
Command argument | Description | Required | Accepted values |
---|---|---|---|
--DQO_JAVA_OPTS |
Configures additional JVM (Java Virtual Machine) options such as the memory limit. The default value for both the 'dqops' python package and for the dqops/dqo Docker image is -XX:MaxRAMPercentage=80.0 which sets the upper memory limit for 80% of the available RAM at the moment when the container starts. This parameter is not supported as a command line parameter, it is only supported as an environment variable. Set (and export) the environment variable DQO_JAVA_OPTS before starting DQOps. This parameter can also be configured by setting the DQO_JAVA_OPTS environment variable. |
||
--dqo.cache.enabled |
Enables or disables the in-memory cache for parsed YAML files and Parquet data files. This parameter can also be configured by setting the DQO_CACHE_ENABLED environment variable. |
||
--dqo.cache.expire-after-seconds |
The time in seconds to expire the cache entries since they were added to the cache. This parameter can also be configured by setting the DQO_CACHE_EXPIRE_AFTER_SECONDS environment variable. |
||
--dqo.cache.file-lists-limit |
The maximum number of folders for which the list of files are cached to avoid listing the files. This parameter can also be configured by setting the DQO_CACHE_FILE_LISTS_LIMIT environment variable. |
||
--dqo.cache.parquet-cache-memory-fraction |
The maximum fraction of the JVM heap memory (configured using the -Xmx java parameter) that is used to cache parquet files in memory. The default value 0.6 means that up to 50% of the JVM heap memory can be used for caching files. The value of the reserved-heap-memory-bytes is subtracted from the total memory size (JVM's -Xmx or -XX:MaxRAMPercentage=80.0 parameter values) before the memory fraction is calculated. The value can be increased to 0.8 for for systems when JVM is given more than 8 GB RAM. This parameter can also be configured by setting the DQO_CACHE_PARQUET_CACHE_MEMORY_FRACTION environment variable. |
||
--dqo.cache.process-file-changes-delay-millis |
The delay in milliseconds between processing file changes that would invalidate the cache. This parameter can also be configured by setting the DQO_CACHE_PROCESS_FILE_CHANGES_DELAY_MILLIS environment variable. |
||
--dqo.cache.reserved-heap-memory-bytes |
The memory size (in bytes) that is not subtracted from the total JVM heap memory before the memory fraction dedicated for the parquet cache is calculated. The default value is 200mb. This parameter can also be configured by setting the DQO_CACHE_RESERVED_HEAP_MEMORY_BYTES environment variable. |
||
--dqo.cache.watch-file-system-changes |
Use a file watcher to detect file system changes and invalidate the in-memory file cache.When a file watches is enabled, all changes made to YAML files directly on the file system (i.e. by editing a file in Visual Studio Code) are instantly detected by DQOps. This parameter can also be configured by setting the DQO_CACHE_WATCH_FILE_SYSTEM_CHANGES environment variable. |
||
--dqo.cache.yaml-files-limit |
The maximum number of specification files to cache. This parameter can also be configured by setting the DQO_CACHE_YAML_FILES_LIMIT environment variable. |
||
--dqo.cli.terminal.width |
Width of the terminal when no terminal window is available, e.g. in one-shot running mode. This parameter can also be configured by setting the DQO_CLI_TERMINAL_WIDTH environment variable. |
||
--dqo.cloud.api-key |
DQOps Cloud api key. Log in to https://cloud.dqops.com/ to get the key. This parameter can also be configured by setting the DQO_CLOUD_API_KEY environment variable. |
||
--dqo.cloud.parallel-file-downloads |
The number of files that are downloaded from DQOps Cloud in parallel using HTTP/2 multiplexing. This parameter can also be configured by setting the DQO_CLOUD_PARALLEL_FILE_DOWNLOADS environment variable. |
||
--dqo.cloud.parallel-file-uploads |
The number of files that are uploaded to DQOps Cloud in parallel using HTTP/2 multiplexing. This parameter can also be configured by setting the DQO_CLOUD_PARALLEL_FILE_UPLOADS environment variable. |
||
--dqo.core.lock-wait-timeout-seconds |
Sets the maximum wait timeout in seconds to obtain a lock to read or write files. This parameter can also be configured by setting the DQO_CORE_LOCK_WAIT_TIMEOUT_SECONDS environment variable. |
||
--dqo.core.print-stack-trace |
Prints a full stack trace for errors on the console. This parameter can also be configured by setting the DQO_CORE_PRINT_STACK_TRACE environment variable. |
||
--dqo.default-time-zone |
Default time zone name used to convert the server's local dates to a local time in a time zone that is relevant for the user. Use official IANA time zone names. When the parameter is not configured, DQOps uses the local time zone of the host running the application. The time zone can be reconfigured at a user settings level. This parameter can also be configured by setting the DQO_DEFAULT_TIME_ZONE environment variable. |
||
--dqo.docker.user-home.allow-unmounted |
When running DQOps in a docker container, allow DQOps user home folder to be initialized inside the container's filesystem if the folder hasn't been mounted to an external volume. This parameter can also be configured by setting the DQO_DOCKER_USER_HOME_ALLOW_UNMOUNTED environment variable. |
||
--dqo.duckdb.memory-limit |
The maximum memory of the system (e.g., 1GB). When not set, DuckDB use the 80% of RAM. This parameter can also be configured by setting the DQO_DUCKDB_MEMORY_LIMIT environment variable. |
||
--dqo.duckdb.threads |
The number of total threads used by the system. The default value is 1000 This parameter can also be configured by setting the DQO_DUCKDB_THREADS environment variable. |
||
--dqo.error-sampling.samples-limit |
The maximum number of error samples (invalid column values) captured for each data grouping when data grouping is configured, or for the whole table when data grouping is not configured. This parameter can also be configured by setting the DQO_ERROR_SAMPLING_SAMPLES_LIMIT environment variable. |
||
--dqo.error-sampling.total-samples-limit |
The maximum total number of error sampling results captured from a table when data grouping is enabled on a table, and error samples are captured from multiple data groupings.. This parameter can also be configured by setting the DQO_ERROR_SAMPLING_TOTAL_SAMPLES_LIMIT environment variable. |
||
--dqo.error-sampling.truncated-strings-length |
The maximum length of error samples captured from text columns (varchar, string, text, etc.) that are stored as error samples table. DQOps truncates longer column values and stores only the first few characters, up to the character count limit defined by this parameter. This parameter can also be configured by setting the DQO_ERROR_SAMPLING_TRUNCATED_STRINGS_LENGTH environment variable. |
||
--dqo.home |
Overrides the path to the DQOps system home (DQO_HOME). The default DQO_HOME contains the definition of built-in data quality sensors, rules and libraries. This parameter can also be configured by setting the DQO_HOME environment variable. |
||
--dqo.incidents.check-histogram-size |
The size of the data quality check histogram that is generated for a preview of a data quality incident. This parameter can also be configured by setting the DQO_INCIDENTS_CHECK_HISTOGRAM_SIZE environment variable. |
||
--dqo.incidents.column-histogram-size |
The size of the column histogram that is generated for a preview of a data quality incident. This parameter can also be configured by setting the DQO_INCIDENTS_COLUMN_HISTOGRAM_SIZE environment variable. |
||
--dqo.incidents.count-open-incidents-days |
The time window between now and X days ago to scan for open incidents that are shown on the list of connections in the incidents section. This parameter can also be configured by setting the DQO_INCIDENTS_COUNT_OPEN_INCIDENTS_DAYS environment variable. |
||
--dqo.incidents.partitioned-checks-time-window-days |
The time window for the maximum age of a daily or monthly partition whose data quality issues are included in new data quality incidents when an issue is detected. Data quality issues on older partitions will not trigger creating a new incident. This parameter can also be configured by setting the DQO_INCIDENTS_PARTITIONED_CHECKS_TIME_WINDOW_DAYS environment variable. |
||
--dqo.incidents.top-incidents-days |
The time window between now and X days ago to scan for incidents that are shown on the main screen, grouped by a requested grouping. This parameter can also be configured by setting the DQO_INCIDENTS_TOP_INCIDENTS_DAYS environment variable. |
||
--dqo.instance.name |
DQOps instance name. DQOps uses this instance name when finding which data quality checks should be run on this DQOps instance. When a connection is limited to run scheduled data quality checks only on a named instance, the instance name must math. This parameter can be overwritten in the instance's local settings file. This parameter can also be configured by setting the DQO_INSTANCE_NAME environment variable. |
||
--dqo.instance.return-base-url |
Base url of this instance that is used as a return url when authentication with DQOps Cloud credentials is forwarded and the user must be forwarded back to the current instance from the https://cloud.dqops.com login screen. When this parameter is not provided, DQOps will use the url from the "Host" HTTP header. This parameter can also be configured by setting the DQO_INSTANCE_RETURN_BASE_URL environment variable. |
||
--dqo.instance.signature-key |
DQOps local instance signature key that is used to issue and verify digital signatures on API keys. It is a base64 encoded byte array (32 bytes). When not configured, DQOps will generate a secure random key and store it in the .localsettings.dqosettings.yaml file. This parameter can also be configured by setting the DQO_INSTANCE_SIGNATURE_KEY environment variable. |
||
--dqo.integrations.table-health-webhook-urls |
A comma separated list of webhook URLs where DQOps sends updates of the table data quality status changes. This parameter can also be configured by setting the DQO_INTEGRATIONS_TABLE_HEALTH_WEBHOOK_URLS environment variable. |
||
--dqo.jdbc.expire-after-access-seconds |
Sets the number of seconds when a connection in a JDBC pool is expired after the last access. This parameter can also be configured by setting the DQO_JDBC_EXPIRE_AFTER_ACCESS_SECONDS environment variable. |
||
--dqo.jdbc.max-connection-in-pool |
Sets the maximum number of connections in the JDBC connection pool, shared across all data sources using JDBC drivers. This parameter can also be configured by setting the DQO_JDBC_MAX_CONNECTION_IN_POOL environment variable. |
||
--dqo.logging.console |
Enables logging to console, selecting the correct format. The default configuration 'OFF' disables console logging, allowing to use the DQOps shell without being distracted by log entries. Set the 'PATTERN' mode to send formatted entries to the console in a format similar to Apache logs. When running DQOps in as a docker container on a Kubernetes engine that is configured to capture DQOps container logs, use 'JSON' mode to publish structured Json log entries that can be parsed by fluentd or other similar log engines. JSON formatted messages use a Logstash compatible format. This parameter can also be configured by setting the DQO_LOGGING_CONSOLE environment variable. |
OFF JSON PATTERN |
|
--dqo.logging.console-immediate-flush |
When the console logging is enabled with --dqo.logging.console=PATTERN or --dqo.logging.console=JSON, turns on (for 'true') or turns of (for 'false') immediate console flushing after each log entry was written. Immediate console flushing is desirable when DQOps is started as a docker container and docker logs from DQOps should be forwarded to Kubernetes for centralized logging. This parameter can also be configured by setting the DQO_LOGGING_CONSOLE_IMMEDIATE_FLUSH environment variable. |
||
--dqo.logging.enable-user-home-logging |
Enables file logging inside the DQOps User Home's .logs folder. This parameter can also be configured by setting the DQO_LOGGING_ENABLE_USER_HOME_LOGGING environment variable. |
||
--dqo.logging.max-history |
Sets the maximum number of log files that can be stored (archived) in the .logs folder. This parameter can also be configured by setting the DQO_LOGGING_MAX_HISTORY environment variable. |
||
--dqo.logging.pattern |
Log entry pattern for logback used for writing log entries. This parameter can also be configured by setting the DQO_LOGGING_PATTERN environment variable. |
||
--dqo.logging.total-size-cap |
Total log file size cap of log files generated in the DQOps User Home's .logs folder. Supported suffixes are: kb, mb, gb. For example: 10mb, 2gb. This parameter can also be configured by setting the DQO_LOGGING_TOTAL_SIZE_CAP environment variable. |
||
--dqo.logging.user-errors.checks-log-level |
The logging level at which any errors captured during the data quality check evaluation are reported. When a data quality check is executed and the error is related to a sensor (query) or a rule (python) function, they are reported as sensor or rules issues.Only data quality check configuration issues that prevent running a data quality check are reported as check issues. The logging level for the whole application must be equal or higher to this level for effective logging. Check logs are logged under the com.dqops.user-errors.checks log. This parameter can also be configured by setting the DQO_LOGGING_USER_ERRORS_CHECKS_LOG_LEVEL environment variable. |
ERROR WARN INFO DEBUG TRACE |
|
--dqo.logging.user-errors.rules-log-level |
The logging level at which any errors captured during the data quality rule (python function) evaluation are reported. The logging level for the whole application must be equal or higher to this level for effective logging. Rule logs are logged under the com.dqops.user-errors.rules log. This parameter can also be configured by setting the DQO_LOGGING_USER_ERRORS_RULES_LOG_LEVEL environment variable. |
ERROR WARN INFO DEBUG TRACE |
|
--dqo.logging.user-errors.sensors-log-level |
The logging level at which any errors captured during the data quality sensor (query) execution are reported. The logging level for the whole application must be equal or higher to this level for effective logging. Sensor logs are logged under the com.dqops.user-errors.sensors log. This parameter can also be configured by setting the DQO_LOGGING_USER_ERRORS_SENSORS_LOG_LEVEL environment variable. |
ERROR WARN INFO DEBUG TRACE |
|
--dqo.logging.user-errors.statistics-log-level |
The logging level at which any errors captured during the statistics collection are reported. The logging level for the whole application must be equal or higher to this level for effective logging. Statistics logs are logged under the com.dqops.user-errors.statistics log. This parameter can also be configured by setting the DQO_LOGGING_USER_ERRORS_STATISTICS_LOG_LEVEL environment variable. |
ERROR WARN INFO DEBUG TRACE |
|
--dqo.logging.user-errors.yaml-log-level |
The logging level at which any errors captured during YAML file parsing are reported. The logging level for the whole application must be equal or higher to this level for effective logging. Statistics logs are logged under the com.dqops.user-errors.yaml log. This parameter can also be configured by setting the DQO_LOGGING_USER_ERRORS_YAML_LOG_LEVEL environment variable. |
ERROR WARN INFO DEBUG TRACE |
|
--dqo.metadata.auto-import-tables-limit |
Sets the maximum number of tables that are imported from a data source by the auto import that is scheduled on the DQOps CRON scheduler. This parameter can also be configured by setting the DQO_METADATA_AUTO_IMPORT_TABLES_LIMIT environment variable. |
||
--dqo.metadata.import.tables-import-limit |
Sets the maximum number of tables that are imported from a data source. DQOps supports importing more tables by importing additional tables specified by a different table filter. This parameter can also be configured by setting the DQO_METADATA_IMPORT_TABLES_IMPORT_LIMIT environment variable. |
||
--dqo.python.debug-mode |
Configures the debugging (logging) mode used when running Python rules. Under the default configuration ('silent' mode), the data quality rules implemented in Python evaluate silently, without generating any log entries. This parameter changes the logging mode and instructs the rule runner to write the parameters passed to rules. The data is written as JSON files in the DQOps user home, in the .index/models folder for each rule evaluation. These files can be used to debug rules by running them externally. This parameter can also be configured by setting the DQO_PYTHON_DEBUG_MODE environment variable. |
all failed exception silent |
|
--dqo.python.interpreter-name |
A list of python interpreter executable names, separated by a comma, containing possible python interpreter names such as 'python', 'python3', 'python3.exe' or an absolute path to the python interpreter. DQOps will try to find the first python interpreter executable in directories defined in the PATH when a list of python interpreter names (not an absolute path) is used. This parameter can also be configured by setting the DQO_PYTHON_INTERPRETER_NAME environment variable. |
||
--dqo.python.python-script-timeout-seconds |
Python script execution time limit in seconds for running jinja2 and rule evaluation scripts. This parameter can also be configured by setting the DQO_PYTHON_PYTHON_SCRIPT_TIMEOUT_SECONDS environment variable. |
||
--dqo.python.use-host-python |
Disable creating a python virtual environment by DQOps on startup. Instead, use the system python interpreter. DQOps will not install any required python packages on startup and use packages from the user's python installation. This parameter can also be configured by setting the DQO_PYTHON_USE_HOST_PYTHON environment variable. |
||
--dqo.queue.max-concurrent-jobs |
Sets the maximum number of concurrent jobs that the job queue can process at once (running data quality checks, importing metadata, etc.). The maximum number of threads is also limited by the DQOps license. This parameter can also be configured by setting the DQO_QUEUE_MAX_CONCURRENT_JOBS environment variable. |
||
--dqo.queue.wait-timeouts.collect-statistics |
Sets the default timeout (in seconds) for the "collect statistics" REST API operation called from the DQOps client when the "wait" parameter is true and the timeout is not provided by the client. This parameter can also be configured by setting the DQO_QUEUE_WAIT_TIMEOUTS_COLLECT_STATISTICS environment variable. |
||
--dqo.queue.wait-timeouts.default-wait-timeout |
Sets the default wait timeout (in seconds) for waiting for a job when the "waitTimeout" parameter is not given to the call to the "waitForJob" operation from the DQOps client.. This parameter can also be configured by setting the DQO_QUEUE_WAIT_TIMEOUTS_DEFAULT_WAIT_TIMEOUT environment variable. |
||
--dqo.queue.wait-timeouts.delete-stored-data |
Sets the default timeout (in seconds) for the "delete stored data" rest api operation called from the DQOps client when the "wait" parameter is true and the timeout is not provided by the client. This parameter can also be configured by setting the DQO_QUEUE_WAIT_TIMEOUTS_DELETE_STORED_DATA environment variable. |
||
--dqo.queue.wait-timeouts.import-tables |
Sets the default timeout (in seconds) for the "import tables" rest api operation called from the DQOps client when the "wait" parameter is true and the timeout is not provided by the client. This parameter can also be configured by setting the DQO_QUEUE_WAIT_TIMEOUTS_IMPORT_TABLES environment variable. |
||
--dqo.queue.wait-timeouts.run-checks |
Sets the default timeout (in seconds) for the "run checks" rest api operation called from the DQOps client when the "wait" parameter is true and the timeout is not provided by the client. This parameter can also be configured by setting the DQO_QUEUE_WAIT_TIMEOUTS_RUN_CHECKS environment variable. |
||
--dqo.queue.wait-timeouts.synchronize-multiple-folders |
Sets the default timeout (in seconds) for the "synchronize multiple folders" rest api operation called from the DQOps client when the "wait" parameter is true and the timeout is not provided by the client. This parameter can also be configured by setting the DQO_QUEUE_WAIT_TIMEOUTS_SYNCHRONIZE_MULTIPLE_FOLDERS environment variable. |
||
--dqo.rule-mining.days-in-range-max-date-days-ahead |
The number of days to add to the current system date that DQOps rule mining engine uses to set the maximum date in the date_in_range data quality check. The default configuration sets a date that is 10 years ahead. This parameter can also be configured by setting the DQO_RULE_MINING_DAYS_IN_RANGE_MAX_DATE_DAYS_AHEAD environment variable. |
||
--dqo.rule-mining.days-in-range-min-date-days-before |
The number of days to subtract from the earliest found date in a column that DQOps rule mining engine uses to set the minimum date in the date_in_range data quality check. The default configuration sets a date that is 2 days before. This parameter can also be configured by setting the DQO_RULE_MINING_DAYS_IN_RANGE_MIN_DATE_DAYS_BEFORE environment variable. |
||
--dqo.rule-mining.default-fail-checks-at-percent-error-rows |
The default percentage value captured by a profiling check (for example 0.03% of errors or 99.97% of valid) that is used to propose a percentage rule that will treat the values as errors (i.e., max_percent = 0%, or min_percent = 100%).The default value is 2%. This parameter can also be configured by setting the DQO_RULE_MINING_DEFAULT_FAIL_CHECKS_AT_PERCENT_ERROR_ROWS environment variable. |
||
--dqo.rule-mining.default-max-percent-error-rows |
The default maximum percentage of invalid rows for which the rule engine should configure rule values, especially min_percent, min_count or max_percent. This parameter can also be configured by setting the DQO_RULE_MINING_DEFAULT_MAX_PERCENT_ERROR_ROWS environment variable. |
||
--dqo.rule-mining.max-column-samples-to-propose-accepted-values |
The maximum number of samples in the sample values to use when generating a proposed configuration of the accepted_values checks. If a column has more than this number of distinct values, DQOps will not configure the found in set checks. The default value is 50 samples. This parameter can also be configured by setting the DQO_RULE_MINING_MAX_COLUMN_SAMPLES_TO_PROPOSE_ACCEPTED_VALUES environment variable. |
||
--dqo.rule-mining.max-distinct-count |
The default maximum distinct count that is used to activate the distinct_count check. Above the limit, DQOps will configure the distinct_percent check. The default value is 1000 rows. This parameter can also be configured by setting the DQO_RULE_MINING_MAX_DISTINCT_COUNT environment variable. |
||
--dqo.rule-mining.max-expected-texts-in-top-values |
The maximum number of top (most common) text values that are added to the expected_texts_in_top_values check. The default value is 5 top text values. This parameter can also be configured by setting the DQO_RULE_MINING_MAX_EXPECTED_TEXTS_IN_TOP_VALUES environment variable. |
||
--dqo.rule-mining.min-count-rate |
The rate of the current min_count value, such as the row count, or the count of not-null values, obtained from statistics that is used to configure the row_count and not_nulls_count checks by the check mining engine. The default value is 0.90 of the current count captured by the statistics (row count, not-nulls count). This parameter can also be configured by setting the DQO_RULE_MINING_MIN_COUNT_RATE environment variable. |
||
--dqo.rule-mining.min-max-value-rate-delta |
The default delta that is added to the proposed maximum value, or subtracted from a proposed minimum value. It is calculated as a rate of the current value. The default value is 0.1, which means that when the detected minimum value is 5.0, the proposed minimum value in the rule will be 4.5. This parameter can also be configured by setting the DQO_RULE_MINING_MIN_MAX_VALUE_RATE_DELTA environment variable. |
||
--dqo.rule-mining.min-reasonable-not-null-count |
The default minimum reasonable count of not-null values that must be satisfied to apply some checks that validate a range of row counts (like a distinct count between). This parameter can also be configured by setting the DQO_RULE_MINING_MIN_REASONABLE_NOT_NULL_COUNT environment variable. |
||
--dqo.rule-mining.min-word-count |
The minimum word count that is required to apply word count count in range checks. The default value is 3 words. This parameter can also be configured by setting the DQO_RULE_MINING_MIN_WORD_COUNT environment variable. |
||
--dqo.rule-mining.not-null-count-rate-for-duplicate-count |
The default rate (fraction) of the number of rows with not-null values that must contain distinct values to apply the distinct count between check. The default is 0.01, which is 1% of the count of not-null values. This parameter can also be configured by setting the DQO_RULE_MINING_NOT_NULL_COUNT_RATE_FOR_DUPLICATE_COUNT environment variable. |
||
--dqo.rule-mining.percent-check-delta-rate |
The multiplier of the last known percent that is extended by this delta (as a rate/proportion of the percentage) to configure a passing percentage check.The default value is 0.3. For this value and when the last known max_percent was 10%, DQOps rule mining engine will propose a save max_count 13%. The additional 3% of the delta is 0.3 * 10%. This parameter can also be configured by setting the DQO_RULE_MINING_PERCENT_CHECK_DELTA_RATE environment variable. |
||
--dqo.rule-mining.timeliness-max-days-multiplier |
he multiplier of the last known table timeliness checks (freshness, staleness, ingestion delay) that is used to propose the configuration of the max days rule threshold by the rule mining engine.The default value is 2.0x the last known delay. This parameter can also be configured by setting the DQO_RULE_MINING_TIMELINESS_MAX_DAYS_MULTIPLIER environment variable. |
||
--dqo.scheduler.check-run-mode |
Configures the console logging mode for the '"check run" jobs performed by the job scheduler in the background. This parameter can also be configured by setting the DQO_SCHEDULER_CHECK_RUN_MODE environment variable. |
silent summary info debug |
|
--dqo.scheduler.default-schedules.monitoring-daily |
Sets the default schedule for running daily monitoring checks that is copied to the configuration of new data source connections that are registered in DQOps. The default schedule runs checks once a day at 12 PM (noon). This parameter is used only once, during the first initialization of DQOps user home. The value is copied to the settings/defaultschedules.dqoschedules.yaml file. This parameter can also be configured by setting the DQO_SCHEDULER_DEFAULT_SCHEDULES_MONITORING_DAILY environment variable. |
||
--dqo.scheduler.default-schedules.monitoring-monthly |
Sets the default schedule for running monthly monitoring checks that is copied to the configuration of new data source connections that are registered in DQOps. The default schedule runs checks once a day at 12 PM (noon). This parameter is used only once, during the first initialization of DQOps user home. The value is copied to the settings/defaultschedules.dqoschedules.yaml file. This parameter can also be configured by setting the DQO_SCHEDULER_DEFAULT_SCHEDULES_MONITORING_MONTHLY environment variable. |
||
--dqo.scheduler.default-schedules.partitioned-daily |
Sets the default schedule for running daily partitioned checks that is copied to the configuration of new data source connections that are registered in DQOps. The default schedule runs checks once a day at 12 PM (noon). This parameter is used only once, during the first initialization of DQOps user home. The value is copied to the settings/defaultschedules.dqoschedules.yaml file. This parameter can also be configured by setting the DQO_SCHEDULER_DEFAULT_SCHEDULES_PARTITIONED_DAILY environment variable. |
||
--dqo.scheduler.default-schedules.partitioned-monthly |
Sets the default schedule for running monthly partitioned checks that is copied to the configuration of new data source connections that are registered in DQOps. The default schedule runs checks once a day at 12 PM (noon). This parameter is used only once, during the first initialization of DQOps user home. The value is copied to the .settings/defaultschedules.dqoschedules.yaml file. This parameter can also be configured by setting the DQO_SCHEDULER_DEFAULT_SCHEDULES_PARTITIONED_MONTHLY environment variable. |
||
--dqo.scheduler.default-schedules.profiling |
Sets the default schedule for running profiling checks that is copied to the configuration of new data source connections that are registered in DQOps. The default schedule runs profiling checks once a month, on the first day of the month at 1 AM. This parameter is used only once, during the first initialization of DQOps user home. The value is copied to the settings/defaultschedules.dqoschedules.yaml file. This parameter can also be configured by setting the DQO_SCHEDULER_DEFAULT_SCHEDULES_PROFILING environment variable. |
||
--dqo.scheduler.enable-cloud-sync |
Enable synchronization of metadata and results with DQOps Cloud in the job scheduler. This parameter can also be configured by setting the DQO_SCHEDULER_ENABLE_CLOUD_SYNC environment variable. |
||
--dqo.scheduler.start |
Starts the job scheduler on startup (true) or disables the job scheduler (false). This parameter can also be configured by setting the DQO_SCHEDULER_START environment variable. |
||
--dqo.scheduler.synchronization-mode |
Configures the console logging mode for the '"cloud sync all" operations performed by the job scheduler in the background. This parameter can also be configured by setting the DQO_SCHEDULER_SYNCHRONIZATION_MODE environment variable. |
silent summary debug |
|
--dqo.scheduler.synchronize-cron-schedule |
Unix cron expression to configure how often the scheduler will synchronize the local copy of the metadata with DQOps Cloud and detect new cron schedules. The default schedule will synchronize local files with DQOps Cloud and refresh the data quality data warehouse 5 minutes past each hour. A DQOps instance that uses a FREE or a trial PERSONAL license will ignore this setting and synchronize files once an hour, on a random time. Synchronization with DQOps cloud can be disabled by setting --dqo.scheduler.enable-cloud-sync=false. This parameter can also be configured by setting the DQO_SCHEDULER_SYNCHRONIZE_CRON_SCHEDULE environment variable. |
||
--dqo.scheduler.synchronized-folders |
Configures which folders from the DQOps user home folder are synchronized to DQOps Cloud during a monitoring synchronization (triggered by a cron schedule configured by --dqo.scheduler.synchronize-cron-schedule). By default, DQOps synchronizes (pushes) only changes from folders that have local changes. This parameter can also be configured by setting the DQO_SCHEDULER_SYNCHRONIZED_FOLDERS environment variable. |
all locally_changed |
|
--dqo.secrets.enable-gcp-secret-manager |
Enables GCP secret manager to resolve parameters like null in the yaml files. This parameter can also be configured by setting the DQO_SECRETS_ENABLE_GCP_SECRET_MANAGER environment variable. |
||
--dqo.secrets.gcp-project-id |
GCP project name with a GCP secret manager enabled to pull the secrets. This parameter can also be configured by setting the DQO_SECRETS_GCP_PROJECT_ID environment variable. |
||
--dqo.sensor.limit.fail-on-sensor-readout-limit-exceeded |
Configures the behavior when the number of rows returned from a data quality sensor exceeds the limit configured in the 'sensor-readout-limit' parameter. When true, the whole check execution is failed. When false, only results up to the limit are analyzed. The default value is true. This parameter can also be configured by setting the DQO_SENSOR_LIMIT_FAIL_ON_SENSOR_READOUT_LIMIT_EXCEEDED environment variable. |
||
--dqo.sensor.limit.max-merged-queries |
The maximum number of queries that are merged into a bigger query, to calculate multiple sensors on the same table and to analyze multiple columns from the same table. This parameter can also be configured by setting the DQO_SENSOR_LIMIT_MAX_MERGED_QUERIES environment variable. |
||
--dqo.sensor.limit.sensor-readout-limit |
Default row count limit retrieved by a data quality sensor from the results of an SQL query for non-partitioned checks (profiling and monitoring). This is the row count limit applied when querying the data source. When the data grouping configuration sets up a GROUP BY too many columns or columns with too many distinct values, the data source will return too many results to store them as data quality check results and sensor readouts. DQOps will discard additional values returned from the data source or raise an error. This parameter can also be configured by setting the DQO_SENSOR_LIMIT_SENSOR_READOUT_LIMIT environment variable. |
||
--dqo.sensor.limit.sensor-readout-limit-partitioned |
Default row count limit retrieved by a data quality sensor from the results of an SQL query for partitioned checks. This is the row count limit applied when querying the data source. When the data grouping configuration sets up a GROUP BY too many columns or columns with too many distinct values, the data source will return too many results to store them as data quality check results and sensor readouts. DQOps will discard additional values returned from the data source or return an error. The default value is 7x bigger than the sensor-readout-limit to allow analysing the last 7 daily partitions. This parameter can also be configured by setting the DQO_SENSOR_LIMIT_SENSOR_READOUT_LIMIT_PARTITIONED environment variable. |
||
--dqo.smtp-server.host |
Sets the host name of the SMTP server that is used to send email notifications. This parameter can also be configured by setting the DQO_SMTP_SERVER_HOST environment variable. |
||
--dqo.smtp-server.password |
Sets the password of the SMTP server that is used to send email notifications. This parameter can also be configured by setting the DQO_SMTP_SERVER_PASSWORD environment variable. |
||
--dqo.smtp-server.port |
Sets the port number of the SMTP server that is used to send email notifications. This parameter can also be configured by setting the DQO_SMTP_SERVER_PORT environment variable. |
||
--dqo.smtp-server.use-ssl |
Configures if the SMTP server that is used to send email notifications uses SSL protocol. This parameter can also be configured by setting the DQO_SMTP_SERVER_USE_SSL environment variable. |
||
--dqo.smtp-server.username |
Sets the username of the SMTP server that is used to send email notifications. This parameter can also be configured by setting the DQO_SMTP_SERVER_USERNAME environment variable. |
||
--dqo.statistics.samples-limit |
The limit of column value samples that are collected when the basic table statistics are gathered. DQOps collects only the most popular values, which is determined by the number of value occurrences. This parameter can also be configured by setting the DQO_STATISTICS_SAMPLES_LIMIT environment variable. |
||
--dqo.statistics.truncated-strings-length |
The length of samples captured from text columns (varchar, string, text, etc.) that are stored as samples. DQOps truncates longer column values and stores only the first few characters, up to the character count limit defined by this parameter. This parameter can also be configured by setting the DQO_STATISTICS_TRUNCATED_STRINGS_LENGTH environment variable. |
||
--dqo.statistics.viewed-statistics-age-months |
The maximum age (in months) of the basic statistics that are shown on the basic statistics screen. Statistics values captured earlier are still stored, but are not shown in the DQOps UI. This parameter can also be configured by setting the DQO_STATISTICS_VIEWED_STATISTICS_AGE_MONTHS environment variable. |
||
--dqo.user.home |
Overrides the path to the DQOps user home. The default user home is created in the current folder (.). This parameter can also be configured by setting the DQO_USER_HOME environment variable. |
||
--dqo.user.initialize-user-home |
Initializes an empty DQOps user home (identified by the DQO_USER_HOME environment variable) without asking the user for confirmation. This parameter can also be configured by setting the DQO_USER_INITIALIZE_USER_HOME environment variable. |
||
--dqo.webserver.authentication-method |
User authentication method. A standalone instance has no user authentication. Paid versions of DQOps support federated authentication using Single-Sign-On. Please contact DQOps sales for details: https://dqops.com/contact-us/. This parameter can also be configured by setting the DQO_WEBSERVER_AUTHENTICATION_METHOD environment variable. |
none dqops_cloud oauth2 |
|
-fw --file-write |
Write command response to a file This parameter can also be configured by setting the _FWFILE_WRITE environment variable. |
||
--headless -hl |
Starts DQOps in a headless mode. When DQOps runs in a headless mode and the application cannot start because the DQOps Cloud API key is missing or the DQOps user home folder is not configured, DQOps will stop silently instead of asking the user to approve the setup of the DQOps user home folder structure and/or log into DQOps Cloud. This parameter can also be configured by setting the HEADLESS_HL environment variable. |
||
-h --help |
Show the help for the command and parameters This parameter can also be configured by setting the _HHELP environment variable. |
||
--logging.level.com.dqops |
Default logging level for the DQOps runtime. This parameter can also be configured by setting the LOGGING_LEVEL_COM_DQOPS environment variable. |
ERROR WARN INFO DEBUG TRACE |
|
--logging.level.root |
Default logging level at the root level of the logging hierarchy. This parameter can also be configured by setting the LOGGING_LEVEL_ROOT environment variable. |
ERROR WARN INFO DEBUG TRACE |
|
-of --output-format |
Output format for tabular responses This parameter can also be configured by setting the _OFOUTPUT_FORMAT environment variable. |
TABLE CSV JSON |
|
--server.port |
Sets the web server port to host the DQOps local web UI. This parameter can also be configured by setting the SERVER_PORT environment variable. |
||
--silent |
Starts DQOps in a silent mode, without showing the banner and any other information. This parameter can also be configured by setting the SILENT environment variable. |
||
--spring.config.location |
Sets a path to the folder that has the spring configuration files (application.properties or application.yml) or directly to an application.properties or application.yml file. The format of this value is: --spring.config.location=file:./foldername/,file:./alternativeapplication.yml This parameter can also be configured by setting the SPRING_CONFIG_LOCATION environment variable. |