Skip to content

Last updated: July 22, 2025

dqo dqo command-line command

The reference of the dqo command in DQOps. DQOps Data Quality Operations Center Interactive Shell Hit @|yellow <TAB>|@ to see available commands. Hit @|yellow ALT-S|@ to toggle tailtips.


dqo

DQOps command-line entry point script

Description

dqo is an executable script installed in the Python scripts local folder when DQOps is installed locally by installing the dqops package from PyPi. When the python environment Scripts folder is in the path, running dqo from the command line (bash, etc.) will start a DQOps local instance.

Command line sample

$ dqo [root_level_parameter] [command]

Example

$ dqo --dqo.cloud.api-key=3242424324242 check run -c=connection_name

Options

Command argument     Description Required Accepted values
--DQO_JAVA_OPTS
Configures additional JVM (Java Virtual Machine) options such as the memory limit. The default value for both the 'dqops' python package and for the dqops/dqo Docker image is -XX:MaxRAMPercentage=80.0 which sets the upper memory limit for 80% of the available RAM at the moment when the container starts. This parameter is not supported as a command line parameter, it is only supported as an environment variable. Set (and export) the environment variable DQO_JAVA_OPTS before starting DQOps.
This parameter can also be configured by setting the DQO_JAVA_OPTS environment variable.
--dqo.cache.enabled
Enables or disables the in-memory cache for parsed YAML files and Parquet data files.
This parameter can also be configured by setting the DQO_CACHE_ENABLED environment variable.
--dqo.cache.expire-after-seconds
The time in seconds to expire the cache entries since they were added to the cache.
This parameter can also be configured by setting the DQO_CACHE_EXPIRE_AFTER_SECONDS environment variable.
--dqo.cache.file-lists-limit
The maximum number of folders for which the list of files are cached to avoid listing the files.
This parameter can also be configured by setting the DQO_CACHE_FILE_LISTS_LIMIT environment variable.
--dqo.cache.parquet-cache-memory-fraction
The maximum fraction of the JVM heap memory (configured using the -Xmx java parameter) that is used to cache parquet files in memory. The default value 0.6 means that up to 50% of the JVM heap memory can be used for caching files. The value of the reserved-heap-memory-bytes is subtracted from the total memory size (JVM's -Xmx or -XX:MaxRAMPercentage=80.0 parameter values) before the memory fraction is calculated. The value can be increased to 0.8 for for systems when JVM is given more than 8 GB RAM.
This parameter can also be configured by setting the DQO_CACHE_PARQUET_CACHE_MEMORY_FRACTION environment variable.
--dqo.cache.process-file-changes-delay-millis
The delay in milliseconds between processing file changes that would invalidate the cache.
This parameter can also be configured by setting the DQO_CACHE_PROCESS_FILE_CHANGES_DELAY_MILLIS environment variable.
--dqo.cache.reserved-heap-memory-bytes
The memory size (in bytes) that is not subtracted from the total JVM heap memory before the memory fraction dedicated for the parquet cache is calculated. The default value is 200mb.
This parameter can also be configured by setting the DQO_CACHE_RESERVED_HEAP_MEMORY_BYTES environment variable.
--dqo.cache.watch-file-system-changes
Use a file watcher to detect file system changes and invalidate the in-memory file cache.When a file watches is enabled, all changes made to YAML files directly on the file system (i.e. by editing a file in Visual Studio Code) are instantly detected by DQOps.
This parameter can also be configured by setting the DQO_CACHE_WATCH_FILE_SYSTEM_CHANGES environment variable.
--dqo.cache.yaml-files-limit
The maximum number of specification files to cache.
This parameter can also be configured by setting the DQO_CACHE_YAML_FILES_LIMIT environment variable.
--dqo.cli.terminal.width
Width of the terminal when no terminal window is available, e.g. in one-shot running mode.
This parameter can also be configured by setting the DQO_CLI_TERMINAL_WIDTH environment variable.
--dqo.cloud.api-key
DQOps Cloud api key. Log in to https://cloud.dqops.com/ to get the key.
This parameter can also be configured by setting the DQO_CLOUD_API_KEY environment variable.
--dqo.cloud.parallel-file-downloads
The number of files that are downloaded from DQOps Cloud in parallel using HTTP/2 multiplexing.
This parameter can also be configured by setting the DQO_CLOUD_PARALLEL_FILE_DOWNLOADS environment variable.
--dqo.cloud.parallel-file-uploads
The number of files that are uploaded to DQOps Cloud in parallel using HTTP/2 multiplexing.
This parameter can also be configured by setting the DQO_CLOUD_PARALLEL_FILE_UPLOADS environment variable.
--dqo.core.lock-wait-timeout-seconds
Sets the maximum wait timeout in seconds to obtain a lock to read or write files.
This parameter can also be configured by setting the DQO_CORE_LOCK_WAIT_TIMEOUT_SECONDS environment variable.
--dqo.core.print-stack-trace
Prints a full stack trace for errors on the console.
This parameter can also be configured by setting the DQO_CORE_PRINT_STACK_TRACE environment variable.
--dqo.default-time-zone
Default time zone name used to convert the server's local dates to a local time in a time zone that is relevant for the user. Use official IANA time zone names. When the parameter is not configured, DQOps uses the local time zone of the host running the application. The time zone can be reconfigured at a user settings level.
This parameter can also be configured by setting the DQO_DEFAULT_TIME_ZONE environment variable.
--dqo.docker.user-home.allow-unmounted
When running DQOps in a docker container, allow DQOps user home folder to be initialized inside the container's filesystem if the folder hasn't been mounted to an external volume.
This parameter can also be configured by setting the DQO_DOCKER_USER_HOME_ALLOW_UNMOUNTED environment variable.
--dqo.duckdb.memory-limit
The maximum memory of the system (e.g., 1GB). When not set, DuckDB use the 80% of RAM.
This parameter can also be configured by setting the DQO_DUCKDB_MEMORY_LIMIT environment variable.
--dqo.duckdb.threads
The number of total threads used by the system. The default value is 1000
This parameter can also be configured by setting the DQO_DUCKDB_THREADS environment variable.
--dqo.error-sampling.samples-limit
The maximum number of error samples (invalid column values) captured for each data grouping when data grouping is configured, or for the whole table when data grouping is not configured.
This parameter can also be configured by setting the DQO_ERROR_SAMPLING_SAMPLES_LIMIT environment variable.
--dqo.error-sampling.total-samples-limit
The maximum total number of error sampling results captured from a table when data grouping is enabled on a table, and error samples are captured from multiple data groupings..
This parameter can also be configured by setting the DQO_ERROR_SAMPLING_TOTAL_SAMPLES_LIMIT environment variable.
--dqo.error-sampling.truncated-strings-length
The maximum length of error samples captured from text columns (varchar, string, text, etc.) that are stored as error samples table. DQOps truncates longer column values and stores only the first few characters, up to the character count limit defined by this parameter.
This parameter can also be configured by setting the DQO_ERROR_SAMPLING_TRUNCATED_STRINGS_LENGTH environment variable.
--dqo.home
Overrides the path to the DQOps system home (DQO_HOME). The default DQO_HOME contains the definition of built-in data quality sensors, rules and libraries.
This parameter can also be configured by setting the DQO_HOME environment variable.
--dqo.incidents.check-histogram-size
The size of the data quality check histogram that is generated for a preview of a data quality incident.
This parameter can also be configured by setting the DQO_INCIDENTS_CHECK_HISTOGRAM_SIZE environment variable.
--dqo.incidents.column-histogram-size
The size of the column histogram that is generated for a preview of a data quality incident.
This parameter can also be configured by setting the DQO_INCIDENTS_COLUMN_HISTOGRAM_SIZE environment variable.
--dqo.incidents.count-open-incidents-days
The time window between now and X days ago to scan for open incidents that are shown on the list of connections in the incidents section.
This parameter can also be configured by setting the DQO_INCIDENTS_COUNT_OPEN_INCIDENTS_DAYS environment variable.
--dqo.incidents.partitioned-checks-time-window-days
The time window for the maximum age of a daily or monthly partition whose data quality issues are included in new data quality incidents when an issue is detected. Data quality issues on older partitions will not trigger creating a new incident.
This parameter can also be configured by setting the DQO_INCIDENTS_PARTITIONED_CHECKS_TIME_WINDOW_DAYS environment variable.
--dqo.incidents.top-incidents-days
The time window between now and X days ago to scan for incidents that are shown on the main screen, grouped by a requested grouping.
This parameter can also be configured by setting the DQO_INCIDENTS_TOP_INCIDENTS_DAYS environment variable.
--dqo.instance.name
DQOps instance name. DQOps uses this instance name when finding which data quality checks should be run on this DQOps instance. When a connection is limited to run scheduled data quality checks only on a named instance, the instance name must math. This parameter can be overwritten in the instance's local settings file.
This parameter can also be configured by setting the DQO_INSTANCE_NAME environment variable.
--dqo.instance.return-base-url
Base url of this instance that is used as a return url when authentication with DQOps Cloud credentials is forwarded and the user must be forwarded back to the current instance from the https://cloud.dqops.com login screen. When this parameter is not provided, DQOps will use the url from the "Host" HTTP header.
This parameter can also be configured by setting the DQO_INSTANCE_RETURN_BASE_URL environment variable.
--dqo.instance.signature-key
DQOps local instance signature key that is used to issue and verify digital signatures on API keys. It is a base64 encoded byte array (32 bytes). When not configured, DQOps will generate a secure random key and store it in the .localsettings.dqosettings.yaml file.
This parameter can also be configured by setting the DQO_INSTANCE_SIGNATURE_KEY environment variable.
--dqo.integrations.table-health-webhook-urls
A comma separated list of webhook URLs where DQOps sends updates of the table data quality status changes.
This parameter can also be configured by setting the DQO_INTEGRATIONS_TABLE_HEALTH_WEBHOOK_URLS environment variable.
--dqo.jdbc.expire-after-access-seconds
Sets the number of seconds when a connection in a JDBC pool is expired after the last access.
This parameter can also be configured by setting the DQO_JDBC_EXPIRE_AFTER_ACCESS_SECONDS environment variable.
--dqo.jdbc.max-connection-in-pool
Sets the maximum number of connections in the JDBC connection pool, shared across all data sources using JDBC drivers.
This parameter can also be configured by setting the DQO_JDBC_MAX_CONNECTION_IN_POOL environment variable.
--dqo.logging.console
Enables logging to console, selecting the correct format. The default configuration 'OFF' disables console logging, allowing to use the DQOps shell without being distracted by log entries. Set the 'PATTERN' mode to send formatted entries to the console in a format similar to Apache logs. When running DQOps in as a docker container on a Kubernetes engine that is configured to capture DQOps container logs, use 'JSON' mode to publish structured Json log entries that can be parsed by fluentd or other similar log engines. JSON formatted messages use a Logstash compatible format.
This parameter can also be configured by setting the DQO_LOGGING_CONSOLE environment variable.
OFF
JSON
PATTERN
--dqo.logging.console-immediate-flush
When the console logging is enabled with --dqo.logging.console=PATTERN or --dqo.logging.console=JSON, turns on (for 'true') or turns of (for 'false') immediate console flushing after each log entry was written. Immediate console flushing is desirable when DQOps is started as a docker container and docker logs from DQOps should be forwarded to Kubernetes for centralized logging.
This parameter can also be configured by setting the DQO_LOGGING_CONSOLE_IMMEDIATE_FLUSH environment variable.
--dqo.logging.enable-user-home-logging
Enables file logging inside the DQOps User Home's .logs folder.
This parameter can also be configured by setting the DQO_LOGGING_ENABLE_USER_HOME_LOGGING environment variable.
--dqo.logging.max-history
Sets the maximum number of log files that can be stored (archived) in the .logs folder.
This parameter can also be configured by setting the DQO_LOGGING_MAX_HISTORY environment variable.
--dqo.logging.pattern
Log entry pattern for logback used for writing log entries.
This parameter can also be configured by setting the DQO_LOGGING_PATTERN environment variable.
--dqo.logging.total-size-cap
Total log file size cap of log files generated in the DQOps User Home's .logs folder. Supported suffixes are: kb, mb, gb. For example: 10mb, 2gb.
This parameter can also be configured by setting the DQO_LOGGING_TOTAL_SIZE_CAP environment variable.
--dqo.logging.user-errors.checks-log-level
The logging level at which any errors captured during the data quality check evaluation are reported. When a data quality check is executed and the error is related to a sensor (query) or a rule (python) function, they are reported as sensor or rules issues.Only data quality check configuration issues that prevent running a data quality check are reported as check issues. The logging level for the whole application must be equal or higher to this level for effective logging. Check logs are logged under the com.dqops.user-errors.checks log.
This parameter can also be configured by setting the DQO_LOGGING_USER_ERRORS_CHECKS_LOG_LEVEL environment variable.
ERROR
WARN
INFO
DEBUG
TRACE
--dqo.logging.user-errors.rules-log-level
The logging level at which any errors captured during the data quality rule (python function) evaluation are reported. The logging level for the whole application must be equal or higher to this level for effective logging. Rule logs are logged under the com.dqops.user-errors.rules log.
This parameter can also be configured by setting the DQO_LOGGING_USER_ERRORS_RULES_LOG_LEVEL environment variable.
ERROR
WARN
INFO
DEBUG
TRACE
--dqo.logging.user-errors.sensors-log-level
The logging level at which any errors captured during the data quality sensor (query) execution are reported. The logging level for the whole application must be equal or higher to this level for effective logging. Sensor logs are logged under the com.dqops.user-errors.sensors log.
This parameter can also be configured by setting the DQO_LOGGING_USER_ERRORS_SENSORS_LOG_LEVEL environment variable.
ERROR
WARN
INFO
DEBUG
TRACE
--dqo.logging.user-errors.statistics-log-level
The logging level at which any errors captured during the statistics collection are reported. The logging level for the whole application must be equal or higher to this level for effective logging. Statistics logs are logged under the com.dqops.user-errors.statistics log.
This parameter can also be configured by setting the DQO_LOGGING_USER_ERRORS_STATISTICS_LOG_LEVEL environment variable.
ERROR
WARN
INFO
DEBUG
TRACE
--dqo.logging.user-errors.yaml-log-level
The logging level at which any errors captured during YAML file parsing are reported. The logging level for the whole application must be equal or higher to this level for effective logging. Statistics logs are logged under the com.dqops.user-errors.yaml log.
This parameter can also be configured by setting the DQO_LOGGING_USER_ERRORS_YAML_LOG_LEVEL environment variable.
ERROR
WARN
INFO
DEBUG
TRACE
--dqo.metadata.auto-import-tables-limit
Sets the maximum number of tables that are imported from a data source by the auto import that is scheduled on the DQOps CRON scheduler.
This parameter can also be configured by setting the DQO_METADATA_AUTO_IMPORT_TABLES_LIMIT environment variable.
--dqo.metadata.import.tables-import-limit
Sets the maximum number of tables that are imported from a data source. DQOps supports importing more tables by importing additional tables specified by a different table filter.
This parameter can also be configured by setting the DQO_METADATA_IMPORT_TABLES_IMPORT_LIMIT environment variable.
--dqo.python.debug-mode
Configures the debugging (logging) mode used when running Python rules. Under the default configuration ('silent' mode), the data quality rules implemented in Python evaluate silently, without generating any log entries. This parameter changes the logging mode and instructs the rule runner to write the parameters passed to rules. The data is written as JSON files in the DQOps user home, in the .index/models folder for each rule evaluation. These files can be used to debug rules by running them externally.
This parameter can also be configured by setting the DQO_PYTHON_DEBUG_MODE environment variable.
all
failed
exception
silent
--dqo.python.interpreter-name
A list of python interpreter executable names, separated by a comma, containing possible python interpreter names such as 'python', 'python3', 'python3.exe' or an absolute path to the python interpreter. DQOps will try to find the first python interpreter executable in directories defined in the PATH when a list of python interpreter names (not an absolute path) is used.
This parameter can also be configured by setting the DQO_PYTHON_INTERPRETER_NAME environment variable.
--dqo.python.python-script-timeout-seconds
Python script execution time limit in seconds for running jinja2 and rule evaluation scripts.
This parameter can also be configured by setting the DQO_PYTHON_PYTHON_SCRIPT_TIMEOUT_SECONDS environment variable.
--dqo.python.use-host-python
Disable creating a python virtual environment by DQOps on startup. Instead, use the system python interpreter. DQOps will not install any required python packages on startup and use packages from the user's python installation.
This parameter can also be configured by setting the DQO_PYTHON_USE_HOST_PYTHON environment variable.
--dqo.queue.max-concurrent-jobs
Sets the maximum number of concurrent jobs that the job queue can process at once (running data quality checks, importing metadata, etc.). The maximum number of threads is also limited by the DQOps license.
This parameter can also be configured by setting the DQO_QUEUE_MAX_CONCURRENT_JOBS environment variable.
--dqo.queue.wait-timeouts.collect-statistics
Sets the default timeout (in seconds) for the "collect statistics" REST API operation called from the DQOps client when the "wait" parameter is true and the timeout is not provided by the client.
This parameter can also be configured by setting the DQO_QUEUE_WAIT_TIMEOUTS_COLLECT_STATISTICS environment variable.
--dqo.queue.wait-timeouts.default-wait-timeout
Sets the default wait timeout (in seconds) for waiting for a job when the "waitTimeout" parameter is not given to the call to the "waitForJob" operation from the DQOps client..
This parameter can also be configured by setting the DQO_QUEUE_WAIT_TIMEOUTS_DEFAULT_WAIT_TIMEOUT environment variable.
--dqo.queue.wait-timeouts.delete-stored-data
Sets the default timeout (in seconds) for the "delete stored data" rest api operation called from the DQOps client when the "wait" parameter is true and the timeout is not provided by the client.
This parameter can also be configured by setting the DQO_QUEUE_WAIT_TIMEOUTS_DELETE_STORED_DATA environment variable.
--dqo.queue.wait-timeouts.import-tables
Sets the default timeout (in seconds) for the "import tables" rest api operation called from the DQOps client when the "wait" parameter is true and the timeout is not provided by the client.
This parameter can also be configured by setting the DQO_QUEUE_WAIT_TIMEOUTS_IMPORT_TABLES environment variable.
--dqo.queue.wait-timeouts.run-checks
Sets the default timeout (in seconds) for the "run checks" rest api operation called from the DQOps client when the "wait" parameter is true and the timeout is not provided by the client.
This parameter can also be configured by setting the DQO_QUEUE_WAIT_TIMEOUTS_RUN_CHECKS environment variable.
--dqo.queue.wait-timeouts.synchronize-multiple-folders
Sets the default timeout (in seconds) for the "synchronize multiple folders" rest api operation called from the DQOps client when the "wait" parameter is true and the timeout is not provided by the client.
This parameter can also be configured by setting the DQO_QUEUE_WAIT_TIMEOUTS_SYNCHRONIZE_MULTIPLE_FOLDERS environment variable.
--dqo.rule-mining.days-in-range-max-date-days-ahead
The number of days to add to the current system date that DQOps rule mining engine uses to set the maximum date in the date_in_range data quality check. The default configuration sets a date that is 10 years ahead.
This parameter can also be configured by setting the DQO_RULE_MINING_DAYS_IN_RANGE_MAX_DATE_DAYS_AHEAD environment variable.
--dqo.rule-mining.days-in-range-min-date-days-before
The number of days to subtract from the earliest found date in a column that DQOps rule mining engine uses to set the minimum date in the date_in_range data quality check. The default configuration sets a date that is 2 days before.
This parameter can also be configured by setting the DQO_RULE_MINING_DAYS_IN_RANGE_MIN_DATE_DAYS_BEFORE environment variable.
--dqo.rule-mining.default-fail-checks-at-percent-error-rows
The default percentage value captured by a profiling check (for example 0.03% of errors or 99.97% of valid) that is used to propose a percentage rule that will treat the values as errors (i.e., max_percent = 0%, or min_percent = 100%).The default value is 2%.
This parameter can also be configured by setting the DQO_RULE_MINING_DEFAULT_FAIL_CHECKS_AT_PERCENT_ERROR_ROWS environment variable.
--dqo.rule-mining.default-max-percent-error-rows
The default maximum percentage of invalid rows for which the rule engine should configure rule values, especially min_percent, min_count or max_percent.
This parameter can also be configured by setting the DQO_RULE_MINING_DEFAULT_MAX_PERCENT_ERROR_ROWS environment variable.
--dqo.rule-mining.max-column-samples-to-propose-accepted-values
The maximum number of samples in the sample values to use when generating a proposed configuration of the accepted_values checks. If a column has more than this number of distinct values, DQOps will not configure the found in set checks. The default value is 50 samples.
This parameter can also be configured by setting the DQO_RULE_MINING_MAX_COLUMN_SAMPLES_TO_PROPOSE_ACCEPTED_VALUES environment variable.
--dqo.rule-mining.max-distinct-count
The default maximum distinct count that is used to activate the distinct_count check. Above the limit, DQOps will configure the distinct_percent check. The default value is 1000 rows.
This parameter can also be configured by setting the DQO_RULE_MINING_MAX_DISTINCT_COUNT environment variable.
--dqo.rule-mining.max-expected-texts-in-top-values
The maximum number of top (most common) text values that are added to the expected_texts_in_top_values check. The default value is 5 top text values.
This parameter can also be configured by setting the DQO_RULE_MINING_MAX_EXPECTED_TEXTS_IN_TOP_VALUES environment variable.
--dqo.rule-mining.min-count-rate
The rate of the current min_count value, such as the row count, or the count of not-null values, obtained from statistics that is used to configure the row_count and not_nulls_count checks by the check mining engine. The default value is 0.90 of the current count captured by the statistics (row count, not-nulls count).
This parameter can also be configured by setting the DQO_RULE_MINING_MIN_COUNT_RATE environment variable.
--dqo.rule-mining.min-max-value-rate-delta
The default delta that is added to the proposed maximum value, or subtracted from a proposed minimum value. It is calculated as a rate of the current value. The default value is 0.1, which means that when the detected minimum value is 5.0, the proposed minimum value in the rule will be 4.5.
This parameter can also be configured by setting the DQO_RULE_MINING_MIN_MAX_VALUE_RATE_DELTA environment variable.
--dqo.rule-mining.min-reasonable-not-null-count
The default minimum reasonable count of not-null values that must be satisfied to apply some checks that validate a range of row counts (like a distinct count between).
This parameter can also be configured by setting the DQO_RULE_MINING_MIN_REASONABLE_NOT_NULL_COUNT environment variable.
--dqo.rule-mining.min-word-count
The minimum word count that is required to apply word count count in range checks. The default value is 3 words.
This parameter can also be configured by setting the DQO_RULE_MINING_MIN_WORD_COUNT environment variable.
--dqo.rule-mining.not-null-count-rate-for-duplicate-count
The default rate (fraction) of the number of rows with not-null values that must contain distinct values to apply the distinct count between check. The default is 0.01, which is 1% of the count of not-null values.
This parameter can also be configured by setting the DQO_RULE_MINING_NOT_NULL_COUNT_RATE_FOR_DUPLICATE_COUNT environment variable.
--dqo.rule-mining.percent-check-delta-rate
The multiplier of the last known percent that is extended by this delta (as a rate/proportion of the percentage) to configure a passing percentage check.The default value is 0.3. For this value and when the last known max_percent was 10%, DQOps rule mining engine will propose a save max_count 13%. The additional 3% of the delta is 0.3 * 10%.
This parameter can also be configured by setting the DQO_RULE_MINING_PERCENT_CHECK_DELTA_RATE environment variable.
--dqo.rule-mining.timeliness-max-days-multiplier
he multiplier of the last known table timeliness checks (freshness, staleness, ingestion delay) that is used to propose the configuration of the max days rule threshold by the rule mining engine.The default value is 2.0x the last known delay.
This parameter can also be configured by setting the DQO_RULE_MINING_TIMELINESS_MAX_DAYS_MULTIPLIER environment variable.
--dqo.scheduler.check-run-mode
Configures the console logging mode for the '"check run" jobs performed by the job scheduler in the background.
This parameter can also be configured by setting the DQO_SCHEDULER_CHECK_RUN_MODE environment variable.
silent
summary
info
debug
--dqo.scheduler.default-schedules.monitoring-daily
Sets the default schedule for running daily monitoring checks that is copied to the configuration of new data source connections that are registered in DQOps. The default schedule runs checks once a day at 12 PM (noon). This parameter is used only once, during the first initialization of DQOps user home. The value is copied to the settings/defaultschedules.dqoschedules.yaml file.
This parameter can also be configured by setting the DQO_SCHEDULER_DEFAULT_SCHEDULES_MONITORING_DAILY environment variable.
--dqo.scheduler.default-schedules.monitoring-monthly
Sets the default schedule for running monthly monitoring checks that is copied to the configuration of new data source connections that are registered in DQOps. The default schedule runs checks once a day at 12 PM (noon). This parameter is used only once, during the first initialization of DQOps user home. The value is copied to the settings/defaultschedules.dqoschedules.yaml file.
This parameter can also be configured by setting the DQO_SCHEDULER_DEFAULT_SCHEDULES_MONITORING_MONTHLY environment variable.
--dqo.scheduler.default-schedules.partitioned-daily
Sets the default schedule for running daily partitioned checks that is copied to the configuration of new data source connections that are registered in DQOps. The default schedule runs checks once a day at 12 PM (noon). This parameter is used only once, during the first initialization of DQOps user home. The value is copied to the settings/defaultschedules.dqoschedules.yaml file.
This parameter can also be configured by setting the DQO_SCHEDULER_DEFAULT_SCHEDULES_PARTITIONED_DAILY environment variable.
--dqo.scheduler.default-schedules.partitioned-monthly
Sets the default schedule for running monthly partitioned checks that is copied to the configuration of new data source connections that are registered in DQOps. The default schedule runs checks once a day at 12 PM (noon). This parameter is used only once, during the first initialization of DQOps user home. The value is copied to the .settings/defaultschedules.dqoschedules.yaml file.
This parameter can also be configured by setting the DQO_SCHEDULER_DEFAULT_SCHEDULES_PARTITIONED_MONTHLY environment variable.
--dqo.scheduler.default-schedules.profiling
Sets the default schedule for running profiling checks that is copied to the configuration of new data source connections that are registered in DQOps. The default schedule runs profiling checks once a month, on the first day of the month at 1 AM. This parameter is used only once, during the first initialization of DQOps user home. The value is copied to the settings/defaultschedules.dqoschedules.yaml file.
This parameter can also be configured by setting the DQO_SCHEDULER_DEFAULT_SCHEDULES_PROFILING environment variable.
--dqo.scheduler.enable-cloud-sync
Enable synchronization of metadata and results with DQOps Cloud in the job scheduler.
This parameter can also be configured by setting the DQO_SCHEDULER_ENABLE_CLOUD_SYNC environment variable.
--dqo.scheduler.start
Starts the job scheduler on startup (true) or disables the job scheduler (false).
This parameter can also be configured by setting the DQO_SCHEDULER_START environment variable.
--dqo.scheduler.synchronization-mode
Configures the console logging mode for the '"cloud sync all" operations performed by the job scheduler in the background.
This parameter can also be configured by setting the DQO_SCHEDULER_SYNCHRONIZATION_MODE environment variable.
silent
summary
debug
--dqo.scheduler.synchronize-cron-schedule
Unix cron expression to configure how often the scheduler will synchronize the local copy of the metadata with DQOps Cloud and detect new cron schedules. The default schedule will synchronize local files with DQOps Cloud and refresh the data quality data warehouse 5 minutes past each hour. A DQOps instance that uses a FREE or a trial PERSONAL license will ignore this setting and synchronize files once an hour, on a random time. Synchronization with DQOps cloud can be disabled by setting --dqo.scheduler.enable-cloud-sync=false.
This parameter can also be configured by setting the DQO_SCHEDULER_SYNCHRONIZE_CRON_SCHEDULE environment variable.
--dqo.scheduler.synchronized-folders
Configures which folders from the DQOps user home folder are synchronized to DQOps Cloud during a monitoring synchronization (triggered by a cron schedule configured by --dqo.scheduler.synchronize-cron-schedule). By default, DQOps synchronizes (pushes) only changes from folders that have local changes.
This parameter can also be configured by setting the DQO_SCHEDULER_SYNCHRONIZED_FOLDERS environment variable.
all
locally_changed
--dqo.secrets.enable-gcp-secret-manager
Enables GCP secret manager to resolve parameters like null in the yaml files.
This parameter can also be configured by setting the DQO_SECRETS_ENABLE_GCP_SECRET_MANAGER environment variable.
--dqo.secrets.gcp-project-id
GCP project name with a GCP secret manager enabled to pull the secrets.
This parameter can also be configured by setting the DQO_SECRETS_GCP_PROJECT_ID environment variable.
--dqo.sensor.limit.fail-on-sensor-readout-limit-exceeded
Configures the behavior when the number of rows returned from a data quality sensor exceeds the limit configured in the 'sensor-readout-limit' parameter. When true, the whole check execution is failed. When false, only results up to the limit are analyzed. The default value is true.
This parameter can also be configured by setting the DQO_SENSOR_LIMIT_FAIL_ON_SENSOR_READOUT_LIMIT_EXCEEDED environment variable.
--dqo.sensor.limit.max-merged-queries
The maximum number of queries that are merged into a bigger query, to calculate multiple sensors on the same table and to analyze multiple columns from the same table.
This parameter can also be configured by setting the DQO_SENSOR_LIMIT_MAX_MERGED_QUERIES environment variable.
--dqo.sensor.limit.sensor-readout-limit
Default row count limit retrieved by a data quality sensor from the results of an SQL query for non-partitioned checks (profiling and monitoring). This is the row count limit applied when querying the data source. When the data grouping configuration sets up a GROUP BY too many columns or columns with too many distinct values, the data source will return too many results to store them as data quality check results and sensor readouts. DQOps will discard additional values returned from the data source or raise an error.
This parameter can also be configured by setting the DQO_SENSOR_LIMIT_SENSOR_READOUT_LIMIT environment variable.
--dqo.sensor.limit.sensor-readout-limit-partitioned
Default row count limit retrieved by a data quality sensor from the results of an SQL query for partitioned checks. This is the row count limit applied when querying the data source. When the data grouping configuration sets up a GROUP BY too many columns or columns with too many distinct values, the data source will return too many results to store them as data quality check results and sensor readouts. DQOps will discard additional values returned from the data source or return an error. The default value is 7x bigger than the sensor-readout-limit to allow analysing the last 7 daily partitions.
This parameter can also be configured by setting the DQO_SENSOR_LIMIT_SENSOR_READOUT_LIMIT_PARTITIONED environment variable.
--dqo.smtp-server.host
Sets the host name of the SMTP server that is used to send email notifications.
This parameter can also be configured by setting the DQO_SMTP_SERVER_HOST environment variable.
--dqo.smtp-server.password
Sets the password of the SMTP server that is used to send email notifications.
This parameter can also be configured by setting the DQO_SMTP_SERVER_PASSWORD environment variable.
--dqo.smtp-server.port
Sets the port number of the SMTP server that is used to send email notifications.
This parameter can also be configured by setting the DQO_SMTP_SERVER_PORT environment variable.
--dqo.smtp-server.use-ssl
Configures if the SMTP server that is used to send email notifications uses SSL protocol.
This parameter can also be configured by setting the DQO_SMTP_SERVER_USE_SSL environment variable.
--dqo.smtp-server.username
Sets the username of the SMTP server that is used to send email notifications.
This parameter can also be configured by setting the DQO_SMTP_SERVER_USERNAME environment variable.
--dqo.statistics.samples-limit
The limit of column value samples that are collected when the basic table statistics are gathered. DQOps collects only the most popular values, which is determined by the number of value occurrences.
This parameter can also be configured by setting the DQO_STATISTICS_SAMPLES_LIMIT environment variable.
--dqo.statistics.truncated-strings-length
The length of samples captured from text columns (varchar, string, text, etc.) that are stored as samples. DQOps truncates longer column values and stores only the first few characters, up to the character count limit defined by this parameter.
This parameter can also be configured by setting the DQO_STATISTICS_TRUNCATED_STRINGS_LENGTH environment variable.
--dqo.statistics.viewed-statistics-age-months
The maximum age (in months) of the basic statistics that are shown on the basic statistics screen. Statistics values captured earlier are still stored, but are not shown in the DQOps UI.
This parameter can also be configured by setting the DQO_STATISTICS_VIEWED_STATISTICS_AGE_MONTHS environment variable.
--dqo.user.home
Overrides the path to the DQOps user home. The default user home is created in the current folder (.).
This parameter can also be configured by setting the DQO_USER_HOME environment variable.
--dqo.user.initialize-user-home
Initializes an empty DQOps user home (identified by the DQO_USER_HOME environment variable) without asking the user for confirmation.
This parameter can also be configured by setting the DQO_USER_INITIALIZE_USER_HOME environment variable.
--dqo.webserver.authentication-method
User authentication method. A standalone instance has no user authentication. Paid versions of DQOps support federated authentication using Single-Sign-On. Please contact DQOps sales for details: https://dqops.com/contact-us/.
This parameter can also be configured by setting the DQO_WEBSERVER_AUTHENTICATION_METHOD environment variable.
none
dqops_cloud
oauth2
-fw
--file-write
Write command response to a file
This parameter can also be configured by setting the _FWFILE_WRITE environment variable.
--headless
-hl
Starts DQOps in a headless mode. When DQOps runs in a headless mode and the application cannot start because the DQOps Cloud API key is missing or the DQOps user home folder is not configured, DQOps will stop silently instead of asking the user to approve the setup of the DQOps user home folder structure and/or log into DQOps Cloud.
This parameter can also be configured by setting the HEADLESS_HL environment variable.
-h
--help
Show the help for the command and parameters
This parameter can also be configured by setting the _HHELP environment variable.
--logging.level.com.dqops
Default logging level for the DQOps runtime.
This parameter can also be configured by setting the LOGGING_LEVEL_COM_DQOPS environment variable.
ERROR
WARN
INFO
DEBUG
TRACE
--logging.level.root
Default logging level at the root level of the logging hierarchy.
This parameter can also be configured by setting the LOGGING_LEVEL_ROOT environment variable.
ERROR
WARN
INFO
DEBUG
TRACE
-of
--output-format
Output format for tabular responses
This parameter can also be configured by setting the _OFOUTPUT_FORMAT environment variable.
TABLE
CSV
JSON
--server.port
Sets the web server port to host the DQOps local web UI.
This parameter can also be configured by setting the SERVER_PORT environment variable.
--silent
Starts DQOps in a silent mode, without showing the banner and any other information.
This parameter can also be configured by setting the SILENT environment variable.
--spring.config.location
Sets a path to the folder that has the spring configuration files (application.properties or application.yml) or directly to an application.properties or application.yml file. The format of this value is: --spring.config.location=file:./foldername/,file:./alternativeapplication.yml
This parameter can also be configured by setting the SPRING_CONFIG_LOCATION environment variable.