Execution Data Profile
Description
An Apache Hop Execution Data Profile builds data profiles as data flow through pipelines. A number of data profilers can be selected and configure the fine tune the type and detail of the data that is profiled.
Options
Option | Description |
---|---|
Name | The name to be used for this Execution Data Profile |
Description | A description to be used for this Execution Data Profile |
Data Samplers to use | One or more data samplers to use with this Execution Data Profile. See details below. |
Data Samplers
Data Sampler | Description | Options |
---|---|---|
Data profile output rows | Allow for some basic data profiling to be performed on transform output rows |
|
First output rows | Samples the first rows of a transform output | Sample size (default: 100) |
Last output rows | Samples the last rows of a transform output | Sample size (default: 100) |
Random output rows | Do reservoir sampling on the output rows of a transform | Sample size (default: 100) |