This step calculates checksums for one or more fields in the input stream and adds this to the output as a new field.
为数据流输入的列计算校验码,并将输出作为新的一列。
wiki中介绍:http://wiki.pentaho.com/display/EAI/Add+a+checksum
| Option |
Description |
| Step name |
Name of the step. Note: This name has to be unique in a single transformation. |
| Type |
The type of checksum that needs to be calculated. These are the types that are available:
- CRC32 : 32-bit Cyclic Redundancy Check : http://en.wikipedia.org/wiki/Cyclic_redundancy_check
- ADLER 32 : Checksum algorithm by Mark Adler: http://en.wikipedia.org/wiki/Adler-32
- MD5 : Message Digest algorithm 5 : http://en.wikipedia.org/wiki/MD5
- SHA-1 : Secure Hash Algorithm 1 : http://en.wikipedia.org/wiki/SHA-1
|
| Result field |
The name of the result field containing the checksum |
| Fields used in the checksum |
The names of the fields to include in the checksum calculation. Note: You can use the "Get Fields" button to insert all input fields from previous steps. |
注意,双击控件后,可以看到各种设置项。result field为新列的名称,一定要写。
这时候看到compatibility mode选项,打钩后,字段就会变得紧凑,不定长。默认为不选,是32位长的字段!
我为了比较字段,所以用定长。
如果想知道区别,做测试:
然后就preview rows,就可以看出两种结果的不同