scanin

scanin/scanin

Summary

Convert an 8 or 16 bit per component TIFF image of a test chart into .ti3 device values using automatic pattern recognition, or manual chart alignment.
Performs other tasks associated with turning a TIFF raster of test patches into numeric values.

Usage Summary

usage: scanin [options] input.tif recogin.cht valin.cie [diag.tif]
   :- inputs 'input.tif', and outputs scanner 'input.ti3', or

usage: scanin -g [options] input.tif recogout.cht [diag.tif]
   :- outputs file 'recogout.cht', or

usage: scanin -o [options] input.tif recogin.cht [diag.tif]
   :- outputs file 'input.val', or

usage: scanin -c [options] input.tif recogin.cht scanprofile.[icm|mpp] pbase [diag.tif]
   :- inputs pbase.ti2 and outputs printer pbase.ti3, or

usage: scanin -r [options] input.tif recogin.cht pbase [diag.tif]
   :- inputs pbase.ti2+.ti3 and outputs pbase.ti3

-g                   Generate a chart reference (.cht) file
-o                   Output patch values in .val file
-c                   Use image to measure color to convert printer pbase .ti2 to .ti3
-ca                  Same as -c, but accumulates more values to pbase .ti3
                      from subsequent pages
-r                   Replace device values in pbase .ti3
                      Default is to create a scanner .ti3 file
-F x1,y1,x2,y2,x3,y3,x4,y4
                      Don't auto recognize, locate using four fiducual marks
-p                   Compensate for perspective distortion
-a                   Recognize chart in normal orientation only
                      Default is to recognize all possible chart angles
-m                   Return true mean (default is robust mean)
-G gamma             Approximate gamma encoding of image
-v [n]               Verbosity level 0-9
-d [ihvglLIcrsonap]   generate diagnostic output (try -dipn)
     i                 diag - B&W of input image
     h                 diag - Horizontal edge detection
     v                 diag - Vertical edge detection
     g                 diag - Groups detected
     l                 diag - Lines detected
     L                 diag - All lines detected
     I                 diag - lines used to improve fit
     c                 diag - lines perspective corrected
     r                 diag - lines rotated
     s                 diag - sample boxes rotated
     o                 diag - sample box outlines
     n                 diag - sample box names
     a                 diag - sample box areas
     p                 diag - pixel areas sampled
-O outputfile       Override the default output filename & extension.

Usage Details and Discussion

scanin is setup to deal with a raster file that has been roughly cropped to a size that contains the test chart. It's exact orientation is not important [ie. there is usually no need to rotate or crop the image any more finely.] The reference files are normally set up with the assumption that the edges of the chart are visible within the image, and if the image is cropped to exclude the chart edges, it may well not recognize the chart properly. It is often better to crop out anything outside the chart itself (i.e. labeling text, logo's below the chart etc.) It is designed to cope with a variety of resolutions, and will cope with some degree of noise in the scan (due to screening artefacts on the original, or film grain), but it isn't really designed to accept very high resolution input. For anything over 1200 pixels on a side, you should consider down sampling the scan using a filtering down-sample, before submitting the file to scanin. Similarly, any file with a large level of noise (due to screening or scanner artefacts, or a noisy surrounding texture) should consider cropping out the noisy surrounding, or down sampling the image or filtering it with some average preserving filter before submitting it to scanin. Examining the diagnostic output (ie. -dig and -dil) may help in determining whether noise is an issue. To check that the chart has been correctly recognized, use -dipn and examine the diag image.

There are 5 basic modes that scanin operates in.

When no special argument is given scanin is assumed to be parsing an input device characterization chart (ie. an IT8.7/2 chart), for the purpose of creating a .ti3 data file containing the CIE test values and the corresponding RGB scanner values. The .ti3 file can then be used for creating an input profile using colprof. The file arguments are: The TIFF file that is to be processed, the image recognition template file, the CIE reference value definitions for the test chart (sometimes labeled a ".q60" file), and an optional name for the image recognition diagnostic output. The resulting .ti3 file will have the same base name as the input TIFF file.
If the -g flag is specified, then scanin is operating in a mode designed to create the necessary image recognition template file (.cht) boilerplate information. Patch location and labeling information would need to be added manually to such a generated file, to make a complete and useable recognition template file. CHT file format. The input TIFF file in this situation, should be a good quality image, perhaps synthetically generated (rather than being scanned), and perfectly oriented, to make specification of the patch locations easier. The file arguments are: The TIFF file that is to be processed, the image recognition template file to be created, and an optional name for the image recognition diagnostic output.
If the -o flag is used, then scanin will process the input TIFF file and produce a generic CGATS style file containing just the patch values (a .val file). The file arguments are: The TIFF file that is to be processed, the image recognition template file to be created, and an optional name for the image recognition diagnostic output.
If the -c flag is used, then an input image of a print test chart can be used in combination with a device profile, to estimate the CIE tristimulus values of the patches. This allows RGB input devices to be used as a crude replacement for a color measuring instrument. The icc or mpp profile has (presumably) been created by scanning an IT8.7/2 chart (or similar) through the RGB input device, and then using scanin to create the .ti3 file needed to feed to colprof to create the input device profile. The file arguments in -c mode are: The TIFF file that is to be processed containing the image of a print test chart, the image recognition template file for the test chart generated by the printtarg tool, the input device ICC or MPP profile, the base name for the .ti2 file containing the test chart printer device values and their patch identifiers and the base name for the resulting .ti3 file, and finally an optional name for the image recognition diagnostic output. The resulting .ti3 file will have the same base name as the input TIFF file. If there is more than one page in the test chart, then scanin will need to be run multiple times, once for each scan file made from each test chart. The -ca flag combination should be used for all pages after the first, as this then adds that pages test values to the .ti3 file, rather than creating a .ti3 file that contains only that pages test values. If the incoming .ti2 file contains per-channel calibration curves, these will be passed through to the .ti3 so that accurate ink limits can be computed during profiling.
If the -r flag is used, then the input TIFF value is used as a source of device values to replace any existing device values in the given .ti3 file. This is intended for use in the situation in which the device values being fed into an output device are altered in some way that is difficult to predict (ie. such as being screened and then de-screened), and this alteration to the device values needs to be taken into account in creating a profile for such a device. The file arguments in -r mode are: The TIFF file that is to be processed containing a rasterized image of an output test chart, the image recognition template file for the test chart generated by the printtarg tool, the base name for the .ti2 file containing the output test chart device values and their patch identifiers and the base name for the .ti3 file that is to have its device values replaced, and finally an optional name for the image recognition diagnostic output.

A number of flags and options are available, that are independent of the mode that scanin is in.

Normally scanin will try and recognize a chart, irrespective of its orientation. For charts that have some asymmetric patch size or arrangement (such as an IT8.7/2, or a chart generated by printtarg with the -s option), this is both flexible and reliable. Other charts may be symmetrical, and therefore having scanin figure out the orientation automatically is a problem if the recognition template does not contain expected patch values, since it will have an equal chance of orienting it incorrectly as correctly. To solve this, the -a flag can be used, and care taken to provide a raster file that is within 45 degrees of "no rotation".

Normally scanin will use automatic chart recognition to identify the location of the test patches and extract their values. If the chart CHT file has four fiducial marks defined, then the chart can be manually aligned by specifying the pixel location of the four marks as arguments to the -F flag. The top left, top right, bottom right and bottom left fiducial marks X and Y co-ordinates should be specified as a single concatenated argument, separated by comma's, e.g: -F 10,20,435,22,432,239,10,239 The coodinates may be fractional using a decimal point. Four fiducial marks allows for compensation for perspective distortion.

By default the automatic chart recognition copes with rotation, scale and stretch in the chart image, making it suitable for charts that have been scanned, or shot squarely with a camera. If a chart has been shot not exactly facing the camera (perhaps to avoid reflection, or to get more even lighting), then it will suffer from perspective distortion as well. The -p flag enables automatic compensation for perspective distortion.

Normally scanin computes an average of the pixel values within a sample square, using a "robust" mean, that discards pixel values that are too far from the average ("outlier" pixel values). This is done in an attempt to discard value that are due to scanning artefacts such as dust, scratches etc. You can force scanin to return the true mean values for the sample squares that includes all the pixel values, by using the -m flag.

Normally scanin has reasonably robust feature recognition, but the default assumption is that the input chart has an approximately even visual distribution of patch values, and has been scanned and converted to a typical gamma 2.2 corrected image, meaning that the average patch pixel value is expected to be about 50%. If this is not the case (for instance if the input chart has been scanned with linear light or "raw" encoding), then it may enhance the image recognition to provide the approximate gamma encoding of the image. For instance, if linear light encoding ("Raw") is used, a -G value of 1.0 would be appropriate. Values less than 2.2 should be tried if the chart is particularly dark, or greater than 2.2 if the chart is particularly light. Generally it is only necessary to provide this is there are problems in recognizing the chart.

The -v flag enables extra verbosity in processing. This can aid debugging, if a chart fails to be recognized.

The -d flag enables the generation of an image recognition diagnostic raster. The name of diagnostic raster can be specified as the last in the command line, or if not, will default to diag.tif. Various flags control what is written to the diagnostic raster. Note that at least one flag must be specified for a diagnostic raster to be produced.
i    creates a black and white version of the input raster in the diagnostic output, to be able to compare with the feature extraction.
h    will show pixels in the input image classified as being on horizontal edges, in red.
v    will show pixels in the input image classified as being vertical edges, in green.
g    will show groups of pixels that will be used to estimate edge lines, each group in a different color.
l    will show valid lines estimated from the vertical and horizontal pixel groups, in white.
L    will show all lines (valid and invalid) estimated from the vertical and horizontal pixel groups, in white.
I    will show valid lines lines used to improve the final fit, in blue.
c    will show the lines with perspective correction applied in cyan.
r    will show the lines rotated to the reference chart orientation, in yellow.
s    will show the diagnostic sampling box edge outlines, rotated to the reference chart orientation, in orange.
o    will show all the sampling box edge outlines, in orange.
n    will show the ID names of the sampling boxes, plus the diagnostic sample boxes, using a simple stroke font, in orange.
a    will show the sampling areas as crossed boxes, plus the diagnostic sample boxes, in orange.
p    will show the sampling areas as colored pixels.

The combination of -dipn is usually a good place to start.

The TIFF file can be either 8 or 16 bits per color component, with 16 bit files being slower to process, but yielding more precise results.

If at all in doubt that the file has been recognized correctly, use the -dipn diagnostic flag combination, and check the resulting diagnostic raster file.
[ A badly recognised image will typically result in high self fit delta E's when used with colprof. ]

The -O parameter allows the output file name & extension to be specified independently of the last tiff filename. This works for the default, -g and -o modes. It is ignored for the -r, -c and -ca modes that use a basename for .ti2 in and .ti3 output. Note that the full filename must be specified, including the extension.