wiki:DevWikiRealParameters

Version 7 (modified by bnordgren, 13 years ago) ( diff )

--

How to calculate "real" parameters from affine transformation

Introduction

This page discusses how to compute physically significant parameters from an arbitrary linear transformation. The reverse process, calculating the affine parameters from the physically significant parameters, is articulated on another wiki page. The model used to describe how the raster maps onto real world coordinates is illustrated in the following figure:

This model contains all of the parameters except for translation. Translation may be added in after the coordinates have been scaled, rotated and sheared. The above illustration contains the physically significant parameters which will be calculated by this method. These are: θi, θij, and the pixel size along the ib and jb basis vectors.

The inputs to this method are the four non-translation (non-offset) parameters of a linear transform. These are shown below:

Although these parameters have "common names" within the GIS community, the names are both lengthy and misleading. Within the context of this page, the coefficients o11, o12, o21, and o22 will be used.

This page is divided into sections. Each section describes the computation of one physically significant parameter.

Pixel size in the i direction

To calculate the pixel size in the i direction, the i unit vector is projected using the given transform. This gives the basis vector ib, the magnitude of which is the pixel size. The basis vector is expressed as follows:

The pixel size is then calculated as follows:

Pixel size in the j direction

The pixel size in the j direction is computed in a manner similar to the pixel size in the i direction. The only difference is that the j unit vector is used in place of the i unit vector. This gives the basis vector jb, the magnitude of which is the pixel size. The basis vector is expressed as follows:

The pixel size is then calculated as follows:

Rotation

The grid is rotated by the angle θi. This is the angle between the x axis of the reference frame and the ib basis vector. The angle θi is considered positive in the clockwise direction, for consistency with compass headings. The calculation of θi is a two-step process: first the magnitude is calculated, then the sign is determined. Both steps involve using the dot product to determine the angle between θi and one of the axes of the target coordinate system (either the x axis or the y axis. The equations in this section refer to the angles and vectors defined in the following figure:

The first step is to calculate the magnitude of the angle between ib and the x axis. This is the magnitude of θi.

The angle θi is defined as the angle from the x axis to ib. It is positive in the clockwise direction. In the situation described in the above figure, this means that θi is negative if ib is above the x axis, and positive if below. We determine whether ib is above or below the x axis by finding the angle between ib and the y axis. If ib and the y axis are separated by less than 90 degrees, ib is above the x axis, otherwise it is below.

So, if θtest is less than 90, θi = - abs(θi). Otherwise, θi = abs(θi).

Basis vector separation angle

In this section, the method to calculate θij is presented. This is similar to the method for the calculation of θi, but it is accomplished with respect to the rotated reference frame of ib and ibp instead of the x and y axes. The figure which represents this setup is as follows:

The first step is to calculate the magnitude of θij, the angle between ib and jb.

Next, we need to determine the sign of θij in a manner similar to how the sign for θi was determined. The angle θij always represents the angle from ib to jb, and is positive counterclockwise for consistency with a right-handed coordinate system. To do this, we first need to calculate ibp, which is perpendicular to ib and forms a right-hand coordinate system with ib. Observe that ibp is ib after a 90 degree counterclockwise rotation.

Now we can determine the size of the angle between jb and ibp. In this situation, any angle less than 90 degrees means that jb is on the same side of ib as ibp. An angle more than 90 degrees means it lies on the opposite side.

No image "thetatest_ip.png" attached to DevWikiRealParameters

So, if θtest is more than 90 degrees, θij = - abs(θij). Otherwise, θij = abs(θij).

See also

The forward calculation

On the wiki page describing the reverse operation, a transform is developed which supports:

  1. scaling
  2. clockwise rotation around the origin
  3. shearing parallel to the x axis
  4. shearing parallel to the y axis

For the sake of consistency, we will use this same model and attempt to calculate the conceptually meaningful parameters from the transform coefficients. The above model produced the following equations:

  • a11 = o11 = sx ( (1 + kx ky) cosθ + ky sinθ )
  • a12 = o12 = sx ( kx cosθ + sinθ )
  • a13 = tx
  • a21 = o21 = sy ( -(1 + kx ky) sinθ + ky cosθ )
  • a22 = o22 = sy ( - kx sinθ + cosθ )
  • a23 = ty

where:

  • sx : scale factor in x direction
  • sy : scale factor in y direction
  • tx : offset in x direction
  • ty : offset in y direction
  • θ : angle of rotation clockwise around origin
  • kx : shearing parallel to x axis
  • ky : shearing parallel to y axis

Solving for the meaningful parameters

The objective of this page is to find a way to determine values for sx, sy, tx, ty, θ, kx, and ky if all we know is a11,a12, ... a23. Clearly, tx and ty are trivial cases, because we can just use the values for a13 and a23 respectively. This leaves us with these four equations:

  • a11 = o11 = sx ( (1 + kx ky) cosθ + ky sinθ )
  • a12 = o12 = sx ( kx cosθ + sinθ )
  • a21 = o21 = sy ( -(1 + kx ky) sinθ + ky cosθ )
  • a22 = o22 = sy ( - kx sinθ + cosθ )

Unfortunately, these four equations contain five unknowns: sx, sy, θ, kx, and ky. This represents an ill-posed problem and forces us to simplify. Let's assume that there is no shearing. We are just going to declare upfront that kx=0 and ky=0. Remember that we did that because if we're wrong it will screw everything up. This makes the above into:

  • a11 = o11 = sx cosθ
  • a12 = o12 = sx sinθ
  • a21 = o21 = - sy sinθ
  • a22 = o22 = sy cosθ

Now we're on a roll. We can knock out sx and sy very easily:

  • a112 + a122
    • = sx2 cos2θ + sx2 sin2θ
    • = sx2 (cos2θ + sin2θ)
    • = sx2 (1)
  • a212 + a222
    • = (- sy)2 sin2θ + sy2 cos2θ
    • = sy2 (sin2θ + cos2θ)
    • = sy2 (1)

Rewriting as a "final result", this gives:

  • sx2 = a112 + a122
  • sy2 = a212 + a222

Checking against wikipedia

Checking this against the wikipedia page on the world file, which lists:

  • "pixel width" = sx = sqrt(A2 + D2) = sqrt(a112 + a212)
  • "pixel height" = sy = sqrt(B2 + E2) = sqrt(a122 + a222)

Clearly, this does not match the answer on our page. The three possibilities are: we made a mistake; they made a mistake; or no one made a mistake, but we're using different models... The world file page does not declare it's assumptions about what operations are performed, or which order they are performed in. The world file page, however, also defines it's parameter A (for us: a11) as "pixel size in x direction".

NOTE: If the order of the model operations changes, in particular if scaling and rotation are swapped, then these pages match wikipedia. However, the math on wikipedia is unsourced. Since it has become clear that order is vital, and it is by no means guaranteed that all software performs uses the same model, it is probably vital to go out and determine which software performs the calculations which way before we do anything else.

Attachments (13)

Download all attachments as: .zip

Note: See TracWiki for help on using the wiki.