Reference frames

Reference frames#

Note

This page is based on the SymPy mechanics module documentation.

Reference frames are useful when examining the relationships between bodies in multibody systems. It also serves a useful purpose in physics when working with inertial frames, where Newton’s laws of physics apply, and non-inertial reference frames. We can loosely define a reference frames as the Euclidean space spanned by orthogonal unit vectors oriented following the right-hand rule. This may be a two dimensional space spanned by two unit orthogonal unit vectors or a three dimensional space spanned by three orthogonal unit vectors. In this course reference frames don’t have a position in space, but are defined by orientation alone. We often define reference frames relative to some “base frame” that we consider fixed in space. This may be the center of mass on a drone or the center of the earth in a navigation system. We can think of reference frames intuitively as perspectives. The characters on this page change relative to me as I change my perspective (or reference frame) in 3D space. If I rotate my phone while taking a picture, the world rotates relative to the reference frame of the camera.

Unit Vectors#

Vectors have both a magnitude and a direction. Unit vectors are vectors with a magnitude of one oriented parallel to a direction (or dimension) in the reference frame they span. We can think of a reference frame as a box in 3D. A unit vector points along the side of the box. For convenience, we often like to imagine the unit vectors being aligned in a corner of a “box” pointing along the edges of the box. A unit vector follows the right-hand rule. This means that a reference system \(A\) defined by the orthogonal unit vectors \(\hat{a}_x\), \(\hat{a}_y\), \(\hat{a}_z\), the following cross-products hold

(2)#\[ \begin{align}\begin{aligned}\hat{a}_x \times \hat{a}_y = \hat{a}_z\\\hat{a}_z \times \hat{a}_x = \hat{a}_y\\\hat{a}_y \times \hat{a}_z = \hat{a}_x\end{aligned}\end{align} \]

Note that the unit vectors \(\hat{a}_x\), \(\hat{a}_y\), \(\hat{a}_z\) are by definition fixed in reference frame \(A\). If we want to define a new reference frame \(B\) relative to frame \(A\) we can express their relative orientation with the relationship between their respective unit vectors. We’ll examine this in the next section.

Note

Basis vectors are vectors that define a reference frame, but they are not necessarily of unit length. When the basis vectors are normalized to have a magnitude of one, they become unit vectors. In an orthonormal reference frame, the basis vectors are both unit vectors and mutually perpendicular.

Simple Rotation Example#

_images/box_w_lid.svg — Fig. 1 Box with sides \(d\) with rotating lid#

The figure Fig. 1 depicts a box with sides \(d\) and a rotating square lid with sides \(d\). The lid is rotated by an angle \(\theta\) relative to the box. If we want to find some vector \(\vec{p}\) represented in terms of reference frame \(A\), we simply find and substitute its elements in frame \(B\).

_images/box_w_lid_vector.svg — Fig. 2 Box with sides \(d\) with rotating lid and vector \(\vec{p}\)#

Using the unit vectors, we see that \(\vec{p} = d \hat{a}_y + d \hat{a}_z + d \hat{b}_x - d \hat{b}_y\). By looking at the hinge, we can find the relationship between the unit vectors of frame \(A\) and frame \(B\).

_images/box_lid_rotationtransform.svg — Fig. 3 2D representation of Fig. 1 rotating lid#

Looking at the hinge in Fig. 3, we use trigonometry to find

(3)#\[ \begin{align}\begin{aligned}\hat{b}_x = \hat{a}_x\\\hat{b}_y = \cos(\theta) \hat{a}_y - \sin(\theta) \hat{a}_z\\\hat{b}_z = \sin(\theta) \hat{a}_y + \cos(\theta) \hat{a}_z\end{aligned}\end{align} \]

We can then substitute the unit vectors in frame \(B\)

(4)#\[ \begin{align}\begin{aligned}\vec{p} = d \hat{a}_y + d \hat{a}_z + d \hat{b}_x - d \hat{b}_y\\\vec{p} = d \hat{a}_y + d \hat{a}_z + d \hat{a}_x - d (\cos(\theta) \hat{a}_y - \sin(\theta) \hat{a}_z)\\\begin{split}p^A = \begin{bmatrix} d \\ d - d \cos(\theta) \\ d + d \sin(\theta) \end{bmatrix}\end{split}\end{aligned}\end{align} \]

Intuitively, we know this to be the case, since we know that when the lid is closed (\(\theta = 0\)) \(\vec{p} = d \hat{a}_x + d \hat{a}_z\), and when the lid is open (\(\theta = \frac{\pi}{2}\)) \(\vec{p} = d \hat{a}_x + d \hat{a}_y + 2d \hat{a}_z\).

We can generalize this by a matrix product in (3)

(5)#\[\begin{split}\begin{bmatrix} \hat{b}_x \\ \hat{b}_y \\ \hat{b}_z \end{bmatrix} = \begin{bmatrix} 1 & 0 & 0 \\ 0 & \cos(\theta) & -\sin(\theta) \\ 0 & \sin(\theta) & \cos(\theta) \end{bmatrix} \begin{bmatrix} \hat{a}_x \\ \hat{a}_y \\ \hat{a}_z \end{bmatrix} = {\bf R}_A^B(\theta) \begin{bmatrix} \hat{a}_x \\ \hat{a}_y \\ \hat{a}_z \end{bmatrix}\end{split}\]

This matrix is the transformation matrix from \(A\) to \(B\), \({\bf R}_A^B(\theta)\), which means we can transform any vector in frame \(A\) to its representation in frame \(B\) by means of matrix multiplication. This kind of transformation matrix belongs to a group we call rotation matrices. More specifically, it belongs to the special orthogonal group 3 (SO(3)). This means that it has useful properties such that its inverse is equal to its transpose, meaning \({{\bf R}_A^B}^T(\theta) = {{\bf R}_A^B}^{-1}(\theta) = {\bf R}_B^A(\theta)\), thus

(6)#\[ \begin{align}\begin{aligned}{\bf v}^B = {\bf R}_A^B(\theta) {\bf v}^A\\{\bf v}^A = {{\bf R}_B^A}^T(\theta) {\bf v}^B = {{\bf R}_B^A} (\theta){\bf v}^B\end{aligned}\end{align} \]

Instead of looking at unit vectors to find \(p^A\), we can simply transform the components of \(\vec{p}\) in the \(B\)-frame from \(B\) to \(A\).

Note

We use the following convention for transformation matrices \({\bf R}_{from}^{to}\)

(7)#\[\begin{split}p^A = \begin{bmatrix} 0 \\ d \\ d \end{bmatrix} + {\bf R}_B^A \begin{bmatrix} d \\ -d \\ 0 \end{bmatrix}\end{split}\]

calculating that

(8)#\[\begin{split}{\bf R}_B^A = {{\bf R}_A^B}^T = { \begin{bmatrix} 1 & 0 & 0 \\ 0 & \cos(\theta) & -\sin(\theta) \\ 0 & \sin(\theta) & \cos(\theta) \end{bmatrix} }^T = \begin{bmatrix} 1 & 0 & 0 \\ 0 & \cos(\theta) & \sin(\theta) \\ 0 & -\sin(\theta) & \cos(\theta) \end{bmatrix}\end{split}\]

We insert and get

(9)#\[\begin{split}p^A = \begin{bmatrix} 0 \\ d \\ d \end{bmatrix} + \begin{bmatrix} 1 & 0 & 0 \\ 0 & \cos(\theta) & \sin(\theta) \\ 0 & -\sin(\theta) & \cos(\theta) \end{bmatrix} \begin{bmatrix} d \\ -d\\ 0 \end{bmatrix} = \begin{bmatrix} d \\ d - d \cos(\theta) \\ d + d \sin(\theta) \end{bmatrix} \ \ \blacksquare.\end{split}\]

We can easily implement this in SymPy

import sympy as sm
sm.init_printing(use_latex='mathjax')
from sympy import sin, cos

theta, d = sm.symbols('theta d')
R_b_to_a = sm.Matrix([  [1, 0, 0],
                        [0, cos(theta), sin(theta)],
                        [0, -sin(theta), cos(theta)]])
R_b_to_a

\[\begin{split}\displaystyle \left[\begin{matrix}1 & 0 & 0\\0 & \cos{\left(\theta \right)} & \sin{\left(\theta \right)}\\0 & - \sin{\left(\theta \right)} & \cos{\left(\theta \right)}\end{matrix}\right]\end{split}\]

v_A = sm.Matrix([0, d, d]) + R_b_to_a @ sm.Matrix([d, -d, 0])
v_A

\[\begin{split}\displaystyle \left[\begin{matrix}d\\- d \cos{\left(\theta \right)} + d\\d \sin{\left(\theta \right)} + d\end{matrix}\right]\end{split}\]

SymPy Reference Frames#

As you can see from section Simple Rotation Example, even simple examples can get quite tedious when working with reference frames. Luckily, the SymPy module vector implements reference frames with the ReferenceFrame class.

from sympy.physics.vector import ReferenceFrame

A = ReferenceFrame('A')

Each reference frame has three associated basis vectors that define the frame

A.x, A.y, A.z

\[\displaystyle \left( \mathbf{\hat{a}_x}, \ \mathbf{\hat{a}_y}, \ \mathbf{\hat{a}_z}\right)\]

We can create new vectors by using the basis vectors

a = d*A.y + d*A.z
a

\[\displaystyle d\mathbf{\hat{a}_y} + d\mathbf{\hat{a}_z}\]

We can orient a new reference \(B\) relative to our frame \(A\) with an axis rotation around \(\hat{a}_x\)

B = A.orientnew('B', 'Axis', [-theta, A.x]) # negative x-axis rotation from box example

If we want the rotation matrix between two frames, we can call the direction cosine matrix or dcm method

B_to_A = B.dcm(A)
B_to_A

\[\begin{split}\displaystyle \left[\begin{matrix}1 & 0 & 0\\0 & \cos{\left(\theta \right)} & - \sin{\left(\theta \right)}\\0 & \sin{\left(\theta \right)} & \cos{\left(\theta \right)}\end{matrix}\right]\end{split}\]

SymPy makes it trivial to solve the simple example in Fig. 2

b = d*B.x - d*B.y
p = a + b
p

\[\displaystyle d\mathbf{\hat{a}_y} + d\mathbf{\hat{a}_z} + d\mathbf{\hat{b}_x} - d\mathbf{\hat{b}_y}\]

Using the express method we can find the vector \(v^A\). As long as there is a relationship between the reference frames in a vector, SymPy will be able to automatically calculate the vector relative to any frame.

p.to_matrix(A) # Print as matrix relative to frame A

\[\begin{split}\displaystyle \left[\begin{matrix}d\\- d \cos{\left(\theta \right)} + d\\d \sin{\left(\theta \right)} + d\end{matrix}\right]\end{split}\]

Implementation Details#

The ReferenceFrame class stores the name given upon creation as a string and its orientation as a direction cosine matrix (dcm) with type sympy.Matrix. Crucially, it also stores the relationships between other reference frames in a private dictionary, _Frame__frame_dict. The dictionary uses ReferenceFrames as keys and direction cosine matrices with type sympy.Matrix as values. These are set bi-directionally, which means that if we orient reference frame \(A\) to \(B\) we set the key \(B\) and Matrix for frame \(A\)’s dictionary, and the key \(A\) and the transposed Matrix for frame \(B\)’s dictionary.

Exercise

Use SymPy ReferenceFrames to find an expression of the position relative to origin (base of the robot) of the end effector on the SCARA robot depicted below. Use \(\theta\) to denote the joint angles, \(d\) to denote link length and \(J3\) to denote the z-displacement.

https://upload.wikimedia.org/wikipedia/commons/0/09/SCARA_robot_2R.png — Fig. 4 Mitsubishi Electric Automation, Inc. 500 Corporate Woods Pkwy - Vernon Hills, IL - 60061 - US, CC BY-SA 4.0 <https://creativecommons.org/licenses/by-sa/4.0>, via Wikimedia Commons#

Euler angles#

Warning

Rotations in 3D space can often be confusing. This confusion arises from all the different convention used, or rather the lack thereof. There is logic to rotations, so just hold on tight and pay attention to the following subsections.

In three dimensional space we can transform to any orientation we wish by applying three separate rotations. The rotations can be performed around each axis once (e.g. X -> Y -> Z) which are referred to as Tait-Bryan angles, or with one axis repeated once (e.g. X -> Y -> X), which are referred to as proper (or classical) Euler angles Brekke [Bre24]. Tait-Bryan angles are the most intuitive way to visualize such a sequence of rotation because they can be interpreted as roll, pitch and yaw. We can imagine such a sequence of rotation by first rotating your reference frame \(A\) about the \(\hat{a}_z\)-axis, rotating the newly rotated reference frame \(A'\) about \(\hat{a'}_y\)-axis and finally rotating the new coordinate system \(A''\) about \(\hat{a''}_x\). This type of rotation with mobile axes is called an intrinsic sequence of rotation.

Intrinsic and Extrinsic Rotations#

Rotations of Euler angles can be done in an intrinsic or extrinsic manner. Intrinsic rotation means that the axes are mobile, such as in the example in the example above. During each rotation the the axes are rotated. The next rotation is then carried out around the axis rotated by the previous rotations. Extrinsic rotations means that all the rotations are applied around the original fixed axes of the original frame.

An intrinsic sequence of rotation can be written as

(10)#\[\begin{split}\begin{align*} \{a'_x, a'_y, a'_z\} &= R_{z, \psi} \{a_x, a_y, a_z\}, \\[1mm] \{a''_x, a''_y, a''_z\} &= R_{y', \theta} \{a'_x, a'_y, a'_z\}, \\[1mm] \{a'''_x, a'''_y, a'''_z\} &= R_{x'', \phi} \{a''_x, a''_y, a''_z\}. \end{align*}\end{split}\]

(Derived from Brekke [Bre24])

The example above implements:

First rotation: Z-axis of initial frame A by angle \(\psi\)
Second rotation: Y-axis of the rotated frame A’ by angle \(\theta\)
Third rotation: X-axis of the new rotated frame A’’ by angle \(\phi\)

An extrinsic rotation sequence means that we transform around the same axes:

(11)#\[\{a'''_x, a'''_y, a'''_z\} = R_{z, \psi} R_{y, \theta} R_{x, \phi} \{a_x, a_y, a_z\}.\]

(Derived from Brekke [Bre24])

The example above implements:

First rotation: Fixed Z-axis of A by psi
Second rotation: Fixed Y-axis of A by theta
Third rotation: Fixed X-axis of A by phi

Intrinsic-extrinsic equivalence#

We have now seen the difference between extrinsic and intrinsic rotations. Intrinsic rotations are easier to visualize, but harder to compute by hand since you have to keep track of the intermediary axes. Extrinsic rotations are much easier to compute from a mathematical perspective since you always rotate relative to the same frame. Luckily, there we can replace intrinsic rotations with equivalent extrinsic rotations and vise versa. Intrinsic rotations yield the same result as extrinsic rotations carried out in the opposite sequence Brekke [Bre24].

(12)#\[\begin{split}\begin{align*} \{a'''_x, a'''_y, a'''_z\} &= R_{x'', \phi} R_{y', \theta} R_{z, \psi} \{a_x, a_y, a_z\} \\[1mm] &= R_{z, \psi} R_{y, \theta} R_{x, \phi} \{a_x, a_y, a_z\}. \end{align*}\end{split}\]

We can now relate this to the ZYX convention often used in navigation. It’s common to use the intrinsic sequence of rotation yaw-pitch-roll (Z -> Y -> X), which we now know is equivalent to the extrinsic sequence of rotation roll-pitch-yaw (X -> Y -> Z): \(R (\phi, \theta, \psi) = R_{a_z, \psi}R_{a_y, \theta}R_{a_x, \phi}\)

(13)#\[\begin{split}\begin{align*} R_{a_x,\phi} &= \begin{bmatrix} 1 & 0 & 0 \\ 0 & c\phi & -s\phi \\ 0 & s\phi & c\phi \end{bmatrix} \\ R_{a_y,\theta} &= \begin{bmatrix} c\theta & 0 & s\theta \\ 0 & 1 & 0 \\ -s\theta & 0 & c\theta \end{bmatrix} \\ R_{a_z,\psi} &= \begin{bmatrix} c\psi & -s\psi & 0 \\ s\psi & c\psi & 0 \\ 0 & 0 & 1 \end{bmatrix} \\ R(\phi, \theta, \psi) &= \begin{bmatrix} c\psi c\theta & -s\psi c\phi + c\psi s\theta s\phi & s\psi s\phi + c\psi s\theta c\phi \\ s\psi c\theta & c\psi c\phi + s\psi s\theta s\phi & -c\psi s\phi + s\psi s\theta c\phi \\ -s\theta & c\theta s\phi & c\theta c\phi \end{bmatrix} \end{align*}\end{split}\]

Note

c = cos, s = sin

Why bring this up? I’m more confused now…

The reason we bring this up is to stress the importance of being explicit about the conventions and definitions you use when working with rotations. If you don’t, it will inevitably lead to even more confusion.

SymPy 3D rotations#

The sympy method orient_body_fixed implements three successive body fixed simple axis right-hand rotations. We can orient a new reference frame by providing the parent frame, three angles and the order of rotation. The example below orients a new frame \(B\) relative to frame \(A\) by an intrinsic ZYX sequence of rotations (or XYZ extrinsic sequence of rotation).

A = ReferenceFrame('A')
B = ReferenceFrame('B')

phi, theta, psi = symbols('phi, theta, psi')

B.orient_body_fixed(A, (psi, theta, phi), 'ZYX') # Tait-Bryan intrinsic ZYX rotation
A_to_B = B.dcm(A).T
A_to_B

\[\begin{split}\displaystyle \left[\begin{matrix}\cos{\left(\psi \right)} \cos{\left(\theta \right)} & \sin{\left(\phi \right)} \sin{\left(\theta \right)} \cos{\left(\psi \right)} - \sin{\left(\psi \right)} \cos{\left(\phi \right)} & \sin{\left(\phi \right)} \sin{\left(\psi \right)} + \sin{\left(\theta \right)} \cos{\left(\phi \right)} \cos{\left(\psi \right)}\\\sin{\left(\psi \right)} \cos{\left(\theta \right)} & \sin{\left(\phi \right)} \sin{\left(\psi \right)} \sin{\left(\theta \right)} + \cos{\left(\phi \right)} \cos{\left(\psi \right)} & - \sin{\left(\phi \right)} \cos{\left(\psi \right)} + \sin{\left(\psi \right)} \sin{\left(\theta \right)} \cos{\left(\phi \right)}\\- \sin{\left(\theta \right)} & \sin{\left(\phi \right)} \cos{\left(\theta \right)} & \cos{\left(\phi \right)} \cos{\left(\theta \right)}\end{matrix}\right]\end{split}\]

As we can see, this agrees with our definitions in the previous subsection. Simply putting our arguments in the wrong order would have given a different result. The reason we go into such detail is to make it very clear that you need to know how rotations are implemented when using a library. If you’re not sure how they are implemented it’s often better to implement them yourself.

SymPy’s orient_explicit() method implements a way of orienting frames explicitly with direction cosine matrices. This way of orienting frames is prone to mistakes if you’ve defined your dcm incorrectly, so use it with caution.

from sympy import Matrix, cos, sin

N = ReferenceFrame('N')
A = ReferenceFrame('A')
theta = symbols('theta')

# DCM for rotating about z-axis
dcm = Matrix([
    [cos(theta), -sin(theta), 0],
    [sin(theta), cos(theta), 0],
    [0, 0, 1]
])

A.orient_explicit(N, dcm) # Orient frame A w.r.t. to frame N
N_to_A = N.dcm(A)
N_to_A

\[\begin{split}\displaystyle \left[\begin{matrix}\cos{\left(\theta \right)} & - \sin{\left(\theta \right)} & 0\\\sin{\left(\theta \right)} & \cos{\left(\theta \right)} & 0\\0 & 0 & 1\end{matrix}\right]\end{split}\]

Exercise: Skydio drone

The drone illustrated in the picture below is oriented relative to the inertial frame \(N\). Use Euler angles ZXY convention to find the orientation of the camera relative to frame \(N\) by using the intermediary frames \(BODY\) and \(CAM\). Take both the drone orientation and the camera gimbal orientation into account.

_images/skydio_drone.jpg — Fig. 5 Image copyright Vox Media, used under fair use for educational purposes.#

Reference frames

Contents

Reference frames#

Unit Vectors#

Simple Rotation Example#

SymPy Reference Frames#

Implementation Details#

Euler angles#

Intrinsic and Extrinsic Rotations#

Intrinsic-extrinsic equivalence#

SymPy 3D rotations#

Further reading#