adamheins.com

Confidence Intervals for Wishart Random Matrices

mail@adamheins.com (Adam Heins) — Tue, 02 Jan 2024 05:00:00 GMT

In this post we briefly describe Wishart-distributed random matrices and a result from (Chiani, 2017), which provides an algorithm for calculating the probability that a standard Wishart-distributed matrix's eigenvalues lie within a given interval. The functions described in this post are implemented in Python here, including Algorithm 1 from (Chiani, 2017). We also talk about quantile functions and non-standard Wishart-distributed matrices.

The Wishart distribution

Suppose we have the $p\times p$ random matrix

\begin{equation*} \bm{A} = \sum_{i=1}^n \bm{x}_i\bm{x}_i^T, \end{equation*}

where $n\geq p$ and each $\bm{x}_i\in\mathbb{R}^p$ is independently drawn from a zero-mean multivariate normal distribution

\begin{equation*} \bm{x}_i \sim \mathcal{N}_p(\bm{0},\bm{\Sigma}) \end{equation*}

with positive definite covariance matrix $\bm{\Sigma}$ . Then $\bm{A}$ is Wishart-distributed, and we write

\begin{equation*} \bm{A} \sim \mathcal{W}_p(\bm{\Sigma},n), \end{equation*}

where $n$ is called the degrees of freedom. We refer to the special case $\mathcal{W}_p(\bm{I}_p,n)$ as the standard Wishart distribution, where $\bm{I}_p$ is the $p\times p$ identity matrix.

The Wishart distribution arises as the conjugate prior to the inverse covariance matrix of a multivariate normal distribution in Bayesian statistics, among other places.

One property of the Wishart distribution that we will make use of later is that if $\bm{A}\sim\mathcal{W}_p(\bm{\Sigma},n)$ and $\bm{C}\in\mathbb{R}^{p \times p}$ is a constant, full-rank matrix, then

\begin{equation}\label{1} \bm{C}\bm{A}\bm{C}^T \sim \mathcal{W}_p(\bm{C}\bm{\Sigma}\bm{C}^T,n). \end{equation}

Confidence bounds on the eigenvalues of standard Wishart matrices

Suppose we want to compute the probability that the eigenvalues of some standard Wishart-distributed matrix

\begin{equation*} \bm{S} \sim \mathcal{W}_p(\bm{I}_p,n) \end{equation*}

are contained in a given interval $[a,b]$ , with $0\leq a\leq b$ . We denote this probability as

\begin{equation}\label{2} \mathrm{Pr}[a\leq\lambda_{\min}(\bm{S}), \lambda_{\max}(\bm{S})\leq b], \end{equation}

where $\lambda_{\min}(\bm{S})$ and $\lambda_{\max}(\bm{S})$ are the minimum and maximum eigenvalues of $\bm{S}$ , respectively. The probability $\eqref{2}$ is equivalent to

\begin{equation}\label{3} \mathrm{Pr}[a\bm{I}_p \preccurlyeq \bm{S} \preccurlyeq b\bm{I}_p] \end{equation}

(see Appendix A.5.2 of (Boyd & Vandenberghe, 2004)), where the notation $\bm{A}\preccurlyeq\bm{B}$ means that $\bm{B}-\bm{A}$ is positive semidefinite.

Algorithm 1 of (Chiani, 2017) tells us how to compute the function $\psi(a,b)$ such that

\begin{equation*} \mathrm{Pr}[a\bm{I}_p \preccurlyeq \bm{S} \preccurlyeq b\bm{I}_p] = \psi(a, b), \end{equation*}

and I've implemented this algorithm in Python here. This immediately gives us the cumulative density functions (CDFs) of the minimum and maximum eigenvalues of $\bm{S}$ :

\begin{align*} \mathrm{Pr}[a\leq\lambda_{\min}(\bm{S})] &= \psi(a, \infty) = 1 - C_{\min}(a), \\ \mathrm{Pr}[\lambda_{\max}(\bm{S})\leq b] &= \psi(0, b) = C_{\max}(b), \end{align*}

where $C_{\min}$ and $C_{\max}$ are the CDFs for the minimum and maximum eigenvalues, respectively.

Quantile functions

We now have a method for computing the CDFs of the eigenvalues, but we do not have the inverse CDFs (i.e., the quantile functions), which compute the bounds required to achieve a given probability. For example, suppose we want to find the value $b\geq0$ such that $\lambda_{\max}(\bm{S})\leq b$ with a given probability $\rho\in[0,1]$ . One way to do this is by solving the optimization problem

\begin{equation*} \mathrm{argmin}_{b\geq0} (\rho - C_{\max}(b))^2, \end{equation*}

which is also implemented here, using one of scipy's built-in solvers.

Non-standard Wishart matrices

We can extend the results above to the non-standard Wishart-distributed matrix

\begin{equation*} \bm{A} \sim \mathcal{W}_p(\bm{\Sigma},n). \end{equation*}

In particular, generalizing $\eqref{3}$ we get

\begin{equation}\label{4} \mathrm{Pr}[a\bm{\Sigma} \preccurlyeq \bm{A} \preccurlyeq b\bm{\Sigma}] = \psi(a,b). \end{equation}

To see this, let $\bm{L}$ be the Cholesky decomposition of $\bm{\Sigma}$ , such that $\bm{L}\bm{L}^T=\bm{\Sigma}$ . Since $\bm{\Sigma}$ is positive definite, $\bm{L}$ is full-rank and invertible. Then from $\eqref{1}$ we know that $\bm{L}^{-1}\bm{A}\bm{L}^{-T}\sim\mathcal{W}_p(\bm{L}^{-1}\bm{\Sigma}\bm{L}^{-T},n)=\mathcal{W}_p(\bm{I}_p,n)$ , which means that $\mathrm{Pr}[a\bm{I}_p\preccurlyeq\bm{L}^{-1}\bm{A}\bm{L}^{-T}\preccurlyeq b\bm{I}_p]=\psi(a,b)$ . Finally, since $a\bm{I}_p\preccurlyeq\bm{L}^{-1}\bm{A}\bm{L}^{-T}\preccurlyeq b\bm{I}_p$ if and only if $a\bm{\Sigma}\preccurlyeq\bm{A}\preccurlyeq b\bm{\Sigma}$ (this follows from Observation 7.1.8 of (Horn & Johnson, 2013)), we obtain $\eqref{4}$ .

Notice that $\eqref{4}$ is no longer expressed in terms of eigenvalues of $\bm{A}$ , but only expresses the probability that $\bm{A}$ is bounded by two matrices. This could be useful for formulating chance constraints on a positive definite matrix in convex optimization.

Thanks to Abhishek Goudar for reading a draft of this.

References

Boyd, Stephen and Lieven Vandenberghe (2004). Convex Optimization. Cambridge University Press. (Freely available here.)
Chiani, M. (2017). "On the Probability That All Eigenvalues of Gaussian, Wishart, and Double Wishart Random Matrices Lie Within an Interval," in IEEE Transactions on Information Theory, vol. 63, no. 7, pp. 4521–4531, July 2017, doi: 10.1109/TIT.2017.2694846, arxiv: 1502.04189.
Horn, Roger A. and Charles R. Johnson (2013). Matrix Analysis. Cambridge University Press.

New Features in pyb_utils

mail@adamheins.com (Adam Heins) — Mon, 17 Jul 2023 04:00:00 GMT

I've recently added a few things to pyb_utils, which is a small package designed to make things easier when working with the PyBullet simulator. In my previous post (quite some time ago now) I discussed my initial reason for creating pyb_utils, which was to easily perform collision detection in parallel with the main simulation, for things like planning or control. Since then I've added some new things that I think are generally useful.

Rigid body construction

First, there is now the BulletBody class for quickly creating rigid bodies. Creating and adding a box to a simulation is as easy as

>>> import pybullet as pyb
>>> import pyb_utils

>>> pyb.connect(pyb.GUI)

# create a 1x1x1 cube at the origin
>>> box = pyb_utils.BulletBody.box(position=[0, 0, 0], half_extents=[0.5, 0.5, 0.5])

The BulletBody class takes care of the boilerplate for creating the visual and collision objects and combining them into a multibody. It also provides methods for getting and setting position, orientation, and velocity, as well as applying external forces and torques.

In addition to the box, there are dedicated constructors for spheres, cylinders, and capsules. For example:

# put a ball on top of the cube
>>> ball = pyb_utils.BulletBody.sphere(position=[0, 0, 1.5], radius=0.5)

# now put it somewhere else
>>> ball.set_pose(position=[2, 0, 0.5])

The BulletBody class makes it easy to add simple objects to act as obstacles or manipulation targets, for instance.

Named tuples

Second, pyb_utils now wraps some PyBullet functions and returns named tuples rather than just vanilla tuples (but the calling API is exactly the same). This makes it a lot easier to remember what the values in the returned tuples actually mean. In particular, there are wrappers for getDynamicsInfo, getContactPoints, getClosestPoints, and getConstraintInfo. These functions all return tuples containing over ten values each. Continuing our example from above, with regular PyBullet we have something like

# hard to read!
>>> pyb.getDynamicsInfo(box.uid, -1)
(1.0,
 0.5,
 (0.16666666666666666, 0.16666666666666666, 0.16666666666666666),
 (0.0, 0.0, 0.0),
 (0.0, 0.0, 0.0, 1.0),
 0.0,
 0.0,
 0.0,
 -1.0,
 -1.0,
 2,
 0.001)

Instead, we can easily replace the call with the pyb_utils equivalent:

>>> info = pyb_utils.getDynamicsInfo(box.uid, -1)
# now we can access fields by name
>>> info.mass
1.0
>>> info.localInertiaPos
(0.0, 0.0, 0.0)

Now there is no need to count through the field names in the PyBullet documentation every time one of these functions is used. And since the calling API is exactly the same, there is no barrier to switching to the pyb_utils wrappers.

Quaternion utilities

Finally, I want to mention a couple of simple quaternion utilities. They've actually been present in pyb_utils for quite a while, but I've come to appreciate them more over time for fast prototyping. PyBullet represents 3D orientation using quaternions in $(x,y,z,w)$ order (i.e., with the scalar part last). Using the spatialmath library under the hood, pyb_utils provides the functions:

quaternion_to_matrix  # convert quaternion to rotation matrix
matrix_to_quaternion  # convert rotation matrix to quaternion
quaternion_multiply   # multiply two quaternions together (Hamilton product)
quaternion_rotate     # rotate a point by a rotation represented by a quaternion

which make it easy to apply and compound rotations. The main reason to use these functions rather than just those from spatialmath directly is because by default spatialmath uses the convention that the scalar part of the quaternion is first; to switch to the PyBullet convention one needs to constantly pass the order keyword argument to all of the functions, which is tiresome and hard to debug if forgotten.

Conclusion

I wrote and continue adding to pyb_utils because it provides a set of tools useful for my research on robotics. I hope it can be useful for others as well. Pull requests are welcome!

Collision Detection in PyBullet

mail@adamheins.com (Adam Heins) — Mon, 27 Dec 2021 05:00:00 GMT

All of the code for this post can be found here. However, as of pyb_utils v2.0 released on Oct. 20, 2023, the collision API discussed below has been revised; refer to the updated example here.

PyBullet is a popular physics simulator often used for robotics research. An important requirement for any robotics simulator is the ability to check for collisions between a robot and itself or its surroundings, in order to plan a collision-free path. More generally, we want to be able to compute the shortest distances between arbitrary objects in arbitrary configurations—if the distance is non-positive, the objects are in collision.

Shortest distance computations are fairly straightforward to do with PyBullet, though the solution was not immediately obvious to me: run a separate (headless) physics server only for collision checking. Consider a basic PyBullet setup which loads a robot, a ground plane, and some obstacles:

def load_environment(client_id):
    pyb.setAdditionalSearchPath(
        pybullet_data.getDataPath(), physicsClientId=client_id
    )

    # ground plane
    ground_id = pyb.loadURDF(
        "plane.urdf",
        [0, 0, 0],
        useFixedBase=True,
        physicsClientId=client_id,
    )

    # KUKA iiwa robot arm
    kuka_id = pyb.loadURDF(
        "kuka_iiwa/model.urdf",
        [0, 0, 0],
        useFixedBase=True,
        physicsClientId=client_id,
    )

    # some cubes for obstacles
    cube1_id = pyb.loadURDF(
        "cube.urdf",
        [1, 1, 0.5],
        useFixedBase=True,
        physicsClientId=client_id,
    )
    cube2_id = pyb.loadURDF(
        "cube.urdf",
        [-1, -1, 0.5],
        useFixedBase=True,
        physicsClientId=client_id,
    )
    cube3_id = pyb.loadURDF(
        "cube.urdf",
        [1, -1, 0.5],
        useFixedBase=True,
        physicsClientId=client_id,
    )

    # store body indices in a dict with more convenient key names
    bodies = {
        "robot": kuka_id,
        "ground": ground_id,
        "cube1": cube1_id,
        "cube2": cube2_id,
        "cube3": cube3_id,
    }
    return bodies


# start the main physics server and load the environment
gui_id = pyb.connect(pyb.GUI)
bodies = load_environment(gui_id)

To set up an additional physics server for collision checking, we can add:

col_id = pyb.connect(pyb.DIRECT)

# collision simulator has the same objects as the main one
collision_bodies = load_environment(col_id)

for which we don't require a GUI. In fact, we don't require dynamics simulation at all: all we need to do is put the robot in a particular configuration and then compute the shortest distances between the objects of interest.

Computing the shortest distances between pairs of objects is rather cumbersome in PyBullet, since one cannot directly specify link names to check (PyBullet uses joint indices, which are less intuitive for this purpose). As such, I wrote a little code that allows us to set up distance computation using something like:

# NamedCollisionObjects contain the name of the body, and optionally
# the name of the link on the body to check for collisions
ground = NamedCollisionObject("ground")
cube1 = NamedCollisionObject("cube1")
cube2 = NamedCollisionObject("cube2")
cube3 = NamedCollisionObject("cube3")
link7 = NamedCollisionObject("robot", "lbr_iiwa_link_7")  # last link

# then we set up collision detection for desired pairs of objects
col_detector = CollisionDetector(
    col_id,  # client ID for collision physics server
    collision_bodies,  # bodies in the simulation
    # these are the pairs of objects to compute distances between
    [(link7, ground), (link7, cube1), (link7, cube2), (link7, cube3)],
)

Now we can compute shortest distances between pairs of objects (specified by name!) for whatever configuration of the robot we want, without affecting the main GUI-based simulation:

while True:
    # compute shortest distances for a random configuration
    q = np.pi * (np.random.random(7) - 0.5)
    d = col_detector.compute_distances(q)
    in_col = col_detector.in_collision(q)

    print(f"Configuration = {q}")
    print(f"Distance to obstacles = {d}")
    print(f"In collision = {in_col}")

    # wait for user to press enter to continue
    input()

    # the main GUI-based simulation is not affected
    # we could do whatever motions we want here
    pyb.stepSimulation(physicsClientId=gui_id)

If you found this useful, consider checking out the full pyb_utils project that came out of it. In addition to collision detection, it also provides ghost (i.e. visual-only) objects, cameras, and more.