madhadron

The unit circle deformed

Status: Done
Confidence: Certain

Math in this page not rendering? See the fix

The distance between two points x=(x1,x2,,xn)x=(x_1,x_2,\dots,x_n) and y=(y1,y2,,yn)y=(y_1,y_2,\dots,y_n) in the classical Euclidean sense is

xy=(x1y1)2+(x2y2)2++(xnyn)2\| x - y \| = \sqrt{(x_1 - y_1)^2 + (x_2 - y_2)^2 + \dots + (x_n - y_n)^2}

However, other measures are possible. On a grid (say, the distance to get from place to place in New York City), the distance is the lateral distance, since you can’t go on diagonals:

xy=|x1y1|++|xnyn|\| x - y \| = |x_1 - y_1| + \dots + |x_n - y_n|

We can dream up scenarios in which we take the cube of each term, and the cube root overall, or any other power that we care to choose. We’ll generalize, and define the pp-norm of a vector xx to be

xp=(|x1|p++|xn|p)1p\|x\|_p = (|x_1|^p + \dots + |x_n|^p)^{\frac{1}{p}}

After you play with this for a while, you might ask, “what does the unit circle look like for some pp?” We’ll explore this in two dimensions where it’s easy to see. Nothing new happens in higher dimensions.

p=2p=2 is our usual norm, and the unit circle is our usual circle. If we go to p=1p=1, our grid norm above, then we are plotting the equation |x1|+|x2|=1|x_1| + |x_2| = 1 which is just four line segments connecting the points (1,0), (0,1), (-1,0), and (0,-1). The unit circles for p=1,2,3,4p=1,2,3,4 look like

Unit circles for various p=1,2,3,4

To see the more general behavior, note that we can take the norm of a fixed vector xx as a function of pp. If we fix x=(1,0)x=(1,0), or any of the other points on the axes, then the norm doesn’t vary with pp—it’s always 1. If we go at forty five degrees between the axes (x=(1/2,1/2)x=(1/\sqrt{2}, 1/\sqrt{2})), then we have

xp\|x\|_p
=(12p+12p)1p= (\frac{1}{\sqrt{2}^p} + \frac{1}{\sqrt{2}^p})^{\frac{1}{p}}
=(212p2)1p= (2\frac{1}{2^{\frac{p}{2}}})^{\frac{1}{p}}
=(21p2)1p= (2^{1-\frac{p}{2}})^{\frac{1}{p}}
=2(1p12)= 2^{(\frac{1}{p}-\frac{1}{2})}

This function looks like

The function y=2^(1/x - 1/2)

Remember, we have here a fixed vector: its value goes down, so in order to get a length of 1, we have to put in a longer vector. The unit circle thus heads steadily towards a square.

The limiting case is the \infty-norm, which is simply defined as 𝐱=maxi|xi|\|\mathbf{x}\|_\infty = \textrm{max}_i |x_i|, the largest component of the vector. A few moments thought show that the unit circle is simply the square centered at the origin with side length two, and all sides parallel to the axes.

Given these insights into the unit circle, we can actually use this to get another characteristic of a vector. How much does it lie on the axes? Are only two of its components in this basis important? Three?

To measure this, consider our vector 𝐱\mathbf{x}. Take

d𝐱pdp|p=1-\frac{d\|\mathbf{x}\|_p}{dp}|_{p=1}

What is this horrible object? Consider a simple case, where 𝐱\mathbf{x} has kk components which are 1 and nkn-k components which are 0. Then

d𝐱pdp- \frac{d\|\mathbf{x}\|_p}{dp}
=ddpk1p= -\frac{d}{dp} k^{\frac{1}{p}}
=1pk1plogk1p= \frac{1}{p} k^{\frac{1}{p}} \log k^{\frac{1}{p}}

When we evaluate this at p=1p=1, we get d𝐱p/dp|p=1=klogk-d\|\mathbf{x}\|_p/dp|_{p=1} = k\log k, which is at least monotonic in kk, and gives us an estimate of the number of components.