Ehressmann Connection
tldr; Connections on Fiber bundles are are specified by vertical bundle-valued one-forms
One might be familiar with the connection as the Christoffel symbols or the gauge field . We will review the idea of Ehresmann connection that unifies these 2 seemingly disparate structures in a more abstract framework. It is essential if we are going to generalize the Levi Civita connection (Christoffel symbols) on the tangent bundle to a spin connection on the spin bundle.
The Ehresmann connection can be defined for any smooth Fiber bundle with smooth fiber . If we consider tangent bundle , one can see that , which implies that . Intuitively, the tangent bundle has some linear subspace that are “along ”, and this leads to the definition of One can look for another subspace that is complementary to , i.e. The intuition is that vectors in point in a direction that leads to another fiber, while vectors in point in a direction tangent to the current fiber. I would love to add a diagram showing an example with (the only intuitive diagram one can draw in my opinion).
In the absence of any additional structure, there are many valid choices of , and our choice of affects . This is analogous to saying that the vector space of degree 2 polynomials can be decomposed in many ways
Connection Form
Instead of saying we provide a horizontal subspace at every tangent space of the the tangent bundle , it is an equivalent definition to just provide a projection map from which can be inferred.
Punchline: One can say that is a -valued one-form on since it takes a vector in and gives an element of .
Ehressmann Connection on -bundle
tldr; Connection on -bundle is a Lie algebra-valued one-form on
If we consider to be a principal -bundle , then the is a Lie group-valued one-form. Since consists of vectors that live in the tangent space that have the same dimension as , the vertical subspace is isomorphic to the Lie algebra. So the connection on a -bundle is specified by a Lie algebra-valued one-form on P (the full bundle ).
However, it turns out that given a local trivialization of , specifying a Lie algebra-valued one-form on actually specifies enough information to specify on , which is then glued together with other charts to obtain on the full bundle . This might seem wild at first since surely projecting onto to yield will lose information with regard to how this one form behaves on the fibers. The answer is that the condition that there is an action of on the fibers is constraining enough to reconstruct given . These are known as the Yang-Mills fields. There are alot of details that are covered in the excellent lecture 22 by Frederic Schuller.
Details of Theorem
The main result is a theorem
Given a section which induces canonical local trivialization , a -valued one-form on can be pulled back via or .
The pullback induces the pullback by the following definition where .
Theorem: This satisfies the 2 properties of a connection, namely
where and is the fundamental vector field on generated by , defined pointwise as
Proof of :
Since , the pushforward via yields , where is where . It is clear from this that .
This condition is basically saying that the inverse pullback satisfies the connection one-form condition .
Proof of :
To prove this,
So the above calculations has shown that given a -valued one-form on and a section , there exists a unique connection one-form on such that . The constructed connection depends on .
construction: Split into charts . Then each local trivialization induces a section . The pullback of on via will give an on , which is a Yang-mills field on the base manifold after gluing. The gluing is what we call “gauge transformations of ” or “how transforms under change in coordinates”.
construction: Split into charts . On each chart, the one-form on induces a one-form (which is actually ) on the local trivialisation of via the theorem above. These can be glued together to construct a one-form on .
Clearing the Ambiguity in Nakahara’s Notation
I learned alot from Nakahara’s book, but I find their section on connection one-form slightly ambiguous. For the connection one-form on , they write where is the Yang Mills one-form on the base manifold . The first term actually means where is the adjoint action of on , and . The and notations coincide for matrix Lie groups, where the multiplication between and is matrix multiplication. I still prefer to talk purely in non-matrix notation because I find it’s clearer.
Because and , the above actually reduces to
where is a horizontal vector. The first term takes care of the vertical component of .
The second term is extremely ambiguous. is simply just . Trouble comes from . Usually one uses to denote the pushforward of . But in this case, actually means . If I include the implicit annotations, So how does one justify ? is interpreted as a map in the local trivialization . A vector is the derivative of a curve at one point of the curve, so given a curve in which leads to vector , we actually mean to define i.e. derivative of the fiber “coordinate” of our curve . is the pushforward of the map . So .
tldr; is .
Gluing of Connection One-Form
Given 2 overlapping local trivializations (which induces 2 local sections ), we would like to find how is related to . The answer is obtained by investigating how they act on a vector . We need to calculate in terms of . Or in terms of the local trivialisation and .
We know that is the transition map defined as the unique (guaranteed since acts freely and transitively on the fibers of ) element such that This definition can be rephrased in a more friendly manner as
Repeating the derivation with swapped, we get that
If we let , then the last equation becomes which translates the definition into a statement about transition maps between sections
Suppose we have a curve in that yields the vector at . This vector acts on a function . The pushforward is given by
so the vector fields are related as follows
Note that each term is responsible for the horizontal and vertical component of the vector . Right -invariance of the horizontal bundle means horizontal gets mapped onto horizontal. The vector field generated from an element of the Lie algebra lives in the vertical bundle too, so vertical maps onto vertical.
This allows us to relate to , which is what we call “gauge transformations” of .
So this is the mathematically precise expression for the
Comparison with Frederic Schuller Lecture 22
In case you were wondering how to match the result we derived with Schuller’s Lecture 22 timestamp 59:08, where he says that
- In our case is a -valued one-form on whereas his is a one-form on . So our is his .
- His is our
- His Maurer Cartan form is just , in this case is . For our expression to match his, we just rewrite as , i.e. we pullback the -valued one-form on via to yield a -valued one-form on So our is identified with his
Calculating Christoffel Symbols
Christoffel symbols is just the connection one-form on the base manifold of a tangent frame bundle (which can be shown to be a -principal bundle). The goal will be to calculate how changes when we change coordinates.
The nice thing is that a coordinate chart on already defines a local trivialization of the -bundle. Namely, the coordinate functions can be used to define a section
A vector can be written as , with . Hence the “Yang-Mills” one-form on the base manifold takes the form , where .
Since , the Lie algebra is just , consisting of all real matrices. So where are indices for the matrix Lie algebra .
We are interested in the transformations laws for this . Suppose we have another chart . Then on the intersection , if is a curve with tangent vector ,
The last term can be rewritten as
So with all the indices flashed out,
The vector expressed in the 2nd chart is , with So
Putting this back into the above expression,
{\Gamma^{(2) i}}_{j\mu} \tilde X^\mu &= {(\Omega^{-1})^i}_{k} \left( {\Gamma^{(1)k}}_{l\mu} \frac{\partial x^\mu}{\partial y^\nu}\tilde X^\nu \right) {\Omega^l}_j + {(\Omega^{-1})^i}_k\partial_\mu {\Omega^k}_j X^\mu \end{aligned}$$ Now the transition functions $\Omega:M\rightarrow GL(d)$, or in coordinates, ${\Omega^i}_j: M \rightarrow \mathbb R$ are actually $${\Omega^i}_j = \frac{\partial x^i}{\partial y^j}$$ $${(\Omega^{-1})^i}_j = \frac{\partial y^i}{\partial x^j}$$ so $$\begin{aligned} {\Gamma^{(2) i}}_{j\mu} \tilde X^\mu &= \frac{\partial y^i}{\partial x^k} \left( {\Gamma^{(1)k}}_{l\mu} \frac{\partial x^\mu}{\partial y^\nu}\tilde X^\nu \right) \frac{\partial x^l}{\partial y^j} + \frac{\partial y^i}{\partial x^k} \frac{\partial^2 x^k}{\partial y^\mu \partial y^j} \tilde X^\mu \end{aligned}$$ Removing the $X$ (more rigorously, choosing $X=\delta^{\mu\rho} \frac{\partial}{\partial y^\rho}$), $$\begin{aligned} {\Gamma^{(2) i}}_{j\rho} &= {\Gamma^{(1)k}}_{l\mu} \frac{\partial y^i}{\partial x^k} \frac{\partial x^\mu}{\partial y^\rho} \frac{\partial x^l}{\partial y^j} + \frac{\partial y^i}{\partial x^k} \frac{\partial^2 x^k}{\partial y^\rho \partial y^j} \end{aligned}$$ Not all indices of $\Gamma$ are the same type! 1 of them is spacetime, the other 2 actually label the Lie algebra. ### Memorisation Trick When we learn this in GR we usually get giddy just trying to write it with the correct indices. However, if we write it as follows and compare with the abstract form, it becomes much clearer what is really going on $$\begin{aligned} {\Gamma^{(2) i}}_{j\rho} dy^\rho &= \frac{\partial y^i}{\partial x^k} \left({\Gamma^{(1)k}}_{l\mu} \frac{\partial x^\mu}{\partial y^\rho} dy^\rho \right) \frac{\partial x^l}{\partial y^j} + \frac{\partial y^i}{\partial x^k} \frac{\partial}{\partial y^\rho } \left(\frac{\partial x^k}{\partial y^j} \right) dy^\rho\\ \mathbf \Gamma^{(2)}_\rho dy^\rho &= \mathbf\Omega^{-1} \left(\mathbf \Gamma^{(1)}_\mu dx^\mu\right) \mathbf \Omega + \mathbf \Omega^{-1} d \mathbf \Omega \end{aligned}$$