Running in circles

Even though the title would work for the reason this blog post is out so late, it should have been published Monday evening, I still like to do a little math (it’s better for my blood pressure anyways).

But on to the blog post. Last week I had a short chat with a colleague who is working on graph theory, not my field of research but I really like it. She was stuck on the proof of a property which seems to be rather obvious:

Any Graph with at least as many Edges as Vertices contains a cycle.

If I understood her correctly it was for an exercise for her students, so if you are here because you googled the problem set “Please try to solving it on your own!” Otherwise I hope you learn something.

Ok now so let’s first clear up for everybody what we are talking about.

(Undirected) Graph

A graph GG is a tuple G=(V,E)G=(V,E) with VV being a set the vertices or nodes, and EP(V)E\subseteq P(V) with eE|e|=2\forall e\in E \bullet |e|=2 a set of edges.

If you are a mathematician or at least have a good ability to read mathematical definitions, this might be enough for you and you can skip the next paragraph.

For anybody else let me try to explain what this means. First we have a set of vertices VV. Every vertex is just a “thing” that we can somehow connect with some other “thing” (vertex) by a relation e.g. train stations, people or dots. The second part, the edges EE, are precisely these relations, every edge (eEe\in E) connects exactly two “things” (vertices) with one another (eE|e|=2\forall e\in E \bullet |e|=2), going on with the examples above, you can think of an edge as train tracks connecting two train stations, friendships connecting two people or simply a line connecting two dots. In the special case stated above we can’t distinguish between an “origin” and a “target” of the relations, so they are symmetrical which only means if aa and bb are train stations and there is a track between them we can’t say the track only goes from aa to bb but not the other way around. There is one more technicality we don’t allow loops e.g. one vertex vVv\in V having an edge going from vv back to vv is forbidden. For the reminder of this exercise this makes no big difference, but makes the formulation above a little easier.

For obvious reasons we resort to showing graphs as dots and lines between them. So here is an example:

Here we have the vertices V={a,b,c,d}V=\{a,b,c,d\} and the edges E={{a,b},{b,c},{c,d}}E=\{ \{a,b\}, \{b,c\}, \{c,d\} \}. Note that I could have written E={{b,a},{c,b},{c,d}}E=\{ \{b,a\}, \{c,b\}, \{c,d\} \} as well, since sets have no inherent order, which is what I said above with no “origin” and “target”.

But let’s to go on with our property from above. For a mathematical formulation we need one more thing:

Path

For a Graph G=(V,E)G=(V, E) a path is sequence of vertices <v0,v1,v2,,vn><v_0, v_1, v_2, \ldots , v_n> such that for every two vertices {vi,vi+1}E\{v_{i}, v_{i+1}\}\in E holds. A path with v0=vnv_0=v_n is called a cycle.

Once again a more intuitive way to state that. A path is probably exactly what you would think it is, if you can go from the first vertex in the sequence to the second via an edge then to a third an so on you have a path. In our example from above <a,b,c,d><a, b, c, d> is a path since we can go from aa to bb with the edge {a,b}\{a,b\} from bb to cc via {b,c}\{b,c\} and from cc to dd with {c,d}\{c,d\}.

A cycle is then, probably not to you surprise, a path that starts and ends at the same vertex.

So now once again to our problem. A mathematical formulation of the above is:

For any graph G=(V,E)G=(V,E), if |E||V||E|\geq |V| then GG contains a cycle.

In the above example it is easy to see what our property states, the graph has four vertices and three edges, so one less edge than vertices and has no cycle. However you can’t add any edge between two of the vertices without introducing one.

Two remarks here:

  1. The property does not state that we can’t make a cycle in a graph with less edges than vertices. It only states that if we have at least as many edges as vertices we are guaranteed to have a cycle.
  2. From your math education in school you might remember that any number of example can’t be a proof that this holds in general.

In this case however the property holds and now let’s proof that.

Proof

First let’s give an equivalent formulation:

For any graph G=(V,E)G=(V,E) that is cycle-free |E|<|V||E| < |V| holds.

I trust in you seeing that this is equivalent formulation to the one above.

Our proof now proceeds in two steps:

  1. We proof that there is at least on vertex vv in any cycle-free graph that has at most one edge connecting it to one other vertex. Or mathematically stated vVdeg(v)<2\exists v\in V \bullet deg(v)<2.
  2. We proof by induction, that is, if we have a proof that our property holds for a/all smaller graph(s) we can proof it for a graph having one more vertex.

For the first part.

Lemma

Every cycle-free Graph GG has (at least) one vertex vv with deg(v)<2deg(v)<2.

Proof (by reductio ad absurdum)

We proof this lemma by reductio ad absurdum or contradiction, that is, we say: “There is a cycle-free graph that has no such vertex” and than show that this is a contradiction, which leads to understanding that our initial idea has to be wrong and the opposite has to be true, which is what we wanted to proof in the first place. Here we go:

Claim: There is a cycle-free graph G=(V,E)G=(V,E) with |V|=n|V|=n where all vertices vv have deg(v)2deg(v)\geq 2.

If we have such a graph, then we construct a path <v0,v1,,vn><v_0, v_1, \ldots , v_n> as follows: We take any vertex v0v_0, since this vertex has deg(v0)2deg(v_0)\geq 2 there is an edge {v0,v1}E\{v_0, v_1\} \in E. And we can select v1v_1 as the second vertex, and so on. Since for all vertices deg(vi)2deg(v_i)\geq 2 we can always select vi1,vi,vi+1v_{i-1}, v_{i}, v_{i+1} such that $v_{i-1} \not= v_{i+1}$, so the incoming edge is different from the outgoing edge. But now we have a problem <v0,v1,,vn><v_0, v_1, \ldots , v_n> contains n+1n+1 vertices (just count from 00 to nn) but VV only contains nn vertices, so two vertices in this path need to be the same. Which means, yes you guessed it, there has to be a cycle. So the graph can’t be cycle-free and we have our contradiction and by that know that our lemma holds.

Theorem

Now to our main theorem:

For any graph G=(V,E)G=(V,E) that is cycle-free |E|<|V||E| < |V| holds.

We will proof this by induction, which goes as follows, we first proof our theorem directly for the base instance, the smallest graph we want our property to hold for. Since we didn’t state otherwise this is the graph with one vertex. Following that we construct a proof for a graph with n+1n+1 vertices given that we have a proof for a graph with nn vertices.

So let’s see how this goes. We start with the base case.

Proof

Base case: A graph G=(V,E)G=(V, E) with only one vertex |V|=1|V|=1 can obviously have no edge without having a cycle.

This takes care of the base case.

Induction hypothesis: For every graph G=(V,E)G=(V,E) with |V|=n|V|=n, if GG is cycle-free |E|<|V||E|<|V| holds.

Induction step: Let G=(V,E)G=(V,E) be a cycle-free graph with |V|=n+1|V|=n+1 vertices. Than, due to our lemma from above, VV contains at least one node vVv\in V with deg(v)2deg(v)\leq 2. We construct a new graph G=(V,E)G^{\prime}=(V^{\prime}, E^{\prime}), where V=V{v}V^{\prime}=V\backslash\{v\}, so we remove vv from VV and we construct EE^{\prime} by removing all edges connecting vv with any other vertex E=E{{u,v}|uV}E^{\prime}=E\backslash \{\{u,v\} | u\in V\}. Obviously |V|=|V|1|V^{\prime}|=|V|-1, since we removed 1 vertex, and |E||E|1| E^{\prime} | \geq |E|-1, since we removed at most one edge, due to deg(v)1deg(v)\leq 1. As we couldn’t have added a cycle by removing one vertex GG^{\prime} is also cycle-free. We can now use the induction hypothesis, because GG^{\prime} only contains nn vertices and is cycle-free. By that we know |E|<|V|=n| E^{\prime} | < | V^{\prime} | = n , therefore |E|<n+1|E| < n + 1.

And we are done. The rest is taken care of by the induction principle. Since we have the base case, we can use the induction step to get a solution for two vertices, after we have that we can continue for three, four and so on.

I hope some of you at least tried to suffer through this and have learned a thing or two. And if this was a problem set for one of your math courses, I’m glad I could help, but please try to do your problem sets on your own. You really need the training to sharpen your brain and trust me I know how hard it is to do this in the beginning.