Assignment 3: Control Flow Optimization

Global optimization -- optimization spanning multiple basic blocks -- is very important. We have spent quite a bit of time talking about it in class. However, I also want you to build something. Most of those optimizations take a while to implement, so we're going to keep this project as simple as possible and focus merely on improving the control flow itself, i.e., code straightening.

Starting with any of the previous versions (but best to have already done dead code elimination), you're now going to add code straightening. What does that mean? Well, even though it's pretty simple as global optimization goes, it means that you've actually got quite a bit to do.

Building The Control Flow Graph

The first step is to build the control flow graph. There will only be one function in your input (I promise), and it is entered at the top, but you'll need to figure-out the control flow from all the SEL and LAB tuples. Each basic block has at most two successors. A SEL tuple ending a block explicitly specifies the successor(s), but falling into a LAB without passing through a SEL also creates a control flow entry to the LAB.

I strongly recommend that you think of the flow graph as a separate data structure in which each node represents an original basic block. Each node contains a sequence of tuples (at least one: the LAB that starts the block), which you can identify by pointers to the first and last tuples in the block. Each node has no more than two exit arcs pointing at other nodes:

A conditional SEL creates two exit arcs.
An unconditional SEL or simply "falling through" from another block without a SEL creates a single exit arc.
The end of the input essentially creates no exit arcs.

Note that although each node is followed by at most 2 successors, it may have any number of predecessors. You'll want to record the number of immediate predecessors in each node. Of course, the very first block is special -- it logically has infinitely many predecessors (anything at the operating system level could jump to it), despite the fact that the code you are processing might not have any arcs to it.

How do you output this graph? Well, I'm very fond of the GraphViz graph visualization software. There is an interface for C programs to actually produce GraphViz rendings, but all I'm asking is that your program output the graph as ASCII text using the DOT language. Here's a really simple example of how that works. The DOT file:

digraph {
	start [shape=none];
	start -> LAB0;
	LAB0 -> LAB1;
	LAB1 -> LAB2;
	LAB2 -> LAB3;
	LAB3 -> LAB4;
	LAB3 -> LAB5;
	LAB4 -> LAB8;
	LAB5 -> LAB6;
	LAB6 -> LAB2;
	LAB6 -> LAB7;
	LAB8 -> LAB6;
}

when run through the command dot -Tps <dig.dot >dig.ps generates a postscript file that looks like:

You might have noticed that the dot rendering cleverly moved the LAB8 node into a more logical position as it drew the graph. Incidentally, I don't care what you name the nodes. Cool, eh?

Code Straightening

When you straighten the graph, you:

Ignore unreachable nodes. For example, a block that was inside an if statement where the condition had the constant value 0 is unreachable. In fact, for that reason, it might not even appear in even your original graph output (depending on how you structure your dot code).
Any pair of nodes, A, B, such that A is the only predecessor of B and B is the only successor of A, should be merged into a single node with the contents of A followed by the contents of B.

For example, the straightened version of the above graph is:

digraph {
	start [shape=none];
	start -> LAB01;
	LAB01 -> LAB23;
	LAB23 -> LAB48;
	LAB23 -> LAB5;
	LAB48 -> LAB6;
	LAB5 -> LAB6;
	LAB6 -> LAB23;
	LAB6 -> LAB7;
}

and generates the graphic output:

Easy, right?

What Do You Mean, "contents of A followed by contents of B"?

Ok, it's not quite that easy. You knew there had to be a catch. Here it is:

Ideally, you should combine the blocks and spit out optimized code for them just like in the previous asignment.

Fortunately, I'm NOT asking you to do that here.. All I want is for you to output a graph DOT specification that gives the straightened graph. If you were going to combine the blocks and re-optimize them, you'd have to copy things into a new list of instructions for each node and re-optimize that, but I'm not asking you to do that here. All I ask is that you output DOT code that will draw the straightened graph; you can use whatever labels you want for the nodes, preferably derived from v in the label(v) tuple that was entered for the possibly-merged block. You do NOT need to list all the blocks merged into it.

Due Dates

The due date for this assignment is by 11:59PM Monday, April 26, 2021. You may submit as many times as you wish, but only the last submission that you make before the final deadline will be counted toward your course grade.

Note that you can ensure that you get at least half credit for this project by simply submitting a tar of an "implementor's notes" document explaining that your project doesn't work because you have not done it yet. Given that, perhaps you should start by immediately making and submitting your implementor's notes document? (I would!)

Submission Procedure

For each project, you will be submitting a tarball (i.e., a file with the name ending in .tar or .tgz) that contains all things relevant to your work on the project. Minimally, each project tarball includes the source code for the project and a semi-formal "implementors notes" document as a PDF named notes.pdf. It also may include test cases, sample output, a make file, etc., but should not include any files that are built by your Makefile (e.g., no binary executables). Make it very clear in your implementor's notes exactly what each file is.

Submit your tarball below. The file can be either an ordinary .tar file created using tar cvf file.tar yourprojectfiles or a compressed .tgz file file created using tar zcvf file.tgz yourprojectfiles. Be careful about using * as a shorthand in listing yourprojectfiles on the command line, because if the output tar file is listed in the expansion, the result can be an infinite file (which is not ok).

Optimizing Compilers