Troels Henriksens academic homepage

Supervision

I am willing to supervise BSc and MSc projects that are related to my own research, or on topics that happen to catch my fancy. The projects tend to be heavy on implementation and somewhat technically difficult, but most of my former students claim to have enjoyed it. Most of my projects are fairly well-defined in regarding the eventual goal, but are open in terms of technique. Many of my projects are also directly related to either active open source projects, or ongoing research efforts.

When filling out projects contracts and such, use the following information:

How many hours are set aside for supervision?: In practice as many as necessary, but put 5 hours for a 7.5 ECTS project, 10 hours for a 15 ECTS project, and 20 hours for a 30 ECTS project.
How often do we meet for supervision?: Once per week, at a fixed time we agree upon after signing the contract. It is likely that many of the meetings will be cancelled when there is nothing to talk about, but having a regular slot avoids frequent calendar synchronisation.
What is expected of the student at the meeting?: The student must send an agenda or a cancellation at least the evening before the day of the meeting. Further, at the meeting the student must have up-to-date versions of all relevant code ready to demonstrate or clarify any problems.
What is expected of the supervisor at the meeting?: The supervisor should have pondered any questions asked in the meeting agenda.
Agreements or relevant aspects about your data collection/laboratory work, if any.: Not applicable for the vast majority of the projects I supervise.
Expectations of your cooperation in general.: My expectations are listed here. Feel free to put anything else that you consider important.
Handin date.: All bachelor projects have a fixed handin date, which is listed somewhere on Absalon.

You are also asked to write a project description, which contains a background and high level problem statement. You should write a first draft of this on your own, based on our discussions, as it is a useful test of how well you understand the goal of the project. One somewhat artificial part that the project description must contain is a list of 3-5 learning goals. These are actionable demonstrations of your (hopefully) newly acquired competences, and are to some extent what you will be evaluated on at the end of your project. Each learning goal is typically a single sentence. Here is an example of a project description.

Think about the goal of your project: Do you simply wish to try out interesting theory or technology? Contribute working code to an open source project? Contribute to a research project? Get a high grade as easily as possible? Make your goal clear to me so we both know what to aim for.

Valkyrie Savage has also written a helpful guide on being a project student.

Signature

UCPH has an interesting procedure where I have to pretend to sign your contract by copying an image of my signature into your PDF file. As an optimisation, here is an image of my signature so you can do it yourself: A photograph of my signature written in red ink. If you wish to see me operate a fountain pen in person, you can also come by my office with a printout of your contract.

Current list of project proposals

Beyond the list below, see also the list of student-viable projects on the Futhark issue tracker. These are all projects about contributing directly to a real open source project. Not all of them are large enough to serve as, for example, a master's thesis on their own. Others are perhaps too challenging for a bachelor's thesis unless you are a particularly skilled programmer.

Proposal: Port benchmarks from PBBS

The Problem Based Benchmark Suite is a collection of benchmark programs written in parallel C++. I am interested in porting them to Futhark such that they can be added to the existing collection of benchmarks. Some of the benchmarks are relatively trivial; others are more difficult. It might be a good idea for a project to combine a trivial benchmark with a more complex one. The list of benchmarks is here. The ones listed as Basic Building Blocks are all pretty straightforward, but not large enough to serve as a project on their own. Look at the others and pick whatever looks interesting. Particularly interesting are the ones related to computational geometry:

This project involves becoming familiar with data parallel programming and learning how to rephrase (typically) task parallel algorithms in the data parallel setting, usually also by flattening control flow and data.

Proposal: Static Analysis of Irregular Parallelism

The Futhark compiler currently supports regular nested data parallelism, which is a form of parallelism where the size of inner parallel dimensions are invariant to the outer dimensions. Even though we are working on supporting arbitrary irregular nested data parallelism, this will never be as efficient as regular nesting. Unfortunately, whether application parallelism is regular or irregular is an implicit property, and it can be hard for programmers to know when it occurs. This project is about writing a static analysis tool that takes as input a Futhark program and analyses it for potential instances of irregular nested data parallelism. This is a fun project if you enjoy working with ASTs and devising algorithms.

Proposal: Experiment with Tools for Automatic Differentiation

AD is a program transformation for computing derivatives of arbitrary programs. GradBench is a project that investigates the performance of AD tools for various languages. This project is about implementing one or more of the GradBench benchmarks in a new language or AD tool. For example, I am quite curious about the effectiveness of these AD tools for Haskell, but you can pick whatever language you wish, as long as it has some library or tool that supports AD. The concrete work done in the project is implementing the specified benchmark problems, applying AD as best you can, then benchmarking the resulting programs.

Proposal: Implementing an error-tolerant parser

The Futhark compiler uses a traditional parser architecture, based on the Happy parser generator that stops at the first error and is unable to produce a parse tree in case a syntax error occurs. This means the compiler and related tools, such as the Language Server and formatter, is unable to provide any information when the program has a syntax error. This is not ideal, as it is common for programs to have intermittent syntax errors while working on them.

Happy recently grew support for error recovery, which has already been used to modify GHC. This project is about adding a similar form of error tolerance to the Futhark parser, and as time permits, investigating how to modify programs such as the type checker and formatter to handle incorrect programs. Although the Futhark compiler is written in Haskell, it is not expected that this project requires particularly fancy Haskell programming. It is a good project for students interested in parser theory and practice.

Proposal: Mixing compiled and interpreted code

The Futhark compiler produces very efficient code, but it takes a long time to compile, which makes it impractical for quick or interactive use. The Futhark interpreter can execute code immediately, but its execution speed is by design quite slow. This project is about designing and implementing a mechanism for interoperability between compiled and interpreted code, such that one can compile a library of Futhark functions ahead of time, which can then be invoked from interpreted code. One subpart of this problem is exposing the types of a compiled Futhark program to the type checker, without the type checker having access to the original program.

Much of the low-level machinery for making this work is already in place: the server protocol provides a way to interact with an instance of a compiled Futhark program, and FutharkScript is an example of a tiny language that can invoke compiled Futhark programs.

The successful completion of this project would have substantial practical impact on Futhark usage, and would definitely see real use.

Proposal: Implement chained-scan in the Futhark compiler

In addition to its GPU backends, Futhark has a backend that generates code for multicore CPUs. However, its code generation for the scan building block is not great. This project is about implementing the chained-chan algorithm in the Futhark code generator. The result will be that Futhark programs run faster. The programming work will mainly involve Haskell programming, but the main technical challenge is understanding the optimisation of the chained-scan algorithm.

Proposal: Implement a differentiable renderer

Differentiable rendering is the idea of implementing a graphics renderer that is differentiable (using automatic differentiation) with respect to the scene being rendered. This allows the use of optimisation routines (machine learning) to e.g. construct a scene that when rendered matches some reference image or video. This has major applications to areas such as computer vision.

This project is about constructing such a differentiable renderer in Futhark, taking advantage of Futhark's support for automatic differentiation. The goal is not just to construct a renderer for future use, but also to exercise (and motivate improvements to) Futhark's AD implementation.

A subproblem (or perhaps the main problem if you find it more interesting) could also be a differentiable physics engine.

An alternative interpretation of this is to not focus on constructing a good single differentiable renderer, but instead define a simple (but still realistic) differentiable rendering algorithm, then implement it in multiple languages (and AD tools), such that it can serve as the basis for a benchmark for GradBench.

Proposal: Improving profiling reports for Futhark

Futhark can produce a reasonable and ever-increasing amount of profiling information that describes which parts of a program execution are the most time consuming, and (to some extent), why. However, this information is currently not made available to the user in a particularly useful way. This project is about writing a program that takes the information that is being produced and presenting it in a more useful way. This can include:

Heat maps of the original source code, showing which expressions contribute the most to the runtime.
Flame graphs showing time spent.
Backend-specific and heuristic-based performance suggestions, that look for e.g. kernels launched with insufficient parallelism.

The project is best implemented as an extension to the existing futhark profile tool, which is written in (rather simple) Haskell.

If successful, this project will be frequently used in practice by real Futhark programmers.

What do to if you liked a course

This section contains suggestions for where to go next if you liked a course that I have been involved with; particularly which other related courses would be worth taking, or who to contact to do a related BSc or MSc project. Use the Course Search to look up further details.

If you liked High Performance Programming and Systems

Related courses

High Performance Parallel Computing, taught by the physics department. A classic high performance computing course, that does not require more background in programming or computer systems than you have learned in HPPS.
Programming Massively Parallel Hardware, taught by Cosmin Oancea. Focuses on low level parallel programming and optimising for hardware specifics. A good course if you like programming and are good at it.
Data Parallel Programming, taught by myself and Cosmin Oancea. Focuses on high level parallel programming in a functional language, although your F# experience from the first year should be sufficient (especially if you were good at it).

Related projects

While I use a lot of the HPPS material in my own research, most of my projects also involve significant amounts of material related to functional programming, programming language theory, or compiler construction. For projects centered in the HPPS material, David Gray Marchant is a more suitable supervisor.

If you liked Advanced Programming

Related courses

Data Parallel Programming, taught by myself and Cosmin Oancea. Applies the principles of functional programming to the problem of writing certain kinds of fast programs.
Semantics and Types, taught by Andrzej Filinski. A course on fundamental programming language theory.
Advanced Topics in Programming Languages, taught by an ensemble cast of researchers. The theme changes every once in a while, but is always about cutting-edge ideas in programming languages.

Related projects

Many researchers at DIKU are willing to supervise projects related to functional programming or more generally the theory of programming. If you find monads or type systems particularly interesting, consider contacting Andrzej Filinski. Ken Friis Larsen is interested in most unusual uses of Haskell.

Hints for doing a student project with me

For electronic communication, use email. I prefer athas@sigkill.dk for various reasons. Rationale: This saves me from having to search a multitude of communication mediums whenever I need to figure out the context of our conversation. As a special case, feel free to contact me on IRC (#diku on Libera Chat), because IRC is a pleasant medium.

If your project involves modifying a program maintained in a Git repository (such as Futhark), use a feature branch for your project and issue a pull request against the main repository. Rationale: Using a distinct branch names makes it much easier for me to help you by checking out your code as a local branch, and issuing a pull request enables use of GitHub's tools for code review.

Feel free to write me emails with questions whenever you have any. I will respond as I am able.

When you desire feedback on your report, tell me specifically which part you want me to read, especially when I have previously given you feedback.

You should generally think about what kind of help you need from me, and say so explicitly in your emails. I am often willing to put in significant amounts of work to help with technical obstacles that are peripheral to your project, but I am unlikely to take the initiative. I can also provide suggestions for alternative approaches when you are stuck, but I need to know when you get stuck.

Regarding the defense

A bachelors project defense is about 20 minutes of presentation, followed by 20 minutes of the censor and I asking questions. A master's thesis is about 25 minutes of presentation and 25 minutes of questions. For a joint defense, each student beyond the first adds about 5 minutes. A POCS with a single student does not have a defense.

There is no requirement to present using slides, but it by a significant margin the most common. Consider very carefully before deciding to do a presentation using only white- or blackboard.

There is no requirement for any specific presentation tool. Use something you are comfortable with. Most students use Beamer. Very cool students use Patat or write their own presentation software.

Make sure to check in advance that your electronic equipment works. It is your reponsibility to bring whatever dongles you need.

Use black text on a white background for your slides. The lighting conditions are usually not good enough for white-on-black to be legible.

If you wish to use Beamer, note that it defaults to an aspect ratio of 4:3, which is obsolete even on UCPH's projectors. Use \documentclass[aspectratio=169]{beamer} to enable a modern 16:9 aspect ratio. There is a UCPH beamer template circulating. Do not use it! It is extremely ugly. Also, remember to disable the navigation buttons that Beamer inexplicably places on slides: \usenavigationsymbolstemplate{}

Have your code (if applicable) and report at hand to show on the projector, in case we want to point to specific parts in our questions.

Feel free to invite friends, family, lovers, enemies, colleagues, or anyone else you wish to your defense, but let me know well in advance so I can book a room with sufficient capacity.

The purpose of the defense is to evaluate your dissemination skills. The presentation part is slightly artificial, as both censor and I will have read your report in advance, so you do not have to exhaustively describe the background, context, or every technical detail. My advice is to superficially cover every main aspect of your work, then pick a single technical detail that you discuss in depth.

We may ask questions about any part of your work, not just your presentation.

It is permissible to present new work at the defense. This can be anything from additional experimental data, to solving a problem that was unsolved at the report deadline.

For guidance on how to give a good presentation, see How to Give a Great Research Talk.

Troels Henriksens academic homepage

Synopsis

How to find me

Supervision

Signature

Project proposals

Current list of project proposals

What do to if you liked a course

If you liked High Performance Programming and Systems

Related courses

Related projects

If you liked Advanced Programming

Related courses

Related projects

Hints for doing a student project with me

Regarding the report

Regarding the defense