Below follows a list of project descriptions for students. Some of the projects are finished, some are in progress, and some are still available to students that want to do a UG4, MSc or a PhD projects.
If you want to do an MSc or PhD with us then you need to go through the application procedures set by the School of Informatics. Make sure you discuss your research proposal first with Jano van Hemert. Important to note: you need to apply under Intelligent Systems & their Applications.
The University of Edinburgh offers a MSc/Diploma in Distributed Scientific Computing.
List of projects
Present day instrumentation networks in rivers provide huge quantities of multi-dimensional data. Although there are numerous machine learning tools that can extract trends, find patterns and predict future states given some data, it is crucial to properly optimize these techniques according to the semantic content of the data.
Principal goal: to extend Rapid, a tool for developing web portals for scientific computing, to operate with jclouds.
This is project is part of the Google Summer of Code 2010 (see http://www.omii.ac.uk/wiki/RapidJclouds)
Principal goal: to extend Rapid, a tool for developing web portals for scientific computing, to operate with Apache Hadoop.
This is project is part of the Google Summer of Code 2010 (see http://www.omii.ac.uk/wiki/RapidHadoop)
Principal goal: to substantially improve the performance of the data-intensive analysis for genome-wide association studies (GWAS) by using graphics processing units (GPUs).
Principal goal: to develop, test and make available to the cosmology community a parameter estimation method for models that explain our dark Universe.
Principal goal: to apply machines learning to identify small molecues that are likely candidates to have relevant bioactivity for follow-up wet-lab experiments.
Principle goal: To evaluate existing data streaming implementation, formulate model to predict streaming performance corresponding to buffering strategy and then optimise data streaming with dynamical buffering implementation.
Principle goal: to investigate existing data placement strategies and build a decision model to improve data placement strategies in enacting data-intensive workflow.
Primary objective: to perform data mining on a real-world data set from a biology lab in the School of Biological Sciences with the aim to extract patterns that lead to hypotheses about mode of action of compounds and function of genes.
Primary goal: to develop a classification algorithm to detect Web Spam.